Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   ACJVC4_RS07980 Genome accession   NZ_CP177179
Coordinates   1735337..1737577 (-) Length   746 a.a.
NCBI ID   WP_002891564.1    Uniprot ID   J7SHS1
Organism   Streptococcus salivarius strain MRD-NRLLH     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1719597..1740191 1735337..1737577 within 0


Gene organization within MGE regions


Location: 1719597..1740191
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACJVC4_RS07915 ligA 1719597..1721555 (-) 1959 WP_037598365.1 NAD-dependent DNA ligase LigA -
  ACJVC4_RS07920 - 1721719..1723060 (-) 1342 Protein_1520 IS3 family transposase -
  ACJVC4_RS07925 - 1723145..1723822 (+) 678 WP_002890056.1 helix-turn-helix domain-containing protein -
  ACJVC4_RS07930 - 1723819..1724784 (+) 966 WP_002890057.1 IS3 family transposase -
  ACJVC4_RS07935 tnpA 1725000..1725473 (-) 474 WP_002891552.1 IS200/IS605 family transposase -
  ACJVC4_RS07940 - 1726189..1727361 (+) 1173 WP_002891554.1 SIR2 family protein -
  ACJVC4_RS07945 - 1727354..1728979 (+) 1626 WP_002891555.1 ATP-binding protein -
  ACJVC4_RS07950 - 1729644..1730153 (-) 510 WP_002891556.1 QueT transporter family protein -
  ACJVC4_RS07955 - 1730244..1731203 (-) 960 WP_002891558.1 YihY/virulence factor BrkB family protein -
  ACJVC4_RS07960 - 1731205..1732065 (-) 861 WP_002891560.1 methionyl aminopeptidase -
  ACJVC4_RS07965 spxR 1732105..1733385 (-) 1281 WP_037598368.1 CBS-HotDog domain-containing transcription factor SpxR -
  ACJVC4_RS07970 - 1733378..1733923 (-) 546 WP_037598369.1 GNAT family N-acetyltransferase -
  ACJVC4_RS07975 - 1733989..1735251 (-) 1263 WP_037598371.1 UDP-N-acetylglucosamine 1-carboxyvinyltransferase -
  ACJVC4_RS07980 comEC/celB 1735337..1737577 (-) 2241 WP_002891564.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  ACJVC4_RS07985 comEA 1737567..1738262 (-) 696 WP_002891565.1 helix-hairpin-helix domain-containing protein Machinery gene
  ACJVC4_RS07990 - 1738371..1739126 (-) 756 WP_002891566.1 lysophospholipid acyltransferase family protein -
  ACJVC4_RS07995 - 1739256..1740191 (+) 936 WP_037598582.1 polysaccharide deacetylase family protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84350.04 Da        Isoelectric Point: 9.6833

>NTDB_id=1085817 ACJVC4_RS07980 WP_002891564.1 1735337..1737577(-) (comEC/celB) [Streptococcus salivarius strain MRD-NRLLH]
MWLKKAPINLFSLSLLIAALYFTIFVTNVYAIGAFVFLMVCFLKHHWKNKAALKLVGLVGSFFLVYFLFLHHRASVQDKQ
APAEINQVTLVADTLSVNGEQLSAIGKAKGQTYQIFYRLKSEKEQHFFKTTSQTLVLKGKINLSQATAQRNFQGFNYQSY
LASQGIYRIAQIERLDQVVPQKSLSPLAFFHQLRRRALVHIQTHFPSPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKDYLFIVLLPFSLCYGLMTGWTASVLRSLVQSLLAEFGIKKLDNMGITLLLLFLL
LPHFLLTVGGVLSCSYAFLLCLFDFEDLSSLKKSIYTSLVLSLGILPFLTYYYGTFQPVSLILTAIFSIVFDSFLLPVLT
VFFALSGLVIFSQINPLFEWMESFLTWIQSWIGQPLILGKPSLFQFGLMIAVLVMLFDFWKKPQFRICLLMIFGLLMVWV
KHPLTNEVTVVDVGQGDSIFLRSMKGDTILIDVGGKVTFGTKEKWQESSQTSNAEKTLIPYLKARGVSQIDYLVLTHTDT
DHIGDLEEVAKCFKIKEICVSQGALTKPSFVKRLRTIKCPVHTLKAGDKLPMMGSNLQVLYPNKVGDGGNNDSIVLYGKL
LGSSFLFTGDLEKEGEEELMASYPTLRASVLKAGHHGSKGSSSEAFLDQLHPSLALVSAGENNRYKHPNDETIERFKQRH
IKILRTDKDGAIRFKGWFKWSSETVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=1085817 ACJVC4_RS07980 WP_002891564.1 1735337..1737577(-) (comEC/celB) [Streptococcus salivarius strain MRD-NRLLH]
ATGTGGCTTAAAAAAGCTCCAATCAATCTTTTTTCCTTGTCTCTTTTAATAGCTGCCCTTTATTTTACGATTTTTGTCAC
TAATGTTTACGCTATTGGTGCTTTTGTCTTTCTTATGGTCTGTTTTTTGAAGCATCATTGGAAGAATAAGGCAGCCCTAA
AGTTGGTGGGACTTGTAGGTAGTTTCTTTTTGGTCTACTTTTTATTCTTGCACCACAGAGCTAGCGTACAAGATAAACAA
GCCCCTGCTGAAATCAATCAGGTGACTCTGGTTGCTGATACGCTATCGGTTAATGGTGAGCAATTATCAGCTATCGGTAA
GGCAAAGGGACAAACCTATCAGATCTTTTACCGACTCAAATCTGAGAAGGAGCAGCATTTTTTCAAGACTACTAGCCAAA
CGTTGGTATTGAAGGGAAAAATAAACTTATCTCAAGCAACTGCTCAGCGTAATTTTCAAGGGTTTAATTATCAGTCTTAT
CTAGCCAGTCAAGGTATTTATCGAATTGCTCAGATTGAGCGCTTGGACCAAGTGGTACCTCAAAAATCTCTATCTCCCCT
AGCTTTTTTCCATCAACTGAGGAGGAGGGCTTTGGTTCATATCCAGACGCACTTTCCTAGTCCTATGAGACACTATATGA
CAGGCCTGCTCTTTGGGTATTTGGATAAGGAGTTTGATGAGCAGAGCCAGCTTTACACAAGTTTAGGTATTATTCATCTA
TTCGCACTTTCGGGGATGCAAGTAGGCTTTTTTCTGGGATGGTTTCGCTACGGTCTCCTACGCTTAGGTCTTCCTAAAGA
TTATCTATTTATCGTCTTGCTGCCTTTTTCCTTATGTTATGGCTTAATGACAGGATGGACAGCTTCAGTCCTACGTTCCT
TGGTTCAAAGTTTGTTGGCGGAGTTTGGTATTAAAAAACTGGACAATATGGGAATAACCTTGCTTCTATTGTTTCTCTTA
TTACCTCATTTTCTCTTAACAGTGGGAGGTGTATTAAGTTGTTCCTATGCCTTCTTGTTGTGTTTGTTTGATTTTGAGGA
CTTGTCATCTTTAAAGAAATCAATCTATACGAGCTTGGTATTGAGTCTTGGGATTTTACCCTTTTTAACCTACTATTATG
GGACTTTTCAACCAGTTAGTTTGATTCTCACTGCAATCTTTTCAATTGTATTTGATAGCTTTCTCTTGCCTGTATTGACA
GTATTCTTTGCCCTCTCAGGATTGGTAATCTTCTCTCAAATCAACCCACTTTTTGAATGGATGGAGTCCTTTTTGACTTG
GATACAATCCTGGATAGGTCAACCTTTGATTTTAGGGAAACCTAGTTTGTTTCAGTTTGGCTTGATGATAGCAGTATTGG
TTATGCTCTTTGATTTTTGGAAAAAGCCTCAGTTTAGGATTTGCCTTTTGATGATTTTTGGACTTTTGATGGTCTGGGTC
AAACATCCTTTAACCAATGAGGTGACTGTGGTTGACGTTGGTCAGGGAGATAGTATTTTTCTGAGGAGTATGAAAGGCGA
TACCATCCTAATTGATGTGGGTGGCAAGGTGACGTTTGGAACTAAGGAAAAATGGCAAGAGTCTAGCCAGACGAGCAATG
CGGAGAAAACCTTGATTCCCTATCTAAAGGCTAGAGGAGTGTCTCAAATTGATTATCTGGTCCTGACGCATACGGACACA
GACCATATTGGTGATTTGGAAGAAGTGGCCAAGTGTTTTAAGATTAAGGAAATCTGTGTCAGTCAGGGGGCTTTGACTAA
GCCTAGCTTTGTTAAAAGACTTCGGACTATAAAATGCCCAGTTCACACTCTAAAGGCTGGCGACAAATTACCCATGATGG
GAAGCAATCTACAGGTTCTTTATCCAAATAAAGTTGGTGATGGCGGTAACAATGATTCGATAGTTCTTTACGGAAAGCTA
TTAGGAAGCAGTTTTCTGTTTACCGGTGATTTGGAAAAAGAAGGAGAGGAGGAACTGATGGCCAGCTATCCAACTTTAAG
GGCAAGCGTCCTCAAAGCTGGACACCACGGTTCAAAAGGTTCTTCGTCTGAAGCTTTTTTGGACCAGCTGCACCCCTCCC
TTGCACTTGTTTCAGCTGGTGAGAACAATCGTTATAAACATCCAAATGATGAAACAATAGAACGTTTTAAGCAACGCCAC
ATTAAAATTTTACGAACAGATAAAGACGGAGCCATCCGTTTTAAGGGCTGGTTTAAATGGTCAAGTGAAACAGTCCGATA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB J7SHS1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

48.133

100

0.484

  comEC/celB Streptococcus mitis NCTC 12261

48.064

100

0.483

  comEC/celB Streptococcus pneumoniae Rx1

46.667

100

0.469

  comEC/celB Streptococcus pneumoniae D39

46.667

100

0.469

  comEC/celB Streptococcus pneumoniae R6

46.667

100

0.469

  comEC/celB Streptococcus pneumoniae TIGR4

46.328

100

0.465

  comEC Lactococcus lactis subsp. cremoris KW2

43.902

98.928

0.434


Multiple sequence alignment