Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC/cglC   Type   Machinery gene
Locus tag   QSV41_RS09020 Genome accession   NZ_CP128464
Coordinates   1739557..1739832 (-) Length   91 a.a.
NCBI ID   WP_002356991.1    Uniprot ID   A0A1B4XPV0
Organism   Enterococcus faecalis strain RE25     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1695475..1739533 1739557..1739832 flank 24


Gene organization within MGE regions


Location: 1695475..1739832
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QSV41_RS08715 (QSV41_08720) - 1695475..1696482 (-) 1008 WP_002357063.1 class I SAM-dependent methyltransferase -
  QSV41_RS08720 (QSV41_08725) comGG 1696611..1696964 (-) 354 WP_002362054.1 competence type IV pilus minor pilin ComGG -
  QSV41_RS08725 (QSV41_08730) comGF 1696964..1697398 (-) 435 WP_002357060.1 competence type IV pilus minor pilin ComGF -
  QSV41_RS08730 (QSV41_08735) - 1697388..1697714 (-) 327 WP_010775953.1 type II secretion system protein -
  QSV41_RS08735 (QSV41_08740) hemH 1698516..1699457 (-) 942 WP_002357056.1 ferrochelatase -
  QSV41_RS08745 (QSV41_08750) - 1699958..1700917 (+) 960 WP_000221326.1 IS30-like element IS6770 family transposase -
  QSV41_RS08750 (QSV41_08755) - 1701241..1701513 (-) 273 WP_002370964.1 hypothetical protein -
  QSV41_RS08755 (QSV41_08760) - 1701585..1701755 (-) 171 WP_289690528.1 cold-shock protein -
  QSV41_RS08760 (QSV41_08765) - 1702592..1703833 (-) 1242 WP_002370962.1 LysM peptidoglycan-binding domain-containing protein -
  QSV41_RS08765 (QSV41_08770) - 1703834..1704067 (-) 234 WP_002384371.1 phage holin -
  QSV41_RS08770 (QSV41_08775) - 1704060..1704281 (-) 222 WP_002364191.1 hypothetical protein -
  QSV41_RS08775 (QSV41_08780) - 1704316..1704471 (-) 156 WP_002364192.1 XkdX family protein -
  QSV41_RS08780 (QSV41_08785) - 1704473..1704793 (-) 321 WP_002389262.1 hypothetical protein -
  QSV41_RS08785 (QSV41_08790) - 1704807..1705298 (-) 492 WP_289690532.1 hypothetical protein -
  QSV41_RS08790 (QSV41_08795) - 1705298..1705585 (-) 288 WP_002357046.1 collagen-like protein -
  QSV41_RS08795 (QSV41_08800) - 1705582..1706178 (-) 597 WP_002357045.1 hypothetical protein -
  QSV41_RS08800 (QSV41_08805) - 1706171..1707052 (-) 882 WP_002357044.1 phage baseplate upper protein -
  QSV41_RS08805 (QSV41_08810) - 1707071..1709878 (-) 2808 WP_002384367.1 phage tail spike protein -
  QSV41_RS08810 (QSV41_08815) - 1709860..1710594 (-) 735 WP_002357042.1 hypothetical protein -
  QSV41_RS08815 (QSV41_08820) - 1710584..1713481 (-) 2898 WP_202575549.1 tape measure protein -
  QSV41_RS08820 (QSV41_08825) - 1713729..1714079 (-) 351 WP_002370959.1 hypothetical protein -
  QSV41_RS08825 (QSV41_08830) - 1714132..1714980 (-) 849 WP_002384363.1 major tail protein -
  QSV41_RS08830 (QSV41_08835) - 1714981..1715355 (-) 375 WP_002357037.1 DUF6838 family protein -
  QSV41_RS08835 (QSV41_08840) - 1715358..1715756 (-) 399 WP_002357036.1 HK97 gp10 family phage protein -
  QSV41_RS08840 (QSV41_08845) - 1715749..1716117 (-) 369 WP_002357034.1 hypothetical protein -
  QSV41_RS08845 (QSV41_08850) - 1716114..1716458 (-) 345 WP_002357033.1 hypothetical protein -
  QSV41_RS08850 (QSV41_08855) - 1716472..1716654 (-) 183 WP_002357032.1 hypothetical protein -
  QSV41_RS08855 (QSV41_08860) - 1716683..1717570 (-) 888 WP_002357030.1 DUF5309 domain-containing protein -
  QSV41_RS08860 (QSV41_08865) - 1717584..1718207 (-) 624 WP_002389133.1 DUF4355 domain-containing protein -
  QSV41_RS08865 (QSV41_08870) - 1718425..1718745 (-) 321 WP_002389265.1 hypothetical protein -
  QSV41_RS08870 (QSV41_08875) - 1718810..1719031 (-) 222 WP_002357027.1 hypothetical protein -
  QSV41_RS08875 (QSV41_08880) - 1719028..1720785 (-) 1758 WP_002384360.1 head protein -
  QSV41_RS08880 (QSV41_08885) - 1720760..1722247 (-) 1488 WP_002384358.1 phage portal protein -
  QSV41_RS08885 (QSV41_08890) - 1722259..1723548 (-) 1290 WP_002403123.1 PBSX family phage terminase large subunit -
  QSV41_RS08890 (QSV41_08895) terS 1723520..1724323 (-) 804 WP_002389117.1 phage terminase small subunit -
  QSV41_RS08895 (QSV41_08900) - 1724592..1725509 (+) 918 WP_002370955.1 hypothetical protein -
  QSV41_RS08900 (QSV41_08905) - 1725547..1726125 (-) 579 WP_002370953.1 sce7726 family protein -
  QSV41_RS08905 (QSV41_08910) - 1726269..1726502 (-) 234 WP_002389079.1 hypothetical protein -
  QSV41_RS08915 (QSV41_08920) - 1727496..1727912 (-) 417 WP_010816133.1 ArpU family phage packaging/lysis transcriptional regulator -
  QSV41_RS08920 (QSV41_08925) - 1728819..1729244 (-) 426 WP_002389212.1 RusA family crossover junction endodeoxyribonuclease -
  QSV41_RS08925 (QSV41_08930) - 1729262..1729561 (-) 300 WP_002389080.1 MazG-like family protein -
  QSV41_RS08930 (QSV41_08935) - 1729562..1729864 (-) 303 WP_002389299.1 hypothetical protein -
  QSV41_RS08935 (QSV41_08940) - 1729868..1730869 (-) 1002 WP_002389256.1 Lin1244/Lin1753 domain-containing protein -
  QSV41_RS08940 (QSV41_08945) - 1730906..1731799 (-) 894 WP_002369907.1 recombinase RecT -
  QSV41_RS08945 (QSV41_08950) - 1731799..1732743 (-) 945 WP_002389314.1 lambda-exonuclease family protein -
  QSV41_RS08950 (QSV41_08955) - 1732841..1733068 (-) 228 WP_002364220.1 hypothetical protein -
  QSV41_RS08955 (QSV41_08960) - 1733068..1733391 (-) 324 WP_002369793.1 hypothetical protein -
  QSV41_RS08960 (QSV41_08965) - 1733435..1733620 (-) 186 WP_002364222.1 hypothetical protein -
  QSV41_RS08965 (QSV41_08970) - 1733611..1733805 (-) 195 WP_002364223.1 hypothetical protein -
  QSV41_RS08970 (QSV41_08975) - 1733858..1734040 (+) 183 WP_002364224.1 YegP family protein -
  QSV41_RS08975 (QSV41_08980) - 1734080..1734801 (-) 722 Protein_1668 ORF6C domain-containing protein -
  QSV41_RS08980 (QSV41_08985) - 1734840..1735157 (-) 318 WP_002364228.1 hypothetical protein -
  QSV41_RS08985 (QSV41_08990) - 1735163..1735354 (-) 192 WP_002356998.1 hypothetical protein -
  QSV41_RS08990 (QSV41_08995) - 1735645..1735989 (+) 345 WP_002385819.1 helix-turn-helix transcriptional regulator -
  QSV41_RS08995 (QSV41_09000) - 1736002..1736640 (+) 639 WP_002389292.1 ImmA/IrrE family metallo-endopeptidase -
  QSV41_RS09000 (QSV41_09005) - 1736758..1737522 (+) 765 WP_002389030.1 LysM domain-containing protein -
  QSV41_RS09005 (QSV41_09010) - 1737537..1737866 (+) 330 WP_002369911.1 hypothetical protein -
  QSV41_RS09010 (QSV41_09015) - 1737941..1739089 (+) 1149 WP_002389015.1 site-specific integrase -
  QSV41_RS09015 (QSV41_09020) comGD 1739117..1739560 (-) 444 WP_025189135.1 competence type IV pilus minor pilin ComGD -
  QSV41_RS09020 (QSV41_09025) comGC/cglC 1739557..1739832 (-) 276 WP_002356991.1 competence type IV pilus major pilin ComGC Machinery gene

Sequence


Protein


Download         Length: 91 a.a.        Molecular weight: 10464.41 Da        Isoelectric Point: 9.3192

>NTDB_id=847519 QSV41_RS09020 WP_002356991.1 1739557..1739832(-) (comGC/cglC) [Enterococcus faecalis strain RE25]
MKKKQKYAGFTLLEMLIVLLIISVLILLFVPNLAKHKETVDKKGNEAIVKIVESQIELYTLEKNKTPSLNELVNEGYITK
EQLDKYTAEKQ

Nucleotide


Download         Length: 276 bp        

>NTDB_id=847519 QSV41_RS09020 WP_002356991.1 1739557..1739832(-) (comGC/cglC) [Enterococcus faecalis strain RE25]
ATGAAAAAGAAACAAAAATACGCGGGGTTTACATTATTAGAAATGTTGATTGTCTTATTGATTATTTCCGTATTGATTTT
ACTTTTTGTCCCTAACTTAGCGAAACATAAAGAAACAGTTGACAAAAAAGGCAATGAAGCAATCGTAAAAATTGTAGAAT
CACAAATCGAGCTCTACACACTAGAAAAAAATAAGACGCCTTCCTTAAATGAATTAGTCAACGAAGGCTACATTACTAAA
GAGCAGTTAGATAAATATACAGCAGAAAAGCAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1B4XPV0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC/cglC Streptococcus pneumoniae Rx1

63.095

92.308

0.582

  comGC/cglC Streptococcus pneumoniae D39

63.095

92.308

0.582

  comGC/cglC Streptococcus pneumoniae R6

63.095

92.308

0.582

  comGC/cglC Streptococcus pneumoniae TIGR4

63.095

92.308

0.582

  comGC/cglC Streptococcus mitis NCTC 12261

61.905

92.308

0.571

  comGC/cglC Streptococcus mitis SK321

61.176

93.407

0.571

  comGC Lactococcus lactis subsp. cremoris KW2

58.14

94.505

0.549

  comYC Streptococcus gordonii str. Challis substr. CH1

56.322

95.604

0.538

  comYC Streptococcus suis isolate S10

52.326

94.505

0.495

  comYC Streptococcus mutans UA159

57.692

85.714

0.495

  comYC Streptococcus mutans UA140

57.692

85.714

0.495

  comGC Latilactobacillus sakei subsp. sakei 23K

46.154

100

0.462

  comGC Staphylococcus aureus MW2

46.835

86.813

0.407

  comGC Staphylococcus aureus N315

46.835

86.813

0.407

  comGC Bacillus subtilis subsp. subtilis str. 168

48.649

81.319

0.396