Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC/cglC   Type   Machinery gene
Locus tag   NLG43_RS08555 Genome accession   NZ_CP100596
Coordinates   1725742..1726017 (-) Length   91 a.a.
NCBI ID   WP_002356991.1    Uniprot ID   A0A1B4XPV0
Organism   Enterococcus faecalis strain Chr-JH 2-2     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1683277..1725718 1725742..1726017 flank 24


Gene organization within MGE regions


Location: 1683277..1726017
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NLG43_RS08250 (NLG43_08230) - 1683277..1684284 (-) 1008 WP_002357063.1 class I SAM-dependent methyltransferase -
  NLG43_RS08255 (NLG43_08235) comGG 1684413..1684766 (-) 354 WP_010820074.1 competence type IV pilus minor pilin ComGG -
  NLG43_RS08260 (NLG43_08240) comGF 1684766..1685200 (-) 435 WP_002378441.1 competence type IV pilus minor pilin ComGF -
  NLG43_RS08265 (NLG43_08245) - 1685190..1685561 (-) 372 WP_171025395.1 type II secretion system protein -
  NLG43_RS08270 - 1685895..1686020 (+) 126 WP_002364184.1 hypothetical protein -
  NLG43_RS08275 (NLG43_08250) hemH 1686317..1687258 (-) 942 WP_002364185.1 ferrochelatase -
  NLG43_RS08285 (NLG43_08260) - 1688002..1688226 (-) 225 WP_010708354.1 hypothetical protein -
  NLG43_RS08290 (NLG43_08265) - 1688324..1688524 (-) 201 WP_002357053.1 cold-shock protein -
  NLG43_RS08295 (NLG43_08270) - 1689361..1690602 (-) 1242 WP_002394567.1 LysM peptidoglycan-binding domain-containing protein -
  NLG43_RS08300 (NLG43_08275) - 1690603..1690836 (-) 234 WP_002384371.1 phage holin -
  NLG43_RS08305 (NLG43_08280) - 1690829..1691050 (-) 222 WP_002364191.1 hypothetical protein -
  NLG43_RS08310 (NLG43_08285) - 1691085..1691240 (-) 156 WP_002364192.1 XkdX family protein -
  NLG43_RS08315 (NLG43_08290) - 1691242..1691562 (-) 321 WP_002389262.1 hypothetical protein -
  NLG43_RS08320 (NLG43_08295) - 1691576..1692067 (-) 492 WP_002364194.1 hypothetical protein -
  NLG43_RS08325 (NLG43_08300) - 1692067..1692354 (-) 288 WP_002357046.1 collagen-like protein -
  NLG43_RS08330 (NLG43_08305) - 1692351..1692947 (-) 597 WP_142427739.1 hypothetical protein -
  NLG43_RS08335 (NLG43_08310) - 1692940..1693821 (-) 882 WP_002387290.1 phage baseplate upper protein -
  NLG43_RS08340 (NLG43_08315) - 1693840..1696647 (-) 2808 WP_185939559.1 phage tail spike protein -
  NLG43_RS08345 (NLG43_08320) - 1696629..1697363 (-) 735 WP_002403868.1 tail protein -
  NLG43_RS08350 (NLG43_08325) - 1697353..1700250 (-) 2898 WP_185939561.1 tape measure protein -
  NLG43_RS08355 (NLG43_08330) - 1700498..1700848 (-) 351 WP_002357039.1 hypothetical protein -
  NLG43_RS08360 (NLG43_08335) - 1700901..1701749 (-) 849 WP_002387287.1 major tail protein -
  NLG43_RS08365 (NLG43_08340) - 1701750..1702124 (-) 375 WP_002387286.1 DUF6838 family protein -
  NLG43_RS08370 (NLG43_08345) - 1702127..1702525 (-) 399 WP_002357036.1 HK97 gp10 family phage protein -
  NLG43_RS08375 (NLG43_08350) - 1702518..1702886 (-) 369 WP_002357034.1 hypothetical protein -
  NLG43_RS08380 (NLG43_08355) - 1702883..1703227 (-) 345 WP_002357033.1 hypothetical protein -
  NLG43_RS08385 (NLG43_08360) - 1703241..1703423 (-) 183 WP_002357032.1 hypothetical protein -
  NLG43_RS08390 (NLG43_08365) - 1703452..1704339 (-) 888 WP_142955395.1 DUF5309 domain-containing protein -
  NLG43_RS08395 (NLG43_08370) - 1704353..1704976 (-) 624 WP_002393977.1 DUF4355 domain-containing protein -
  NLG43_RS08400 (NLG43_08375) - 1705195..1705515 (-) 321 WP_281517316.1 hypothetical protein -
  NLG43_RS08405 (NLG43_08380) - 1705572..1705802 (-) 231 WP_002380437.1 hypothetical protein -
  NLG43_RS08410 (NLG43_08385) - 1705803..1707707 (-) 1905 WP_002393613.1 hypothetical protein -
  NLG43_RS08415 (NLG43_08390) - 1707682..1709169 (-) 1488 WP_002393614.1 phage portal protein -
  NLG43_RS08420 (NLG43_08395) - 1709181..1710470 (-) 1290 WP_061691125.1 PBSX family phage terminase large subunit -
  NLG43_RS08425 (NLG43_08400) terS 1710442..1711245 (-) 804 WP_010823780.1 phage terminase small subunit -
  NLG43_RS08430 (NLG43_08405) - 1711304..1711663 (-) 360 WP_010823781.1 hypothetical protein -
  NLG43_RS08435 (NLG43_08410) - 1712241..1712840 (-) 600 WP_142955394.1 hypothetical protein -
  NLG43_RS08440 (NLG43_08415) - 1712853..1713860 (-) 1008 WP_010823783.1 Kiwa anti-phage protein KwaB-like domain-containing protein -
  NLG43_RS08450 (NLG43_08425) - 1714575..1714991 (-) 417 WP_010823784.1 ArpU family phage packaging/lysis transcriptional regulator -
  NLG43_RS08455 (NLG43_08430) - 1715359..1715619 (-) 261 WP_002357017.1 hypothetical protein -
  NLG43_RS08460 (NLG43_08435) - 1715898..1716332 (-) 435 WP_002402491.1 RusA family crossover junction endodeoxyribonuclease -
  NLG43_RS08465 (NLG43_08440) - 1716341..1716640 (-) 300 WP_002372047.1 MazG-like family protein -
  NLG43_RS08470 (NLG43_08445) - 1716641..1716943 (-) 303 WP_002368214.1 hypothetical protein -
  NLG43_RS08475 (NLG43_08450) - 1716940..1717800 (-) 861 WP_002402489.1 helix-turn-helix domain-containing protein -
  NLG43_RS08480 (NLG43_08455) - 1717800..1718000 (-) 201 WP_010715320.1 hypothetical protein -
  NLG43_RS08485 (NLG43_08460) - 1718005..1718646 (-) 642 WP_002402487.1 putative HNHc nuclease -
  NLG43_RS08490 (NLG43_08465) - 1718651..1719385 (-) 735 WP_002402486.1 ERF family protein -
  NLG43_RS08495 (NLG43_08470) - 1719378..1719695 (-) 318 WP_002402485.1 hypothetical protein -
  NLG43_RS08500 (NLG43_08475) - 1719891..1720229 (-) 339 WP_002402484.1 hypothetical protein -
  NLG43_RS08505 (NLG43_08480) - 1720266..1720475 (-) 210 WP_002378465.1 hypothetical protein -
  NLG43_RS08510 (NLG43_08485) - 1720530..1720718 (+) 189 WP_002357001.1 YegP family protein -
  NLG43_RS08515 (NLG43_08490) - 1720744..1721466 (-) 723 WP_002410531.1 ORF6C domain-containing protein -
  NLG43_RS08520 (NLG43_08495) - 1721489..1721800 (-) 312 WP_002403885.1 hypothetical protein -
  NLG43_RS08525 (NLG43_08500) - 1721812..1722003 (-) 192 WP_002383936.1 hypothetical protein -
  NLG43_RS08530 (NLG43_08505) - 1722299..1722640 (+) 342 WP_002387267.1 helix-turn-helix transcriptional regulator -
  NLG43_RS08535 (NLG43_08510) - 1722645..1723295 (+) 651 WP_002387266.1 ImmA/IrrE family metallo-endopeptidase -
  NLG43_RS08540 (NLG43_08515) - 1723391..1724020 (+) 630 WP_002387265.1 SHOCT domain-containing protein -
  NLG43_RS08545 (NLG43_08520) - 1724126..1725274 (+) 1149 WP_002387264.1 site-specific integrase -
  NLG43_RS08550 (NLG43_08525) comGD 1725308..1725745 (-) 438 WP_281517317.1 competence type IV pilus minor pilin ComGD -
  NLG43_RS08555 (NLG43_08530) comGC/cglC 1725742..1726017 (-) 276 WP_002356991.1 competence type IV pilus major pilin ComGC Machinery gene

Sequence


Protein


Download         Length: 91 a.a.        Molecular weight: 10464.41 Da        Isoelectric Point: 9.3192

>NTDB_id=705873 NLG43_RS08555 WP_002356991.1 1725742..1726017(-) (comGC/cglC) [Enterococcus faecalis strain Chr-JH 2-2]
MKKKQKYAGFTLLEMLIVLLIISVLILLFVPNLAKHKETVDKKGNEAIVKIVESQIELYTLEKNKTPSLNELVNEGYITK
EQLDKYTAEKQ

Nucleotide


Download         Length: 276 bp        

>NTDB_id=705873 NLG43_RS08555 WP_002356991.1 1725742..1726017(-) (comGC/cglC) [Enterococcus faecalis strain Chr-JH 2-2]
ATGAAAAAGAAACAAAAATACGCAGGGTTTACATTATTAGAAATGTTGATTGTCTTATTGATTATTTCCGTATTGATTTT
ACTTTTTGTTCCTAACTTAGCGAAACATAAAGAAACAGTTGATAAAAAAGGCAATGAAGCAATCGTAAAAATTGTAGAAT
CACAAATCGAGCTCTACACACTAGAAAAAAATAAGACGCCTTCCTTAAATGAATTAGTCAACGAAGGCTACATTACTAAA
GAGCAGTTAGATAAATATACAGCAGAAAAGCAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1B4XPV0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC/cglC Streptococcus pneumoniae Rx1

63.095

92.308

0.582

  comGC/cglC Streptococcus pneumoniae D39

63.095

92.308

0.582

  comGC/cglC Streptococcus pneumoniae R6

63.095

92.308

0.582

  comGC/cglC Streptococcus pneumoniae TIGR4

63.095

92.308

0.582

  comGC/cglC Streptococcus mitis NCTC 12261

61.905

92.308

0.571

  comGC/cglC Streptococcus mitis SK321

61.176

93.407

0.571

  comGC Lactococcus lactis subsp. cremoris KW2

58.14

94.505

0.549

  comYC Streptococcus gordonii str. Challis substr. CH1

56.322

95.604

0.538

  comYC Streptococcus suis isolate S10

52.326

94.505

0.495

  comYC Streptococcus mutans UA159

57.692

85.714

0.495

  comYC Streptococcus mutans UA140

57.692

85.714

0.495

  comGC Latilactobacillus sakei subsp. sakei 23K

46.154

100

0.462

  comGC Staphylococcus aureus MW2

46.835

86.813

0.407

  comGC Staphylococcus aureus N315

46.835

86.813

0.407

  comGC Bacillus subtilis subsp. subtilis str. 168

48.649

81.319

0.396