Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC/cglC   Type   Machinery gene
Locus tag   QQS48_RS08335 Genome accession   NZ_CP127177
Coordinates   1724040..1724315 (-) Length   91 a.a.
NCBI ID   WP_002356991.1    Uniprot ID   A0A1B4XPV0
Organism   Enterococcus faecalis strain 90_2023     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1685066..1724016 1724040..1724315 flank 24


Gene organization within MGE regions


Location: 1685066..1724315
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QQS48_RS08045 (QQS48_08045) - 1685066..1686073 (-) 1008 WP_002357063.1 class I SAM-dependent methyltransferase -
  QQS48_RS08050 (QQS48_08050) comGG 1686202..1686555 (-) 354 WP_002362054.1 competence type IV pilus minor pilin ComGG -
  QQS48_RS08055 (QQS48_08055) comGF 1686555..1686989 (-) 435 WP_002357060.1 competence type IV pilus minor pilin ComGF -
  QQS48_RS08060 (QQS48_08060) - 1686979..1687380 (-) 402 WP_002397440.1 type II secretion system protein -
  QQS48_RS08065 (QQS48_08065) - 1687501..1687986 (-) 486 WP_174084474.1 hypothetical protein -
  QQS48_RS08070 (QQS48_08070) - 1688392..1688835 (-) 444 WP_126266804.1 type II toxin-antitoxin system PemK/MazF family toxin -
  QQS48_RS08075 (QQS48_08075) - 1688828..1689655 (-) 828 WP_164549170.1 DUF3800 domain-containing protein -
  QQS48_RS08080 (QQS48_08080) - 1689805..1691064 (-) 1260 WP_267570864.1 LysM peptidoglycan-binding domain-containing protein -
  QQS48_RS08085 (QQS48_08085) - 1691065..1691460 (-) 396 WP_267570863.1 phage holin family protein -
  QQS48_RS08090 (QQS48_08090) - 1691490..1692707 (-) 1218 WP_267570862.1 glycerophosphodiester phosphodiesterase -
  QQS48_RS08095 (QQS48_08095) - 1692707..1693024 (-) 318 WP_267570861.1 hypothetical protein -
  QQS48_RS08100 (QQS48_08100) - 1693017..1693331 (-) 315 WP_280196307.1 hypothetical protein -
  QQS48_RS08105 (QQS48_08105) - 1693331..1694233 (-) 903 WP_174084357.1 phage baseplate upper protein -
  QQS48_RS08110 (QQS48_08110) - 1694250..1697054 (-) 2805 WP_174084356.1 phage tail tip lysozyme -
  QQS48_RS08115 (QQS48_08115) - 1697051..1697764 (-) 714 WP_174084355.1 phage tail protein -
  QQS48_RS08120 (QQS48_08120) - 1697761..1700064 (-) 2304 WP_174084354.1 phage tail tape measure protein -
  QQS48_RS08125 (QQS48_08125) - 1700256..1700711 (-) 456 WP_174084353.1 hypothetical protein -
  QQS48_RS08130 (QQS48_08130) - 1700711..1701358 (-) 648 WP_002393025.1 tail protein -
  QQS48_RS08135 (QQS48_08135) - 1701355..1701717 (-) 363 WP_016634652.1 HK97-gp10 family putative phage morphogenesis protein -
  QQS48_RS08140 (QQS48_08140) - 1701707..1702045 (-) 339 WP_002406253.1 hypothetical protein -
  QQS48_RS08145 (QQS48_08145) - 1702035..1702355 (-) 321 WP_267570859.1 phage head closure protein -
  QQS48_RS08150 (QQS48_08150) - 1702339..1702608 (-) 270 WP_285534243.1 phage gp6-like head-tail connector protein -
  QQS48_RS08155 (QQS48_08155) - 1702750..1703928 (-) 1179 WP_174084350.1 phage major capsid protein -
  QQS48_RS08160 (QQS48_08160) - 1703925..1704599 (-) 675 WP_174084349.1 head maturation protease, ClpP-related -
  QQS48_RS08165 (QQS48_08165) - 1704580..1705761 (-) 1182 WP_174084348.1 phage portal protein -
  QQS48_RS08170 (QQS48_08170) - 1705776..1707467 (-) 1692 WP_262120590.1 terminase TerL endonuclease subunit -
  QQS48_RS08175 (QQS48_08175) - 1707469..1707831 (-) 363 WP_104876160.1 hypothetical protein -
  QQS48_RS08185 (QQS48_08185) - 1709535..1709672 (+) 138 WP_010773976.1 hypothetical protein -
  QQS48_RS08190 (QQS48_08190) - 1709708..1709971 (-) 264 WP_174084463.1 hypothetical protein -
  QQS48_RS08195 (QQS48_08195) - 1710324..1710791 (-) 468 WP_002380819.1 ArpU family phage packaging/lysis transcriptional regulator -
  QQS48_RS08200 (QQS48_08200) - 1710976..1711200 (-) 225 WP_016632373.1 hypothetical protein -
  QQS48_RS08205 (QQS48_08205) - 1711383..1711931 (-) 549 WP_100966712.1 DUF1642 domain-containing protein -
  QQS48_RS08210 (QQS48_08210) - 1711924..1712091 (-) 168 WP_048948834.1 hypothetical protein -
  QQS48_RS08215 (QQS48_08215) - 1712088..1712498 (-) 411 WP_100966713.1 YopX family protein -
  QQS48_RS08220 (QQS48_08220) - 1712532..1712843 (-) 312 WP_002393129.1 hypothetical protein -
  QQS48_RS08225 (QQS48_08225) - 1712864..1713184 (-) 321 WP_016024806.1 hypothetical protein -
  QQS48_RS08230 (QQS48_08230) - 1713186..1713650 (-) 465 Protein_1598 class I SAM-dependent methyltransferase -
  QQS48_RS08235 (QQS48_08235) - 1713707..1713934 (-) 228 WP_002389214.1 hypothetical protein -
  QQS48_RS08240 (QQS48_08240) - 1714062..1714586 (-) 525 WP_010711643.1 hypothetical protein -
  QQS48_RS08245 (QQS48_08245) - 1714590..1715522 (-) 933 WP_280196305.1 phage replisome organizer N-terminal domain-containing protein -
  QQS48_RS08250 (QQS48_08250) - 1715527..1716207 (-) 681 WP_002389197.1 putative HNHc nuclease -
  QQS48_RS08255 (QQS48_08255) - 1716204..1717142 (-) 939 WP_002410850.1 DUF1351 domain-containing protein -
  QQS48_RS08260 (QQS48_08260) - 1717127..1717804 (-) 678 WP_002389124.1 ERF family protein -
  QQS48_RS08265 (QQS48_08265) - 1717805..1718041 (-) 237 WP_002389004.1 hypothetical protein -
  QQS48_RS08270 (QQS48_08270) - 1718098..1718253 (-) 156 WP_002389151.1 hypothetical protein -
  QQS48_RS08275 (QQS48_08275) - 1718418..1718603 (+) 186 WP_025188165.1 DUF1508 domain-containing protein -
  QQS48_RS08280 (QQS48_08280) - 1718768..1718908 (-) 141 WP_280196304.1 hypothetical protein -
  QQS48_RS08285 (QQS48_08285) - 1718920..1719111 (-) 192 WP_236547317.1 hypothetical protein -
  QQS48_RS08290 (QQS48_08290) - 1719122..1719307 (-) 186 WP_033598023.1 hypothetical protein -
  QQS48_RS08295 (QQS48_08295) - 1719318..1720031 (-) 714 WP_267570857.1 Rha family transcriptional regulator -
  QQS48_RS08300 (QQS48_08300) - 1720070..1720381 (-) 312 WP_280196303.1 hypothetical protein -
  QQS48_RS08305 (QQS48_08305) - 1720392..1720568 (-) 177 WP_002364354.1 helix-turn-helix transcriptional regulator -
  QQS48_RS08310 (QQS48_08310) - 1720877..1721200 (+) 324 WP_010826721.1 helix-turn-helix transcriptional regulator -
  QQS48_RS08315 (QQS48_08315) - 1721218..1721562 (+) 345 WP_267570856.1 ImmA/IrrE family metallo-endopeptidase -
  QQS48_RS08320 (QQS48_08320) - 1721596..1722324 (+) 729 WP_126266796.1 potassium channel family protein -
  QQS48_RS08325 (QQS48_08325) - 1722424..1723572 (+) 1149 WP_010711674.1 site-specific integrase -
  QQS48_RS08330 (QQS48_08330) comGD 1723609..1724043 (-) 435 Protein_1618 competence type IV pilus minor pilin ComGD -
  QQS48_RS08335 (QQS48_08335) comGC/cglC 1724040..1724315 (-) 276 WP_002356991.1 competence type IV pilus major pilin ComGC Machinery gene

Sequence


Protein


Download         Length: 91 a.a.        Molecular weight: 10464.41 Da        Isoelectric Point: 9.3192

>NTDB_id=842378 QQS48_RS08335 WP_002356991.1 1724040..1724315(-) (comGC/cglC) [Enterococcus faecalis strain 90_2023]
MKKKQKYAGFTLLEMLIVLLIISVLILLFVPNLAKHKETVDKKGNEAIVKIVESQIELYTLEKNKTPSLNELVNEGYITK
EQLDKYTAEKQ

Nucleotide


Download         Length: 276 bp        

>NTDB_id=842378 QQS48_RS08335 WP_002356991.1 1724040..1724315(-) (comGC/cglC) [Enterococcus faecalis strain 90_2023]
ATGAAAAAGAAACAAAAATACGCAGGGTTTACATTATTAGAAATGTTGATTGTCTTATTGATTATTTCCGTATTGATTTT
ACTTTTTGTTCCTAACTTAGCGAAACATAAAGAAACAGTTGATAAAAAAGGCAATGAAGCAATCGTAAAAATTGTAGAAT
CACAAATCGAGCTCTACACACTAGAAAAAAATAAGACGCCTTCCTTAAATGAATTAGTCAACGAAGGCTACATTACTAAA
GAGCAGTTAGATAAATATACAGCAGAAAAGCAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1B4XPV0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC/cglC Streptococcus pneumoniae Rx1

63.095

92.308

0.582

  comGC/cglC Streptococcus pneumoniae D39

63.095

92.308

0.582

  comGC/cglC Streptococcus pneumoniae R6

63.095

92.308

0.582

  comGC/cglC Streptococcus pneumoniae TIGR4

63.095

92.308

0.582

  comGC/cglC Streptococcus mitis NCTC 12261

61.905

92.308

0.571

  comGC/cglC Streptococcus mitis SK321

61.176

93.407

0.571

  comGC Lactococcus lactis subsp. cremoris KW2

58.14

94.505

0.549

  comYC Streptococcus gordonii str. Challis substr. CH1

56.322

95.604

0.538

  comYC Streptococcus suis isolate S10

52.326

94.505

0.495

  comYC Streptococcus mutans UA159

57.692

85.714

0.495

  comYC Streptococcus mutans UA140

57.692

85.714

0.495

  comGC Latilactobacillus sakei subsp. sakei 23K

46.154

100

0.462

  comGC Staphylococcus aureus MW2

46.835

86.813

0.407

  comGC Staphylococcus aureus N315

46.835

86.813

0.407

  comGC Bacillus subtilis subsp. subtilis str. 168

48.649

81.319

0.396