Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGD/cglD   Type   Machinery gene
Locus tag   EQH24_RS10210 Genome accession   NZ_CP035256
Coordinates   1967498..1967902 (-) Length   134 a.a.
NCBI ID   WP_000588023.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain TVO_1901930     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1968652..2019658 1967498..1967902 flank 750


Gene organization within MGE regions


Location: 1967498..2019658
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH24_RS10210 (EQH24_10735) comGD/cglD 1967498..1967902 (-) 405 WP_000588023.1 competence type IV pilus minor pilin ComGD Machinery gene
  EQH24_RS10215 (EQH24_10740) comGC/cglC 1967895..1968161 (-) 267 WP_050129046.1 competence type IV pilus major pilin ComGC Machinery gene
  EQH24_RS10220 (EQH24_10745) - 1968163..1968420 (-) 258 WP_238101663.1 hypothetical protein -
  EQH24_RS10225 (EQH24_10750) - 1968652..1969608 (-) 957 WP_000350480.1 N-acetylmuramoyl-L-alanine amidase family protein -
  EQH24_RS10230 (EQH24_10755) - 1969612..1969944 (-) 333 WP_001186219.1 phage holin -
  EQH24_RS10235 (EQH24_10760) - 1969948..1970247 (-) 300 WP_001811580.1 hypothetical protein -
  EQH24_RS10240 (EQH24_10765) - 1970256..1970606 (-) 351 WP_000852245.1 hypothetical protein -
  EQH24_RS10245 (EQH24_10770) - 1970609..1970812 (-) 204 WP_001091123.1 hypothetical protein -
  EQH24_RS10250 (EQH24_10775) - 1970793..1970909 (-) 117 WP_001063632.1 hypothetical protein -
  EQH24_RS11915 - 1970906..1977322 (-) 6417 WP_409202213.1 tail fiber domain-containing protein -
  EQH24_RS11920 - 1978268..1981630 (-) 3363 Protein_2027 peptidase S74 -
  EQH24_RS10260 (EQH24_10785) - 1981635..1981985 (-) 351 WP_000068031.1 DUF6711 family protein -
  EQH24_RS10265 (EQH24_10790) - 1981994..1985668 (-) 3675 WP_238101665.1 hypothetical protein -
  EQH24_RS10270 (EQH24_10795) - 1985655..1986005 (-) 351 WP_000478007.1 hypothetical protein -
  EQH24_RS10275 (EQH24_10800) - 1986044..1986424 (-) 381 WP_001185629.1 DUF6096 family protein -
  EQH24_RS10280 (EQH24_10805) - 1986429..1986842 (-) 414 WP_000880676.1 phage tail tube protein -
  EQH24_RS10285 (EQH24_10810) - 1986845..1987213 (-) 369 WP_000608232.1 hypothetical protein -
  EQH24_RS10290 (EQH24_10815) - 1987210..1987725 (-) 516 WP_000015941.1 HK97-gp10 family putative phage morphogenesis protein -
  EQH24_RS10295 (EQH24_10820) - 1987700..1988038 (-) 339 WP_000478945.1 hypothetical protein -
  EQH24_RS10300 (EQH24_10825) - 1988019..1988330 (-) 312 WP_000021222.1 phage head-tail connector protein -
  EQH24_RS10305 (EQH24_10830) - 1988332..1988520 (-) 189 WP_000669349.1 hypothetical protein -
  EQH24_RS10310 (EQH24_10835) - 1988510..1988692 (-) 183 WP_000054934.1 Rho termination factor N-terminal domain-containing protein -
  EQH24_RS10315 (EQH24_10840) - 1988704..1989549 (-) 846 WP_000123890.1 N4-gp56 family major capsid protein -
  EQH24_RS10320 (EQH24_10845) - 1989556..1990140 (-) 585 WP_001288024.1 DUF4355 domain-containing protein -
  EQH24_RS10325 (EQH24_10850) - 1990310..1990522 (-) 213 WP_000393349.1 crAss001_48 related protein -
  EQH24_RS10330 - 1990667..1990840 (-) 174 WP_000379086.1 hypothetical protein -
  EQH24_RS10335 (EQH24_10855) - 1990991..1992541 (-) 1551 WP_179208665.1 minor capsid protein -
  EQH24_RS10340 (EQH24_10860) - 1992450..1993919 (-) 1470 WP_238101666.1 phage portal protein -
  EQH24_RS10345 (EQH24_10865) - 1993931..1995229 (-) 1299 WP_000084429.1 PBSX family phage terminase large subunit -
  EQH24_RS10350 (EQH24_10870) - 1995207..1995647 (-) 441 WP_014931818.1 terminase small subunit -
  EQH24_RS10360 (EQH24_10875) - 1996121..1996543 (-) 423 WP_001030244.1 DUF1492 domain-containing protein -
  EQH24_RS10365 (EQH24_10880) - 1996613..1996984 (-) 372 WP_001247151.1 hypothetical protein -
  EQH24_RS10370 (EQH24_10885) - 1996981..1997418 (-) 438 WP_000612395.1 YopX family protein -
  EQH24_RS10375 (EQH24_10890) - 1997437..1997886 (-) 450 WP_001132423.1 hypothetical protein -
  EQH24_RS10380 (EQH24_10895) - 1997889..1998821 (-) 933 WP_228114830.1 DUF1642 domain-containing protein -
  EQH24_RS10385 (EQH24_10900) - 1998823..1999140 (-) 318 WP_179132006.1 hypothetical protein -
  EQH24_RS10390 (EQH24_10905) - 1999165..1999347 (-) 183 WP_000796349.1 hypothetical protein -
  EQH24_RS10395 (EQH24_10910) - 1999363..1999794 (-) 432 WP_000779141.1 RusA family crossover junction endodeoxyribonuclease -
  EQH24_RS10400 (EQH24_10915) - 1999791..2000120 (-) 330 WP_001864270.1 hypothetical protein -
  EQH24_RS10405 (EQH24_10920) - 2000134..2000343 (-) 210 WP_000455269.1 hypothetical protein -
  EQH24_RS10410 (EQH24_10925) - 2000309..2000815 (-) 507 WP_000034831.1 class I SAM-dependent methyltransferase -
  EQH24_RS10415 (EQH24_10930) ssbA 2000825..2001241 (-) 417 WP_000609562.1 single-stranded DNA-binding protein Machinery gene
  EQH24_RS10420 (EQH24_10935) - 2001336..2001671 (-) 336 WP_000598345.1 sporulation protein Cse60 -
  EQH24_RS10425 (EQH24_10940) - 2001664..2001987 (-) 324 WP_001828022.1 hypothetical protein -
  EQH24_RS10430 (EQH24_10945) - 2002155..2003195 (-) 1041 WP_001157037.1 DUF1351 domain-containing protein -
  EQH24_RS10435 (EQH24_10950) bet 2003205..2003957 (-) 753 WP_050208850.1 phage recombination protein Bet -
  EQH24_RS10440 (EQH24_10955) - 2003975..2004160 (-) 186 WP_000746960.1 hypothetical protein -
  EQH24_RS10445 (EQH24_10960) - 2004464..2004718 (-) 255 WP_050250076.1 hypothetical protein -
  EQH24_RS10450 (EQH24_10965) - 2004711..2004914 (-) 204 WP_050250075.1 hypothetical protein -
  EQH24_RS10455 - 2004914..2005075 (-) 162 WP_000823399.1 BOW99_gp33 family protein -
  EQH24_RS10460 - 2005140..2005283 (-) 144 WP_000389589.1 hypothetical protein -
  EQH24_RS10465 (EQH24_10970) - 2005402..2005593 (+) 192 WP_000834563.1 hypothetical protein -
  EQH24_RS10470 (EQH24_10980) - 2005868..2006194 (-) 327 WP_050250036.1 replication protein -
  EQH24_RS10475 (EQH24_10985) - 2006358..2006546 (-) 189 WP_001161207.1 helix-turn-helix transcriptional regulator -
  EQH24_RS10480 (EQH24_10990) - 2006620..2006826 (+) 207 WP_000129515.1 hypothetical protein -
  EQH24_RS10485 - 2007051..2007221 (-) 171 WP_000660186.1 hypothetical protein -
  EQH24_RS10490 (EQH24_11000) - 2007420..2008109 (+) 690 WP_000577515.1 DUF4145 domain-containing protein -
  EQH24_RS10495 (EQH24_11005) - 2008269..2008544 (-) 276 WP_001094375.1 hypothetical protein -
  EQH24_RS10500 (EQH24_11010) - 2008699..2009487 (+) 789 WP_050116194.1 S24 family peptidase -
  EQH24_RS10505 (EQH24_11015) - 2009489..2009689 (+) 201 WP_000064302.1 hypothetical protein -
  EQH24_RS10510 (EQH24_11020) - 2009871..2011316 (+) 1446 WP_061385035.1 recombinase family protein -
  EQH24_RS10520 (EQH24_11030) comGB/cglB 2011398..2012414 (-) 1017 WP_073177425.1 competence type IV pilus assembly protein ComGB Machinery gene
  EQH24_RS10525 (EQH24_11035) comGA/cglA/cilD 2012362..2013303 (-) 942 WP_000249559.1 competence type IV pilus ATPase ComGA Machinery gene
  EQH24_RS10530 (EQH24_11040) - 2013379..2013744 (-) 366 WP_000286415.1 DUF1033 family protein -
  EQH24_RS10535 (EQH24_11045) - 2014012..2015267 (+) 1256 WP_408604980.1 ISL3 family transposase -
  EQH24_RS10540 (EQH24_11050) - 2015316..2016374 (-) 1059 WP_000649468.1 zinc-dependent alcohol dehydrogenase family protein -
  EQH24_RS10545 (EQH24_11055) nagA 2016537..2017688 (-) 1152 WP_001134457.1 N-acetylglucosamine-6-phosphate deacetylase -
  EQH24_RS10550 (EQH24_11060) - 2017841..2019658 (-) 1818 WP_001220838.1 acyltransferase family protein -

Sequence


Protein


Download         Length: 134 a.a.        Molecular weight: 14638.82 Da        Isoelectric Point: 10.2164

>NTDB_id=338027 EQH24_RS10210 WP_000588023.1 1967498..1967902(-) (comGD/cglD) [Streptococcus pneumoniae strain TVO_1901930]
MIKAFTMLESLLVLGLVSILALGLSGSVQSTFAAVEEQIFFMEFEELYRETQKRSVASQQKTSLNLDGQTLSNGSQKLTV
PKGIQAPSGQSITFDRAGGNSSLAKVEFQTSKGAIRYQLYLGNGKIKRIKETKN

Nucleotide


Download         Length: 405 bp        

>NTDB_id=338027 EQH24_RS10210 WP_000588023.1 1967498..1967902(-) (comGD/cglD) [Streptococcus pneumoniae strain TVO_1901930]
ATGATTAAGGCCTTTACCATGCTGGAAAGTCTCTTGGTTTTGGGTCTTGTGAGTATCCTTGCCTTGGGCTTGTCCGGCTC
TGTTCAGTCCACTTTTGCGGCGGTAGAGGAACAGATTTTCTTTATGGAGTTTGAAGAACTCTATCGGGAAACCCAAAAAC
GCAGTGTAGCCAGTCAGCAAAAGACTAGTCTGAACTTAGATGGGCAGACGCTTAGCAATGGCAGTCAAAAGTTGACAGTT
CCTAAAGGAATTCAGGCACCATCAGGCCAAAGTATTACATTTGACCGAGCTGGGGGCAATTCGTCCCTGGCTAAGGTTGA
ATTTCAGACCAGTAAAGGAGCGATTCGCTATCAATTATATCTAGGAAATGGAAAAATTAAACGCATTAAGGAAACAAAAA
ATTAG

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGD/cglD Streptococcus pneumoniae TIGR4

98.507

100

0.985

  comGD/cglD Streptococcus pneumoniae Rx1

97.761

100

0.978

  comGD/cglD Streptococcus pneumoniae D39

97.761

100

0.978

  comGD/cglD Streptococcus pneumoniae R6

97.761

100

0.978

  comGD/cglD Streptococcus mitis SK321

97.015

100

0.97

  comGD/cglD Streptococcus mitis NCTC 12261

97.744

99.254

0.97

  comYD Streptococcus gordonii str. Challis substr. CH1

58.268

94.776

0.552

  comYD Streptococcus mutans UA140

49.219

95.522

0.47

  comYD Streptococcus mutans UA159

49.219

95.522

0.47


Multiple sequence alignment