Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGD/cglD   Type   Machinery gene
Locus tag   E0F40_RS10015 Genome accession   NZ_LR216065
Coordinates   1862474..1862878 (-) Length   134 a.a.
NCBI ID   WP_000588002.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain GPSC18 substr. ST13 isolate 55896440-41bd-11e5-998e-3c4a9275d6c6     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1863667..1909866 1862474..1862878 flank 789


Gene organization within MGE regions


Location: 1862474..1909866
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E0F40_RS10015 (SAMEA3714487_01900) comGD/cglD 1862474..1862878 (-) 405 WP_000588002.1 competence type IV pilus minor pilin ComGD Machinery gene
  E0F40_RS10020 (SAMEA3714487_01901) comGC/cglC 1862871..1863137 (-) 267 WP_000962026.1 competence type IV pilus major pilin ComGC Machinery gene
  E0F40_RS10025 (SAMEA3714487_01902) - 1863139..1863396 (-) 258 WP_000698513.1 hypothetical protein -
  E0F40_RS10030 (SAMEA3714487_01903) - 1863667..1864623 (-) 957 WP_054385177.1 N-acetylmuramoyl-L-alanine amidase -
  E0F40_RS10035 (SAMEA3714487_01904) - 1864627..1864959 (-) 333 WP_001186216.1 phage holin -
  E0F40_RS10040 (SAMEA3714487_01905) - 1864963..1865262 (-) 300 WP_050203193.1 hypothetical protein -
  E0F40_RS10045 (SAMEA3714487_01906) - 1865271..1865621 (-) 351 WP_000852245.1 hypothetical protein -
  E0F40_RS10050 (SAMEA3714487_01907) - 1865624..1865827 (-) 204 WP_001091112.1 hypothetical protein -
  E0F40_RS12520 (SAMEA3714487_01908) - 1865808..1865924 (-) 117 WP_001063633.1 hypothetical protein -
  E0F40_RS12230 (SAMEA3714487_01909) - 1865921..1872481 (-) 6561 WP_232035519.1 phage head spike fiber domain-containing protein -
  E0F40_RS10070 (SAMEA3714487_01910) - 1872486..1872836 (-) 351 WP_000068026.1 DUF6711 family protein -
  E0F40_RS10075 (SAMEA3714487_01911) - 1872845..1876498 (-) 3654 WP_054376500.1 hypothetical protein -
  E0F40_RS10080 (SAMEA3714487_01912) - 1876485..1876835 (-) 351 WP_000478010.1 hypothetical protein -
  E0F40_RS10085 - 1876874..1877254 (-) 381 Protein_1882 DUF6096 family protein -
  E0F40_RS10090 (SAMEA3714487_01914) - 1877259..1877672 (-) 414 WP_000880678.1 phage tail tube protein -
  E0F40_RS10095 (SAMEA3714487_01915) - 1877675..1878043 (-) 369 WP_000608233.1 hypothetical protein -
  E0F40_RS10100 (SAMEA3714487_01916) - 1878040..1878555 (-) 516 WP_000015941.1 HK97-gp10 family putative phage morphogenesis protein -
  E0F40_RS10105 (SAMEA3714487_01917) - 1878530..1878868 (-) 339 WP_000478943.1 hypothetical protein -
  E0F40_RS10110 (SAMEA3714487_01918) - 1878849..1879160 (-) 312 WP_000021219.1 phage head-tail connector protein -
  E0F40_RS10115 (SAMEA3714487_01919) - 1879162..1879350 (-) 189 WP_000669346.1 hypothetical protein -
  E0F40_RS10120 (SAMEA3714487_01920) - 1879360..1880385 (-) 1026 WP_000863391.1 sugar-binding protein -
  E0F40_RS10125 (SAMEA3714487_01921) - 1880408..1880917 (-) 510 WP_054376499.1 DUF4355 domain-containing protein -
  E0F40_RS10130 (SAMEA3714487_01922) - 1881063..1881275 (-) 213 WP_000393349.1 crAss001_48 related protein -
  E0F40_RS10135 (SAMEA3714487_01924) - 1881416..1881703 (-) 288 WP_001046058.1 hypothetical protein -
  E0F40_RS10140 (SAMEA3714487_01925) - 1881746..1882159 (-) 414 WP_000565276.1 HD domain-containing protein -
  E0F40_RS10145 (SAMEA3714487_01926) - 1882156..1882365 (-) 210 WP_000651747.1 hypothetical protein -
  E0F40_RS10150 (SAMEA3714487_01927) - 1882367..1884004 (-) 1638 WP_174222386.1 minor capsid protein -
  E0F40_RS10155 (SAMEA3714487_01928) - 1883913..1885382 (-) 1470 WP_078136275.1 phage portal protein -
  E0F40_RS10160 (SAMEA3714487_01929) - 1885394..1886608 (-) 1215 WP_050140470.1 PBSX family phage terminase large subunit -
  E0F40_RS10165 (SAMEA3714487_01930) - 1886598..1887092 (-) 495 WP_000351060.1 terminase small subunit -
  E0F40_RS10170 (SAMEA3714487_01931) - 1887553..1887957 (-) 405 WP_050110650.1 DUF1492 domain-containing protein -
  E0F40_RS10175 (SAMEA3714487_01932) - 1888029..1888394 (-) 366 WP_050110649.1 hypothetical protein -
  E0F40_RS10180 (SAMEA3714487_01933) - 1888391..1888711 (-) 321 WP_054376498.1 hypothetical protein -
  E0F40_RS10185 (SAMEA3714487_01934) - 1888708..1889148 (-) 441 WP_050138382.1 YopX family protein -
  E0F40_RS10190 (SAMEA3714487_01935) - 1889145..1889648 (-) 504 WP_001021771.1 DUF1642 domain-containing protein -
  E0F40_RS10195 (SAMEA3714487_01936) - 1889650..1889967 (-) 318 WP_174222387.1 hypothetical protein -
  E0F40_RS10200 (SAMEA3714487_01937) - 1889992..1890174 (-) 183 WP_000796349.1 hypothetical protein -
  E0F40_RS10205 (SAMEA3714487_01938) - 1890190..1890621 (-) 432 WP_000779143.1 RusA family crossover junction endodeoxyribonuclease -
  E0F40_RS11800 (SAMEA3714487_01939) - 1890618..1890779 (-) 162 WP_164993621.1 hypothetical protein -
  E0F40_RS10210 (SAMEA3714487_01940) - 1890793..1891002 (-) 210 WP_000455269.1 hypothetical protein -
  E0F40_RS10215 (SAMEA3714487_01941) - 1891004..1891699 (-) 696 WP_130898913.1 DNA-methyltransferase -
  E0F40_RS10220 (SAMEA3714487_01942) ssb 1891743..1891988 (-) 246 Protein_1910 single-stranded DNA-binding protein -
  E0F40_RS10225 (SAMEA3714487_01944) - 1892083..1892418 (-) 336 WP_000598345.1 sporulation protein Cse60 -
  E0F40_RS10230 (SAMEA3714487_01945) - 1892411..1892734 (-) 324 WP_029743260.1 hypothetical protein -
  E0F40_RS11805 (SAMEA3714487_01946) - 1892737..1892898 (-) 162 WP_174222388.1 hypothetical protein -
  E0F40_RS10235 (SAMEA3714487_01947) - 1892901..1893941 (-) 1041 WP_001157038.1 DUF1351 domain-containing protein -
  E0F40_RS10240 (SAMEA3714487_01948) bet 1893951..1894715 (-) 765 WP_130898914.1 phage recombination protein Bet -
  E0F40_RS10245 (SAMEA3714487_01949) - 1894727..1894957 (-) 231 WP_000192920.1 hypothetical protein -
  E0F40_RS11810 (SAMEA3714487_01951) - 1895077..1895220 (-) 144 WP_000161124.1 hypothetical protein -
  E0F40_RS10250 (SAMEA3714487_01952) - 1895207..1895467 (-) 261 WP_000471463.1 hypothetical protein -
  E0F40_RS10255 (SAMEA3714487_01953) - 1895460..1895666 (-) 207 WP_000839221.1 hypothetical protein -
  E0F40_RS11815 (SAMEA3714487_01954) - 1895666..1895827 (-) 162 WP_000823400.1 BOW99_gp33 family protein -
  E0F40_RS10265 (SAMEA3714487_01956) - 1896045..1896899 (-) 855 WP_001198473.1 ATP-binding protein -
  E0F40_RS10270 (SAMEA3714487_01957) - 1896909..1897760 (-) 852 WP_050167223.1 DnaD domain protein -
  E0F40_RS10275 (SAMEA3714487_01958) - 1897757..1897984 (-) 228 WP_001125555.1 helix-turn-helix domain-containing protein -
  E0F40_RS10285 (SAMEA3714487_01960) - 1898175..1898465 (-) 291 WP_001815531.1 hypothetical protein -
  E0F40_RS11820 - 1898462..1898608 (-) 147 WP_000389580.1 hypothetical protein -
  E0F40_RS12365 (SAMEA3714487_01961) - 1899003..1899125 (-) 123 WP_000343850.1 hypothetical protein -
  E0F40_RS10290 (SAMEA3714487_01962) - 1899199..1899474 (-) 276 WP_001094372.1 hypothetical protein -
  E0F40_RS10295 (SAMEA3714487_01963) - 1899649..1900455 (+) 807 WP_000090700.1 XRE family transcriptional regulator -
  E0F40_RS10300 (SAMEA3714487_01964) - 1900457..1901221 (+) 765 WP_000032361.1 type II toxin-antitoxin system PemK/MazF family toxin -
  E0F40_RS10305 (SAMEA3714487_01965) - 1901500..1902945 (+) 1446 WP_050105991.1 recombinase family protein -
  E0F40_RS10315 (SAMEA3714487_01966) comGB/cglB 1903027..1904043 (-) 1017 WP_077141332.1 competence type IV pilus assembly protein ComGB Machinery gene
  E0F40_RS10320 (SAMEA3714487_01967) comGA/cglA/cilD 1903991..1904932 (-) 942 WP_000249550.1 competence type IV pilus ATPase ComGA Machinery gene
  E0F40_RS10325 (SAMEA3714487_01968) - 1905008..1905373 (-) 366 WP_000286415.1 DUF1033 family protein -
  E0F40_RS10330 (SAMEA3714487_01969) - 1905524..1906582 (-) 1059 WP_000649468.1 zinc-dependent alcohol dehydrogenase family protein -
  E0F40_RS10335 (SAMEA3714487_01970) nagA 1906745..1907896 (-) 1152 WP_001134457.1 N-acetylglucosamine-6-phosphate deacetylase -
  E0F40_RS10340 (SAMEA3714487_01971) - 1908049..1909866 (-) 1818 WP_001220850.1 acyltransferase family protein -

Sequence


Protein


Download         Length: 134 a.a.        Molecular weight: 14667.82 Da        Isoelectric Point: 10.2164

>NTDB_id=1126372 E0F40_RS10015 WP_000588002.1 1862474..1862878(-) (comGD/cglD) [Streptococcus pneumoniae strain GPSC18 substr. ST13 isolate 55896440-41bd-11e5-998e-3c4a9275d6c6]
MIKAFTMLESLLALSLVSILALGLSGSVQSTFAAVEEQIFFMEFEELYRETQKRSVASQQKTNLNLDGQTLSNGSQKLTV
PKGIQAPSGQSITFDRAGGNSSLAKVEFQTSKGAIRYQLYLGNGKIKRIKETKN

Nucleotide


Download         Length: 405 bp        

>NTDB_id=1126372 E0F40_RS10015 WP_000588002.1 1862474..1862878(-) (comGD/cglD) [Streptococcus pneumoniae strain GPSC18 substr. ST13 isolate 55896440-41bd-11e5-998e-3c4a9275d6c6]
ATGATTAAGGCCTTTACCATGCTGGAAAGTCTCTTGGCTTTGAGTCTTGTGAGTATCCTTGCCTTGGGCTTGTCCGGCTC
TGTTCAGTCCACTTTTGCGGCAGTAGAGGAACAGATTTTCTTTATGGAGTTTGAAGAACTCTATCGGGAAACCCAAAAAC
GCAGTGTAGCCAGTCAGCAAAAGACTAATCTAAATTTAGATGGGCAGACGCTTAGCAATGGCAGTCAAAAGTTGACAGTT
CCTAAAGGAATTCAGGCACCATCAGGCCAAAGTATTACATTTGACCGAGCTGGGGGCAATTCGTCCCTGGCTAAGGTTGA
ATTTCAGACCAGTAAAGGAGCGATTCGCTATCAATTATATCTAGGAAATGGAAAAATTAAACGCATTAAGGAAACAAAAA
ATTAG

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGD/cglD Streptococcus pneumoniae TIGR4

96.269

100

0.963

  comGD/cglD Streptococcus pneumoniae Rx1

95.522

100

0.955

  comGD/cglD Streptococcus pneumoniae D39

95.522

100

0.955

  comGD/cglD Streptococcus pneumoniae R6

95.522

100

0.955

  comGD/cglD Streptococcus mitis NCTC 12261

95.489

99.254

0.948

  comGD/cglD Streptococcus mitis SK321

94.776

100

0.948

  comYD Streptococcus gordonii str. Challis substr. CH1

57.48

94.776

0.545

  comYD Streptococcus mutans UA140

49.219

95.522

0.47

  comYD Streptococcus mutans UA159

49.219

95.522

0.47


Multiple sequence alignment