Detailed information
Overview
| Name | comGD/cglD | Type | Machinery gene |
| Locus tag | E0F40_RS10015 | Genome accession | NZ_LR216065 |
| Coordinates | 1862474..1862878 (-) | Length | 134 a.a. |
| NCBI ID | WP_000588002.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain GPSC18 substr. ST13 isolate 55896440-41bd-11e5-998e-3c4a9275d6c6 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 1863667..1909866 | 1862474..1862878 | flank | 789 |
Gene organization within MGE regions
Location: 1862474..1909866
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| E0F40_RS10015 (SAMEA3714487_01900) | comGD/cglD | 1862474..1862878 (-) | 405 | WP_000588002.1 | competence type IV pilus minor pilin ComGD | Machinery gene |
| E0F40_RS10020 (SAMEA3714487_01901) | comGC/cglC | 1862871..1863137 (-) | 267 | WP_000962026.1 | competence type IV pilus major pilin ComGC | Machinery gene |
| E0F40_RS10025 (SAMEA3714487_01902) | - | 1863139..1863396 (-) | 258 | WP_000698513.1 | hypothetical protein | - |
| E0F40_RS10030 (SAMEA3714487_01903) | - | 1863667..1864623 (-) | 957 | WP_054385177.1 | N-acetylmuramoyl-L-alanine amidase | - |
| E0F40_RS10035 (SAMEA3714487_01904) | - | 1864627..1864959 (-) | 333 | WP_001186216.1 | phage holin | - |
| E0F40_RS10040 (SAMEA3714487_01905) | - | 1864963..1865262 (-) | 300 | WP_050203193.1 | hypothetical protein | - |
| E0F40_RS10045 (SAMEA3714487_01906) | - | 1865271..1865621 (-) | 351 | WP_000852245.1 | hypothetical protein | - |
| E0F40_RS10050 (SAMEA3714487_01907) | - | 1865624..1865827 (-) | 204 | WP_001091112.1 | hypothetical protein | - |
| E0F40_RS12520 (SAMEA3714487_01908) | - | 1865808..1865924 (-) | 117 | WP_001063633.1 | hypothetical protein | - |
| E0F40_RS12230 (SAMEA3714487_01909) | - | 1865921..1872481 (-) | 6561 | WP_232035519.1 | phage head spike fiber domain-containing protein | - |
| E0F40_RS10070 (SAMEA3714487_01910) | - | 1872486..1872836 (-) | 351 | WP_000068026.1 | DUF6711 family protein | - |
| E0F40_RS10075 (SAMEA3714487_01911) | - | 1872845..1876498 (-) | 3654 | WP_054376500.1 | hypothetical protein | - |
| E0F40_RS10080 (SAMEA3714487_01912) | - | 1876485..1876835 (-) | 351 | WP_000478010.1 | hypothetical protein | - |
| E0F40_RS10085 | - | 1876874..1877254 (-) | 381 | Protein_1882 | DUF6096 family protein | - |
| E0F40_RS10090 (SAMEA3714487_01914) | - | 1877259..1877672 (-) | 414 | WP_000880678.1 | phage tail tube protein | - |
| E0F40_RS10095 (SAMEA3714487_01915) | - | 1877675..1878043 (-) | 369 | WP_000608233.1 | hypothetical protein | - |
| E0F40_RS10100 (SAMEA3714487_01916) | - | 1878040..1878555 (-) | 516 | WP_000015941.1 | HK97-gp10 family putative phage morphogenesis protein | - |
| E0F40_RS10105 (SAMEA3714487_01917) | - | 1878530..1878868 (-) | 339 | WP_000478943.1 | hypothetical protein | - |
| E0F40_RS10110 (SAMEA3714487_01918) | - | 1878849..1879160 (-) | 312 | WP_000021219.1 | phage head-tail connector protein | - |
| E0F40_RS10115 (SAMEA3714487_01919) | - | 1879162..1879350 (-) | 189 | WP_000669346.1 | hypothetical protein | - |
| E0F40_RS10120 (SAMEA3714487_01920) | - | 1879360..1880385 (-) | 1026 | WP_000863391.1 | sugar-binding protein | - |
| E0F40_RS10125 (SAMEA3714487_01921) | - | 1880408..1880917 (-) | 510 | WP_054376499.1 | DUF4355 domain-containing protein | - |
| E0F40_RS10130 (SAMEA3714487_01922) | - | 1881063..1881275 (-) | 213 | WP_000393349.1 | crAss001_48 related protein | - |
| E0F40_RS10135 (SAMEA3714487_01924) | - | 1881416..1881703 (-) | 288 | WP_001046058.1 | hypothetical protein | - |
| E0F40_RS10140 (SAMEA3714487_01925) | - | 1881746..1882159 (-) | 414 | WP_000565276.1 | HD domain-containing protein | - |
| E0F40_RS10145 (SAMEA3714487_01926) | - | 1882156..1882365 (-) | 210 | WP_000651747.1 | hypothetical protein | - |
| E0F40_RS10150 (SAMEA3714487_01927) | - | 1882367..1884004 (-) | 1638 | WP_174222386.1 | minor capsid protein | - |
| E0F40_RS10155 (SAMEA3714487_01928) | - | 1883913..1885382 (-) | 1470 | WP_078136275.1 | phage portal protein | - |
| E0F40_RS10160 (SAMEA3714487_01929) | - | 1885394..1886608 (-) | 1215 | WP_050140470.1 | PBSX family phage terminase large subunit | - |
| E0F40_RS10165 (SAMEA3714487_01930) | - | 1886598..1887092 (-) | 495 | WP_000351060.1 | terminase small subunit | - |
| E0F40_RS10170 (SAMEA3714487_01931) | - | 1887553..1887957 (-) | 405 | WP_050110650.1 | DUF1492 domain-containing protein | - |
| E0F40_RS10175 (SAMEA3714487_01932) | - | 1888029..1888394 (-) | 366 | WP_050110649.1 | hypothetical protein | - |
| E0F40_RS10180 (SAMEA3714487_01933) | - | 1888391..1888711 (-) | 321 | WP_054376498.1 | hypothetical protein | - |
| E0F40_RS10185 (SAMEA3714487_01934) | - | 1888708..1889148 (-) | 441 | WP_050138382.1 | YopX family protein | - |
| E0F40_RS10190 (SAMEA3714487_01935) | - | 1889145..1889648 (-) | 504 | WP_001021771.1 | DUF1642 domain-containing protein | - |
| E0F40_RS10195 (SAMEA3714487_01936) | - | 1889650..1889967 (-) | 318 | WP_174222387.1 | hypothetical protein | - |
| E0F40_RS10200 (SAMEA3714487_01937) | - | 1889992..1890174 (-) | 183 | WP_000796349.1 | hypothetical protein | - |
| E0F40_RS10205 (SAMEA3714487_01938) | - | 1890190..1890621 (-) | 432 | WP_000779143.1 | RusA family crossover junction endodeoxyribonuclease | - |
| E0F40_RS11800 (SAMEA3714487_01939) | - | 1890618..1890779 (-) | 162 | WP_164993621.1 | hypothetical protein | - |
| E0F40_RS10210 (SAMEA3714487_01940) | - | 1890793..1891002 (-) | 210 | WP_000455269.1 | hypothetical protein | - |
| E0F40_RS10215 (SAMEA3714487_01941) | - | 1891004..1891699 (-) | 696 | WP_130898913.1 | DNA-methyltransferase | - |
| E0F40_RS10220 (SAMEA3714487_01942) | ssb | 1891743..1891988 (-) | 246 | Protein_1910 | single-stranded DNA-binding protein | - |
| E0F40_RS10225 (SAMEA3714487_01944) | - | 1892083..1892418 (-) | 336 | WP_000598345.1 | sporulation protein Cse60 | - |
| E0F40_RS10230 (SAMEA3714487_01945) | - | 1892411..1892734 (-) | 324 | WP_029743260.1 | hypothetical protein | - |
| E0F40_RS11805 (SAMEA3714487_01946) | - | 1892737..1892898 (-) | 162 | WP_174222388.1 | hypothetical protein | - |
| E0F40_RS10235 (SAMEA3714487_01947) | - | 1892901..1893941 (-) | 1041 | WP_001157038.1 | DUF1351 domain-containing protein | - |
| E0F40_RS10240 (SAMEA3714487_01948) | bet | 1893951..1894715 (-) | 765 | WP_130898914.1 | phage recombination protein Bet | - |
| E0F40_RS10245 (SAMEA3714487_01949) | - | 1894727..1894957 (-) | 231 | WP_000192920.1 | hypothetical protein | - |
| E0F40_RS11810 (SAMEA3714487_01951) | - | 1895077..1895220 (-) | 144 | WP_000161124.1 | hypothetical protein | - |
| E0F40_RS10250 (SAMEA3714487_01952) | - | 1895207..1895467 (-) | 261 | WP_000471463.1 | hypothetical protein | - |
| E0F40_RS10255 (SAMEA3714487_01953) | - | 1895460..1895666 (-) | 207 | WP_000839221.1 | hypothetical protein | - |
| E0F40_RS11815 (SAMEA3714487_01954) | - | 1895666..1895827 (-) | 162 | WP_000823400.1 | BOW99_gp33 family protein | - |
| E0F40_RS10265 (SAMEA3714487_01956) | - | 1896045..1896899 (-) | 855 | WP_001198473.1 | ATP-binding protein | - |
| E0F40_RS10270 (SAMEA3714487_01957) | - | 1896909..1897760 (-) | 852 | WP_050167223.1 | DnaD domain protein | - |
| E0F40_RS10275 (SAMEA3714487_01958) | - | 1897757..1897984 (-) | 228 | WP_001125555.1 | helix-turn-helix domain-containing protein | - |
| E0F40_RS10285 (SAMEA3714487_01960) | - | 1898175..1898465 (-) | 291 | WP_001815531.1 | hypothetical protein | - |
| E0F40_RS11820 | - | 1898462..1898608 (-) | 147 | WP_000389580.1 | hypothetical protein | - |
| E0F40_RS12365 (SAMEA3714487_01961) | - | 1899003..1899125 (-) | 123 | WP_000343850.1 | hypothetical protein | - |
| E0F40_RS10290 (SAMEA3714487_01962) | - | 1899199..1899474 (-) | 276 | WP_001094372.1 | hypothetical protein | - |
| E0F40_RS10295 (SAMEA3714487_01963) | - | 1899649..1900455 (+) | 807 | WP_000090700.1 | XRE family transcriptional regulator | - |
| E0F40_RS10300 (SAMEA3714487_01964) | - | 1900457..1901221 (+) | 765 | WP_000032361.1 | type II toxin-antitoxin system PemK/MazF family toxin | - |
| E0F40_RS10305 (SAMEA3714487_01965) | - | 1901500..1902945 (+) | 1446 | WP_050105991.1 | recombinase family protein | - |
| E0F40_RS10315 (SAMEA3714487_01966) | comGB/cglB | 1903027..1904043 (-) | 1017 | WP_077141332.1 | competence type IV pilus assembly protein ComGB | Machinery gene |
| E0F40_RS10320 (SAMEA3714487_01967) | comGA/cglA/cilD | 1903991..1904932 (-) | 942 | WP_000249550.1 | competence type IV pilus ATPase ComGA | Machinery gene |
| E0F40_RS10325 (SAMEA3714487_01968) | - | 1905008..1905373 (-) | 366 | WP_000286415.1 | DUF1033 family protein | - |
| E0F40_RS10330 (SAMEA3714487_01969) | - | 1905524..1906582 (-) | 1059 | WP_000649468.1 | zinc-dependent alcohol dehydrogenase family protein | - |
| E0F40_RS10335 (SAMEA3714487_01970) | nagA | 1906745..1907896 (-) | 1152 | WP_001134457.1 | N-acetylglucosamine-6-phosphate deacetylase | - |
| E0F40_RS10340 (SAMEA3714487_01971) | - | 1908049..1909866 (-) | 1818 | WP_001220850.1 | acyltransferase family protein | - |
Sequence
Protein
Download Length: 134 a.a. Molecular weight: 14667.82 Da Isoelectric Point: 10.2164
>NTDB_id=1126372 E0F40_RS10015 WP_000588002.1 1862474..1862878(-) (comGD/cglD) [Streptococcus pneumoniae strain GPSC18 substr. ST13 isolate 55896440-41bd-11e5-998e-3c4a9275d6c6]
MIKAFTMLESLLALSLVSILALGLSGSVQSTFAAVEEQIFFMEFEELYRETQKRSVASQQKTNLNLDGQTLSNGSQKLTV
PKGIQAPSGQSITFDRAGGNSSLAKVEFQTSKGAIRYQLYLGNGKIKRIKETKN
MIKAFTMLESLLALSLVSILALGLSGSVQSTFAAVEEQIFFMEFEELYRETQKRSVASQQKTNLNLDGQTLSNGSQKLTV
PKGIQAPSGQSITFDRAGGNSSLAKVEFQTSKGAIRYQLYLGNGKIKRIKETKN
Nucleotide
Download Length: 405 bp
>NTDB_id=1126372 E0F40_RS10015 WP_000588002.1 1862474..1862878(-) (comGD/cglD) [Streptococcus pneumoniae strain GPSC18 substr. ST13 isolate 55896440-41bd-11e5-998e-3c4a9275d6c6]
ATGATTAAGGCCTTTACCATGCTGGAAAGTCTCTTGGCTTTGAGTCTTGTGAGTATCCTTGCCTTGGGCTTGTCCGGCTC
TGTTCAGTCCACTTTTGCGGCAGTAGAGGAACAGATTTTCTTTATGGAGTTTGAAGAACTCTATCGGGAAACCCAAAAAC
GCAGTGTAGCCAGTCAGCAAAAGACTAATCTAAATTTAGATGGGCAGACGCTTAGCAATGGCAGTCAAAAGTTGACAGTT
CCTAAAGGAATTCAGGCACCATCAGGCCAAAGTATTACATTTGACCGAGCTGGGGGCAATTCGTCCCTGGCTAAGGTTGA
ATTTCAGACCAGTAAAGGAGCGATTCGCTATCAATTATATCTAGGAAATGGAAAAATTAAACGCATTAAGGAAACAAAAA
ATTAG
ATGATTAAGGCCTTTACCATGCTGGAAAGTCTCTTGGCTTTGAGTCTTGTGAGTATCCTTGCCTTGGGCTTGTCCGGCTC
TGTTCAGTCCACTTTTGCGGCAGTAGAGGAACAGATTTTCTTTATGGAGTTTGAAGAACTCTATCGGGAAACCCAAAAAC
GCAGTGTAGCCAGTCAGCAAAAGACTAATCTAAATTTAGATGGGCAGACGCTTAGCAATGGCAGTCAAAAGTTGACAGTT
CCTAAAGGAATTCAGGCACCATCAGGCCAAAGTATTACATTTGACCGAGCTGGGGGCAATTCGTCCCTGGCTAAGGTTGA
ATTTCAGACCAGTAAAGGAGCGATTCGCTATCAATTATATCTAGGAAATGGAAAAATTAAACGCATTAAGGAAACAAAAA
ATTAG
Domains
No domain identified.
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comGD/cglD | Streptococcus pneumoniae TIGR4 |
96.269 |
100 |
0.963 |
| comGD/cglD | Streptococcus pneumoniae Rx1 |
95.522 |
100 |
0.955 |
| comGD/cglD | Streptococcus pneumoniae D39 |
95.522 |
100 |
0.955 |
| comGD/cglD | Streptococcus pneumoniae R6 |
95.522 |
100 |
0.955 |
| comGD/cglD | Streptococcus mitis NCTC 12261 |
95.489 |
99.254 |
0.948 |
| comGD/cglD | Streptococcus mitis SK321 |
94.776 |
100 |
0.948 |
| comYD | Streptococcus gordonii str. Challis substr. CH1 |
57.48 |
94.776 |
0.545 |
| comYD | Streptococcus mutans UA140 |
49.219 |
95.522 |
0.47 |
| comYD | Streptococcus mutans UA159 |
49.219 |
95.522 |
0.47 |