Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGD/cglD   Type   Machinery gene
Locus tag   R8625_RS09590 Genome accession   NZ_AP026923
Coordinates   1847983..1848417 (-) Length   144 a.a.
NCBI ID   WP_000608686.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain PZ900700054     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1849173..1903405 1847983..1848417 flank 756


Gene organization within MGE regions


Location: 1847983..1903405
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R8625_RS09590 (PC0054_18660) comGD/cglD 1847983..1848417 (-) 435 WP_000608686.1 competence type IV pilus minor pilin ComGD Machinery gene
  R8625_RS09595 (PC0054_18670) comGC/cglC 1848380..1848643 (-) 264 WP_000962025.1 competence type IV pilus major pilin ComGC Machinery gene
  R8625_RS09600 (PC0054_18680) - 1848645..1848902 (-) 258 WP_000698511.1 hypothetical protein -
  R8625_RS09605 (PC0054_18690) - 1849173..1850129 (-) 957 WP_219601358.1 N-acetylmuramoyl-L-alanine amidase family protein -
  R8625_RS09610 (PC0054_18700) - 1850133..1850465 (-) 333 WP_050085269.1 phage holin -
  R8625_RS09615 (PC0054_18710) - 1850469..1850768 (-) 300 WP_001811580.1 hypothetical protein -
  R8625_RS09620 (PC0054_18720) - 1850777..1851127 (-) 351 WP_000852245.1 hypothetical protein -
  R8625_RS09625 (PC0054_18730) - 1851130..1851333 (-) 204 WP_001091109.1 hypothetical protein -
  R8625_RS09630 - 1851314..1851430 (-) 117 WP_001063633.1 hypothetical protein -
  R8625_RS09635 (PC0054_18740) - 1851427..1858692 (-) 7266 WP_317649160.1 tail fiber domain-containing protein -
  R8625_RS09640 (PC0054_18750) - 1858697..1859047 (-) 351 WP_000068032.1 DUF6711 family protein -
  R8625_RS09645 (PC0054_18760) - 1859056..1862709 (-) 3654 WP_277776316.1 hypothetical protein -
  R8625_RS09650 (PC0054_18770) - 1862696..1863046 (-) 351 WP_000478011.1 hypothetical protein -
  R8625_RS09655 (PC0054_18780) - 1863085..1863465 (-) 381 WP_001185637.1 DUF6096 family protein -
  R8625_RS09660 (PC0054_18790) - 1863470..1863883 (-) 414 WP_000880666.1 phage tail tube protein -
  R8625_RS09665 (PC0054_18800) - 1863886..1864254 (-) 369 WP_000608235.1 hypothetical protein -
  R8625_RS09670 (PC0054_18810) - 1864251..1864766 (-) 516 WP_050202026.1 HK97-gp10 family putative phage morphogenesis protein -
  R8625_RS09675 (PC0054_18820) - 1864741..1865079 (-) 339 WP_000478945.1 hypothetical protein -
  R8625_RS09680 (PC0054_18830) - 1865060..1865371 (-) 312 WP_000021222.1 phage head-tail connector protein -
  R8625_RS09685 (PC0054_18840) - 1865373..1865561 (-) 189 WP_000669348.1 hypothetical protein -
  R8625_RS09690 (PC0054_18850) - 1865571..1866596 (-) 1026 WP_000863391.1 sugar-binding protein -
  R8625_RS09695 (PC0054_18860) - 1866619..1867134 (-) 516 WP_050168042.1 DUF4355 domain-containing protein -
  R8625_RS09700 (PC0054_18870) - 1867302..1867553 (-) 252 WP_050204613.1 DUF6275 family protein -
  R8625_RS09705 (PC0054_18880) - 1867555..1867813 (-) 259 Protein_1884 hypothetical protein -
  R8625_RS09710 (PC0054_18890) - 1867958..1868131 (-) 174 WP_000379086.1 hypothetical protein -
  R8625_RS09715 (PC0054_18900) - 1868182..1868409 (-) 228 WP_050168037.1 hypothetical protein -
  R8625_RS09720 (PC0054_18910) - 1868397..1869800 (-) 1404 WP_225791266.1 minor capsid protein -
  R8625_RS09725 (PC0054_18920) - 1869763..1871178 (-) 1416 WP_317649163.1 phage portal protein -
  R8625_RS09730 (PC0054_18930) - 1871190..1872413 (-) 1224 WP_001864388.1 PBSX family phage terminase large subunit -
  R8625_RS09735 (PC0054_18940) - 1872403..1872861 (-) 459 WP_061385061.1 hypothetical protein -
  R8625_RS09740 (PC0054_18950) - 1872890..1874152 (-) 1263 WP_061632308.1 DNA modification methylase -
  R8625_RS09750 (PC0054_18960) - 1874648..1875109 (-) 462 WP_001030245.1 DUF1492 domain-containing protein -
  R8625_RS09755 (PC0054_18970) - 1875185..1875448 (-) 264 WP_050207369.1 hypothetical protein -
  R8625_RS09760 (PC0054_18980) - 1875445..1875765 (-) 321 WP_001268497.1 hypothetical protein -
  R8625_RS09765 (PC0054_18990) - 1875762..1876184 (-) 423 WP_317649164.1 YopX family protein -
  R8625_RS09770 (PC0054_19000) - 1876181..1876375 (-) 195 WP_050200029.1 hypothetical protein -
  R8625_RS09775 (PC0054_19010) - 1876372..1876632 (-) 261 WP_050200028.1 DUF1372 family protein -
  R8625_RS09780 (PC0054_19020) - 1876629..1876793 (-) 165 WP_317649166.1 hypothetical protein -
  R8625_RS09785 (PC0054_19030) - 1876790..1877485 (-) 696 WP_317649167.1 DUF1642 domain-containing protein -
  R8625_RS09790 (PC0054_19040) - 1877487..1877804 (-) 318 WP_317649168.1 hypothetical protein -
  R8625_RS09795 (PC0054_19050) - 1877829..1878011 (-) 183 WP_000796349.1 hypothetical protein -
  R8625_RS09800 (PC0054_19060) - 1878027..1878458 (-) 432 WP_000779141.1 RusA family crossover junction endodeoxyribonuclease -
  R8625_RS09805 (PC0054_19070) - 1878455..1878784 (-) 330 WP_288171468.1 hypothetical protein -
  R8625_RS09810 (PC0054_19080) - 1878798..1879007 (-) 210 WP_050199665.1 hypothetical protein -
  R8625_RS09815 (PC0054_19090) - 1878973..1879479 (-) 507 WP_000034832.1 class I SAM-dependent methyltransferase -
  R8625_RS09820 (PC0054_19100) ssbA 1879489..1879905 (-) 417 WP_000609561.1 single-stranded DNA-binding protein Machinery gene
  R8625_RS09825 (PC0054_19110) - 1879895..1880038 (-) 144 WP_153277088.1 hypothetical protein -
  R8625_RS09830 (PC0054_19120) - 1880041..1881081 (-) 1041 WP_288171457.1 DUF1351 domain-containing protein -
  R8625_RS09835 (PC0054_19130) bet 1881091..1881855 (-) 765 WP_000184008.1 phage recombination protein Bet -
  R8625_RS09840 (PC0054_19140) - 1881867..1882097 (-) 231 WP_000192920.1 hypothetical protein -
  R8625_RS09845 (PC0054_19150) - 1882217..1882360 (-) 144 WP_000196747.1 hypothetical protein -
  R8625_RS09850 (PC0054_19160) - 1882353..1882607 (-) 255 WP_317649171.1 hypothetical protein -
  R8625_RS09855 (PC0054_19170) - 1882600..1882806 (-) 207 WP_317649172.1 hypothetical protein -
  R8625_RS09860 (PC0054_19180) - 1882806..1882967 (-) 162 WP_000823399.1 BOW99_gp33 family protein -
  R8625_RS09865 (PC0054_19200) - 1883185..1884039 (-) 855 WP_001198473.1 ATP-binding protein -
  R8625_RS09870 (PC0054_19210) - 1884049..1884900 (-) 852 WP_050167223.1 DnaD domain protein -
  R8625_RS09875 (PC0054_19220) - 1884897..1885124 (-) 228 WP_001125555.1 helix-turn-helix transcriptional regulator -
  R8625_RS09880 (PC0054_19240) - 1885315..1885605 (-) 291 WP_001815531.1 hypothetical protein -
  R8625_RS09885 (PC0054_19250) - 1885602..1885748 (-) 147 WP_000389580.1 hypothetical protein -
  R8625_RS09890 (PC0054_19260) - 1886143..1886265 (-) 123 WP_000343850.1 hypothetical protein -
  R8625_RS09895 (PC0054_19270) - 1886339..1886614 (-) 276 WP_001094372.1 hypothetical protein -
  R8625_RS09900 (PC0054_19280) - 1886789..1887595 (+) 807 WP_054422885.1 XRE family transcriptional regulator -
  R8625_RS09905 (PC0054_19290) - 1887597..1887797 (+) 201 WP_000064302.1 hypothetical protein -
  R8625_RS09910 (PC0054_19300) - 1887971..1889416 (+) 1446 WP_024478469.1 recombinase family protein -
  R8625_RS09920 (PC0054_19310) comGB/cglB 1889498..1890514 (-) 1017 WP_138028521.1 competence type IV pilus assembly protein ComGB Machinery gene
  R8625_RS09925 (PC0054_19320) comGA/cglA/cilD 1890462..1891403 (-) 942 WP_016398028.1 competence type IV pilus ATPase ComGA Machinery gene
  R8625_RS09930 (PC0054_19330) - 1891479..1891844 (-) 366 WP_000286415.1 DUF1033 family protein -
  R8625_RS09935 (PC0054_19340) - 1891995..1893053 (-) 1059 WP_000649473.1 zinc-dependent alcohol dehydrogenase family protein -
  R8625_RS09940 (PC0054_19350) nagA 1893216..1894367 (-) 1152 WP_001134454.1 N-acetylglucosamine-6-phosphate deacetylase -
  R8625_RS09945 (PC0054_19360) - 1894520..1896337 (-) 1818 WP_317649173.1 acyltransferase family protein -
  R8625_RS09950 (PC0054_19370) tgt 1896435..1897577 (-) 1143 WP_317649174.1 tRNA guanosine(34) transglycosylase Tgt -
  R8625_RS09955 (PC0054_19380) - 1897707..1898564 (+) 858 WP_001108863.1 DUF975 family protein -
  R8625_RS09960 (PC0054_19390) pcp 1898592..1899236 (-) 645 Protein_1933 pyroglutamyl-peptidase I -
  R8625_RS09965 (PC0054_19400) - 1899343..1899692 (-) 350 Protein_1934 DUF1304 domain-containing protein -
  R8625_RS09970 - 1899692..1899795 (-) 104 Protein_1935 transcriptional regulator -
  R8625_RS09975 (PC0054_19410) - 1900223..1901335 (-) 1113 WP_317649175.1 LysM domain-containing protein -
  R8625_RS09980 (PC0054_19420) - 1901501..1902121 (-) 621 WP_001172823.1 HAD family hydrolase -
  R8625_RS09985 (PC0054_19430) - 1902125..1903405 (-) 1281 WP_317649176.1 MATE family efflux transporter -

Sequence


Protein


Download         Length: 144 a.a.        Molecular weight: 15786.23 Da        Isoelectric Point: 10.3971

>NTDB_id=98219 R8625_RS09590 WP_000608686.1 1847983..1848417(-) (comGD/cglD) [Streptococcus pneumoniae strain PZ900700054]
MINKILVKSLKIKAFTILESLLVLGLVSILALGLSGSVQSTFAAVEEQIFFMEFEELYRETQKRSVASQQKTSLNSDGQT
ISNGSQKLPVPKGIQAPSDQSITFDRAGGNSSLAKVEFQTSKGAIRYQLYLGNGKIKRIKETKN

Nucleotide


Download         Length: 435 bp        

>NTDB_id=98219 R8625_RS09590 WP_000608686.1 1847983..1848417(-) (comGD/cglD) [Streptococcus pneumoniae strain PZ900700054]
ATGATAAACAAAATACTAGTCAAATCGTTAAAGATTAAGGCCTTTACCATACTTGAAAGTCTCTTGGTTTTGGGTCTTGT
GAGTATCCTTGCCCTGGGCTTGTCCGGCTCTGTTCAGTCCACTTTTGCGGCGGTAGAGGAACAGATTTTCTTTATGGAGT
TTGAAGAACTCTATAGGGAAACCCAAAAACGCAGTGTAGCCAGTCAGCAAAAGACTAGTCTGAACTCAGATGGGCAGACG
ATTAGCAATGGCAGTCAAAAGTTGCCAGTCCCTAAAGGAATTCAGGCCCCATCAGATCAAAGTATTACATTTGACCGTGC
TGGGGGCAATTCGTCCCTGGCTAAGGTTGAATTTCAGACCAGTAAAGGAGCGATTCGCTATCAATTATATCTAGGAAATG
GAAAAATTAAACGCATTAAGGAAACAAAAAATTAG

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGD/cglD Streptococcus mitis NCTC 12261

93.056

100

0.931

  comGD/cglD Streptococcus pneumoniae TIGR4

96.241

92.361

0.889

  comGD/cglD Streptococcus pneumoniae Rx1

95.489

92.361

0.882

  comGD/cglD Streptococcus pneumoniae D39

95.489

92.361

0.882

  comGD/cglD Streptococcus pneumoniae R6

95.489

92.361

0.882

  comGD/cglD Streptococcus mitis SK321

94.737

92.361

0.875

  comYD Streptococcus gordonii str. Challis substr. CH1

56.835

96.528

0.549

  comYD Streptococcus mutans UA140

48.837

89.583

0.437

  comYD Streptococcus mutans UA159

48.837

89.583

0.437


Multiple sequence alignment