Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   ES276_RS09710 Genome accession   NZ_CP038252
Coordinates   1926384..1927337 (-) Length   317 a.a.
NCBI ID   WP_000345135.1    Uniprot ID   A0A2U3RW99
Organism   Streptococcus pneumoniae strain TVO_1901936     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 1863587..1937685 1926384..1927337 within 0


Gene organization within MGE regions


Location: 1863587..1937685
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ES276_RS09400 (ES276_10000) mraY 1866147..1867127 (-) 981 WP_000470785.1 phospho-N-acetylmuramoyl-pentapeptide- transferase -
  ES276_RS09405 (ES276_10005) pbp2X 1867129..1869381 (-) 2253 WP_000872275.1 penicillin-binding protein 2X -
  ES276_RS09410 (ES276_10010) ftsL 1869385..1869702 (-) 318 WP_000818547.1 cell division protein FtsL -
  ES276_RS09415 (ES276_10015) rsmH 1869714..1870664 (-) 951 WP_000159382.1 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH -
  ES276_RS09420 (ES276_10020) - 1870830..1871024 (+) 195 WP_001082472.1 helix-turn-helix transcriptional regulator -
  ES276_RS09425 (ES276_10025) - 1871040..1871582 (+) 543 WP_000712374.1 DUF3278 domain-containing protein -
  ES276_RS09430 (ES276_10030) - 1871593..1871835 (+) 243 WP_000711739.1 hypothetical protein -
  ES276_RS09435 (ES276_10035) - 1871846..1872181 (+) 336 WP_000838121.1 hypothetical protein -
  ES276_RS09440 (ES276_10040) - 1872291..1873292 (-) 1002 WP_000412453.1 LacI family DNA-binding transcriptional regulator -
  ES276_RS09445 (ES276_10045) - 1873352..1875253 (-) 1902 WP_000657091.1 alginate lyase family protein -
  ES276_RS09450 (ES276_10050) yajC 1875275..1875568 (-) 294 WP_000381734.1 preprotein translocase subunit YajC -
  ES276_RS09455 (ES276_10055) - 1875568..1876386 (-) 819 WP_000148012.1 PTS system mannose/fructose/sorbose family transporter subunit IID -
  ES276_RS09460 (ES276_10060) - 1876373..1877152 (-) 780 WP_000026610.1 PTS mannose/fructose/sorbose/N-acetylgalactosamine transporter subunit IIC -
  ES276_RS09465 (ES276_10065) - 1877167..1877658 (-) 492 WP_000178625.1 PTS system mannose/fructose/N-acetylgalactosamine-transporter subunit IIB -
  ES276_RS09470 (ES276_10070) - 1877669..1878859 (-) 1191 WP_000592948.1 glycoside hydrolase family 88 protein -
  ES276_RS09475 (ES276_10075) - 1878871..1879305 (-) 435 WP_000706826.1 PTS sugar transporter subunit IIA -
  ES276_RS09480 (ES276_10080) - 1879576..1880391 (+) 816 WP_000185879.1 gluconate 5-dehydrogenase -
  ES276_RS09485 (ES276_10085) - 1880410..1881051 (+) 642 WP_000684785.1 RpiB/LacA/LacB family sugar-phosphate isomerase -
  ES276_RS09490 (ES276_10090) - 1881082..1882083 (+) 1002 WP_000161485.1 sugar kinase -
  ES276_RS09495 (ES276_10095) - 1882093..1882722 (+) 630 WP_000167294.1 bifunctional 4-hydroxy-2-oxoglutarate aldolase/2-dehydro-3-deoxy-phosphogluconate aldolase -
  ES276_RS09505 (ES276_10110) tnpA 1883110..1883486 (-) 377 Protein_1873 IS200/IS605 family transposase -
  ES276_RS09510 (ES276_10115) - 1883870..1887070 (-) 3201 WP_001193687.1 LPXTG-anchored hyaluronate lyase -
  ES276_RS09515 (ES276_10120) - 1887293..1887769 (+) 477 WP_000203065.1 glutathione peroxidase -
  ES276_RS09520 (ES276_10125) - 1887991..1890204 (-) 2214 WP_238092994.1 glycoside hydrolase family 31 protein -
  ES276_RS09525 (ES276_10135) - 1890605..1890781 (-) 177 WP_000050748.1 hypothetical protein -
  ES276_RS09530 (ES276_10140) celB 1891160..1892518 (-) 1359 WP_238092995.1 PTS cellobiose transporter subunit IIC -
  ES276_RS09535 (ES276_10145) - 1892598..1893104 (-) 507 WP_000358307.1 hypothetical protein -
  ES276_RS09540 (ES276_10150) - 1893135..1893449 (-) 315 WP_001070943.1 PTS cellobiose transporter subunit IIA -
  ES276_RS09545 (ES276_10155) - 1893459..1895438 (-) 1980 Protein_1881 BglG family transcription antiterminator -
  ES276_RS09550 (ES276_10160) - 1895556..1895870 (-) 315 WP_001029637.1 PTS cellobiose transporter subunit IIB -
  ES276_RS09555 (ES276_10165) - 1896000..1896239 (-) 240 WP_000580382.1 SemiSWEET transporter -
  ES276_RS09560 (ES276_10170) - 1896291..1897727 (-) 1437 WP_000206471.1 6-phospho-beta-glucosidase -
  ES276_RS09565 (ES276_10175) - 1897917..1898255 (-) 339 WP_000176382.1 antibiotic biosynthesis monooxygenase family protein -
  ES276_RS09570 (ES276_10185) - 1898411..1899257 (-) 847 Protein_1886 IS630 family transposase -
  ES276_RS09575 (ES276_10190) nadC 1899388..1900260 (-) 873 WP_000022748.1 carboxylating nicotinate-nucleotide diphosphorylase -
  ES276_RS09580 (ES276_10195) - 1900497..1901807 (+) 1311 WP_000581530.1 SLC13 family permease -
  ES276_RS11120 - 1901947..1902075 (+) 129 WP_001833084.1 hypothetical protein -
  ES276_RS09590 (ES276_10205) - 1902173..1902367 (-) 195 WP_000676515.1 ABC transporter ATP-binding protein -
  ES276_RS09595 (ES276_10210) - 1902826..1903554 (-) 729 WP_000105270.1 GntR family transcriptional regulator -
  ES276_RS09600 (ES276_10215) - 1903712..1905121 (+) 1410 WP_000728302.1 glycoside hydrolase family 1 protein -
  ES276_RS09605 (ES276_10220) - 1905139..1906434 (+) 1296 WP_000798146.1 PTS sugar transporter subunit IIC -
  ES276_RS09610 (ES276_10225) - 1906439..1906747 (+) 309 WP_000809624.1 PTS sugar transporter subunit IIB -
  ES276_RS09615 (ES276_10230) - 1906744..1907052 (+) 309 WP_000134124.1 PTS lactose/cellobiose transporter subunit IIA -
  ES276_RS09620 (ES276_10235) adhE 1908038..1910689 (-) 2652 WP_000763969.1 bifunctional acetaldehyde-CoA/alcohol dehydrogenase -
  ES276_RS09625 (ES276_10240) - 1910989..1911399 (-) 411 WP_000429499.1 MORN repeat-containing protein -
  ES276_RS09630 (ES276_10245) - 1911401..1911829 (-) 429 WP_000737440.1 low molecular weight protein-tyrosine-phosphatase -
  ES276_RS09635 (ES276_10250) yajC 1911875..1912174 (-) 300 WP_001069051.1 preprotein translocase subunit YajC -
  ES276_RS09640 (ES276_10255) tkt 1912291..1914267 (-) 1977 WP_000067853.1 transketolase -
  ES276_RS09645 (ES276_10260) ulaG 1914381..1915472 (-) 1092 WP_001134220.1 L-ascorbate 6-phosphate lactonase -
  ES276_RS09650 (ES276_10265) - 1915584..1917257 (-) 1674 WP_000242103.1 transcription antiterminator -
  ES276_RS09655 (ES276_10270) - 1917436..1918140 (-) 705 WP_001077561.1 L-ribulose-5-phosphate 4-epimerase -
  ES276_RS09660 (ES276_10275) - 1918142..1919005 (-) 864 WP_000252125.1 L-ribulose-5-phosphate 3-epimerase -
  ES276_RS09665 (ES276_10280) - 1919009..1919674 (-) 666 WP_000166725.1 3-keto-L-gulonate-6-phosphate decarboxylase UlaD -
  ES276_RS09670 (ES276_10285) - 1919691..1920176 (-) 486 WP_001049932.1 PTS sugar transporter subunit IIA -
  ES276_RS09675 (ES276_10290) - 1920253..1920534 (-) 282 WP_000241469.1 PTS sugar transporter subunit IIB -
  ES276_RS09680 (ES276_10295) - 1920557..1922014 (-) 1458 WP_000454435.1 PTS ascorbate transporter subunit IIC -
  ES276_RS09685 (ES276_10300) - 1922145..1922768 (-) 624 WP_000932644.1 hypothetical protein -
  ES276_RS09690 (ES276_10305) jag 1922819..1923805 (-) 987 WP_000260012.1 RNA-binding cell elongation regulator Jag/EloR -
  ES276_RS09695 (ES276_10310) - 1923824..1924648 (-) 825 WP_000727904.1 membrane protein insertase YidC -
  ES276_RS09700 (ES276_10315) rnpA 1924623..1924994 (-) 372 WP_000739253.1 ribonuclease P protein component -
  ES276_RS10870 - 1925011..1925142 (-) 132 WP_000768904.1 hypothetical protein -
  ES276_RS09705 (ES276_10320) - 1925143..1926333 (-) 1191 WP_000167757.1 acetate kinase -
  ES276_RS09710 (ES276_10325) comYH 1926384..1927337 (-) 954 WP_000345135.1 class I SAM-dependent methyltransferase Machinery gene
  ES276_RS09715 (ES276_10330) - 1927398..1927992 (-) 595 Protein_1916 class I SAM-dependent methyltransferase -
  ES276_RS09720 (ES276_10335) comGG/cglG 1928129..1928542 (-) 414 WP_000265622.1 competence type IV pilus minor pilin ComGG Machinery gene
  ES276_RS09725 (ES276_10340) comGF/cglF 1928520..1928981 (-) 462 WP_000250534.1 competence type IV pilus minor pilin ComGF Machinery gene
  ES276_RS09730 (ES276_10345) comGE/cglE 1928944..1929246 (-) 303 WP_000413382.1 competence type IV pilus minor pilin ComGE Machinery gene
  ES276_RS09735 (ES276_10350) comGD/cglD 1929209..1929613 (-) 405 WP_000588028.1 competence type IV pilus minor pilin ComGD Machinery gene
  ES276_RS09740 (ES276_10355) comGC/cglC 1929606..1929932 (-) 327 WP_000738627.1 comG operon protein ComGC Machinery gene
  ES276_RS09745 (ES276_10360) comGB/cglB 1929934..1930950 (-) 1017 WP_074196785.1 competence type IV pilus assembly protein ComGB Machinery gene
  ES276_RS09750 (ES276_10365) comGA/cglA/cilD 1930898..1931839 (-) 942 WP_000249564.1 competence type IV pilus ATPase ComGA Machinery gene
  ES276_RS09755 (ES276_10370) - 1931915..1932280 (-) 366 WP_000286415.1 DUF1033 family protein -
  ES276_RS09760 (ES276_10375) - 1932431..1933489 (-) 1059 WP_000649468.1 zinc-dependent alcohol dehydrogenase family protein -
  ES276_RS09765 (ES276_10380) nagA 1933652..1934803 (-) 1152 WP_001134457.1 N-acetylglucosamine-6-phosphate deacetylase -
  ES276_RS09770 (ES276_10385) - 1934956..1936773 (-) 1818 WP_001220855.1 acyltransferase family protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35756.99 Da        Isoelectric Point: 4.2748

>NTDB_id=352650 ES276_RS09710 WP_000345135.1 1926384..1927337(-) (comYH) [Streptococcus pneumoniae strain TVO_1901936]
MDFEKIEQAYTYLLENVQVIQSDLATNFYDALVEQNSIYLDGETELNQVKENNQTLKRLALRKEEWLKTYQFLLMKAGQT
EPLQANHQFTPDAIALLLVFIVEELFKEEEITILEMGSGMGILGAIFLTSLTKKVDYLGMEVDDLLIDLAASMADVIGLQ
AGFVQGDAVRPQMLKESDVVISDLPVGYYPDDAVASRHQVASSQEHTYAHHLLMEQGLKYLKSDGYAIFLAPSDLLTSPQ
SDLLKEWLKEEASLVAMISLPENLFANAKQSKTIFILQKKNEIAVEPFVYPLASLQDASVLMKFKENFQKWTQGTEI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=352650 ES276_RS09710 WP_000345135.1 1926384..1927337(-) (comYH) [Streptococcus pneumoniae strain TVO_1901936]
ATGGATTTTGAAAAAATTGAACAAGCTTATACCTATTTACTAGAGAATGTCCAAGTCATCCAAAGTGATTTGGCGACCAA
CTTTTATGACGCCTTGGTGGAGCAAAATAGCATCTATCTGGATGGTGAAACTGAGCTAAACCAGGTCAAGGAGAACAATC
AAACCCTTAAGCGTTTAGCACTACGCAAAGAAGAATGGCTCAAGACCTACCAGTTTCTCTTGATGAAGGCTGGGCAAACA
GAACCCTTGCAGGCCAATCACCAGTTTACACCGGATGCTATTGCTTTGCTTTTGGTGTTTATTGTGGAAGAGTTGTTTAA
AGAGGAGGAAATTACTATCCTCGAAATGGGTTCTGGGATGGGAATTCTAGGCGCTATTTTCTTGACCTCGCTTACTAAAA
AGGTGGATTACTTGGGAATGGAAGTGGATGATTTGCTGATTGATCTGGCAGCTAGCATGGCAGATGTAATTGGTTTGCAG
GCTGGCTTTGTCCAAGGAGATGCCGTTCGCCCACAAATGCTCAAAGAAAGCGATGTGGTCATCAGTGACTTGCCTGTCGG
CTATTATCCTGATGATGCCGTTGCGTCGCGCCATCAAGTTGCTTCTAGCCAAGAACATACTTACGCCCATCACTTGCTCA
TGGAACAAGGGCTTAAGTACCTCAAGTCAGACGGATACGCTATTTTTCTAGCTCCGAGTGATTTGTTGACCAGTCCTCAA
AGTGATTTGTTAAAAGAATGGCTGAAAGAAGAGGCGAGTCTGGTTGCTATGATTAGTCTGCCTGAAAATCTCTTTGCTAA
TGCCAAACAATCTAAGACTATTTTTATCTTACAGAAGAAAAATGAAATAGCAGTAGAGCCTTTTGTTTATCCACTTGCTA
GCTTGCAAGATGCAAGTGTTTTAATGAAATTTAAAGAAAATTTTCAAAAATGGACTCAAGGTACTGAAATATAA

Domains


Predicted by InterproScan.

(69-282)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2U3RW99

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

54.633

98.738

0.539

  comYH Streptococcus mutans UA159

54.313

98.738

0.536


Multiple sequence alignment