Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   ACFKHK_RS09955 Genome accession   NZ_CP170760
Coordinates   1894092..1895045 (-) Length   317 a.a.
NCBI ID   WP_392462068.1    Uniprot ID   -
Organism   Streptococcus parasuis strain FZ2     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 1852027..1927113 1894092..1895045 within 0


Gene organization within MGE regions


Location: 1852027..1927113
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACFKHK_RS09730 tcrZ 1852584..1852790 (-) 207 WP_002335389.1 copper chaperone TcrZ -
  ACFKHK_RS09735 - 1852815..1853015 (-) 201 Protein_1889 HAD family hydrolase -
  ACFKHK_RS09740 gap 1853310..1854311 (-) 1002 WP_171989921.1 type I glyceraldehyde-3-phosphate dehydrogenase -
  ACFKHK_RS09745 - 1854609..1855199 (-) 591 WP_174846436.1 YdhK family protein -
  ACFKHK_RS09750 - 1855512..1855967 (-) 456 WP_039671262.1 CopY/TcrY family copper transport repressor -
  ACFKHK_RS09755 - 1856535..1856900 (-) 366 WP_253952800.1 hypothetical protein -
  ACFKHK_RS09760 - 1856900..1857364 (-) 465 WP_171989923.1 replication initiator protein A -
  ACFKHK_RS09765 - 1857366..1857556 (-) 191 Protein_1895 hypothetical protein -
  ACFKHK_RS09770 - 1857749..1858921 (+) 1173 WP_392462037.1 NAD(P)/FAD-dependent oxidoreductase -
  ACFKHK_RS09775 - 1859015..1859713 (+) 699 WP_171989926.1 ATP-binding cassette domain-containing protein -
  ACFKHK_RS09780 - 1859723..1861348 (+) 1626 WP_392462040.1 hypothetical protein -
  ACFKHK_RS09785 ald 1861450..1862547 (+) 1098 WP_392462042.1 alanine dehydrogenase -
  ACFKHK_RS09790 - 1862669..1863691 (-) 1023 WP_216806357.1 YeiH family protein -
  ACFKHK_RS09795 - 1863701..1864024 (-) 324 WP_174846424.1 AzlD domain-containing protein -
  ACFKHK_RS09800 - 1864011..1864712 (-) 702 WP_171989929.1 AzlC family ABC transporter permease -
  ACFKHK_RS09805 - 1865043..1865831 (-) 789 WP_277745447.1 ABC transporter permease subunit -
  ACFKHK_RS09810 - 1865828..1866661 (-) 834 WP_274504954.1 ABC transporter ATP-binding protein -
  ACFKHK_RS09815 tsaD 1866651..1867688 (-) 1038 WP_392462045.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD -
  ACFKHK_RS09820 rimI 1867678..1868118 (-) 441 WP_130554967.1 ribosomal protein S18-alanine N-acetyltransferase -
  ACFKHK_RS09825 tsaB 1868115..1868798 (-) 684 WP_392462047.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex dimerization subunit type 1 TsaB -
  ACFKHK_RS09830 - 1869034..1869213 (-) 180 WP_277940764.1 hypothetical protein -
  ACFKHK_RS09835 - 1869536..1869766 (+) 231 WP_024392378.1 RNA polymerase epsilon subunit -
  ACFKHK_RS09840 rnjA 1869770..1871449 (+) 1680 WP_239604291.1 ribonuclease J1 -
  ACFKHK_RS09845 glnA 1871781..1873127 (-) 1347 WP_171989726.1 type I glutamate--ammonia ligase -
  ACFKHK_RS09850 - 1873155..1873526 (-) 372 WP_024392055.1 MerR family transcriptional regulator -
  ACFKHK_RS09855 - 1873602..1874117 (-) 516 WP_024392054.1 aromatic acid exporter family protein -
  ACFKHK_RS09860 - 1874766..1875965 (-) 1200 WP_392462051.1 phosphoglycerate kinase -
  ACFKHK_RS09865 - 1876103..1876945 (-) 843 WP_392462053.1 aldo/keto reductase -
  ACFKHK_RS09870 - 1876963..1878159 (-) 1197 WP_216806349.1 iron-containing alcohol dehydrogenase -
  ACFKHK_RS09875 gap 1878347..1879357 (-) 1011 WP_174846416.1 type I glyceraldehyde-3-phosphate dehydrogenase -
  ACFKHK_RS09880 fusA 1879584..1881665 (-) 2082 WP_171989721.1 elongation factor G -
  ACFKHK_RS09885 rpsG 1881996..1882466 (-) 471 WP_105182784.1 30S ribosomal protein S7 -
  ACFKHK_RS09890 rpsL 1882483..1882896 (-) 414 WP_105130854.1 30S ribosomal protein S12 -
  ACFKHK_RS09895 groL 1883126..1884748 (-) 1623 WP_392462056.1 chaperonin GroEL -
  ACFKHK_RS09900 groES 1884847..1885128 (-) 282 WP_171989719.1 co-chaperone GroES -
  ACFKHK_RS09905 ssbA 1885698..1886093 (-) 396 WP_392462058.1 single-stranded DNA-binding protein Machinery gene
  ACFKHK_RS09910 - 1886175..1887107 (+) 933 WP_392462059.1 alpha/beta fold hydrolase -
  ACFKHK_RS09915 ytpR 1887176..1887799 (-) 624 WP_392462060.1 YtpR family tRNA-binding protein -
  ACFKHK_RS09920 - 1887818..1888774 (-) 957 WP_392462062.1 DUF1002 domain-containing protein -
  ACFKHK_RS09925 - 1888818..1889138 (-) 321 WP_392462064.1 thioredoxin family protein -
  ACFKHK_RS09930 - 1889135..1889419 (-) 285 WP_392462066.1 DUF4651 domain-containing protein -
  ACFKHK_RS09935 pepA 1889632..1890693 (+) 1062 WP_277940770.1 glutamyl aminopeptidase -
  ACFKHK_RS09940 - 1890738..1891994 (-) 1257 WP_392462675.1 bifunctional folylpolyglutamate synthase/dihydrofolate synthase -
  ACFKHK_RS09945 - 1892049..1892600 (-) 552 WP_216806342.1 folate family ECF transporter S component -
  ACFKHK_RS09950 - 1892855..1894042 (-) 1188 WP_239604301.1 acetate kinase -
  ACFKHK_RS09955 comYH 1894092..1895045 (-) 954 WP_392462068.1 class I SAM-dependent methyltransferase Machinery gene
  ACFKHK_RS09960 comGG 1895065..1895565 (-) 501 WP_171989710.1 competence type IV pilus minor pilin ComGG -
  ACFKHK_RS09965 comYF 1895543..1895977 (-) 435 WP_174846408.1 competence type IV pilus minor pilin ComGF Machinery gene
  ACFKHK_RS09970 comGE 1895964..1896257 (-) 294 WP_171989708.1 competence type IV pilus minor pilin ComGE -
  ACFKHK_RS09975 comGD 1896229..1896633 (-) 405 WP_171989707.1 competence type IV pilus minor pilin ComGD -
  ACFKHK_RS09980 comYC 1896617..1896901 (-) 285 WP_171989706.1 competence type IV pilus major pilin ComGC Machinery gene
  ACFKHK_RS09985 comYB 1896903..1897940 (-) 1038 WP_171989705.1 competence type IV pilus assembly protein ComGB Machinery gene
  ACFKHK_RS09990 comYA 1897852..1898802 (-) 951 WP_392462071.1 competence type IV pilus ATPase ComGA Machinery gene
  ACFKHK_RS09995 - 1898890..1899840 (+) 951 WP_277940773.1 S66 peptidase family protein -
  ACFKHK_RS10000 - 1899873..1900241 (-) 369 WP_277943115.1 DUF1033 family protein -
  ACFKHK_RS10005 rpoC 1900366..1904010 (-) 3645 WP_171989701.1 DNA-directed RNA polymerase subunit beta' -
  ACFKHK_RS10010 rpoB 1904188..1907760 (-) 3573 WP_174846402.1 DNA-directed RNA polymerase subunit beta -
  ACFKHK_RS10015 pbp1b 1908420..1910825 (-) 2406 WP_239604306.1 penicillin-binding protein PBP1B -
  ACFKHK_RS10020 tyrS 1910972..1912231 (+) 1260 WP_392462074.1 tyrosine--tRNA ligase -
  ACFKHK_RS10025 - 1912529..1912966 (-) 438 WP_392462076.1 DUF1492 domain-containing protein -
  ACFKHK_RS10030 - 1913370..1913933 (-) 564 Protein_1948 hypothetical protein -
  ACFKHK_RS10035 - 1914114..1914287 (-) 174 WP_392462078.1 hypothetical protein -
  ACFKHK_RS10040 - 1914624..1916126 (-) 1503 WP_392462080.1 phage/plasmid primase, P4 family -
  ACFKHK_RS10045 - 1916146..1917015 (-) 870 WP_392462082.1 primase alpha helix C-terminal domain-containing protein -
  ACFKHK_RS10050 - 1917150..1917431 (-) 282 WP_392462084.1 XRE family transcriptional regulator -
  ACFKHK_RS10055 - 1917424..1917759 (-) 336 WP_392462085.1 hypothetical protein -
  ACFKHK_RS10060 - 1917756..1917905 (-) 150 WP_392462087.1 hypothetical protein -
  ACFKHK_RS10065 - 1917892..1918107 (-) 216 WP_392462089.1 hypothetical protein -
  ACFKHK_RS10070 - 1918333..1918536 (-) 204 WP_277945849.1 hypothetical protein -
  ACFKHK_RS10075 - 1918533..1918766 (-) 234 WP_392462091.1 hypothetical protein -
  ACFKHK_RS10080 - 1919219..1919848 (-) 630 WP_392462093.1 Rha family transcriptional regulator -
  ACFKHK_RS10085 - 1919866..1920066 (-) 201 WP_029188910.1 helix-turn-helix transcriptional regulator -
  ACFKHK_RS10090 - 1920275..1920769 (+) 495 WP_392462095.1 helix-turn-helix domain-containing protein -
  ACFKHK_RS10095 - 1921504..1922649 (+) 1146 WP_392462097.1 tyrosine-type recombinase/integrase -
  ACFKHK_RS10100 - 1922822..1923199 (+) 378 WP_239604307.1 HIT family protein -
  ACFKHK_RS10105 - 1923311..1924021 (+) 711 WP_239604308.1 PASTA domain-containing protein -
  ACFKHK_RS10110 - 1924070..1924279 (-) 210 WP_130554922.1 heavy-metal-associated domain-containing protein -
  ACFKHK_RS10115 - 1924447..1924896 (-) 450 WP_392462100.1 CopY/TcrY family copper transport repressor -
  ACFKHK_RS10120 - 1924997..1926505 (-) 1509 WP_392462676.1 zinc ABC transporter substrate-binding protein AdcA -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35592.78 Da        Isoelectric Point: 4.5696

>NTDB_id=1059286 ACFKHK_RS09955 WP_392462068.1 1894092..1895045(-) (comYH) [Streptococcus parasuis strain FZ2]
MKYEKIEQAYNLLLENVQTIQNQLGTNIYDAMIEQNAAYVAGQHEPELVLNNNQVLRELALTKEEWRRAYQFLLIKANQT
EPMQYNHQFTPDSIGFILSFLVDQLVSTPRVTVLEIGSGTGNLAQTVLNASQKELDYLGIEIDDLLIDLSASIADVMGAA
ISFAQGDAVRPQILKESQVILGDLPIGYYPDDQIASRYQVASPNEHTYAHHLLMEQSLKYLEKDGFAILLAPNDLLTSPQ
SDLLKGWLQEQANIVAMIALPPNLFGKAAMAKSIFVLQKKAARPLTPFVYPLQSLQEPEAIQKFMVNFKNWKQENAI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=1059286 ACFKHK_RS09955 WP_392462068.1 1894092..1895045(-) (comYH) [Streptococcus parasuis strain FZ2]
ATGAAATATGAAAAAATTGAACAGGCCTACAACCTGCTTTTAGAAAATGTACAGACAATTCAAAATCAACTTGGGACCAA
TATTTATGATGCTATGATTGAGCAAAATGCTGCCTATGTAGCTGGTCAACATGAACCCGAACTTGTTCTCAACAACAATC
AAGTCTTGAGAGAATTAGCCTTGACAAAGGAAGAATGGCGTCGTGCTTACCAATTCTTGCTCATTAAAGCTAACCAGACA
GAACCGATGCAGTACAACCATCAGTTCACACCTGACTCTATCGGATTTATCTTATCTTTCCTTGTGGACCAGTTGGTCTC
TACTCCAAGAGTGACGGTTTTGGAAATTGGTTCTGGTACAGGAAACTTAGCTCAGACCGTTCTCAACGCCAGCCAGAAGG
AATTAGACTATTTAGGGATTGAAATAGATGACCTCTTGATTGATTTATCGGCTAGTATCGCGGATGTCATGGGAGCAGCT
ATTTCATTTGCTCAGGGAGATGCGGTACGTCCGCAGATTTTGAAAGAAAGCCAAGTGATTTTGGGAGATTTGCCTATTGG
CTACTATCCAGATGACCAGATTGCTAGCCGTTATCAGGTAGCTAGTCCAAATGAACATACCTACGCCCATCATTTGCTCA
TGGAACAGTCCCTCAAATATCTGGAAAAAGATGGATTTGCGATTTTACTGGCTCCAAATGATTTATTAACGAGTCCGCAA
AGCGATTTATTAAAAGGTTGGTTACAAGAACAAGCCAATATTGTTGCCATGATTGCCCTGCCGCCAAATCTTTTTGGGAA
GGCTGCCATGGCCAAGTCCATTTTTGTGTTACAAAAGAAAGCTGCCAGACCTTTAACTCCTTTCGTTTATCCTTTGCAAA
GCCTGCAAGAACCAGAAGCGATTCAGAAGTTCATGGTCAATTTCAAAAATTGGAAGCAAGAGAATGCAATTTGA

Domains


Predicted by InterproScan.

(68-283)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

59.177

99.685

0.59

  comYH Streptococcus mutans UA159

58.861

99.685

0.587


Multiple sequence alignment