Detailed information
Overview
| Name | comYH | Type | Machinery gene |
| Locus tag | PFZ59_RS01895 | Genome accession | NZ_CP116393 |
| Coordinates | 391224..392177 (-) | Length | 317 a.a. |
| NCBI ID | WP_277697384.1 | Uniprot ID | - |
| Organism | Streptococcus suis strain SS/UPM/MY/F001 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| ICE | 319717..402487 | 391224..392177 | within | 0 |
Gene organization within MGE regions
Location: 319717..402487
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| PFZ59_RS01560 (PFZ59_01560) | rnhC | 320137..321027 (+) | 891 | WP_136671550.1 | ribonuclease HIII | - |
| PFZ59_RS01565 (PFZ59_01565) | lepB | 321037..321666 (+) | 630 | WP_063077047.1 | signal peptidase I | - |
| PFZ59_RS01570 (PFZ59_01570) | - | 321731..324223 (+) | 2493 | WP_208581203.1 | ATP-dependent RecD-like DNA helicase | - |
| PFZ59_RS01575 (PFZ59_01575) | - | 324341..325045 (+) | 705 | WP_277697357.1 | hypothetical protein | - |
| PFZ59_RS01580 (PFZ59_01580) | - | 325215..326135 (-) | 921 | WP_277697358.1 | PfkB family carbohydrate kinase | - |
| PFZ59_RS01585 (PFZ59_01585) | - | 326240..326866 (-) | 627 | WP_208582643.1 | NAD(P)-dependent oxidoreductase | - |
| PFZ59_RS01590 (PFZ59_01590) | - | 327002..327421 (-) | 420 | WP_208582645.1 | Rrf2 family transcriptional regulator | - |
| PFZ59_RS01595 (PFZ59_01595) | dinB | 327464..328531 (-) | 1068 | WP_208582647.1 | DNA polymerase IV | - |
| PFZ59_RS01600 (PFZ59_01600) | pflB | 328847..331192 (+) | 2346 | WP_024376804.1 | formate C-acetyltransferase | - |
| PFZ59_RS01605 (PFZ59_01605) | - | 331376..332627 (+) | 1252 | Protein_296 | ISL3 family transposase | - |
| PFZ59_RS01610 (PFZ59_01610) | - | 333064..334011 (-) | 948 | WP_277697359.1 | serine hydrolase domain-containing protein | - |
| PFZ59_RS01615 (PFZ59_01615) | - | 334008..334739 (-) | 732 | WP_244229275.1 | CppA family protein | - |
| PFZ59_RS01620 (PFZ59_01620) | - | 334924..337251 (+) | 2328 | WP_277697360.1 | Xaa-Pro dipeptidyl-peptidase | - |
| PFZ59_RS01625 (PFZ59_01625) | - | 337236..338082 (+) | 847 | WP_277697006.1 | IS630 family transposase | - |
| PFZ59_RS01630 (PFZ59_01630) | - | 338125..339291 (-) | 1167 | WP_277697361.1 | SIS domain-containing protein | - |
| PFZ59_RS01635 (PFZ59_01635) | - | 339557..339856 (+) | 300 | WP_002941492.1 | YbaB/EbfC family nucleoid-associated protein | - |
| PFZ59_RS01640 (PFZ59_01640) | - | 339897..341663 (-) | 1767 | WP_277697362.1 | glycerophosphodiester phosphodiesterase | - |
| PFZ59_RS01645 (PFZ59_01645) | - | 341848..342396 (+) | 549 | Protein_304 | GNAT family protein | - |
| PFZ59_RS01650 (PFZ59_01650) | - | 342417..342911 (-) | 495 | WP_015646325.1 | DUF536 domain-containing protein | - |
| PFZ59_RS01655 (PFZ59_01655) | - | 343081..344481 (-) | 1401 | WP_277697363.1 | glycoside hydrolase family 1 protein | - |
| PFZ59_RS01660 (PFZ59_01660) | - | 344631..345478 (+) | 848 | WP_208580884.1 | IS630 family transposase | - |
| PFZ59_RS01665 (PFZ59_01665) | - | 345617..346789 (+) | 1173 | WP_277697364.1 | NAD(P)/FAD-dependent oxidoreductase | - |
| PFZ59_RS01670 (PFZ59_01670) | - | 346799..347491 (+) | 693 | WP_277697365.1 | GNAT family protein | - |
| PFZ59_RS01675 (PFZ59_01675) | - | 347572..348270 (+) | 699 | WP_277697366.1 | ABC transporter ATP-binding protein | - |
| PFZ59_RS01680 (PFZ59_01680) | - | 348280..349905 (+) | 1626 | WP_277697367.1 | hypothetical protein | - |
| PFZ59_RS01685 (PFZ59_01685) | - | 350048..351070 (-) | 1023 | WP_277697368.1 | YeiH family protein | - |
| PFZ59_RS01690 (PFZ59_01690) | - | 351080..351403 (-) | 324 | WP_208581183.1 | AzlD domain-containing protein | - |
| PFZ59_RS01695 (PFZ59_01695) | - | 351390..352091 (-) | 702 | WP_208581186.1 | AzlC family ABC transporter permease | - |
| PFZ59_RS01700 (PFZ59_01700) | - | 352266..354467 (-) | 2202 | WP_277697369.1 | alpha-galactosidase | - |
| PFZ59_RS01705 (PFZ59_01705) | - | 354477..355307 (-) | 831 | WP_044756438.1 | carbohydrate ABC transporter permease | - |
| PFZ59_RS01710 (PFZ59_01710) | - | 355318..356211 (-) | 894 | WP_277697761.1 | sugar ABC transporter permease | - |
| PFZ59_RS01715 (PFZ59_01715) | - | 356279..357553 (-) | 1275 | WP_277697370.1 | sugar ABC transporter substrate-binding protein | - |
| PFZ59_RS01720 (PFZ59_01720) | - | 357768..358604 (+) | 837 | WP_014637344.1 | AraC family transcriptional regulator | - |
| PFZ59_RS01725 (PFZ59_01725) | tsaD | 358782..359789 (-) | 1008 | WP_002938526.1 | tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD | - |
| PFZ59_RS01730 (PFZ59_01730) | rimI | 359779..360219 (-) | 441 | WP_277697371.1 | ribosomal protein S18-alanine N-acetyltransferase | - |
| PFZ59_RS01735 (PFZ59_01735) | tsaB | 360216..360899 (-) | 684 | WP_277697372.1 | tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex dimerization subunit type 1 TsaB | - |
| PFZ59_RS01740 (PFZ59_01740) | - | 361308..361592 (-) | 285 | WP_277697373.1 | hypothetical protein | - |
| PFZ59_RS01745 (PFZ59_01745) | - | 361645..362492 (+) | 848 | WP_208580884.1 | IS630 family transposase | - |
| PFZ59_RS01750 (PFZ59_01750) | - | 362738..363742 (-) | 1005 | WP_277696681.1 | IS5 family transposase | - |
| PFZ59_RS01755 (PFZ59_01755) | - | 363793..364056 (-) | 264 | WP_277697374.1 | hypothetical protein | - |
| PFZ59_RS01760 (PFZ59_01760) | - | 364059..364271 (-) | 213 | WP_208581196.1 | hypothetical protein | - |
| PFZ59_RS01765 (PFZ59_01765) | - | 364646..365515 (-) | 870 | WP_208582853.1 | Rgg/GadR/MutR family transcriptional regulator | - |
| PFZ59_RS01770 (PFZ59_01770) | - | 365814..366044 (+) | 231 | WP_002938523.1 | DNA-dependent RNA polymerase subunit epsilon | - |
| PFZ59_RS01775 (PFZ59_01775) | - | 366048..367727 (+) | 1680 | WP_002938522.1 | ribonuclease J | - |
| PFZ59_RS01780 (PFZ59_01780) | glnA | 368104..369450 (-) | 1347 | WP_011921751.1 | type I glutamate--ammonia ligase | - |
| PFZ59_RS01785 (PFZ59_01785) | - | 369479..369850 (-) | 372 | WP_002940041.1 | MerR family transcriptional regulator | - |
| PFZ59_RS01790 (PFZ59_01790) | - | 369926..370441 (-) | 516 | WP_277697375.1 | aromatic acid exporter family protein | - |
| PFZ59_RS01795 (PFZ59_01795) | - | 371198..372454 (+) | 1257 | WP_277697762.1 | ISL3 family transposase | - |
| PFZ59_RS01800 (PFZ59_01800) | - | 372549..373748 (-) | 1200 | WP_014637337.1 | phosphoglycerate kinase | - |
| PFZ59_RS01805 (PFZ59_01805) | gap | 374008..375018 (-) | 1011 | WP_002938507.1 | type I glyceraldehyde-3-phosphate dehydrogenase | - |
| PFZ59_RS01810 (PFZ59_01810) | fusA | 375225..377306 (-) | 2082 | WP_011921743.1 | elongation factor G | - |
| PFZ59_RS01815 (PFZ59_01815) | rpsG | 377736..378206 (-) | 471 | WP_044775067.1 | 30S ribosomal protein S7 | - |
| PFZ59_RS01820 (PFZ59_01820) | rpsL | 378223..378636 (-) | 414 | WP_002940030.1 | 30S ribosomal protein S12 | - |
| PFZ59_RS01825 (PFZ59_01825) | - | 378942..380105 (+) | 1164 | WP_277697376.1 | IS30 family transposase | - |
| PFZ59_RS01830 (PFZ59_01830) | groL | 380421..382043 (-) | 1623 | WP_277697377.1 | chaperonin GroEL | - |
| PFZ59_RS01835 (PFZ59_01835) | groES | 382055..382336 (-) | 282 | WP_014637330.1 | co-chaperone GroES | - |
| PFZ59_RS01840 (PFZ59_01840) | - | 382535..382780 (+) | 246 | WP_277697378.1 | hypothetical protein | - |
| PFZ59_RS01845 (PFZ59_01845) | ssbA | 382899..383294 (-) | 396 | WP_277697379.1 | single-stranded DNA-binding protein | Machinery gene |
| PFZ59_RS01850 (PFZ59_01850) | - | 383349..384128 (+) | 780 | WP_277697380.1 | DUF2785 domain-containing protein | - |
| PFZ59_RS01855 (PFZ59_01855) | ytpR | 384161..384784 (-) | 624 | WP_277697381.1 | YtpR family tRNA-binding protein | - |
| PFZ59_RS01860 (PFZ59_01860) | - | 384803..385759 (-) | 957 | WP_277697382.1 | DUF1002 domain-containing protein | - |
| PFZ59_RS01865 (PFZ59_01865) | - | 385957..386277 (-) | 321 | WP_044669967.1 | thioredoxin family protein | - |
| PFZ59_RS01870 (PFZ59_01870) | - | 386274..386558 (-) | 285 | WP_015646299.1 | DUF4651 domain-containing protein | - |
| PFZ59_RS01875 (PFZ59_01875) | pepA | 386705..387766 (+) | 1062 | WP_208580731.1 | glutamyl aminopeptidase | - |
| PFZ59_RS01880 (PFZ59_01880) | - | 387811..389067 (-) | 1257 | WP_277697383.1 | folylpolyglutamate synthase/dihydrofolate synthase family protein | - |
| PFZ59_RS01885 (PFZ59_01885) | - | 389123..389674 (-) | 552 | WP_208580735.1 | folate family ECF transporter S component | - |
| PFZ59_RS01890 (PFZ59_01890) | - | 389987..391174 (-) | 1188 | WP_208580737.1 | acetate kinase | - |
| PFZ59_RS01895 (PFZ59_01895) | comYH | 391224..392177 (-) | 954 | WP_277697384.1 | class I SAM-dependent methyltransferase | Machinery gene |
| PFZ59_RS01900 (PFZ59_01900) | comGG | 392227..392745 (-) | 519 | WP_277697385.1 | competence type IV pilus minor pilin ComGG | - |
| PFZ59_RS01905 (PFZ59_01905) | comGF/cglF | 392723..393157 (-) | 435 | WP_277697386.1 | competence type IV pilus minor pilin ComGF | Machinery gene |
| PFZ59_RS01910 (PFZ59_01910) | comYE | 393144..393437 (-) | 294 | WP_024405248.1 | competence type IV pilus minor pilin ComGE | Machinery gene |
| PFZ59_RS01915 (PFZ59_01915) | comGD | 393409..393816 (-) | 408 | WP_277697387.1 | competence type IV pilus minor pilin ComGD | - |
| PFZ59_RS01920 (PFZ59_01920) | comYC | 393797..394078 (-) | 282 | WP_024387069.1 | competence type IV pilus major pilin ComGC | Machinery gene |
| PFZ59_RS01925 (PFZ59_01925) | comYB | 394080..395117 (-) | 1038 | WP_277697388.1 | competence type IV pilus assembly protein ComGB | Machinery gene |
| PFZ59_RS01930 (PFZ59_01930) | comYA | 395029..395979 (-) | 951 | WP_105156412.1 | competence type IV pilus ATPase ComGA | Machinery gene |
| PFZ59_RS01935 (PFZ59_01935) | - | 396065..397015 (+) | 951 | WP_277697389.1 | S66 peptidase family protein | - |
| PFZ59_RS01940 (PFZ59_01940) | - | 397050..397415 (-) | 366 | WP_277697390.1 | DUF1033 family protein | - |
| PFZ59_RS01945 (PFZ59_01945) | - | 397488..397943 (-) | 456 | WP_347176380.1 | transposase | - |
| PFZ59_RS01950 (PFZ59_01950) | - | 398008..398337 (-) | 330 | WP_277696633.1 | IS630 transposase-related protein | - |
| PFZ59_RS01955 (PFZ59_01955) | - | 398360..398815 (-) | 456 | WP_347176380.1 | transposase | - |
| PFZ59_RS01960 (PFZ59_01960) | - | 398880..399209 (-) | 330 | WP_277696633.1 | IS630 transposase-related protein | - |
Sequence
Protein
Download Length: 317 a.a. Molecular weight: 35754.88 Da Isoelectric Point: 4.4571
>NTDB_id=777073 PFZ59_RS01895 WP_277697384.1 391224..392177(-) (comYH) [Streptococcus suis strain SS/UPM/MY/F001]
MNFEKIEQAYDLLLENVQTIQNQLGTNIYDAMIEQNAAYVANQHETDLIINNNKTLKQLDLTKEEWRRAYQFLLIKANQT
EPMQYNHQFTPDSIGFILSFLVDQLVPTQKVTVLEIGSGTGNLAQTILNASQKELDYLGIEVDDLLIDLSASIADVMQAD
ISFAQGDAVRPQILKESQVILGDLPIGYYPDDQIASRYQVASPNEHTYAHHLLMEQSLKYLEKDGFAILLAPNDLLTSPQ
SDLLKGWLQEQANIVAMIALPPSLFGKAAMAKSIFVLQRKAARPLAPFVYPLQSLQEPEAIQKFMLNFKNWKQENAI
MNFEKIEQAYDLLLENVQTIQNQLGTNIYDAMIEQNAAYVANQHETDLIINNNKTLKQLDLTKEEWRRAYQFLLIKANQT
EPMQYNHQFTPDSIGFILSFLVDQLVPTQKVTVLEIGSGTGNLAQTILNASQKELDYLGIEVDDLLIDLSASIADVMQAD
ISFAQGDAVRPQILKESQVILGDLPIGYYPDDQIASRYQVASPNEHTYAHHLLMEQSLKYLEKDGFAILLAPNDLLTSPQ
SDLLKGWLQEQANIVAMIALPPSLFGKAAMAKSIFVLQRKAARPLAPFVYPLQSLQEPEAIQKFMLNFKNWKQENAI
Nucleotide
Download Length: 954 bp
>NTDB_id=777073 PFZ59_RS01895 WP_277697384.1 391224..392177(-) (comYH) [Streptococcus suis strain SS/UPM/MY/F001]
ATGAATTTTGAAAAGATCGAACAGGCTTACGACCTGCTATTAGAAAACGTACAGACTATCCAAAACCAGCTAGGTACCAA
TATCTATGATGCCATGATTGAGCAAAATGCTGCTTACGTAGCTAATCAGCATGAGACGGACCTTATTATCAATAATAACA
AGACCTTGAAACAACTAGATTTAACCAAGGAAGAATGGCGTCGTGCCTACCAATTCCTGCTCATCAAGGCCAATCAGACT
GAACCCATGCAGTACAATCACCAGTTCACACCAGACTCTATCGGATTTATCCTATCTTTTCTAGTAGACCAATTGGTGCC
GACTCAAAAGGTGACGGTTCTGGAAATTGGTTCGGGGACAGGCAATCTAGCGCAGACCATTCTCAACGCCAGCCAGAAAG
AATTGGATTACTTGGGGATTGAAGTGGACGACCTCTTGATTGATTTGTCGGCAAGTATTGCTGATGTCATGCAGGCAGAT
ATTTCTTTTGCTCAGGGAGATGCGGTACGTCCGCAGATTTTGAAGGAAAGTCAAGTAATTTTGGGAGATTTGCCTATTGG
TTACTATCCAGATGACCAGATTGCTAGCCGCTATCAGGTCGCCAGCCCAAATGAACATACCTACGCCCATCATTTACTCA
TGGAACAATCCCTGAAATATCTGGAAAAAGATGGCTTTGCGATTTTGTTGGCTCCAAATGATTTATTGACTAGCCCGCAA
AGCGATTTGCTGAAAGGTTGGTTACAGGAGCAAGCCAATATTGTTGCCATGATTGCCCTGCCACCAAGTCTCTTTGGGAA
GGCTGCTATGGCCAAGTCTATTTTTGTCTTGCAAAGGAAAGCAGCTAGACCTCTAGCGCCGTTTGTTTATCCCTTGCAAA
GTCTTCAAGAACCAGAAGCTATTCAGAAGTTCATGCTCAATTTCAAAAATTGGAAGCAAGAGAATGCAATTTAA
ATGAATTTTGAAAAGATCGAACAGGCTTACGACCTGCTATTAGAAAACGTACAGACTATCCAAAACCAGCTAGGTACCAA
TATCTATGATGCCATGATTGAGCAAAATGCTGCTTACGTAGCTAATCAGCATGAGACGGACCTTATTATCAATAATAACA
AGACCTTGAAACAACTAGATTTAACCAAGGAAGAATGGCGTCGTGCCTACCAATTCCTGCTCATCAAGGCCAATCAGACT
GAACCCATGCAGTACAATCACCAGTTCACACCAGACTCTATCGGATTTATCCTATCTTTTCTAGTAGACCAATTGGTGCC
GACTCAAAAGGTGACGGTTCTGGAAATTGGTTCGGGGACAGGCAATCTAGCGCAGACCATTCTCAACGCCAGCCAGAAAG
AATTGGATTACTTGGGGATTGAAGTGGACGACCTCTTGATTGATTTGTCGGCAAGTATTGCTGATGTCATGCAGGCAGAT
ATTTCTTTTGCTCAGGGAGATGCGGTACGTCCGCAGATTTTGAAGGAAAGTCAAGTAATTTTGGGAGATTTGCCTATTGG
TTACTATCCAGATGACCAGATTGCTAGCCGCTATCAGGTCGCCAGCCCAAATGAACATACCTACGCCCATCATTTACTCA
TGGAACAATCCCTGAAATATCTGGAAAAAGATGGCTTTGCGATTTTGTTGGCTCCAAATGATTTATTGACTAGCCCGCAA
AGCGATTTGCTGAAAGGTTGGTTACAGGAGCAAGCCAATATTGTTGCCATGATTGCCCTGCCACCAAGTCTCTTTGGGAA
GGCTGCTATGGCCAAGTCTATTTTTGTCTTGCAAAGGAAAGCAGCTAGACCTCTAGCGCCGTTTGTTTATCCCTTGCAAA
GTCTTCAAGAACCAGAAGCTATTCAGAAGTTCATGCTCAATTTCAAAAATTGGAAGCAAGAGAATGCAATTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comYH | Streptococcus mutans UA140 |
60.443 |
99.685 |
0.603 |
| comYH | Streptococcus mutans UA159 |
60.127 |
99.685 |
0.599 |