Detailed information
Overview
| Name | comYH | Type | Machinery gene |
| Locus tag | H7679_RS01070 | Genome accession | NZ_AP023392 |
| Coordinates | 163938..164891 (+) | Length | 317 a.a. |
| NCBI ID | WP_184495879.1 | Uniprot ID | - |
| Organism | Streptococcus suis strain DAT300 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 123402..175862 | 163938..164891 | within | 0 |
Gene organization within MGE regions
Location: 123402..175862
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| H7679_RS00760 (DAT300_01170) | - | 123402..124841 (-) | 1440 | WP_172077518.1 | recombinase family protein | - |
| H7679_RS00765 (DAT300_01180) | - | 124976..125545 (-) | 570 | WP_197970253.1 | hypothetical protein | - |
| H7679_RS00770 (DAT300_01190) | - | 125559..126317 (-) | 759 | WP_105131553.1 | helix-turn-helix domain-containing protein | - |
| H7679_RS00775 (DAT300_01200) | - | 126517..126720 (+) | 204 | WP_105131554.1 | helix-turn-helix domain-containing protein | - |
| H7679_RS00780 (DAT300_01210) | - | 126775..127032 (+) | 258 | WP_105131555.1 | hypothetical protein | - |
| H7679_RS00785 (DAT300_01220) | - | 127008..127154 (+) | 147 | WP_184495818.1 | BOW99_gp33 family protein | - |
| H7679_RS00790 (DAT300_01230) | - | 127197..127544 (+) | 348 | WP_105131556.1 | transcriptional regulator | - |
| H7679_RS00795 (DAT300_01240) | - | 127755..128387 (+) | 633 | WP_184495821.1 | polymer-forming cytoskeletal protein | - |
| H7679_RS00800 (DAT300_01250) | - | 128391..129083 (+) | 693 | WP_074412752.1 | ERF family protein | - |
| H7679_RS00805 (DAT300_01260) | - | 129087..130109 (+) | 1023 | WP_184495823.1 | DUF1351 domain-containing protein | - |
| H7679_RS00810 (DAT300_01270) | - | 130109..130756 (+) | 648 | WP_184495824.1 | hypothetical protein | - |
| H7679_RS00815 (DAT300_01280) | - | 130758..131291 (+) | 534 | WP_184495827.1 | MazG-like family protein | - |
| H7679_RS00820 (DAT300_01290) | - | 131291..131467 (+) | 177 | WP_171841111.1 | hypothetical protein | - |
| H7679_RS00825 (DAT300_01300) | ssbA | 131457..131987 (+) | 531 | WP_184495829.1 | single-stranded DNA-binding protein | Machinery gene |
| H7679_RS00830 (DAT300_01310) | - | 132003..132281 (+) | 279 | WP_184495831.1 | hypothetical protein | - |
| H7679_RS00835 (DAT300_01320) | - | 132282..133070 (+) | 789 | WP_184495833.1 | DNA methyltransferase | - |
| H7679_RS00840 (DAT300_01330) | - | 133067..133270 (+) | 204 | WP_074392405.1 | hypothetical protein | - |
| H7679_RS00845 (DAT300_01340) | - | 133267..133746 (+) | 480 | WP_184495836.1 | helix-turn-helix domain-containing protein | - |
| H7679_RS00850 (DAT300_01350) | - | 133758..134507 (+) | 750 | WP_029752016.1 | DNA-methyltransferase | - |
| H7679_RS00855 (DAT300_01360) | - | 134524..134961 (+) | 438 | WP_024383995.1 | hypothetical protein | - |
| H7679_RS00860 (DAT300_01370) | - | 134963..135697 (+) | 735 | WP_184495839.1 | hypothetical protein | - |
| H7679_RS00865 (DAT300_01380) | - | 135718..136134 (+) | 417 | WP_105119255.1 | hypothetical protein | - |
| H7679_RS11505 (DAT300_01390) | - | 136191..136316 (+) | 126 | WP_269088801.1 | hypothetical protein | - |
| H7679_RS00870 (DAT300_01400) | - | 136329..137180 (+) | 852 | WP_024410597.1 | prohibitin family protein | - |
| H7679_RS00875 (DAT300_01410) | - | 137181..137483 (+) | 303 | WP_014736101.1 | DUF1372 family protein | - |
| H7679_RS00880 (DAT300_01420) | - | 137484..137714 (+) | 231 | WP_014736100.1 | hypothetical protein | - |
| H7679_RS00885 (DAT300_01430) | - | 137707..137937 (+) | 231 | WP_074412312.1 | hypothetical protein | - |
| H7679_RS00890 (DAT300_01440) | - | 137934..138344 (+) | 411 | WP_074415751.1 | endodeoxyribonuclease RusA | - |
| H7679_RS00895 (DAT300_01450) | - | 138337..138675 (+) | 339 | WP_044767980.1 | hypothetical protein | - |
| H7679_RS00900 (DAT300_01460) | - | 138668..138904 (+) | 237 | WP_044767978.1 | hypothetical protein | - |
| H7679_RS00905 (DAT300_01470) | - | 138905..139285 (+) | 381 | WP_014736097.1 | hypothetical protein | - |
| H7679_RS00910 (DAT300_01480) | - | 139354..140025 (+) | 672 | WP_184495841.1 | DUF4417 domain-containing protein | - |
| H7679_RS00915 (DAT300_01490) | - | 140022..140405 (+) | 384 | WP_014736095.1 | hypothetical protein | - |
| H7679_RS00920 (DAT300_01500) | - | 140595..141074 (+) | 480 | WP_044767977.1 | terminase small subunit | - |
| H7679_RS00925 (DAT300_01510) | - | 141061..142374 (+) | 1314 | WP_184495844.1 | PBSX family phage terminase large subunit | - |
| H7679_RS00930 (DAT300_01520) | - | 142386..143975 (+) | 1590 | WP_184495846.1 | phage portal protein | - |
| H7679_RS00935 (DAT300_01530) | - | 143959..144213 (+) | 255 | WP_415669128.1 | hypothetical protein | - |
| H7679_RS00940 (DAT300_01540) | - | 144216..145364 (+) | 1149 | WP_184495850.1 | phage minor capsid protein | - |
| H7679_RS00945 (DAT300_01550) | - | 145511..146128 (+) | 618 | WP_184495852.1 | phage scaffolding protein | - |
| H7679_RS00950 (DAT300_01560) | - | 146132..146971 (+) | 840 | WP_016056180.1 | N4-gp56 family major capsid protein | - |
| H7679_RS00955 (DAT300_01570) | - | 146971..147156 (+) | 186 | WP_074412321.1 | Rho termination factor N-terminal domain-containing protein | - |
| H7679_RS00960 (DAT300_01580) | - | 147134..147361 (+) | 228 | WP_184495855.1 | hypothetical protein | - |
| H7679_RS00965 (DAT300_01590) | - | 147403..147798 (+) | 396 | WP_184495858.1 | hypothetical protein | - |
| H7679_RS00970 (DAT300_01600) | - | 147788..148114 (+) | 327 | WP_184495860.1 | putative minor capsid protein | - |
| H7679_RS00975 (DAT300_01610) | - | 148114..148464 (+) | 351 | WP_029751987.1 | minor capsid protein | - |
| H7679_RS00980 (DAT300_01620) | - | 148464..148871 (+) | 408 | WP_014736083.1 | minor capsid protein | - |
| H7679_RS00985 (DAT300_01630) | - | 148875..149333 (+) | 459 | WP_014736082.1 | phage tail tube protein | - |
| H7679_RS00990 (DAT300_01640) | - | 149356..149724 (+) | 369 | WP_029751986.1 | hypothetical protein | - |
| H7679_RS00995 (DAT300_01650) | - | 149724..150311 (+) | 588 | WP_184495862.1 | Gp15 family bacteriophage protein | - |
| H7679_RS01000 (DAT300_01660) | - | 150331..153744 (+) | 3414 | WP_184495864.1 | hypothetical protein | - |
| H7679_RS01005 (DAT300_01670) | - | 153741..155243 (+) | 1503 | WP_174854417.1 | distal tail protein Dit | - |
| H7679_RS01010 (DAT300_01680) | - | 155243..157927 (+) | 2685 | WP_184495866.1 | phage tail spike protein | - |
| H7679_RS01015 (DAT300_01690) | - | 157941..160013 (+) | 2073 | WP_184495869.1 | DUF859 family phage minor structural protein | - |
| H7679_RS01020 (DAT300_01700) | - | 160075..160341 (+) | 267 | WP_184495871.1 | hypothetical protein | - |
| H7679_RS01025 (DAT300_01710) | - | 160354..160809 (+) | 456 | WP_184495873.1 | hypothetical protein | - |
| H7679_RS01030 (DAT300_01720) | - | 160784..160987 (+) | 204 | WP_029751977.1 | phage holin | - |
| H7679_RS01035 (DAT300_01730) | - | 161117..161827 (+) | 711 | WP_184495875.1 | CHAP domain-containing protein | - |
| H7679_RS01040 (DAT300_01740) | prx | 161973..162158 (+) | 186 | WP_016056161.1 | hypothetical protein | Regulator |
| H7679_RS01045 (DAT300_01750) | comYC | 162159..162419 (+) | 261 | WP_184495877.1 | competence type IV pilus major pilin ComGC | Machinery gene |
| H7679_RS01050 (DAT300_01760) | comGD | 162400..162807 (+) | 408 | WP_024395446.1 | competence type IV pilus minor pilin ComGD | - |
| H7679_RS01055 (DAT300_01770) | comYE | 162779..163072 (+) | 294 | WP_024394703.1 | competence type IV pilus minor pilin ComGE | Machinery gene |
| H7679_RS01060 (DAT300_01780) | comGF/cglF | 163059..163493 (+) | 435 | WP_014637316.1 | competence type IV pilus minor pilin ComGF | Machinery gene |
| H7679_RS01065 (DAT300_01790) | comGG | 163471..163881 (+) | 411 | WP_044676305.1 | competence type IV pilus minor pilin ComGG | - |
| H7679_RS01070 (DAT300_01800) | comYH | 163938..164891 (+) | 954 | WP_184495879.1 | class I SAM-dependent methyltransferase | Machinery gene |
| H7679_RS01075 (DAT300_01810) | - | 164941..166128 (+) | 1188 | WP_024401510.1 | acetate kinase | - |
| H7679_RS01080 (DAT300_01820) | - | 166443..166994 (+) | 552 | WP_024395442.1 | folate family ECF transporter S component | - |
| H7679_RS01085 (DAT300_01830) | - | 167050..168306 (+) | 1257 | WP_024395441.1 | bifunctional folylpolyglutamate synthase/dihydrofolate synthase | - |
| H7679_RS01090 (DAT300_01840) | pepA | 168351..169412 (-) | 1062 | WP_024395440.1 | glutamyl aminopeptidase | - |
| H7679_RS01095 (DAT300_01850) | - | 169559..169843 (+) | 285 | WP_024395439.1 | DUF4651 domain-containing protein | - |
| H7679_RS01100 (DAT300_01860) | - | 169840..170160 (+) | 321 | WP_105143421.1 | thioredoxin family protein | - |
| H7679_RS01105 (DAT300_01870) | - | 170205..171161 (+) | 957 | WP_024395438.1 | DUF1002 domain-containing protein | - |
| H7679_RS01110 (DAT300_01880) | ytpR | 171180..171803 (+) | 624 | WP_105104958.1 | YtpR family tRNA-binding protein | - |
| H7679_RS01115 (DAT300_01890) | - | 171836..172615 (-) | 780 | WP_105104959.1 | DUF2785 domain-containing protein | - |
| H7679_RS01120 (DAT300_01900) | ssbA | 172670..173065 (+) | 396 | WP_011921737.1 | single-stranded DNA-binding protein | Machinery gene |
| H7679_RS01125 (DAT300_01910) | - | 173539..173751 (-) | 213 | WP_162495426.1 | hypothetical protein | - |
| H7679_RS01130 (DAT300_01920) | groES | 173948..174229 (+) | 282 | WP_014637330.1 | co-chaperone GroES | - |
| H7679_RS01135 (DAT300_01930) | groL | 174240..175862 (+) | 1623 | WP_044692542.1 | chaperonin GroEL | - |
Sequence
Protein
Download Length: 317 a.a. Molecular weight: 35774.88 Da Isoelectric Point: 4.4442
>NTDB_id=82934 H7679_RS01070 WP_184495879.1 163938..164891(+) (comYH) [Streptococcus suis strain DAT300]
MNFEKIEQAYDLLLENVQTIQNQLGTNIYDAMIEQNAAYVANQHETDLVVNNNQTLKQLDLTKEEWRRAYQFLLIKANQT
EPMQYNHQFTPDSIGFILSFLVDQLVPTQKVTVLEIGSGTGNLAQTILNASQKELDYLGIEVDDLLIDLSASIADVMQAD
ISFAQGDAVRPQILKESQVILGDLPIGYYPDDQIASRYQVASPNEHTYAHHLLMEQSLKYLEKNGFAILLAPNDLLTSPQ
SDLLKGWLQEQANIVAMIALPPNLFGKAAMAKSIFVLQKKAARSLMPFVYPLQSLQEPEAIQKFMLNFKNWKQENAI
MNFEKIEQAYDLLLENVQTIQNQLGTNIYDAMIEQNAAYVANQHETDLVVNNNQTLKQLDLTKEEWRRAYQFLLIKANQT
EPMQYNHQFTPDSIGFILSFLVDQLVPTQKVTVLEIGSGTGNLAQTILNASQKELDYLGIEVDDLLIDLSASIADVMQAD
ISFAQGDAVRPQILKESQVILGDLPIGYYPDDQIASRYQVASPNEHTYAHHLLMEQSLKYLEKNGFAILLAPNDLLTSPQ
SDLLKGWLQEQANIVAMIALPPNLFGKAAMAKSIFVLQKKAARSLMPFVYPLQSLQEPEAIQKFMLNFKNWKQENAI
Nucleotide
Download Length: 954 bp
>NTDB_id=82934 H7679_RS01070 WP_184495879.1 163938..164891(+) (comYH) [Streptococcus suis strain DAT300]
ATGAATTTTGAAAAGATCGAACAGGCTTACGACCTGCTATTAGAAAACGTACAGACTATCCAAAACCAGCTAGGTACCAA
TATCTATGATGCCATGATTGAGCAAAATGCTGCTTACGTAGCTAATCAGCATGAGACGGACCTTGTTGTCAATAACAATC
AGACCTTGAAACAACTAGATTTAACCAAGGAAGAATGGCGTCGTGCCTACCAATTCCTGCTCATCAAGGCCAATCAGACT
GAACCCATGCAGTATAATCACCAGTTCACACCAGACTCTATCGGATTTATCCTATCTTTTCTAGTAGACCAATTGGTGCC
GACTCAAAAGGTGACGGTTCTGGAAATTGGTTCGGGGACAGGCAATCTAGCTCAGACCATTCTCAACGCCAGCCAGAAAG
AATTGGATTACTTGGGGATCGAAGTGGACGACCTCTTGATTGATTTGTCGGCAAGTATTGCGGATGTCATGCAGGCAGAT
ATTTCTTTTGCTCAGGGAGATGCGGTACGTCCGCAGATTTTGAAGGAAAGTCAAGTAATTTTGGGAGATTTGCCTATTGG
CTACTATCCAGATGACCAGATTGCTAGCCGCTATCAGGTCGCCAGCCCAAATGAACATACCTACGCCCATCATTTACTCA
TGGAACAATCCCTCAAATATCTGGAAAAAAATGGTTTTGCGATTTTGTTGGCTCCAAATGATTTATTGACTAGTCCGCAA
AGCGATTTGCTGAAAGGTTGGTTACAGGAGCAAGCCAATATTGTTGCCATGATTGCCCTGCCACCAAATCTCTTTGGGAA
GGCTGCTATGGCCAAGTCTATTTTTGTCTTGCAAAAGAAAGCTGCAAGATCGTTGATGCCGTTTGTTTATCCCTTACAAA
GTCTTCAAGAACCAGAAGCTATTCAGAAGTTCATGCTCAATTTCAAAAATTGGAAGCAAGAGAATGCAATTTAA
ATGAATTTTGAAAAGATCGAACAGGCTTACGACCTGCTATTAGAAAACGTACAGACTATCCAAAACCAGCTAGGTACCAA
TATCTATGATGCCATGATTGAGCAAAATGCTGCTTACGTAGCTAATCAGCATGAGACGGACCTTGTTGTCAATAACAATC
AGACCTTGAAACAACTAGATTTAACCAAGGAAGAATGGCGTCGTGCCTACCAATTCCTGCTCATCAAGGCCAATCAGACT
GAACCCATGCAGTATAATCACCAGTTCACACCAGACTCTATCGGATTTATCCTATCTTTTCTAGTAGACCAATTGGTGCC
GACTCAAAAGGTGACGGTTCTGGAAATTGGTTCGGGGACAGGCAATCTAGCTCAGACCATTCTCAACGCCAGCCAGAAAG
AATTGGATTACTTGGGGATCGAAGTGGACGACCTCTTGATTGATTTGTCGGCAAGTATTGCGGATGTCATGCAGGCAGAT
ATTTCTTTTGCTCAGGGAGATGCGGTACGTCCGCAGATTTTGAAGGAAAGTCAAGTAATTTTGGGAGATTTGCCTATTGG
CTACTATCCAGATGACCAGATTGCTAGCCGCTATCAGGTCGCCAGCCCAAATGAACATACCTACGCCCATCATTTACTCA
TGGAACAATCCCTCAAATATCTGGAAAAAAATGGTTTTGCGATTTTGTTGGCTCCAAATGATTTATTGACTAGTCCGCAA
AGCGATTTGCTGAAAGGTTGGTTACAGGAGCAAGCCAATATTGTTGCCATGATTGCCCTGCCACCAAATCTCTTTGGGAA
GGCTGCTATGGCCAAGTCTATTTTTGTCTTGCAAAAGAAAGCTGCAAGATCGTTGATGCCGTTTGTTTATCCCTTACAAA
GTCTTCAAGAACCAGAAGCTATTCAGAAGTTCATGCTCAATTTCAAAAATTGGAAGCAAGAGAATGCAATTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comYH | Streptococcus mutans UA140 |
60.443 |
99.685 |
0.603 |
| comYH | Streptococcus mutans UA159 |
60.127 |
99.685 |
0.599 |