Detailed information
Overview
| Name | comYH | Type | Machinery gene |
| Locus tag | EQH38_RS02180 | Genome accession | NZ_CP035242 |
| Coordinates | 394668..395621 (+) | Length | 317 a.a. |
| NCBI ID | WP_061367493.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain TVO_1901947 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 389668..400621
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EQH38_RS02135 (EQH38_02250) | - | 389736..390101 (+) | 366 | WP_000286415.1 | DUF1033 family protein | - |
| EQH38_RS02140 (EQH38_02255) | comGA/cglA/cilD | 390177..391118 (+) | 942 | WP_000249548.1 | competence type IV pilus ATPase ComGA | Machinery gene |
| EQH38_RS02145 (EQH38_02260) | comGB/cglB | 391066..392082 (+) | 1017 | WP_074017570.1 | competence type IV pilus assembly protein ComGB | Machinery gene |
| EQH38_RS02150 (EQH38_02265) | comGC/cglC | 392084..392410 (+) | 327 | WP_000738629.1 | comG operon protein ComGC | Machinery gene |
| EQH38_RS02155 (EQH38_02270) | comGD/cglD | 392403..392807 (+) | 405 | WP_000588013.1 | competence type IV pilus minor pilin ComGD | Machinery gene |
| EQH38_RS02160 (EQH38_02275) | comGE | 392770..393071 (+) | 302 | Protein_423 | competence type IV pilus minor pilin ComGE | - |
| EQH38_RS02165 (EQH38_02280) | comGF/cglF | 393034..393495 (+) | 462 | WP_000250540.1 | competence type IV pilus minor pilin ComGF | Machinery gene |
| EQH38_RS02170 (EQH38_02285) | comGG/cglG | 393473..393883 (+) | 411 | WP_000265621.1 | competence type IV pilus minor pilin ComGG | Machinery gene |
| EQH38_RS02175 | - | 394020..394607 (+) | 588 | Protein_426 | class I SAM-dependent methyltransferase | - |
| EQH38_RS02180 (EQH38_02300) | comYH | 394668..395621 (+) | 954 | WP_061367493.1 | class I SAM-dependent methyltransferase | Machinery gene |
| EQH38_RS02185 (EQH38_02305) | - | 395672..396862 (+) | 1191 | WP_000167756.1 | acetate kinase | - |
| EQH38_RS10785 | - | 396863..396994 (+) | 132 | WP_000768904.1 | hypothetical protein | - |
| EQH38_RS02190 (EQH38_02310) | rnpA | 397011..397382 (+) | 372 | WP_000739244.1 | ribonuclease P protein component | - |
| EQH38_RS02195 (EQH38_02315) | - | 397357..398181 (+) | 825 | WP_201454401.1 | membrane protein insertase YidC | - |
| EQH38_RS02200 (EQH38_02320) | jag | 398200..399186 (+) | 987 | WP_044727692.1 | RNA-binding cell elongation regulator Jag/EloR | - |
| EQH38_RS02205 (EQH38_02325) | - | 399237..399860 (+) | 624 | WP_000932660.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 317 a.a. Molecular weight: 35724.99 Da Isoelectric Point: 4.2706
>NTDB_id=337116 EQH38_RS02180 WP_061367493.1 394668..395621(+) (comYH) [Streptococcus pneumoniae strain TVO_1901947]
MDFEKIEQAYIYLLENVQVIQSDLATNFYDALVEQNSIYLDGETELNQVKDNNQALKRLALRKEEWLKTYQFLLMKAGQT
EPLQANHQFTPDAIALLLVFIVEELFKEEEITILEMGSGMGILGAIFLTSLTKKVDYLGMEVDDLLIDLAASMADVIGLQ
AGFVQGDAVRPQMLKESDVVISDLPVGYYPDDAVASRHQVASSQEHTYAHHLLMEQGLKYLKSDGYAIFLAPSDLLTSPQ
SDLLKEWLKEEASLVAMISLPENLFANAKQSKTIFILQKKNEIAVEPFVYPLASLQDASVLMKFKENFQKWTQGTEI
MDFEKIEQAYIYLLENVQVIQSDLATNFYDALVEQNSIYLDGETELNQVKDNNQALKRLALRKEEWLKTYQFLLMKAGQT
EPLQANHQFTPDAIALLLVFIVEELFKEEEITILEMGSGMGILGAIFLTSLTKKVDYLGMEVDDLLIDLAASMADVIGLQ
AGFVQGDAVRPQMLKESDVVISDLPVGYYPDDAVASRHQVASSQEHTYAHHLLMEQGLKYLKSDGYAIFLAPSDLLTSPQ
SDLLKEWLKEEASLVAMISLPENLFANAKQSKTIFILQKKNEIAVEPFVYPLASLQDASVLMKFKENFQKWTQGTEI
Nucleotide
Download Length: 954 bp
>NTDB_id=337116 EQH38_RS02180 WP_061367493.1 394668..395621(+) (comYH) [Streptococcus pneumoniae strain TVO_1901947]
ATGGATTTTGAAAAAATTGAACAAGCTTATATCTATTTACTAGAGAATGTCCAAGTCATCCAAAGTGATTTGGCGACCAA
CTTTTATGACGCCTTGGTGGAGCAAAATAGCATCTATCTGGATGGTGAAACTGAGCTAAACCAGGTCAAAGACAACAATC
AGGCCCTTAAGCGTTTAGCACTACGCAAAGAAGAATGGCTCAAGACCTACCAGTTTCTCTTGATGAAGGCTGGGCAAACA
GAACCCTTGCAGGCCAATCACCAGTTTACGCCAGATGCTATTGCCTTACTTTTGGTATTTATTGTGGAAGAGTTGTTTAA
AGAGGAGGAAATTACTATCCTCGAAATGGGTTCTGGGATGGGAATTCTAGGCGCTATTTTCTTGACCTCGCTTACTAAAA
AGGTGGATTACTTGGGAATGGAAGTGGATGATTTGCTGATTGATCTGGCAGCTAGCATGGCAGATGTAATTGGTTTGCAG
GCTGGCTTTGTCCAAGGAGATGCCGTTCGCCCACAAATGCTCAAAGAAAGCGATGTGGTCATCAGTGACTTGCCTGTCGG
CTATTATCCTGATGATGCCGTTGCGTCGCGCCATCAAGTTGCTTCTAGCCAAGAACATACTTACGCCCATCACTTGCTCA
TGGAACAAGGGCTTAAGTACCTCAAGTCAGACGGATACGCTATTTTTCTAGCTCCGAGTGATTTGTTGACCAGTCCTCAA
AGTGATTTGTTAAAAGAATGGCTGAAAGAAGAGGCGAGTCTGGTTGCTATGATTAGTCTGCCTGAAAATCTCTTTGCTAA
TGCCAAACAATCTAAGACTATTTTTATCTTACAGAAGAAAAATGAAATAGCAGTAGAGCCTTTTGTTTATCCACTTGCTA
GCTTGCAAGATGCAAGTGTTTTAATGAAATTTAAAGAAAATTTTCAAAAATGGACTCAAGGTACTGAAATATAA
ATGGATTTTGAAAAAATTGAACAAGCTTATATCTATTTACTAGAGAATGTCCAAGTCATCCAAAGTGATTTGGCGACCAA
CTTTTATGACGCCTTGGTGGAGCAAAATAGCATCTATCTGGATGGTGAAACTGAGCTAAACCAGGTCAAAGACAACAATC
AGGCCCTTAAGCGTTTAGCACTACGCAAAGAAGAATGGCTCAAGACCTACCAGTTTCTCTTGATGAAGGCTGGGCAAACA
GAACCCTTGCAGGCCAATCACCAGTTTACGCCAGATGCTATTGCCTTACTTTTGGTATTTATTGTGGAAGAGTTGTTTAA
AGAGGAGGAAATTACTATCCTCGAAATGGGTTCTGGGATGGGAATTCTAGGCGCTATTTTCTTGACCTCGCTTACTAAAA
AGGTGGATTACTTGGGAATGGAAGTGGATGATTTGCTGATTGATCTGGCAGCTAGCATGGCAGATGTAATTGGTTTGCAG
GCTGGCTTTGTCCAAGGAGATGCCGTTCGCCCACAAATGCTCAAAGAAAGCGATGTGGTCATCAGTGACTTGCCTGTCGG
CTATTATCCTGATGATGCCGTTGCGTCGCGCCATCAAGTTGCTTCTAGCCAAGAACATACTTACGCCCATCACTTGCTCA
TGGAACAAGGGCTTAAGTACCTCAAGTCAGACGGATACGCTATTTTTCTAGCTCCGAGTGATTTGTTGACCAGTCCTCAA
AGTGATTTGTTAAAAGAATGGCTGAAAGAAGAGGCGAGTCTGGTTGCTATGATTAGTCTGCCTGAAAATCTCTTTGCTAA
TGCCAAACAATCTAAGACTATTTTTATCTTACAGAAGAAAAATGAAATAGCAGTAGAGCCTTTTGTTTATCCACTTGCTA
GCTTGCAAGATGCAAGTGTTTTAATGAAATTTAAAGAAAATTTTCAAAAATGGACTCAAGGTACTGAAATATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comYH | Streptococcus mutans UA140 |
54.313 |
98.738 |
0.536 |
| comYH | Streptococcus mutans UA159 |
53.994 |
98.738 |
0.533 |