Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | EQH19_RS04530 | Genome accession | NZ_CP035261 |
| Coordinates | 900123..902363 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942425.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain TVO_1901924 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 895123..907363
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EQH19_RS04500 (EQH19_04775) | - | 895273..896241 (+) | 969 | WP_000658183.1 | PhoH family protein | - |
| EQH19_RS04505 (EQH19_04785) | - | 896434..896934 (+) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| EQH19_RS04510 (EQH19_04790) | - | 896937..897263 (+) | 327 | Protein_900 | TfoX/Sxy family protein | - |
| EQH19_RS04515 | ald | 897564..898675 (-) | 1112 | Protein_901 | alanine dehydrogenase | - |
| EQH19_RS04520 (EQH19_04810) | - | 898852..899421 (+) | 570 | WP_000443775.1 | GNAT family N-acetyltransferase | - |
| EQH19_RS04525 (EQH19_04815) | comEA/celA/cilE | 899489..900139 (+) | 651 | WP_000387332.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| EQH19_RS04530 (EQH19_04820) | comEC/celB | 900123..902363 (+) | 2241 | WP_000942425.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| EQH19_RS10840 (EQH19_04825) | - | 902512..902729 (+) | 218 | Protein_905 | hypothetical protein | - |
| EQH19_RS04535 (EQH19_04830) | - | 902762..903349 (+) | 588 | WP_000939880.1 | ATP-binding cassette domain-containing protein | - |
| EQH19_RS04540 (EQH19_04835) | - | 903353..904534 (+) | 1182 | WP_000655935.1 | membrane protein | - |
| EQH19_RS04545 (EQH19_04840) | infC | 904841..905371 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| EQH19_RS04550 (EQH19_04845) | rpmI | 905404..905604 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| EQH19_RS04555 (EQH19_04850) | rplT | 905656..906015 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| EQH19_RS04560 (EQH19_04855) | - | 906073..906453 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84595.12 Da Isoelectric Point: 9.5123
>NTDB_id=338378 EQH19_RS04530 WP_000942425.1 900123..902363(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901924]
MLQWIKNIPIPLIYLSFLLLWFYYAIFSASYLALLGFVFLLVCLFFQFPWKSAGKVLVICGVFGFWFLFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKADGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVFLLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNEWLLILLLISLPLVYDLRKNIKGLTVLSLLITGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNIPIPLIYLSFLLLWFYYAIFSASYLALLGFVFLLVCLFFQFPWKSAGKVLVICGVFGFWFLFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKADGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVFLLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNEWLLILLLISLPLVYDLRKNIKGLTVLSLLITGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=338378 EQH19_RS04530 WP_000942425.1 900123..902363(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901924]
ATGTTACAGTGGATTAAGAACATCCCCATTCCCCTAATTTACCTGAGTTTTTTGTTACTTTGGTTTTATTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCATGGAAATCAGCAG
GCAAAGTTTTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAAATTGGCAACAGAGCCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTCTGCCTGACACTGTTAAGGTCAATGGTGATAGTCTGTCCTTTCGCGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACAGTGTTTCTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
CTCACAGGTGGCAAGTAGGCCTCTAGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCTTTGCCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGGGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAACATCCCCATTCCCCTAATTTACCTGAGTTTTTTGTTACTTTGGTTTTATTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCATGGAAATCAGCAG
GCAAAGTTTTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAAATTGGCAACAGAGCCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTCTGCCTGACACTGTTAAGGTCAATGGTGATAGTCTGTCCTTTCGCGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACAGTGTTTCTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
CTCACAGGTGGCAAGTAGGCCTCTAGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCTTTGCCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGGGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae Rx1 |
97.453 |
100 |
0.975 |
| comEC/celB | Streptococcus pneumoniae D39 |
97.453 |
100 |
0.975 |
| comEC/celB | Streptococcus pneumoniae R6 |
97.453 |
100 |
0.975 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
97.319 |
100 |
0.973 |
| comEC/celB | Streptococcus mitis SK321 |
91.689 |
100 |
0.917 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.275 |
99.866 |
0.912 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
43.952 |
99.732 |
0.438 |