Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | EQH29_RS04380 | Genome accession | NZ_CP035251 |
| Coordinates | 873801..876041 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942387.1 | Uniprot ID | A0AAX3HGB1 |
| Organism | Streptococcus pneumoniae strain TVO_1901935 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 868801..881041
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EQH29_RS04350 (EQH29_04635) | - | 868951..869919 (+) | 969 | WP_000658183.1 | PhoH family protein | - |
| EQH29_RS04355 (EQH29_04645) | - | 870112..870612 (+) | 501 | WP_000566983.1 | GNAT family N-acetyltransferase | - |
| EQH29_RS04360 (EQH29_04650) | - | 870615..870941 (+) | 327 | Protein_872 | TfoX/Sxy family protein | - |
| EQH29_RS04365 | ald | 871242..872353 (-) | 1112 | Protein_873 | alanine dehydrogenase | - |
| EQH29_RS04370 (EQH29_04670) | - | 872530..873099 (+) | 570 | WP_000443744.1 | GNAT family N-acetyltransferase | - |
| EQH29_RS04375 (EQH29_04675) | comEA/celA/cilE | 873167..873817 (+) | 651 | WP_000387331.1 | ComEA family DNA-binding protein | Machinery gene |
| EQH29_RS04380 (EQH29_04680) | comEC/celB | 873801..876041 (+) | 2241 | WP_000942387.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| EQH29_RS04385 (EQH29_04685) | - | 876183..876407 (+) | 225 | WP_001813351.1 | hypothetical protein | - |
| EQH29_RS04390 (EQH29_04690) | - | 876439..877026 (+) | 588 | WP_000945253.1 | ATP-binding cassette domain-containing protein | - |
| EQH29_RS04395 (EQH29_04695) | - | 877030..878211 (+) | 1182 | WP_000655954.1 | hypothetical protein | - |
| EQH29_RS04400 (EQH29_04700) | infC | 878518..879048 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| EQH29_RS04405 (EQH29_04705) | rpmI | 879081..879281 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| EQH29_RS04410 (EQH29_04710) | rplT | 879333..879692 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| EQH29_RS04415 (EQH29_04715) | - | 879750..880130 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84545.20 Da Isoelectric Point: 9.7371
>NTDB_id=337745 EQH29_RS04380 WP_000942387.1 873801..876041(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901935]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPLSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKQFVVELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGWNSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPLSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKQFVVELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGWNSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=337745 EQH29_RS04380 WP_000942387.1 873801..876041(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901935]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTTGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAAATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAACATGTCGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTGTGGTAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCCCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGGTTTAAAGGTTGGAATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTTGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAAATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAACATGTCGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTGTGGTAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCCCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGGTTTAAAGGTTGGAATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae Rx1 |
97.319 |
100 |
0.973 |
| comEC/celB | Streptococcus pneumoniae D39 |
97.319 |
100 |
0.973 |
| comEC/celB | Streptococcus pneumoniae R6 |
97.319 |
100 |
0.973 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
97.319 |
100 |
0.973 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
92.483 |
99.866 |
0.924 |
| comEC/celB | Streptococcus mitis SK321 |
92.225 |
100 |
0.922 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.804 |
99.33 |
0.445 |