Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | AAYT23_RS04225 | Genome accession | NZ_CP154877 |
| Coordinates | 829435..831675 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942413.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain 20234295 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 824435..836675
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| AAYT23_RS04195 (AAYT23_04180) | - | 824586..825554 (+) | 969 | WP_000658183.1 | PhoH family protein | - |
| AAYT23_RS04200 (AAYT23_04185) | - | 825747..826247 (+) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| AAYT23_RS04205 (AAYT23_04190) | - | 826250..826576 (+) | 327 | Protein_828 | TfoX/Sxy family protein | - |
| AAYT23_RS04210 (AAYT23_04195) | ald | 826877..827988 (-) | 1112 | Protein_829 | alanine dehydrogenase | - |
| AAYT23_RS04215 (AAYT23_04200) | - | 828165..828734 (+) | 570 | WP_000443743.1 | GNAT family N-acetyltransferase | - |
| AAYT23_RS04220 (AAYT23_04205) | comEA/celA/cilE | 828801..829451 (+) | 651 | WP_000387324.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| AAYT23_RS04225 (AAYT23_04210) | comEC/celB | 829435..831675 (+) | 2241 | WP_000942413.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| AAYT23_RS04230 (AAYT23_04215) | - | 831854..832042 (+) | 189 | WP_001809102.1 | hypothetical protein | - |
| AAYT23_RS04235 (AAYT23_04220) | - | 832074..832661 (+) | 588 | WP_000933542.1 | ATP-binding cassette domain-containing protein | - |
| AAYT23_RS04240 (AAYT23_04225) | - | 832665..833846 (+) | 1182 | WP_000655952.1 | membrane protein | - |
| AAYT23_RS04245 (AAYT23_04230) | infC | 834153..834683 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| AAYT23_RS04250 (AAYT23_04235) | rpmI | 834716..834916 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| AAYT23_RS04255 (AAYT23_04240) | rplT | 834968..835327 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| AAYT23_RS04260 (AAYT23_04245) | - | 835385..835765 (+) | 381 | WP_000157138.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84642.24 Da Isoelectric Point: 9.6735
>NTDB_id=994444 AAYT23_RS04225 WP_000942413.1 829435..831675(+) (comEC/celB) [Streptococcus pneumoniae strain 20234295]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSTSYLALLGFVFLLVCLFFQFPWKSASKVLVICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFFTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFIFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNTWLLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKNSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPYQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSTSYLALLGFVFLLVCLFFQFPWKSASKVLVICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFFTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFIFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNTWLLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKNSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPYQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=994444 AAYT23_RS04225 WP_000942413.1 829435..831675(+) (comEC/celB) [Streptococcus pneumoniae strain 20234295]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCCATTTT
TTCAACATCCTATCTTGCTTTGTTGGGATTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCTGCTA
GCAAAGTTCTAGTGATTTGTGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTTACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTATCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGTAGACCGCTTGTCTTTGGACAACCCAATACATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAAGCTTGATACCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAAACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCTATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCCATTTT
TTCAACATCCTATCTTGCTTTGTTGGGATTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCTGCTA
GCAAAGTTCTAGTGATTTGTGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTTACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTATCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGTAGACCGCTTGTCTTTGGACAACCCAATACATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAAGCTTGATACCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAAACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCTATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae Rx1 |
98.391 |
100 |
0.984 |
| comEC/celB | Streptococcus pneumoniae D39 |
98.391 |
100 |
0.984 |
| comEC/celB | Streptococcus pneumoniae R6 |
98.391 |
100 |
0.984 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus mitis SK321 |
91.287 |
100 |
0.913 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
90.872 |
99.866 |
0.908 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.13 |
99.33 |
0.438 |