Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | A66_RS04340 | Genome accession | NZ_LN847353 |
| Coordinates | 834636..836876 (+) | Length | 746 a.a. |
| NCBI ID | WP_053039576.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain A66 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 829636..841876
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| A66_RS04300 (A66_00864) | - | 829788..830756 (+) | 969 | WP_000658183.1 | PhoH family protein | - |
| A66_RS04305 (A66_00866) | - | 830949..831449 (+) | 501 | WP_000566982.1 | GNAT family N-acetyltransferase | - |
| A66_RS11690 | - | 831452..831777 (+) | 326 | Protein_853 | TfoX/Sxy family protein | - |
| A66_RS12485 | ald | 832078..833189 (-) | 1112 | Protein_854 | alanine dehydrogenase | - |
| A66_RS12760 | - | 833366..833934 (+) | 569 | Protein_855 | GNAT family N-acetyltransferase | - |
| A66_RS04335 (A66_00873) | comEA/celA/cilE | 834002..834652 (+) | 651 | WP_000387342.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| A66_RS04340 (A66_00874) | comEC/celB | 834636..836876 (+) | 2241 | WP_053039576.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| A66_RS12985 | - | 837055..837243 (+) | 189 | Protein_858 | hypothetical protein | - |
| A66_RS04350 (A66_00875) | - | 837276..837863 (+) | 588 | WP_000945250.1 | ATP-binding cassette domain-containing protein | - |
| A66_RS04355 (A66_00876) | - | 837867..839051 (+) | 1185 | WP_000655967.1 | hypothetical protein | - |
| A66_RS04360 (A66_00877) | infC | 839365..839895 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| A66_RS04365 (A66_00878) | rpmI | 839928..840128 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| A66_RS04370 (A66_00879) | rplT | 840180..840539 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| A66_RS04375 (A66_00880) | - | 840597..840977 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84596.13 Da Isoelectric Point: 9.4491
>NTDB_id=1114566 A66_RS04340 WP_053039576.1 834636..836876(+) (comEC/celB) [Streptococcus pneumoniae strain A66]
MLQWIKNIPIPLIYLSFLLLWFYYAIFSASYLALLGFVFLLVCLFFQFPWKSAGKVLVICGVFGFWFLFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKADGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQNIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDIGQGESIFLRDMTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNIPIPLIYLSFLLLWFYYAIFSASYLALLGFVFLLVCLFFQFPWKSAGKVLVICGVFGFWFLFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKADGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQNIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDIGQGESIFLRDMTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=1114566 A66_RS04340 WP_053039576.1 834636..836876(+) (comEC/celB) [Streptococcus pneumoniae strain A66]
ATGTTACAGTGGATTAAGAACATCCCCATTCCCCTAATTTACCTGAGTTTTTTGTTACTTTGGTTTTATTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCAGCAG
GCAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAAATTGGCAACAGAGCCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTCTGCCTGACACTGTTAAGGTCAATGGTGATAGTCTGTCCTTTCGCGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAATATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATATTGGGCAAGGAGAAAGTATTTTCCTACGGGATATGACTGGTAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAACATCCCCATTCCCCTAATTTACCTGAGTTTTTTGTTACTTTGGTTTTATTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCAGCAG
GCAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAAATTGGCAACAGAGCCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTCTGCCTGACACTGTTAAGGTCAATGGTGATAGTCTGTCCTTTCGCGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAATATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATATTGGGCAAGGAGAAAGTATTTTCCTACGGGATATGACTGGTAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae Rx1 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus pneumoniae D39 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus pneumoniae R6 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
96.247 |
100 |
0.962 |
| comEC/celB | Streptococcus mitis SK321 |
91.689 |
100 |
0.917 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.409 |
99.866 |
0.913 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.22 |
99.732 |
0.441 |