Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | R8566_RS04640 | Genome accession | NZ_AP026920 |
| Coordinates | 916310..918550 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942383.1 | Uniprot ID | A0A0T8TEA1 |
| Organism | Streptococcus pneumoniae strain PZ900700014 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 911310..923550
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| R8566_RS04615 | - | 911417..911742 (+) | 326 | Protein_912 | TfoX/Sxy family protein | - |
| R8566_RS04620 (PC0014_09220) | - | 912135..913481 (+) | 1347 | WP_001824784.1 | IS1380-like element ISSpn5 family transposase | - |
| R8566_RS04625 | ald | 913751..914862 (-) | 1112 | Protein_914 | alanine dehydrogenase | - |
| R8566_RS04630 (PC0014_09260) | - | 915039..915608 (+) | 570 | WP_000443775.1 | GNAT family N-acetyltransferase | - |
| R8566_RS04635 (PC0014_09270) | comEA/celA/cilE | 915676..916326 (+) | 651 | WP_000387330.1 | ComEA family DNA-binding protein | Machinery gene |
| R8566_RS04640 (PC0014_09280) | comEC/celB | 916310..918550 (+) | 2241 | WP_000942383.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| R8566_RS04645 | - | 918699..918916 (+) | 218 | Protein_918 | hypothetical protein | - |
| R8566_RS04650 (PC0014_09290) | - | 918949..919536 (+) | 588 | WP_000939880.1 | ATP-binding cassette domain-containing protein | - |
| R8566_RS04655 (PC0014_09300) | - | 919540..920715 (+) | 1176 | WP_000655944.1 | hypothetical protein | - |
| R8566_RS04660 (PC0014_09310) | infC | 921036..921566 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| R8566_RS04665 (PC0014_09320) | rpmI | 921599..921799 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| R8566_RS04670 (PC0014_09330) | rplT | 921851..922210 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| R8566_RS04675 (PC0014_09340) | - | 922268..922648 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84620.31 Da Isoelectric Point: 9.8066
>NTDB_id=97947 R8566_RS04640 WP_000942383.1 916310..918550(+) (comEC/celB) [Streptococcus pneumoniae strain PZ900700014]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFFQFPWKSASKVLVICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKRIQSLQKVGSWDIGEKLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTIKEGKGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNEWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEVTKAFHVGEILVSKGSLKQKQFVVELQATKTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFFQFPWKSASKVLVICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKRIQSLQKVGSWDIGEKLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTIKEGKGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNEWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEVTKAFHVGEILVSKGSLKQKQFVVELQATKTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=97947 R8566_RS04640 WP_000942383.1 916310..918550(+) (comEC/celB) [Streptococcus pneumoniae strain PZ900700014]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGATTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCTGCTA
GCAAAGTTCTAGTGATTTGTGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTCAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCACTCCAAAAGGTTGGCAGTTGGGATATAGG
TGAAAAACTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACTTCCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTAGGACATCTGGACACCGATTTTGAGGAGATGAATGAGCTTTATTCCAGTTTAGGAATTATTCACCTT
TTTGCCTTGTCAGGTATGCAGGTAGGGTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCATTAAAGAAGG
GAAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCCTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
CTCACAGGTGACAAGTAGGCCTCTAGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCTTGACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTGTGGTAGAACTACAGGCGACTAAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGCCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGATTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCTGCTA
GCAAAGTTCTAGTGATTTGTGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTCAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCACTCCAAAAGGTTGGCAGTTGGGATATAGG
TGAAAAACTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACTTCCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTAGGACATCTGGACACCGATTTTGAGGAGATGAATGAGCTTTATTCCAGTTTAGGAATTATTCACCTT
TTTGCCTTGTCAGGTATGCAGGTAGGGTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCATTAAAGAAGG
GAAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCCTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
CTCACAGGTGACAAGTAGGCCTCTAGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCTTGACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTGTGGTAGAACTACAGGCGACTAAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGCCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae TIGR4 |
97.051 |
100 |
0.971 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
96.247 |
100 |
0.962 |
| comEC/celB | Streptococcus pneumoniae D39 |
96.247 |
100 |
0.962 |
| comEC/celB | Streptococcus pneumoniae R6 |
96.247 |
100 |
0.962 |
| comEC/celB | Streptococcus mitis SK321 |
91.689 |
100 |
0.917 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.544 |
99.866 |
0.914 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.669 |
99.33 |
0.444 |