Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | OS912_RS10795 | Genome accession | NZ_CP113266 |
| Coordinates | 2115369..2117609 (+) | Length | 746 a.a. |
| NCBI ID | WP_050227511.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain BC1 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2110369..2122609
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| OS912_RS10760 (OS912_10765) | pyrH | 2110459..2111196 (+) | 738 | WP_000002997.1 | UMP kinase | - |
| OS912_RS10765 (OS912_10770) | frr | 2111205..2111762 (+) | 558 | WP_000024409.1 | ribosome recycling factor | - |
| OS912_RS10770 (OS912_10775) | cvfB | 2111822..2112676 (+) | 855 | WP_044791153.1 | RNA-binding virulence regulatory protein CvfB | - |
| OS912_RS10775 (OS912_10780) | - | 2112685..2112900 (+) | 216 | WP_001232085.1 | YozE family protein | - |
| OS912_RS10780 (OS912_10785) | - | 2112986..2113990 (+) | 1005 | WP_050096410.1 | PhoH family protein | - |
| OS912_RS10785 (OS912_10790) | - | 2114098..2114667 (+) | 570 | WP_050227512.1 | GNAT family N-acetyltransferase | - |
| OS912_RS10790 (OS912_10795) | comEA/celA/cilE | 2114735..2115385 (+) | 651 | WP_001866227.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| OS912_RS10795 (OS912_10800) | comEC/celB | 2115369..2117609 (+) | 2241 | WP_050227511.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| OS912_RS10800 (OS912_10805) | - | 2117788..2117976 (+) | 189 | WP_001809102.1 | hypothetical protein | - |
| OS912_RS10805 (OS912_10810) | - | 2118008..2118595 (+) | 588 | WP_000933542.1 | ATP-binding cassette domain-containing protein | - |
| OS912_RS10810 (OS912_10815) | - | 2118599..2119780 (+) | 1182 | WP_000655951.1 | membrane protein | - |
| OS912_RS10815 (OS912_10820) | infC | 2120087..2120617 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| OS912_RS10820 (OS912_10825) | rpmI | 2120650..2120850 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| OS912_RS10825 (OS912_10830) | rplT | 2120902..2121261 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| OS912_RS10830 (OS912_10835) | - | 2121319..2121699 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84687.38 Da Isoelectric Point: 9.5773
>NTDB_id=761452 OS912_RS10795 WP_050227511.1 2115369..2117609(+) (comEC/celB) [Streptococcus pneumoniae strain BC1]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSVSYFALLGFVFLLVCLFIQFPWKSAGKVLVICGVFGFWFLFQTWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSDGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQGY
LKTQGIYQTLNIKRIQSLQKVGSWDIGENLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSVSYFALLGFVFLLVCLFIQFPWKSAGKVLVICGVFGFWFLFQTWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSDGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQGY
LKTQGIYQTLNIKRIQSLQKVGSWDIGENLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=761452 OS912_RS10795 WP_050227511.1 2115369..2117609(+) (comEC/celB) [Streptococcus pneumoniae strain BC1]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTATGGCTTTACTACGCCATTTT
CTCAGTATCCTATTTTGCTTTGTTGGGTTTTGTTTTTCTGCTAGTCTGCCTCTTTATCCAATTTCCTTGGAAATCAGCAG
GTAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAACTTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTCAACGGTGACAGTCTGTCCTTTCGTGGTAA
GTCTGATGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TTCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGGCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCACTCCAAAAGGTTGGTAGTTGGGATATAGG
TGAAAACCTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACTTCCCAGACCCTATGCGCAATTACATGA
CAGGACTTTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTAACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGGGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTATGGCTTTACTACGCCATTTT
CTCAGTATCCTATTTTGCTTTGTTGGGTTTTGTTTTTCTGCTAGTCTGCCTCTTTATCCAATTTCCTTGGAAATCAGCAG
GTAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAACTTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTCAACGGTGACAGTCTGTCCTTTCGTGGTAA
GTCTGATGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TTCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGGCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCACTCCAAAAGGTTGGTAGTTGGGATATAGG
TGAAAACCTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACTTCCCAGACCCTATGCGCAATTACATGA
CAGGACTTTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTAACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGGGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae Rx1 |
97.185 |
100 |
0.972 |
| comEC/celB | Streptococcus pneumoniae D39 |
97.185 |
100 |
0.972 |
| comEC/celB | Streptococcus pneumoniae R6 |
97.185 |
100 |
0.972 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
96.783 |
100 |
0.968 |
| comEC/celB | Streptococcus mitis SK321 |
91.555 |
100 |
0.916 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.409 |
99.866 |
0.913 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.399 |
99.33 |
0.441 |