Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | R4710_RS10330 | Genome accession | NZ_CP137101 |
| Coordinates | 1969294..1971534 (+) | Length | 746 a.a. |
| NCBI ID | WP_317804697.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain 16P2012-2 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1964294..1976534
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| R4710_RS10300 | - | 1964445..1965413 (+) | 969 | WP_000658183.1 | PhoH family protein | - |
| R4710_RS10305 | - | 1965606..1966106 (+) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| R4710_RS10310 | - | 1966109..1966435 (+) | 327 | Protein_1997 | TfoX/Sxy family protein | - |
| R4710_RS10315 | ald | 1966736..1967847 (-) | 1112 | Protein_1998 | alanine dehydrogenase | - |
| R4710_RS10320 | - | 1968024..1968593 (+) | 570 | WP_000443743.1 | GNAT family N-acetyltransferase | - |
| R4710_RS10325 | comEA/celA/cilE | 1968660..1969310 (+) | 651 | WP_317804695.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| R4710_RS10330 | comEC/celB | 1969294..1971534 (+) | 2241 | WP_317804697.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| R4710_RS10335 | - | 1971713..1971901 (+) | 189 | WP_001809102.1 | hypothetical protein | - |
| R4710_RS10340 | - | 1971933..1972520 (+) | 588 | WP_000933542.1 | ATP-binding cassette domain-containing protein | - |
| R4710_RS10345 | - | 1972524..1973705 (+) | 1182 | WP_050245203.1 | ABC transporter permease | - |
| R4710_RS10350 | infC | 1974012..1974542 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| R4710_RS10355 | rpmI | 1974575..1974775 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| R4710_RS10360 | rplT | 1974827..1975186 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| R4710_RS10365 | - | 1975244..1975624 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84691.27 Da Isoelectric Point: 9.7227
>NTDB_id=895218 R4710_RS10330 WP_317804697.1 1969294..1971534(+) (comEC/celB) [Streptococcus pneumoniae strain 16P2012-2]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSTSYLALLGFVFLLVCLFFQFPWKSASKVLVICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSESEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFFTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFIFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNTWLLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKNSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSRRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPYQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSTSYLALLGFVFLLVCLFFQFPWKSASKVLVICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSESEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFFTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFIFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNTWLLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKNSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSRRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPYQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=895218 R4710_RS10330 WP_317804697.1 1969294..1971534(+) (comEC/celB) [Streptococcus pneumoniae strain 16P2012-2]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCCATTTT
TTCAACATCCTATCTTGCTTTGTTGGGATTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCTGCTA
GCAAAGTTCTAGTGATTTGTGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGTCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTTACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTATCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGTAGACCGCTTGTCTTTGGACAACCCAATACATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAAGCTTGATACCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAAACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCGAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCTATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCCATTTT
TTCAACATCCTATCTTGCTTTGTTGGGATTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCTGCTA
GCAAAGTTCTAGTGATTTGTGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGTCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTTACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTATCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGTAGACCGCTTGTCTTTGGACAACCCAATACATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAAGCTTGATACCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAAACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCGAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCTATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae Rx1 |
98.123 |
100 |
0.981 |
| comEC/celB | Streptococcus pneumoniae D39 |
98.123 |
100 |
0.981 |
| comEC/celB | Streptococcus pneumoniae R6 |
98.123 |
100 |
0.981 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
97.319 |
100 |
0.973 |
| comEC/celB | Streptococcus mitis SK321 |
91.019 |
100 |
0.91 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
90.604 |
99.866 |
0.905 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
43.995 |
99.33 |
0.437 |