Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | R4704_RS10240 | Genome accession | NZ_CP137109 |
| Coordinates | 1971068..1973308 (+) | Length | 746 a.a. |
| NCBI ID | WP_033705318.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain 11012 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1966068..1978308
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| R4704_RS10210 | - | 1966219..1967187 (+) | 969 | WP_000658183.1 | PhoH family protein | - |
| R4704_RS10215 | - | 1967380..1967880 (+) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| R4704_RS10220 | - | 1967883..1968208 (+) | 326 | Protein_1975 | TfoX/Sxy family protein | - |
| R4704_RS10225 | ald | 1968509..1969620 (-) | 1112 | Protein_1976 | alanine dehydrogenase | - |
| R4704_RS10230 | - | 1969797..1970366 (+) | 570 | WP_001864145.1 | GNAT family N-acetyltransferase | - |
| R4704_RS10235 | comEA/celA/cilE | 1970434..1971084 (+) | 651 | WP_001864147.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| R4704_RS10240 | comEC/celB | 1971068..1973308 (+) | 2241 | WP_033705318.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| R4704_RS10245 | - | 1973450..1973674 (+) | 225 | WP_015898758.1 | hypothetical protein | - |
| R4704_RS10250 | - | 1973706..1974293 (+) | 588 | WP_000933540.1 | ATP-binding cassette domain-containing protein | - |
| R4704_RS10255 | - | 1974297..1975475 (+) | 1179 | WP_033705319.1 | membrane protein | - |
| R4704_RS10260 | infC | 1975786..1976316 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| R4704_RS10265 | rpmI | 1976349..1976549 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| R4704_RS10270 | rplT | 1976601..1976960 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| R4704_RS10275 | - | 1977018..1977398 (+) | 381 | WP_033705321.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84448.87 Da Isoelectric Point: 9.3992
>NTDB_id=895832 R4704_RS10240 WP_033705318.1 1971068..1973308(+) (comEC/celB) [Streptococcus pneumoniae strain 11012]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASSPLVFGQPNEWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKTTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKQFVAELQVTQTKVRSMTVGEYLPIFGSQLEVLSPREIGYGGHEDSLVLYGKL
LNNHFLFIGNLEEKGEKDLLKYYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASSPLVFGQPNEWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKTTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKQFVAELQVTQTKVRSMTVGEYLPIFGSQLEVLSPREIGYGGHEDSLVLYGKL
LNNHFLFIGNLEEKGEKDLLKYYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=895832 R4704_RS10240 WP_033705318.1 1971068..1973308(+) (comEC/celB) [Streptococcus pneumoniae strain 11012]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCGAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGCTTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGGCAAGTAGTCCTCTAGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGACGACGACCAGCAATG
CACAGAGAAGCTTGATACCCTATCTTAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTGATTTTGACAAATACGGACAAG
GAACATGTCGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTGTGGCAGAACTACAGGTGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGTACTTGCCCATTTTCG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGGAAATTGGATATGGAGGTCATGAAGATTCCCTGGTTCTTTATGGGAAACTC
TTGAATAATCACTTTCTCTTCATAGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGTACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGCTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCGAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGCTTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGGCAAGTAGTCCTCTAGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGACGACGACCAGCAATG
CACAGAGAAGCTTGATACCCTATCTTAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTGATTTTGACAAATACGGACAAG
GAACATGTCGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTGTGGCAGAACTACAGGTGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGTACTTGCCCATTTTCG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGGAAATTGGATATGGAGGTCATGAAGATTCCCTGGTTCTTTATGGGAAACTC
TTGAATAATCACTTTCTCTTCATAGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGTACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGCTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae TIGR4 |
96.649 |
100 |
0.966 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
95.174 |
100 |
0.952 |
| comEC/celB | Streptococcus pneumoniae D39 |
95.174 |
100 |
0.952 |
| comEC/celB | Streptococcus pneumoniae R6 |
95.174 |
100 |
0.952 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
92.081 |
99.866 |
0.92 |
| comEC/celB | Streptococcus mitis SK321 |
91.957 |
100 |
0.92 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.399 |
99.33 |
0.441 |