Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | DV129_RS03235 | Genome accession | NZ_CP031247 |
| Coordinates | 604130..606370 (-) | Length | 746 a.a. |
| NCBI ID | WP_114865969.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain M23734 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 599130..611370
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| DV129_RS03200 | - | 600030..600410 (-) | 381 | WP_000157154.1 | VOC family protein | - |
| DV129_RS03205 | rplT | 600468..600827 (-) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| DV129_RS03210 | rpmI | 600879..601079 (-) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| DV129_RS03215 | infC | 601112..601642 (-) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| DV129_RS03220 | - | 601955..603139 (-) | 1185 | WP_000655953.1 | hypothetical protein | - |
| DV129_RS03225 | - | 603143..603730 (-) | 588 | WP_000939895.1 | ABC transporter ATP-binding protein | - |
| DV129_RS03230 | - | 603763..603951 (-) | 189 | WP_001812421.1 | hypothetical protein | - |
| DV129_RS03235 | comEC/celB | 604130..606370 (-) | 2241 | WP_114865969.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| DV129_RS03240 | comEA/celA/cilE | 606354..607004 (-) | 651 | WP_000387352.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| DV129_RS03245 | - | 607072..607641 (-) | 570 | WP_000443745.1 | GNAT family N-acetyltransferase | - |
| DV129_RS03250 | ald | 607818..608929 (+) | 1112 | Protein_620 | alanine dehydrogenase | - |
| DV129_RS03255 | - | 609230..609556 (-) | 327 | Protein_621 | TfoX/Sxy family protein | - |
| DV129_RS03260 | - | 609559..610059 (-) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| DV129_RS03265 | - | 610239..611207 (-) | 969 | WP_000658180.1 | PhoH family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84697.16 Da Isoelectric Point: 9.3519
>NTDB_id=304753 DV129_RS03235 WP_114865969.1 604130..606370(-) (comEC/celB) [Streptococcus pneumoniae strain M23734]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSVSYFALLGFVFLLVCLFIQFPWKSADKVLVICGVFGFWFLFQTWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSDGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQGY
LKTQGIYQTLNIKRIQSFQKVGSWDIGENLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
ENVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQQGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGVDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSVSYFALLGFVFLLVCLFIQFPWKSADKVLVICGVFGFWFLFQTWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSDGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQGY
LKTQGIYQTLNIKRIQSFQKVGSWDIGENLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
ENVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQQGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGVDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=304753 DV129_RS03235 WP_114865969.1 604130..606370(-) (comEC/celB) [Streptococcus pneumoniae strain M23734]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTATGGCTTTACTACGCCATTTT
CTCAGTATCCTATTTTGCTTTGTTGGGTTTTGTTTTTCTGCTAGTCTGCCTCTTTATCCAATTTCCTTGGAAATCAGCAG
ATAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAACTTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTTAACGGTGACAGTCTGTCCTTTCGTGGTAA
GTCTGATGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TTCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGGCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCATTCCAAAAGGTTGGCAGTTGGGATATAGG
TGAAAACCTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACTTCCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTAGGACATCTGGACACCGATTTTGAGGAGATGAATGAGCTTTATTCCAGTTTAGGAATTATTCACCTT
TTTGCCTTGTCAGGTATGCAGGTAGGGTTTTTCATGGACGGATTTAAGAAACTACTTTTACGATTGGGCTTGACACAAGA
AAAGTTGAAGTGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGTGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGGAGACCGCTTGTCTTTGGTCAACCCAACGCATGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGAATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACAAGGAAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGGTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTATGGCTTTACTACGCCATTTT
CTCAGTATCCTATTTTGCTTTGTTGGGTTTTGTTTTTCTGCTAGTCTGCCTCTTTATCCAATTTCCTTGGAAATCAGCAG
ATAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAACTTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTTAACGGTGACAGTCTGTCCTTTCGTGGTAA
GTCTGATGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TTCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGGCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCATTCCAAAAGGTTGGCAGTTGGGATATAGG
TGAAAACCTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACTTCCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTAGGACATCTGGACACCGATTTTGAGGAGATGAATGAGCTTTATTCCAGTTTAGGAATTATTCACCTT
TTTGCCTTGTCAGGTATGCAGGTAGGGTTTTTCATGGACGGATTTAAGAAACTACTTTTACGATTGGGCTTGACACAAGA
AAAGTTGAAGTGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGTGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGGAGACCGCTTGTCTTTGGTCAACCCAACGCATGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGAATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACAAGGAAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGGTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae TIGR4 |
96.783 |
100 |
0.968 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
95.04 |
100 |
0.95 |
| comEC/celB | Streptococcus pneumoniae D39 |
95.04 |
100 |
0.95 |
| comEC/celB | Streptococcus pneumoniae R6 |
95.04 |
100 |
0.95 |
| comEC/celB | Streptococcus mitis SK321 |
91.957 |
100 |
0.92 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.678 |
99.866 |
0.916 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.534 |
99.33 |
0.442 |