Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | SPT_RS06190 | Genome accession | NC_012469 |
| Coordinates | 1188568..1190808 (-) | Length | 746 a.a. |
| NCBI ID | WP_000942405.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae Taiwan19F-14 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1183568..1195808
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SPT_RS06155 (SPT_1241) | - | 1184471..1184851 (-) | 381 | WP_000157154.1 | VOC family protein | - |
| SPT_RS06160 (SPT_1242) | rplT | 1184909..1185268 (-) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| SPT_RS06165 (SPT_1243) | rpmI | 1185320..1185520 (-) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| SPT_RS06170 (SPT_1244) | infC | 1185553..1186083 (-) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| SPT_RS06175 (SPT_1245) | - | 1186394..1187578 (-) | 1185 | WP_000655961.1 | membrane protein | - |
| SPT_RS06180 (SPT_1246) | - | 1187582..1188169 (-) | 588 | WP_000939881.1 | ATP-binding cassette domain-containing protein | - |
| SPT_RS13365 | - | 1188202..1188419 (-) | 218 | Protein_1192 | hypothetical protein | - |
| SPT_RS06190 (SPT_1247) | comEC/celB | 1188568..1190808 (-) | 2241 | WP_000942405.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| SPT_RS06195 (SPT_1248) | comEA/celA/cilE | 1190792..1191442 (-) | 651 | WP_000387336.1 | ComEA family DNA-binding protein | Machinery gene |
| SPT_RS06200 (SPT_1249) | - | 1191510..1192079 (-) | 570 | WP_000443774.1 | GNAT family N-acetyltransferase | - |
| SPT_RS13020 (SPT_1252) | ald | 1192256..1193367 (+) | 1112 | Protein_1196 | alanine dehydrogenase | - |
| SPT_RS06220 (SPT_1253) | - | 1193668..1193994 (-) | 327 | Protein_1197 | TfoX/Sxy family protein | - |
| SPT_RS06225 (SPT_1254) | - | 1193997..1194497 (-) | 501 | WP_000566986.1 | GNAT family N-acetyltransferase | - |
| SPT_RS06230 (SPT_1255) | - | 1194690..1195658 (-) | 969 | WP_000658198.1 | PhoH family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84537.15 Da Isoelectric Point: 9.6382
>NTDB_id=33382 SPT_RS06190 WP_000942405.1 1188568..1190808(-) (comEC/celB) [Streptococcus pneumoniae Taiwan19F-14]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKTEKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKQFMVELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKTEKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKQFMVELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=33382 SPT_RS06190 WP_000942405.1 1188568..1190808(-) (comEC/celB) [Streptococcus pneumoniae Taiwan19F-14]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAACCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CACAGAGAAGCTTGATACCCTATCTTAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTGATTTTGACAAATACGGACAAG
GAACATGTCGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTATGGTAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAACCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CACAGAGAAGCTTGATACCCTATCTTAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTGATTTTGACAAATACGGACAAG
GAACATGTCGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTATGGTAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae TIGR4 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus pneumoniae D39 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus pneumoniae R6 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.812 |
99.866 |
0.917 |
| comEC/celB | Streptococcus mitis SK321 |
91.555 |
100 |
0.916 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.265 |
99.33 |
0.44 |