Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | Q7627_RS04565 | Genome accession | NZ_CP131706 |
| Coordinates | 887148..889388 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942408.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain 2018N21-288 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 882148..894388
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| Q7627_RS04540 (Q7627_04535) | - | 882540..882866 (+) | 327 | Protein_895 | TfoX/Sxy family protein | - |
| Q7627_RS04545 (Q7627_04540) | ald | 883167..884254 (-) | 1088 | Protein_896 | alanine dehydrogenase | - |
| Q7627_RS04550 (Q7627_04545) | - | 884288..885544 (-) | 1257 | WP_000530081.1 | ISL3 family transposase | - |
| Q7627_RS04555 (Q7627_04550) | - | 885877..886446 (+) | 570 | WP_000443772.1 | GNAT family N-acetyltransferase | - |
| Q7627_RS04560 (Q7627_04555) | comEA/celA/cilE | 886514..887164 (+) | 651 | WP_000387329.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| Q7627_RS04565 (Q7627_04560) | comEC/celB | 887148..889388 (+) | 2241 | WP_000942408.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| Q7627_RS04570 (Q7627_04565) | - | 889567..889755 (+) | 189 | WP_001810030.1 | hypothetical protein | - |
| Q7627_RS04575 (Q7627_04570) | - | 889788..890375 (+) | 588 | WP_000939886.1 | ABC transporter ATP-binding protein | - |
| Q7627_RS04580 (Q7627_04575) | - | 890379..891563 (+) | 1185 | WP_000655949.1 | ABC transporter permease | - |
| Q7627_RS04585 (Q7627_04580) | infC | 891871..892401 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| Q7627_RS04590 (Q7627_04585) | rpmI | 892434..892634 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| Q7627_RS04595 (Q7627_04590) | rplT | 892686..893045 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| Q7627_RS04600 (Q7627_04595) | - | 893103..893483 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84561.22 Da Isoelectric Point: 9.7561
>NTDB_id=863801 Q7627_RS04565 WP_000942408.1 887148..889388(+) (comEC/celB) [Streptococcus pneumoniae strain 2018N21-288]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGYLNTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKQFVVELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGYLNTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKQFVVELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=863801 Q7627_RS04565 WP_000942408.1 887148..889388(+) (comEC/celB) [Streptococcus pneumoniae strain 2018N21-288]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTTAACGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGATATCTGAACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGGAGACCGCTTGTCTTTGGTCAACCCAACGCATGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CACAGAGAAGCTTGATACCCTATCTTAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTGATTTTGACAAATACGGACAAG
GAACATGTCGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTGTGGTAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCCCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGCTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTTAACGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGATATCTGAACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGGAGACCGCTTGTCTTTGGTCAACCCAACGCATGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CACAGAGAAGCTTGATACCCTATCTTAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTGATTTTGACAAATACGGACAAG
GAACATGTCGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTGTGGTAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCCCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGCTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae TIGR4 |
97.855 |
100 |
0.979 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
96.783 |
100 |
0.968 |
| comEC/celB | Streptococcus pneumoniae D39 |
96.783 |
100 |
0.968 |
| comEC/celB | Streptococcus pneumoniae R6 |
96.783 |
100 |
0.968 |
| comEC/celB | Streptococcus mitis SK321 |
92.225 |
100 |
0.922 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.946 |
99.866 |
0.918 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.534 |
99.33 |
0.442 |