Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | HKM25_RS04520 | Genome accession | NZ_CP053210 |
| Coordinates | 891434..893674 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942396.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain 6A-10 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 886434..898674
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HKM25_RS04485 (HKM25_918) | - | 886584..887552 (+) | 969 | WP_000658191.1 | PhoH family protein | - |
| HKM25_RS04495 (HKM25_919) | - | 887745..888245 (+) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| HKM25_RS04500 (HKM25_920) | - | 888248..888622 (+) | 375 | Protein_904 | TfoX/Sxy family protein | - |
| HKM25_RS04505 | ald | 888876..889986 (-) | 1111 | Protein_905 | alanine dehydrogenase | - |
| HKM25_RS04510 (HKM25_924) | - | 890163..890732 (+) | 570 | WP_000443749.1 | GNAT family N-acetyltransferase | - |
| HKM25_RS04515 (HKM25_925) | comEA/celA/cilE | 890800..891450 (+) | 651 | WP_000387336.1 | ComEA family DNA-binding protein | Machinery gene |
| HKM25_RS04520 (HKM25_926) | comEC/celB | 891434..893674 (+) | 2241 | WP_000942396.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| HKM25_RS04525 (HKM25_927) | - | 893801..894064 (+) | 264 | WP_000747853.1 | hypothetical protein | - |
| HKM25_RS04530 (HKM25_928) | - | 894096..894683 (+) | 588 | WP_000933543.1 | ABC transporter ATP-binding protein | - |
| HKM25_RS04535 (HKM25_929) | - | 894687..895868 (+) | 1182 | WP_000655948.1 | ABC transporter permease | - |
| HKM25_RS04540 (HKM25_931) | infC | 896175..896705 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| HKM25_RS04545 (HKM25_932) | rpmI | 896738..896938 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| HKM25_RS04550 (HKM25_933) | rplT | 896990..897349 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| HKM25_RS04555 (HKM25_934) | - | 897407..897787 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84472.80 Da Isoelectric Point: 9.2649
>NTDB_id=443436 HKM25_RS04520 WP_000942396.1 891434..893674(+) (comEC/celB) [Streptococcus pneumoniae strain 6A-10]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLQLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGEYLPIFGSQLEVLSPREIGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLQLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGEYLPIFGSQLEVLSPREIGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=443436 HKM25_RS04520 WP_000942396.1 891434..893674(+) (comEC/celB) [Streptococcus pneumoniae strain 6A-10]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCAATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
CTCACAGGTGACAAGTAGGCCTCTAGTCTTTGGACAACCCAATGCATGGCTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTTGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGTACTTGCCCATTTTCG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGGAAATTGGAGATGGAGGACACGATGACACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGCGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCAATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
CTCACAGGTGACAAGTAGGCCTCTAGTCTTTGGACAACCCAATGCATGGCTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTTGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGTACTTGCCCATTTTCG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGGAAATTGGAGATGGAGGACACGATGACACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGCGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae TIGR4 |
98.525 |
100 |
0.985 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
97.453 |
100 |
0.975 |
| comEC/celB | Streptococcus pneumoniae D39 |
97.453 |
100 |
0.975 |
| comEC/celB | Streptococcus pneumoniae R6 |
97.453 |
100 |
0.975 |
| comEC/celB | Streptococcus mitis SK321 |
91.957 |
100 |
0.92 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.946 |
99.866 |
0.918 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.265 |
99.33 |
0.44 |