Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | HMPREF1038_RS04800 | Genome accession | NC_018630 |
| Coordinates | 887653..889893 (+) | Length | 746 a.a. |
| NCBI ID | WP_014947326.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae gamPNI0373 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 882653..894893
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HMPREF1038_RS04765 (HMPREF1038_00968) | - | 882816..883784 (+) | 969 | WP_000658180.1 | PhoH family protein | - |
| HMPREF1038_RS04770 (HMPREF1038_00969) | - | 883964..884464 (+) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| HMPREF1038_RS04775 (HMPREF1038_00970) | - | 884467..884793 (+) | 327 | Protein_908 | TfoX/Sxy family protein | - |
| HMPREF1038_RS11620 (HMPREF1038_00971) | ald | 885094..886205 (-) | 1112 | Protein_909 | alanine dehydrogenase | - |
| HMPREF1038_RS04790 (HMPREF1038_00973) | - | 886382..886951 (+) | 570 | WP_000443745.1 | GNAT family N-acetyltransferase | - |
| HMPREF1038_RS04795 (HMPREF1038_00974) | comEA/celA/cilE | 887019..887669 (+) | 651 | WP_000387352.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| HMPREF1038_RS04800 (HMPREF1038_00975) | comEC/celB | 887653..889893 (+) | 2241 | WP_014947326.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| HMPREF1038_RS04805 (HMPREF1038_00976) | - | 890073..890261 (+) | 189 | WP_010976492.1 | hypothetical protein | - |
| HMPREF1038_RS04810 (HMPREF1038_00977) | - | 890293..890880 (+) | 588 | WP_000933542.1 | ATP-binding cassette domain-containing protein | - |
| HMPREF1038_RS04815 (HMPREF1038_00978) | - | 890884..892059 (+) | 1176 | WP_000655959.1 | hypothetical protein | - |
| HMPREF1038_RS04820 (HMPREF1038_00979) | infC | 892380..892910 (+) | 531 | WP_025172502.1 | translation initiation factor IF-3 | - |
| HMPREF1038_RS04825 (HMPREF1038_00980) | rpmI | 892943..893143 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| HMPREF1038_RS04830 (HMPREF1038_00981) | rplT | 893195..893554 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| HMPREF1038_RS04835 (HMPREF1038_00982) | - | 893612..893992 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84724.44 Da Isoelectric Point: 9.4461
>NTDB_id=53341 HMPREF1038_RS04800 WP_014947326.1 887653..889893(+) (comEC/celB) [Streptococcus pneumoniae gamPNI0373]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSVSYFALLGFVFLLVCLFIQFPWKSAGKVLVICGVFGFWFLFQTWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSDGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQGY
LKTQGIYQTLNIKRIQSFQKVGSWDIGENLSSLRRKAVVWIKMHLPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCVYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNEWLLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSVSYFALLGFVFLLVCLFIQFPWKSAGKVLVICGVFGFWFLFQTWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSDGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQGY
LKTQGIYQTLNIKRIQSFQKVGSWDIGENLSSLRRKAVVWIKMHLPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCVYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNEWLLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=53341 HMPREF1038_RS04800 WP_014947326.1 887653..889893(+) (comEC/celB) [Streptococcus pneumoniae gamPNI0373]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTATGGCTTTACTACGCCATTTT
CTCAGTATCCTATTTTGCTTTGTTGGGTTTTGTTTTTCTGCTAGTCTGCCTCTTTATCCAATTTCCTTGGAAATCAGCAG
GTAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAACTTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTTAACGGTGACAGTCTGTCCTTTCGTGGTAA
GTCTGATGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TTCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGGCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCATTCCAAAAGGTTGGCAGTTGGGATATAGG
TGAAAACCTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACCTCCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTAGGACATCTGGACACCGATTTTGAGGAGATGAATGAGCTTTATTCCAGTTTAGGAATTATTCACCTT
TTTGCCTTGTCAGGTATGCAGGTAGGGTTTTTCATGGACGGATTTAAGAAACTACTTTTACGATTGGGCTTGACACAAGA
AAAGTTGAAGTGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGTGTTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGTAGACCGCTTGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAAGCTTGATACCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTATGGCTTTACTACGCCATTTT
CTCAGTATCCTATTTTGCTTTGTTGGGTTTTGTTTTTCTGCTAGTCTGCCTCTTTATCCAATTTCCTTGGAAATCAGCAG
GTAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAACTTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTTAACGGTGACAGTCTGTCCTTTCGTGGTAA
GTCTGATGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TTCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGGCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCATTCCAAAAGGTTGGCAGTTGGGATATAGG
TGAAAACCTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACCTCCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTAGGACATCTGGACACCGATTTTGAGGAGATGAATGAGCTTTATTCCAGTTTAGGAATTATTCACCTT
TTTGCCTTGTCAGGTATGCAGGTAGGGTTTTTCATGGACGGATTTAAGAAACTACTTTTACGATTGGGCTTGACACAAGA
AAAGTTGAAGTGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGTGTTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGTAGACCGCTTGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAAGCTTGATACCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae Rx1 |
96.515 |
100 |
0.965 |
| comEC/celB | Streptococcus pneumoniae D39 |
96.515 |
100 |
0.965 |
| comEC/celB | Streptococcus pneumoniae R6 |
96.515 |
100 |
0.965 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
96.515 |
100 |
0.965 |
| comEC/celB | Streptococcus mitis SK321 |
91.287 |
100 |
0.913 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.007 |
99.866 |
0.909 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.265 |
99.33 |
0.44 |