Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | EQH31_RS04630 | Genome accession | NZ_CP035249 |
| Coordinates | 906858..909098 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942422.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain TVO_1901938 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 901858..914098
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EQH31_RS04600 (EQH31_04895) | - | 902021..902989 (+) | 969 | WP_000658181.1 | PhoH family protein | - |
| EQH31_RS04605 (EQH31_04900) | - | 903169..903669 (+) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| EQH31_RS04610 (EQH31_04905) | - | 903672..903998 (+) | 327 | Protein_927 | TfoX/Sxy family protein | - |
| EQH31_RS04615 (EQH31_04910) | ald | 904299..905410 (-) | 1112 | Protein_928 | alanine dehydrogenase | - |
| EQH31_RS04620 (EQH31_04915) | - | 905587..906156 (+) | 570 | WP_000443745.1 | GNAT family N-acetyltransferase | - |
| EQH31_RS04625 (EQH31_04920) | comEA/celA/cilE | 906224..906874 (+) | 651 | WP_000387352.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| EQH31_RS04630 (EQH31_04925) | comEC/celB | 906858..909098 (+) | 2241 | WP_000942422.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| EQH31_RS04635 (EQH31_04930) | - | 909278..909466 (+) | 189 | WP_010976492.1 | hypothetical protein | - |
| EQH31_RS04640 (EQH31_04935) | - | 909498..910085 (+) | 588 | WP_000933542.1 | ATP-binding cassette domain-containing protein | - |
| EQH31_RS04645 (EQH31_04940) | - | 910089..911264 (+) | 1176 | WP_000655940.1 | hypothetical protein | - |
| EQH31_RS04650 (EQH31_04945) | infC | 911585..912115 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| EQH31_RS04655 (EQH31_04950) | rpmI | 912148..912348 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| EQH31_RS04660 (EQH31_04955) | rplT | 912400..912759 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| EQH31_RS04665 (EQH31_04960) | - | 912817..913197 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84696.39 Da Isoelectric Point: 9.4461
>NTDB_id=337666 EQH31_RS04630 WP_000942422.1 906858..909098(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901938]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSVSYFALLGFVFLLVCLFIQFPWKSAGKVLVICGVFGFWFLFQTWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSDGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQGY
LKTQGIYQTLNIKRIQSLQKVGSWDIGENLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNEWLLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSVSYFALLGFVFLLVCLFIQFPWKSAGKVLVICGVFGFWFLFQTWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSDGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQGY
LKTQGIYQTLNIKRIQSLQKVGSWDIGENLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNEWLLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=337666 EQH31_RS04630 WP_000942422.1 906858..909098(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901938]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTATGGCTTTACTACGCCATTTT
CTCAGTATCCTATTTTGCTTTGTTGGGTTTTGTTTTTCTGCTAGTCTGCCTCTTTATCCAATTTCCTTGGAAATCAGCAG
GTAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAACTTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTTAACGGTGACAGTCTGTCCTTTCGTGGTAA
GTCTGATGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TTCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGGCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCACTCCAAAAGGTTGGCAGTTGGGATATAGG
TGAAAACCTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACTTCCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTAGGACATCTGGACACCGATTTTGAGGAGATGAATGAGCTTTATTCCAGTTTAGGAATTATTCACCTT
TTTGCCTTGTCAGGTATGCAGGTAGGGTTTTTCATGGACGGATTTAAGAAACTACTTTTACGATTGGGCTTGACACAAGA
AAAGTTGAAGTGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGTGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGTAGACCGCTTGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAAGCTTGATACCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTACCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTATGGCTTTACTACGCCATTTT
CTCAGTATCCTATTTTGCTTTGTTGGGTTTTGTTTTTCTGCTAGTCTGCCTCTTTATCCAATTTCCTTGGAAATCAGCAG
GTAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAACTTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTTAACGGTGACAGTCTGTCCTTTCGTGGTAA
GTCTGATGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TTCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGGCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCACTCCAAAAGGTTGGCAGTTGGGATATAGG
TGAAAACCTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACTTCCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTAGGACATCTGGACACCGATTTTGAGGAGATGAATGAGCTTTATTCCAGTTTAGGAATTATTCACCTT
TTTGCCTTGTCAGGTATGCAGGTAGGGTTTTTCATGGACGGATTTAAGAAACTACTTTTACGATTGGGCTTGACACAAGA
AAAGTTGAAGTGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGTGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGTAGACCGCTTGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAAGCTTGATACCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTACCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae Rx1 |
96.917 |
100 |
0.969 |
| comEC/celB | Streptococcus pneumoniae D39 |
96.917 |
100 |
0.969 |
| comEC/celB | Streptococcus pneumoniae R6 |
96.917 |
100 |
0.969 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
96.917 |
100 |
0.969 |
| comEC/celB | Streptococcus mitis SK321 |
91.689 |
100 |
0.917 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.409 |
99.866 |
0.913 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.399 |
99.33 |
0.441 |