Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | EQH18_RS04475 | Genome accession | NZ_CP035262 |
| Coordinates | 891018..893258 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942394.1 | Uniprot ID | A0A166WJ67 |
| Organism | Streptococcus pneumoniae strain TVO_1901923 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 886018..898258
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EQH18_RS04445 (EQH18_04695) | - | 886364..887154 (+) | 791 | Protein_896 | IS5 family transposase | - |
| EQH18_RS04450 (EQH18_04705) | - | 887337..887837 (+) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| EQH18_RS04455 (EQH18_04710) | - | 887840..888166 (+) | 327 | Protein_898 | TfoX/Sxy family protein | - |
| EQH18_RS04460 (EQH18_04715) | ald | 888471..889570 (-) | 1100 | Protein_899 | alanine dehydrogenase | - |
| EQH18_RS04465 (EQH18_04720) | - | 889747..890316 (+) | 570 | WP_000443750.1 | GNAT family N-acetyltransferase | - |
| EQH18_RS04470 (EQH18_04725) | comEA/celA/cilE | 890384..891034 (+) | 651 | WP_000387344.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| EQH18_RS04475 (EQH18_04730) | comEC/celB | 891018..893258 (+) | 2241 | WP_000942394.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| EQH18_RS04480 (EQH18_04735) | - | 893437..893625 (+) | 189 | WP_001808514.1 | hypothetical protein | - |
| EQH18_RS04485 (EQH18_04740) | - | 893658..894245 (+) | 588 | WP_000939895.1 | ATP-binding cassette domain-containing protein | - |
| EQH18_RS04490 (EQH18_04745) | - | 894249..895433 (+) | 1185 | WP_000655960.1 | ABC transporter permease | - |
| EQH18_RS04495 (EQH18_04750) | infC | 895744..896274 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| EQH18_RS04500 (EQH18_04755) | rpmI | 896307..896507 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| EQH18_RS04505 (EQH18_04760) | rplT | 896559..896918 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| EQH18_RS04510 (EQH18_04765) | - | 896976..897356 (+) | 381 | WP_000157154.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84444.00 Da Isoelectric Point: 9.6202
>NTDB_id=338455 EQH18_RS04475 WP_000942394.1 891018..893258(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901923]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMGNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMGNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=338455 EQH18_RS04475 WP_000942394.1 891018..893258(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901923]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGGGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGGAGACCACTTGTCTTTGGTCAACCCAACGCATGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGGGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGGAGACCACTTGTCTTTGGTCAACCCAACGCATGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae TIGR4 |
100 |
100 |
1 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus pneumoniae D39 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus pneumoniae R6 |
97.587 |
100 |
0.976 |
| comEC/celB | Streptococcus mitis SK321 |
92.225 |
100 |
0.922 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.812 |
99.866 |
0.917 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.399 |
99.33 |
0.441 |