Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | EQH35_RS04480 | Genome accession | NZ_CP035245 |
| Coordinates | 868892..871132 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942414.1 | Uniprot ID | A0A4J1ZQU0 |
| Organism | Streptococcus pneumoniae strain TVO_1901943 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 863892..876132
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EQH35_RS04450 (EQH35_04690) | - | 864043..865011 (+) | 969 | WP_000658183.1 | PhoH family protein | - |
| EQH35_RS04455 (EQH35_04700) | - | 865204..865704 (+) | 501 | WP_000566982.1 | GNAT family N-acetyltransferase | - |
| EQH35_RS04460 (EQH35_04705) | - | 865707..866032 (+) | 326 | Protein_897 | TfoX/Sxy family protein | - |
| EQH35_RS04465 | ald | 866333..867444 (-) | 1112 | Protein_898 | alanine dehydrogenase | - |
| EQH35_RS04470 (EQH35_04725) | - | 867621..868190 (+) | 570 | WP_238125046.1 | GNAT family N-acetyltransferase | - |
| EQH35_RS04475 (EQH35_04730) | comEA/celA/cilE | 868258..868908 (+) | 651 | WP_000387335.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| EQH35_RS04480 (EQH35_04735) | comEC/celB | 868892..871132 (+) | 2241 | WP_000942414.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| EQH35_RS04485 (EQH35_04740) | - | 871311..871499 (+) | 189 | WP_001809102.1 | hypothetical protein | - |
| EQH35_RS04490 (EQH35_04745) | - | 871531..872118 (+) | 588 | WP_000933542.1 | ATP-binding cassette domain-containing protein | - |
| EQH35_RS04495 (EQH35_04750) | - | 872122..873297 (+) | 1176 | WP_000655957.1 | ABC transporter permease | - |
| EQH35_RS04500 (EQH35_04755) | infC | 873618..874148 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| EQH35_RS04505 (EQH35_04760) | rpmI | 874181..874381 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| EQH35_RS04510 (EQH35_04765) | rplT | 874433..874792 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| EQH35_RS04515 (EQH35_04770) | - | 874850..875230 (+) | 381 | WP_000157157.1 | VOC family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84540.03 Da Isoelectric Point: 9.6773
>NTDB_id=337354 EQH35_RS04480 WP_000942414.1 868892..871132(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901943]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSTSYLALLGFVFLLVCLFFQFPWKSASKVLVICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFFTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKQFVVELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGSKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSTSYLALLGFVFLLVCLFFQFPWKSASKVLVICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFFTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKQFVVELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGSKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=337354 EQH35_RS04480 WP_000942414.1 868892..871132(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901943]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCCATTTT
TTCAACATCCTATCTTGCTTTGTTGGGATTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCTGCTA
GCAAAGTTCTAGTGATTTGTGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTTACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGGAGACCGCTTGTCTTTGGTCAACCCAACGCATGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CACAGAGAAGCTTGATACCCTATCTTAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTGATTTTGACAAATACGGACAAG
GAACATGTCGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTGTGGTAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCTCTAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCCATTTT
TTCAACATCCTATCTTGCTTTGTTGGGATTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCTGCTA
GCAAAGTTCTAGTGATTTGTGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTTACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGGAGACCGCTTGTCTTTGGTCAACCCAACGCATGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CACAGAGAAGCTTGATACCCTATCTTAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTGATTTTGACAAATACGGACAAG
GAACATGTCGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTAGTATCAAAAGGCAGTTTGAAGCA
GAAGCAATTTGTGGTAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCTCTAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae TIGR4 |
97.721 |
100 |
0.977 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
96.649 |
100 |
0.966 |
| comEC/celB | Streptococcus pneumoniae D39 |
96.649 |
100 |
0.966 |
| comEC/celB | Streptococcus pneumoniae R6 |
96.649 |
100 |
0.966 |
| comEC/celB | Streptococcus mitis SK321 |
92.359 |
100 |
0.924 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
92.081 |
99.866 |
0.92 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.534 |
99.33 |
0.442 |