Detailed information
Overview
| Name | comEA/celA/cilE | Type | Machinery gene |
| Locus tag | KK0981_RS04680 | Genome accession | NZ_AP017971 |
| Coordinates | 870157..870807 (+) | Length | 216 a.a. |
| NCBI ID | WP_000387330.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain KK0981 | ||
| Function | dsDNA binding to the cell surface (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 865157..875807
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| KK0981_RS04650 (KK0981_29150) | pyrH | 865881..866618 (+) | 738 | WP_000002997.1 | UMP kinase | - |
| KK0981_RS04655 (KK0981_29160) | frr | 866627..867184 (+) | 558 | WP_000024409.1 | ribosome recycling factor | - |
| KK0981_RS04660 (KK0981_29170) | cvfB | 867244..868098 (+) | 855 | WP_001095450.1 | RNA-binding virulence regulatory protein CvfB | - |
| KK0981_RS04665 (KK0981_29180) | - | 868107..868322 (+) | 216 | WP_001232082.1 | YozE family protein | - |
| KK0981_RS04670 (KK0981_29190) | - | 868408..869412 (+) | 1005 | WP_000658177.1 | PhoH family protein | - |
| KK0981_RS04675 (KK0981_29200) | - | 869520..870089 (+) | 570 | WP_000443770.1 | GNAT family N-acetyltransferase | - |
| KK0981_RS04680 (KK0981_29210) | comEA/celA/cilE | 870157..870807 (+) | 651 | WP_000387330.1 | ComEA family DNA-binding protein | Machinery gene |
| KK0981_RS04685 (KK0981_29220) | comEC/celB | 870791..873031 (+) | 2241 | WP_000942411.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| KK0981_RS04690 (KK0981_29230) | - | 873075..874028 (-) | 954 | WP_001155321.1 | IS30-like element ISSpn8 family transposase | - |
| KK0981_RS04695 (KK0981_29240) | infC | 874515..875045 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| KK0981_RS04700 (KK0981_29250) | rpmI | 875078..875278 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| KK0981_RS04705 (KK0981_29260) | rplT | 875330..875689 (+) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
Sequence
Protein
Download Length: 216 a.a. Molecular weight: 23229.65 Da Isoelectric Point: 5.5171
>NTDB_id=68678 KK0981_RS04680 WP_000387330.1 870157..870807(+) (comEA/celA/cilE) [Streptococcus pneumoniae strain KK0981]
MEAIIEKIKEYKIIVICTGLGLLVGGFFLLKPAPQTPVKETNLQAEVAAVSKDLVSEKEVNKEEKEEPLEQDLITVDVKG
AVKSPGIYDLPVGSRVNDAVQKAGGLTEQADSKSLNLAQKVSDEALVYVPTKGEEAVSQQTGLGTASSISKEKKVNLNKA
SLEELKQVKGLGGKRAQDIIDHREANGKFKSVDELKKVSGIGGKTIEKLKDYVTVD
MEAIIEKIKEYKIIVICTGLGLLVGGFFLLKPAPQTPVKETNLQAEVAAVSKDLVSEKEVNKEEKEEPLEQDLITVDVKG
AVKSPGIYDLPVGSRVNDAVQKAGGLTEQADSKSLNLAQKVSDEALVYVPTKGEEAVSQQTGLGTASSISKEKKVNLNKA
SLEELKQVKGLGGKRAQDIIDHREANGKFKSVDELKKVSGIGGKTIEKLKDYVTVD
Nucleotide
Download Length: 651 bp
>NTDB_id=68678 KK0981_RS04680 WP_000387330.1 870157..870807(+) (comEA/celA/cilE) [Streptococcus pneumoniae strain KK0981]
ATGGAAGCAATTATCGAGAAAATCAAAGAGTATAAAATCATCGTCATCTGTACTGGTCTGGGCTTGCTTGTAGGAGGATT
TTTCCTGCTAAAACCAGCTCCACAAACACCTGTCAAAGAGACGAATTTGCAGGCTGAAGTCGCAGCTGTTTCCAAGGATT
TGGTATCCGAAAAGGAAGTGAACAAGGAAGAAAAGGAAGAACCCCTTGAACAAGATCTAATCACAGTAGATGTCAAAGGT
GCTGTCAAATCGCCAGGGATTTATGACTTGCCTGTAGGTAGTCGAGTCAATGATGCTGTTCAGAAGGCTGGTGGCTTGAC
AGAGCAAGCAGACAGCAAGTCGCTCAATCTAGCTCAGAAAGTTAGTGATGAGGCTCTGGTTTACGTTCCTACTAAGGGAG
AAGAAGCAGTTAGCCAACAGACTGGTTTGGGGACAGCTTCTTCAATAAGCAAGGAAAAGAAGGTCAATCTCAACAAGGCC
AGTCTGGAAGAACTCAAGCAGGTCAAGGGACTGGGAGGAAAACGAGCTCAGGACATTATCGACCATCGTGAGGCAAATGG
CAAGTTCAAGTCAGTAGACGAGCTCAAGAAGGTCTCTGGCATTGGTGGCAAAACAATAGAAAAGCTTAAAGACTATGTTA
CAGTGGATTAA
ATGGAAGCAATTATCGAGAAAATCAAAGAGTATAAAATCATCGTCATCTGTACTGGTCTGGGCTTGCTTGTAGGAGGATT
TTTCCTGCTAAAACCAGCTCCACAAACACCTGTCAAAGAGACGAATTTGCAGGCTGAAGTCGCAGCTGTTTCCAAGGATT
TGGTATCCGAAAAGGAAGTGAACAAGGAAGAAAAGGAAGAACCCCTTGAACAAGATCTAATCACAGTAGATGTCAAAGGT
GCTGTCAAATCGCCAGGGATTTATGACTTGCCTGTAGGTAGTCGAGTCAATGATGCTGTTCAGAAGGCTGGTGGCTTGAC
AGAGCAAGCAGACAGCAAGTCGCTCAATCTAGCTCAGAAAGTTAGTGATGAGGCTCTGGTTTACGTTCCTACTAAGGGAG
AAGAAGCAGTTAGCCAACAGACTGGTTTGGGGACAGCTTCTTCAATAAGCAAGGAAAAGAAGGTCAATCTCAACAAGGCC
AGTCTGGAAGAACTCAAGCAGGTCAAGGGACTGGGAGGAAAACGAGCTCAGGACATTATCGACCATCGTGAGGCAAATGG
CAAGTTCAAGTCAGTAGACGAGCTCAAGAAGGTCTCTGGCATTGGTGGCAAAACAATAGAAAAGCTTAAAGACTATGTTA
CAGTGGATTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEA/celA/cilE | Streptococcus pneumoniae Rx1 |
100 |
100 |
1 |
| comEA/celA/cilE | Streptococcus pneumoniae D39 |
100 |
100 |
1 |
| comEA/celA/cilE | Streptococcus pneumoniae R6 |
100 |
100 |
1 |
| comEA/celA/cilE | Streptococcus pneumoniae TIGR4 |
97.222 |
100 |
0.972 |
| comEA/celA/cilE | Streptococcus mitis NCTC 12261 |
96.296 |
100 |
0.963 |
| comEA/celA/cilE | Streptococcus mitis SK321 |
90.278 |
100 |
0.903 |
| comEA | Lactococcus lactis subsp. cremoris KW2 |
43.172 |
100 |
0.454 |
| comEA | Bacillus subtilis subsp. subtilis str. 168 |
41.579 |
87.963 |
0.366 |
| comEA | Latilactobacillus sakei subsp. sakei 23K |
33.476 |
100 |
0.361 |