Detailed information
Overview
| Name | clpE | Type | Regulator |
| Locus tag | ES276_RS05895 | Genome accession | NZ_CP038252 |
| Coordinates | 1145196..1147454 (-) | Length | 752 a.a. |
| NCBI ID | WP_238092956.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain TVO_1901936 | ||
| Function | degradation of ComX (predicted from homology) Competence regulation |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| IScluster/Tn | 1140171..1144958 | 1145196..1147454 | flank | 238 |
Gene organization within MGE regions
Location: 1140171..1147454
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| ES276_RS05870 (ES276_06310) | tnpB | 1141169..1141456 (-) | 288 | WP_000586646.1 | IS66 family insertion sequence element accessory protein TnpB | - |
| ES276_RS05875 (ES276_06315) | - | 1141697..1143034 (+) | 1338 | Protein_1150 | IS3 family transposase | - |
| ES276_RS05880 (ES276_06320) | - | 1143142..1143441 (-) | 300 | WP_000767193.1 | DUF1827 family protein | - |
| ES276_RS05885 (ES276_06330) | - | 1143606..1144028 (-) | 423 | WP_001849729.1 | NUDIX hydrolase | - |
| ES276_RS05890 (ES276_06335) | - | 1144139..1144989 (+) | 851 | Protein_1153 | IS630 family transposase | - |
| ES276_RS05895 (ES276_06340) | clpE | 1145196..1147454 (-) | 2259 | WP_238092956.1 | ATP-dependent Clp protease ATP-binding subunit | Regulator |
Sequence
Protein
Download Length: 752 a.a. Molecular weight: 83854.29 Da Isoelectric Point: 5.3713
>NTDB_id=352635 ES276_RS05895 WP_238092956.1 1145196..1147454(-) (clpE) [Streptococcus pneumoniae strain TVO_1901936]
MLCQNCKINDSTIHLYTNLNGKQKQIDLCQNCYKIIKTDPNNSLFKGMTDLNNRDFDPFGDFFNDLNNFRPSSNTPPIPP
TQSGGGYGGNGGYGSQNRGSAQTPPPSQEKGLLEEFGINVTEIARRGDIDPVIGRDDEIIRVIEILNRRTKNNPVLIGEP
GVGKTAVVEGLAQKIVDGDVPHKLQGKQVIRLDVVSLVQGTGIRGQFEERMQKLMEEIRKREDIILFIDEIHEIVGAGSA
SDGNMDAGNILKPALARGELQLVGATTLNEYRIIEKDAALERRMQPVKVDEPTVDETITILKGIQKKYEDYHHVQYTDAA
IEAAATLSNRYIQDRFLPDKAIDLLDEAGSKMNLTLNFVDPKVIDQRLIEAENLKSQATREEDFEKAAYFRDQIAKYKEM
QKKKITDQDTPIISEKTIEHIIEQKTNIPVGDLKEKEQSQLIHLAEDLKSHVIGQDDAVDKIAKAIRRNRVGLGTPNRPI
GSFLFVGPTGVGKTELSKQLAIELFGSADSMIRFDMSEYMEKHSVAKLVGAPPGYVGYDEAGQLTEKVRHNPYSLILLDE
VEKAHPDVMHMFLQILDDGRLTDGQGRTVSFKDAIIIMTSNAGTGKTEASVGFGAAREGRTNSVLGELGNFFSPEFMNRF
DGIIEFKALSKDNLLQIVELMLADVNKRLSSNNIRLDVTDKVKEKLVDLGYDPKMGARPLRRTIQDYIEDTITDYYLENP
SEKDLKAVMTSKGNIQIKSAKKAEVKSSEKEK
MLCQNCKINDSTIHLYTNLNGKQKQIDLCQNCYKIIKTDPNNSLFKGMTDLNNRDFDPFGDFFNDLNNFRPSSNTPPIPP
TQSGGGYGGNGGYGSQNRGSAQTPPPSQEKGLLEEFGINVTEIARRGDIDPVIGRDDEIIRVIEILNRRTKNNPVLIGEP
GVGKTAVVEGLAQKIVDGDVPHKLQGKQVIRLDVVSLVQGTGIRGQFEERMQKLMEEIRKREDIILFIDEIHEIVGAGSA
SDGNMDAGNILKPALARGELQLVGATTLNEYRIIEKDAALERRMQPVKVDEPTVDETITILKGIQKKYEDYHHVQYTDAA
IEAAATLSNRYIQDRFLPDKAIDLLDEAGSKMNLTLNFVDPKVIDQRLIEAENLKSQATREEDFEKAAYFRDQIAKYKEM
QKKKITDQDTPIISEKTIEHIIEQKTNIPVGDLKEKEQSQLIHLAEDLKSHVIGQDDAVDKIAKAIRRNRVGLGTPNRPI
GSFLFVGPTGVGKTELSKQLAIELFGSADSMIRFDMSEYMEKHSVAKLVGAPPGYVGYDEAGQLTEKVRHNPYSLILLDE
VEKAHPDVMHMFLQILDDGRLTDGQGRTVSFKDAIIIMTSNAGTGKTEASVGFGAAREGRTNSVLGELGNFFSPEFMNRF
DGIIEFKALSKDNLLQIVELMLADVNKRLSSNNIRLDVTDKVKEKLVDLGYDPKMGARPLRRTIQDYIEDTITDYYLENP
SEKDLKAVMTSKGNIQIKSAKKAEVKSSEKEK
Nucleotide
Download Length: 2259 bp
>NTDB_id=352635 ES276_RS05895 WP_238092956.1 1145196..1147454(-) (clpE) [Streptococcus pneumoniae strain TVO_1901936]
ATGCTTTGTCAAAACTGTAAAATTAACGACTCAACAATTCATCTTTACACCAATCTCAATGGAAAACAAAAACAAATTGA
CCTCTGTCAAAACTGCTATAAGATTATCAAAACAGATCCTAACAATAGCCTCTTCAAAGGTATGACGGATCTGAACAATC
GTGACTTCGATCCCTTTGGTGATTTCTTCAATGATCTAAACAATTTCAGACCTTCTAGCAATACTCCTCCTATTCCCCCA
ACCCAATCAGGTGGAGGTTACGGTGGAAACGGCGGTTATGGTTCCCAAAATCGTGGATCTGCTCAAACTCCGCCACCTAG
CCAAGAAAAAGGCCTGCTGGAAGAATTTGGTATTAATGTAACTGAAATTGCCCGTCGTGGAGACATTGACCCCGTTATTG
GGCGCGACGATGAGATTATCCGTGTCATCGAGATTCTCAATCGTAGAACCAAGAATAATCCTGTCCTTATCGGTGAACCT
GGTGTCGGAAAAACGGCCGTTGTCGAAGGTCTAGCTCAGAAAATTGTCGATGGCGATGTGCCACATAAACTCCAAGGTAA
ACAAGTCATCCGTCTGGATGTGGTTAGCTTAGTTCAAGGAACGGGGATTCGAGGACAATTTGAAGAACGCATGCAAAAAC
TCATGGAAGAAATTCGCAAACGTGAAGACATCATCCTCTTTATCGATGAAATCCATGAAATTGTTGGTGCTGGTTCTGCG
AGTGATGGTAATATGGACGCAGGAAATATCCTCAAGCCAGCCCTTGCTCGTGGAGAACTGCAACTAGTCGGTGCTACTAC
CCTCAATGAATACCGTATCATTGAAAAGGATGCTGCCCTCGAGCGTCGTATGCAGCCTGTTAAAGTCGATGAACCAACGG
TGGACGAAACAATCACTATTCTCAAAGGGATTCAAAAGAAATACGAAGATTACCACCACGTTCAATATACCGATGCTGCG
ATTGAAGCAGCTGCAACTCTTTCCAATCGCTACATCCAAGATCGCTTCTTGCCTGACAAGGCCATTGACCTCCTAGATGA
AGCTGGTTCTAAGATGAACTTGACCTTGAATTTTGTGGATCCTAAAGTAATTGATCAGCGCTTGATTGAGGCTGAAAATC
TCAAGTCTCAAGCTACACGAGAAGAAGATTTTGAGAAGGCGGCCTACTTCCGCGACCAGATTGCCAAGTATAAGGAAATG
CAAAAGAAAAAGATCACAGACCAGGATACTCCTATCATCAGCGAGAAAACTATTGAGCACATTATCGAGCAGAAAACCAA
TATCCCTGTTGGTGATTTGAAAGAGAAAGAACAATCTCAACTCATCCATCTAGCCGAAGATCTCAAGTCTCATGTTATTG
GCCAAGATGATGCAGTCGATAAGATTGCCAAGGCTATTCGCCGTAATCGTGTCGGACTTGGTACCCCTAACCGCCCAATC
GGAAGCTTCCTCTTCGTTGGGCCAACTGGTGTCGGTAAGACAGAACTTTCCAAACAACTGGCTATCGAACTTTTTGGTTC
TGCTGATAGTATGATTCGCTTTGATATGAGTGAATACATGGAAAAACATAGTGTGGCTAAGTTGGTCGGCGCCCCTCCAG
GTTATGTTGGCTATGATGAGGCTGGTCAATTAACTGAAAAAGTTCGCCACAATCCATATTCTCTCATCCTTCTCGATGAA
GTGGAAAAAGCTCACCCAGATGTTATGCACATGTTTCTTCAAATCTTGGACGATGGTCGTTTGACAGACGGGCAAGGACG
CACCGTTAGCTTCAAGGATGCCATCATTATCATGACCTCAAATGCAGGTACAGGAAAGACCGAAGCTAGCGTTGGATTTG
GTGCTGCTAGAGAAGGACGTACCAATTCTGTCCTCGGTGAACTCGGTAACTTCTTTAGCCCAGAGTTTATGAACCGTTTT
GATGGCATTATCGAATTTAAGGCTCTCAGCAAGGATAACCTCCTTCAGATTGTCGAGCTCATGCTAGCAGATGTTAACAA
GCGCCTCTCTAGCAACAACATTCGTTTGGATGTAACTGATAAGGTCAAGGAAAAGTTGGTTGACCTAGGTTATGATCCAA
AAATGGGAGCACGCCCACTTCGTCGGACTATTCAAGACTATATTGAGGACACAATCACTGACTACTACCTTGAAAATCCA
AGCGAAAAAGATCTCAAAGCAGTTATGACTAGCAAGGGAAACATTCAGATTAAATCTGCCAAAAAAGCTGAAGTTAAAAG
TTCTGAAAAAGAAAAATAA
ATGCTTTGTCAAAACTGTAAAATTAACGACTCAACAATTCATCTTTACACCAATCTCAATGGAAAACAAAAACAAATTGA
CCTCTGTCAAAACTGCTATAAGATTATCAAAACAGATCCTAACAATAGCCTCTTCAAAGGTATGACGGATCTGAACAATC
GTGACTTCGATCCCTTTGGTGATTTCTTCAATGATCTAAACAATTTCAGACCTTCTAGCAATACTCCTCCTATTCCCCCA
ACCCAATCAGGTGGAGGTTACGGTGGAAACGGCGGTTATGGTTCCCAAAATCGTGGATCTGCTCAAACTCCGCCACCTAG
CCAAGAAAAAGGCCTGCTGGAAGAATTTGGTATTAATGTAACTGAAATTGCCCGTCGTGGAGACATTGACCCCGTTATTG
GGCGCGACGATGAGATTATCCGTGTCATCGAGATTCTCAATCGTAGAACCAAGAATAATCCTGTCCTTATCGGTGAACCT
GGTGTCGGAAAAACGGCCGTTGTCGAAGGTCTAGCTCAGAAAATTGTCGATGGCGATGTGCCACATAAACTCCAAGGTAA
ACAAGTCATCCGTCTGGATGTGGTTAGCTTAGTTCAAGGAACGGGGATTCGAGGACAATTTGAAGAACGCATGCAAAAAC
TCATGGAAGAAATTCGCAAACGTGAAGACATCATCCTCTTTATCGATGAAATCCATGAAATTGTTGGTGCTGGTTCTGCG
AGTGATGGTAATATGGACGCAGGAAATATCCTCAAGCCAGCCCTTGCTCGTGGAGAACTGCAACTAGTCGGTGCTACTAC
CCTCAATGAATACCGTATCATTGAAAAGGATGCTGCCCTCGAGCGTCGTATGCAGCCTGTTAAAGTCGATGAACCAACGG
TGGACGAAACAATCACTATTCTCAAAGGGATTCAAAAGAAATACGAAGATTACCACCACGTTCAATATACCGATGCTGCG
ATTGAAGCAGCTGCAACTCTTTCCAATCGCTACATCCAAGATCGCTTCTTGCCTGACAAGGCCATTGACCTCCTAGATGA
AGCTGGTTCTAAGATGAACTTGACCTTGAATTTTGTGGATCCTAAAGTAATTGATCAGCGCTTGATTGAGGCTGAAAATC
TCAAGTCTCAAGCTACACGAGAAGAAGATTTTGAGAAGGCGGCCTACTTCCGCGACCAGATTGCCAAGTATAAGGAAATG
CAAAAGAAAAAGATCACAGACCAGGATACTCCTATCATCAGCGAGAAAACTATTGAGCACATTATCGAGCAGAAAACCAA
TATCCCTGTTGGTGATTTGAAAGAGAAAGAACAATCTCAACTCATCCATCTAGCCGAAGATCTCAAGTCTCATGTTATTG
GCCAAGATGATGCAGTCGATAAGATTGCCAAGGCTATTCGCCGTAATCGTGTCGGACTTGGTACCCCTAACCGCCCAATC
GGAAGCTTCCTCTTCGTTGGGCCAACTGGTGTCGGTAAGACAGAACTTTCCAAACAACTGGCTATCGAACTTTTTGGTTC
TGCTGATAGTATGATTCGCTTTGATATGAGTGAATACATGGAAAAACATAGTGTGGCTAAGTTGGTCGGCGCCCCTCCAG
GTTATGTTGGCTATGATGAGGCTGGTCAATTAACTGAAAAAGTTCGCCACAATCCATATTCTCTCATCCTTCTCGATGAA
GTGGAAAAAGCTCACCCAGATGTTATGCACATGTTTCTTCAAATCTTGGACGATGGTCGTTTGACAGACGGGCAAGGACG
CACCGTTAGCTTCAAGGATGCCATCATTATCATGACCTCAAATGCAGGTACAGGAAAGACCGAAGCTAGCGTTGGATTTG
GTGCTGCTAGAGAAGGACGTACCAATTCTGTCCTCGGTGAACTCGGTAACTTCTTTAGCCCAGAGTTTATGAACCGTTTT
GATGGCATTATCGAATTTAAGGCTCTCAGCAAGGATAACCTCCTTCAGATTGTCGAGCTCATGCTAGCAGATGTTAACAA
GCGCCTCTCTAGCAACAACATTCGTTTGGATGTAACTGATAAGGTCAAGGAAAAGTTGGTTGACCTAGGTTATGATCCAA
AAATGGGAGCACGCCCACTTCGTCGGACTATTCAAGACTATATTGAGGACACAATCACTGACTACTACCTTGAAAATCCA
AGCGAAAAAGATCTCAAAGCAGTTATGACTAGCAAGGGAAACATTCAGATTAAATCTGCCAAAAAAGCTGAAGTTAAAAG
TTCTGAAAAAGAAAAATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| clpE | Streptococcus pneumoniae Rx1 |
99.867 |
100 |
0.999 |
| clpE | Streptococcus pneumoniae D39 |
99.867 |
100 |
0.999 |
| clpE | Streptococcus pneumoniae R6 |
99.867 |
100 |
0.999 |
| clpE | Streptococcus pneumoniae TIGR4 |
99.734 |
100 |
0.997 |
| clpE | Streptococcus mutans UA159 |
82.597 |
99.335 |
0.82 |
| clpC | Lactococcus lactis subsp. cremoris KW2 |
76.975 |
97.606 |
0.751 |
| clpC | Bacillus subtilis subsp. subtilis str. 168 |
52.154 |
86.436 |
0.451 |
| clpC | Lactococcus lactis subsp. lactis strain DGCC12653 |
47.847 |
83.378 |
0.399 |
| clpC | Streptococcus pneumoniae TIGR4 |
46.751 |
83.91 |
0.392 |
| clpC | Streptococcus pneumoniae Rx1 |
46.751 |
83.91 |
0.392 |
| clpC | Streptococcus pneumoniae D39 |
46.751 |
83.91 |
0.392 |
| clpC | Streptococcus thermophilus LMD-9 |
46.635 |
82.979 |
0.387 |
| clpC | Streptococcus mutans UA159 |
46.411 |
83.378 |
0.387 |
| clpC | Streptococcus thermophilus LMG 18311 |
46.24 |
83.112 |
0.384 |