Detailed information
Overview
| Name | clpE | Type | Regulator |
| Locus tag | EQB41_RS04200 | Genome accession | NZ_LR129840 |
| Coordinates | 766180..768438 (-) | Length | 752 a.a. |
| NCBI ID | WP_127820753.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain 4496 isolate 4496 | ||
| Function | degradation of ComX (predicted from homology) Competence regulation |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| IScluster/Tn | 759283..765453 | 766180..768438 | flank | 727 |
Gene organization within MGE regions
Location: 759283..768438
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EQB41_RS04155 | - | 759283..760589 (-) | 1307 | Protein_784 | transposase | - |
| EQB41_RS04165 | - | 761238..762140 (+) | 903 | Protein_785 | IS630 family transposase | - |
| EQB41_RS04170 | tnpB | 762163..762513 (-) | 351 | WP_000586654.1 | IS66 family insertion sequence element accessory protein TnpB | - |
| EQB41_RS04175 | - | 762755..764015 (+) | 1261 | Protein_787 | IS3 family transposase | - |
| EQB41_RS04180 | - | 764127..764426 (-) | 300 | WP_000767195.1 | DUF1827 family protein | - |
| EQB41_RS04190 | - | 764591..765042 (-) | 452 | Protein_789 | NUDIX hydrolase | - |
| EQB41_RS04195 | - | 765124..765973 (+) | 850 | Protein_790 | IS630 family transposase | - |
| EQB41_RS04200 | clpE | 766180..768438 (-) | 2259 | WP_127820753.1 | ATP-dependent Clp protease ATP-binding subunit | Regulator |
Sequence
Protein
Download Length: 752 a.a. Molecular weight: 83874.28 Da Isoelectric Point: 5.3713
>NTDB_id=1116188 EQB41_RS04200 WP_127820753.1 766180..768438(-) (clpE) [Streptococcus pneumoniae strain 4496 isolate 4496]
MLCQNCKINDSTIHLYTNLNGKQKQIDLCQNCYKIIKTDPNNSLFKGMTDLNNRDFDPFGDFFNDLNNFRPSSNTPPIPP
TQSGGGYGGNGGYGSQNRGSAQTPPPSQEKGLLEEFGINVTEIARRGDIDPVIGRDDEIIRVIEILNRRTKNNPVLIGEP
GVGKTAVVEGLAQKIVDGDVPHKLQGKQVIRLDVVSLVQGTGIRGQFEERMQKLMEEIRKREDIILFIDEIHEIVGAGSA
SDGNMDAGNILKPALARGELQLVGATTLNEYRIIEKDAALERRMQPVKVDEPTVDETITILKGIQKKYEDYHHVQYTDAA
IEAAATLSNRYIQDRFFPDKAIDLLDEAGSKMNLTLNFVDPKVIDQRLIEAENLKSQATREEDFEKAAYFRDQIAKYKEM
QKKKITDQDTPIISEKTIEHIIEQKTNIPVGDLKEKEQSQLIHLAEDLKSHVIGQDDAVDKIAKAIRRNRVGLGTPNRPI
GSFLFVGPTGVGKTELSKQLAIELFGSADSMIRFDMSEYMEKHSVAKLVGAPPGYVGYDEAGQLTEKVRHNPYSLILLDE
VEKAHPDVMHMFLQVLDDGRLTDGQGRTVSFKDAIIIMTSNAGTGKTEASVGFGAAREGRTNSVLGELGNFFSPEFMNRF
DGIIEFKALSKDNLLQIVELMLADVNKRLSSNNIRLDVTDKVKEKLVDLGYDPKMGARPLRRTIQDYIEDTITDYYLENP
SEKDLKAVMTSKGNIQIKSAKKAEVKSSEKEK
MLCQNCKINDSTIHLYTNLNGKQKQIDLCQNCYKIIKTDPNNSLFKGMTDLNNRDFDPFGDFFNDLNNFRPSSNTPPIPP
TQSGGGYGGNGGYGSQNRGSAQTPPPSQEKGLLEEFGINVTEIARRGDIDPVIGRDDEIIRVIEILNRRTKNNPVLIGEP
GVGKTAVVEGLAQKIVDGDVPHKLQGKQVIRLDVVSLVQGTGIRGQFEERMQKLMEEIRKREDIILFIDEIHEIVGAGSA
SDGNMDAGNILKPALARGELQLVGATTLNEYRIIEKDAALERRMQPVKVDEPTVDETITILKGIQKKYEDYHHVQYTDAA
IEAAATLSNRYIQDRFFPDKAIDLLDEAGSKMNLTLNFVDPKVIDQRLIEAENLKSQATREEDFEKAAYFRDQIAKYKEM
QKKKITDQDTPIISEKTIEHIIEQKTNIPVGDLKEKEQSQLIHLAEDLKSHVIGQDDAVDKIAKAIRRNRVGLGTPNRPI
GSFLFVGPTGVGKTELSKQLAIELFGSADSMIRFDMSEYMEKHSVAKLVGAPPGYVGYDEAGQLTEKVRHNPYSLILLDE
VEKAHPDVMHMFLQVLDDGRLTDGQGRTVSFKDAIIIMTSNAGTGKTEASVGFGAAREGRTNSVLGELGNFFSPEFMNRF
DGIIEFKALSKDNLLQIVELMLADVNKRLSSNNIRLDVTDKVKEKLVDLGYDPKMGARPLRRTIQDYIEDTITDYYLENP
SEKDLKAVMTSKGNIQIKSAKKAEVKSSEKEK
Nucleotide
Download Length: 2259 bp
>NTDB_id=1116188 EQB41_RS04200 WP_127820753.1 766180..768438(-) (clpE) [Streptococcus pneumoniae strain 4496 isolate 4496]
ATGCTTTGTCAAAACTGTAAAATTAACGACTCAACAATTCATCTTTACACCAATCTCAATGGAAAACAAAAACAAATTGA
CCTCTGTCAAAACTGCTATAAGATTATCAAAACAGATCCTAACAATAGCCTCTTCAAAGGTATGACGGATCTGAACAATC
GTGACTTCGATCCCTTTGGTGATTTCTTCAATGATCTAAACAATTTCAGACCTTCTAGCAATACTCCTCCTATTCCCCCA
ACCCAATCAGGTGGAGGTTACGGTGGAAACGGCGGTTATGGTTCCCAAAATCGTGGATCTGCTCAAACTCCGCCACCTAG
CCAAGAAAAAGGCCTGCTGGAAGAATTTGGTATTAATGTAACTGAAATTGCCCGTCGTGGAGACATTGACCCCGTTATTG
GGCGCGACGATGAGATTATCCGTGTCATCGAGATTCTCAATCGTAGAACCAAGAATAATCCTGTCCTTATCGGTGAACCT
GGTGTCGGAAAAACGGCCGTTGTCGAAGGTCTAGCTCAGAAAATTGTCGATGGCGATGTGCCACATAAACTCCAAGGTAA
ACAAGTCATCCGTCTGGATGTGGTTAGCTTAGTTCAAGGAACGGGGATTCGAGGACAATTTGAAGAACGCATGCAAAAAC
TCATGGAAGAAATTCGCAAACGTGAAGACATCATCCTCTTTATCGATGAAATCCATGAAATTGTTGGTGCTGGTTCTGCG
AGTGATGGTAATATGGACGCAGGAAATATCCTCAAGCCAGCCCTTGCTCGTGGAGAACTGCAACTAGTCGGTGCTACTAC
CCTCAATGAATACCGTATCATTGAAAAGGATGCTGCCCTCGAGCGTCGTATGCAGCCTGTTAAAGTCGATGAACCAACGG
TGGATGAAACAATCACTATTCTCAAAGGGATTCAAAAGAAATACGAAGATTACCACCACGTTCAATATACCGATGCTGCG
ATTGAAGCAGCTGCAACTCTTTCCAATCGCTACATCCAAGATCGCTTCTTTCCTGACAAGGCCATTGACCTCCTAGATGA
AGCTGGTTCTAAGATGAACTTGACCTTGAATTTTGTGGATCCTAAAGTAATTGATCAGCGCTTGATTGAGGCTGAAAATC
TCAAGTCTCAAGCTACACGAGAAGAAGATTTTGAGAAGGCGGCCTACTTCCGCGACCAGATTGCCAAGTATAAGGAAATG
CAAAAGAAAAAGATCACAGACCAGGATACTCCTATCATCAGCGAGAAAACTATTGAGCACATTATCGAGCAGAAAACCAA
TATCCCTGTTGGTGATTTGAAAGAGAAAGAACAATCTCAACTCATCCATCTAGCCGAAGATCTCAAGTCTCATGTTATTG
GCCAAGATGATGCAGTCGATAAGATTGCCAAGGCTATTCGCCGTAATCGTGTCGGACTTGGTACCCCTAACCGCCCAATC
GGAAGCTTCCTCTTCGTTGGGCCAACTGGTGTCGGTAAGACAGAACTTTCCAAACAACTGGCTATCGAACTTTTTGGTTC
TGCTGATAGTATGATTCGCTTTGATATGAGTGAATACATGGAAAAACATAGTGTAGCTAAGTTGGTCGGCGCTCCTCCAG
GTTATGTTGGCTATGATGAGGCTGGTCAATTAACTGAAAAAGTTCGCCACAATCCATATTCTCTCATCCTTCTCGATGAA
GTGGAAAAAGCTCACCCAGATGTTATGCACATGTTTCTTCAAGTCTTGGACGATGGTCGTTTGACAGACGGGCAAGGACG
CACCGTTAGCTTCAAGGATGCCATCATTATCATGACCTCAAATGCAGGTACAGGAAAGACCGAAGCTAGCGTTGGATTTG
GTGCTGCTAGAGAAGGACGTACCAATTCTGTCCTCGGTGAACTCGGTAACTTCTTTAGCCCAGAGTTTATGAACCGTTTT
GATGGCATTATCGAATTTAAGGCTCTCAGCAAGGATAACCTCCTTCAGATTGTCGAGCTCATGCTAGCAGATGTTAACAA
GCGCCTCTCTAGCAACAACATTCGTTTGGATGTAACTGACAAGGTCAAGGAAAAGTTGGTTGACCTAGGTTATGATCCAA
AAATGGGAGCACGCCCACTTCGTCGGACTATTCAAGACTATATTGAGGACACAATCACTGACTACTACCTTGAAAATCCA
AGCGAAAAAGATCTCAAAGCAGTTATGACTAGCAAGGGAAACATTCAGATTAAATCTGCCAAAAAAGCTGAAGTTAAAAG
TTCTGAAAAAGAAAAATAA
ATGCTTTGTCAAAACTGTAAAATTAACGACTCAACAATTCATCTTTACACCAATCTCAATGGAAAACAAAAACAAATTGA
CCTCTGTCAAAACTGCTATAAGATTATCAAAACAGATCCTAACAATAGCCTCTTCAAAGGTATGACGGATCTGAACAATC
GTGACTTCGATCCCTTTGGTGATTTCTTCAATGATCTAAACAATTTCAGACCTTCTAGCAATACTCCTCCTATTCCCCCA
ACCCAATCAGGTGGAGGTTACGGTGGAAACGGCGGTTATGGTTCCCAAAATCGTGGATCTGCTCAAACTCCGCCACCTAG
CCAAGAAAAAGGCCTGCTGGAAGAATTTGGTATTAATGTAACTGAAATTGCCCGTCGTGGAGACATTGACCCCGTTATTG
GGCGCGACGATGAGATTATCCGTGTCATCGAGATTCTCAATCGTAGAACCAAGAATAATCCTGTCCTTATCGGTGAACCT
GGTGTCGGAAAAACGGCCGTTGTCGAAGGTCTAGCTCAGAAAATTGTCGATGGCGATGTGCCACATAAACTCCAAGGTAA
ACAAGTCATCCGTCTGGATGTGGTTAGCTTAGTTCAAGGAACGGGGATTCGAGGACAATTTGAAGAACGCATGCAAAAAC
TCATGGAAGAAATTCGCAAACGTGAAGACATCATCCTCTTTATCGATGAAATCCATGAAATTGTTGGTGCTGGTTCTGCG
AGTGATGGTAATATGGACGCAGGAAATATCCTCAAGCCAGCCCTTGCTCGTGGAGAACTGCAACTAGTCGGTGCTACTAC
CCTCAATGAATACCGTATCATTGAAAAGGATGCTGCCCTCGAGCGTCGTATGCAGCCTGTTAAAGTCGATGAACCAACGG
TGGATGAAACAATCACTATTCTCAAAGGGATTCAAAAGAAATACGAAGATTACCACCACGTTCAATATACCGATGCTGCG
ATTGAAGCAGCTGCAACTCTTTCCAATCGCTACATCCAAGATCGCTTCTTTCCTGACAAGGCCATTGACCTCCTAGATGA
AGCTGGTTCTAAGATGAACTTGACCTTGAATTTTGTGGATCCTAAAGTAATTGATCAGCGCTTGATTGAGGCTGAAAATC
TCAAGTCTCAAGCTACACGAGAAGAAGATTTTGAGAAGGCGGCCTACTTCCGCGACCAGATTGCCAAGTATAAGGAAATG
CAAAAGAAAAAGATCACAGACCAGGATACTCCTATCATCAGCGAGAAAACTATTGAGCACATTATCGAGCAGAAAACCAA
TATCCCTGTTGGTGATTTGAAAGAGAAAGAACAATCTCAACTCATCCATCTAGCCGAAGATCTCAAGTCTCATGTTATTG
GCCAAGATGATGCAGTCGATAAGATTGCCAAGGCTATTCGCCGTAATCGTGTCGGACTTGGTACCCCTAACCGCCCAATC
GGAAGCTTCCTCTTCGTTGGGCCAACTGGTGTCGGTAAGACAGAACTTTCCAAACAACTGGCTATCGAACTTTTTGGTTC
TGCTGATAGTATGATTCGCTTTGATATGAGTGAATACATGGAAAAACATAGTGTAGCTAAGTTGGTCGGCGCTCCTCCAG
GTTATGTTGGCTATGATGAGGCTGGTCAATTAACTGAAAAAGTTCGCCACAATCCATATTCTCTCATCCTTCTCGATGAA
GTGGAAAAAGCTCACCCAGATGTTATGCACATGTTTCTTCAAGTCTTGGACGATGGTCGTTTGACAGACGGGCAAGGACG
CACCGTTAGCTTCAAGGATGCCATCATTATCATGACCTCAAATGCAGGTACAGGAAAGACCGAAGCTAGCGTTGGATTTG
GTGCTGCTAGAGAAGGACGTACCAATTCTGTCCTCGGTGAACTCGGTAACTTCTTTAGCCCAGAGTTTATGAACCGTTTT
GATGGCATTATCGAATTTAAGGCTCTCAGCAAGGATAACCTCCTTCAGATTGTCGAGCTCATGCTAGCAGATGTTAACAA
GCGCCTCTCTAGCAACAACATTCGTTTGGATGTAACTGACAAGGTCAAGGAAAAGTTGGTTGACCTAGGTTATGATCCAA
AAATGGGAGCACGCCCACTTCGTCGGACTATTCAAGACTATATTGAGGACACAATCACTGACTACTACCTTGAAAATCCA
AGCGAAAAAGATCTCAAAGCAGTTATGACTAGCAAGGGAAACATTCAGATTAAATCTGCCAAAAAAGCTGAAGTTAAAAG
TTCTGAAAAAGAAAAATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| clpE | Streptococcus pneumoniae Rx1 |
99.867 |
100 |
0.999 |
| clpE | Streptococcus pneumoniae D39 |
99.867 |
100 |
0.999 |
| clpE | Streptococcus pneumoniae R6 |
99.867 |
100 |
0.999 |
| clpE | Streptococcus pneumoniae TIGR4 |
99.734 |
100 |
0.997 |
| clpE | Streptococcus mutans UA159 |
82.597 |
99.335 |
0.82 |
| clpC | Lactococcus lactis subsp. cremoris KW2 |
76.703 |
97.606 |
0.749 |
| clpC | Bacillus subtilis subsp. subtilis str. 168 |
52.154 |
86.436 |
0.451 |
| clpC | Lactococcus lactis subsp. lactis strain DGCC12653 |
47.528 |
83.378 |
0.396 |
| clpC | Streptococcus pneumoniae TIGR4 |
46.751 |
83.91 |
0.392 |
| clpC | Streptococcus pneumoniae Rx1 |
46.751 |
83.91 |
0.392 |
| clpC | Streptococcus pneumoniae D39 |
46.751 |
83.91 |
0.392 |
| clpC | Streptococcus thermophilus LMD-9 |
46.635 |
82.979 |
0.387 |
| clpC | Streptococcus mutans UA159 |
46.411 |
83.378 |
0.387 |
| clpC | Streptococcus thermophilus LMG 18311 |
46.24 |
83.112 |
0.384 |