Detailed information
Overview
| Name | clpE | Type | Regulator |
| Locus tag | GOM46_RS05940 | Genome accession | NZ_CP046525 |
| Coordinates | 1180475..1182724 (+) | Length | 749 a.a. |
| NCBI ID | WP_049530939.1 | Uniprot ID | - |
| Organism | Streptococcus infantis strain SO | ||
| Function | degradation of ComX (predicted from homology) Competence regulation |
||
Genomic Context
Location: 1175475..1187724
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GOM46_RS05910 (GOM46_05900) | - | 1176220..1176405 (+) | 186 | WP_006149832.1 | hypothetical protein | - |
| GOM46_RS05915 (GOM46_05905) | - | 1176429..1177277 (-) | 849 | WP_006149957.1 | NAD(P)H-hydrate dehydratase | - |
| GOM46_RS05920 (GOM46_05910) | - | 1177394..1178251 (-) | 858 | WP_006149835.1 | bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase | - |
| GOM46_RS05925 (GOM46_05915) | - | 1178389..1179123 (-) | 735 | WP_006145885.1 | amino acid ABC transporter ATP-binding protein | - |
| GOM46_RS05930 (GOM46_05920) | - | 1179123..1179806 (-) | 684 | WP_006149889.1 | amino acid ABC transporter permease | - |
| GOM46_RS05935 (GOM46_05925) | - | 1180005..1180235 (+) | 231 | WP_004253825.1 | DUF1797 family protein | - |
| GOM46_RS05940 (GOM46_05930) | clpE | 1180475..1182724 (+) | 2250 | WP_049530939.1 | ATP-dependent Clp protease ATP-binding subunit | Regulator |
| GOM46_RS05945 (GOM46_05935) | - | 1182814..1183263 (+) | 450 | WP_235083329.1 | NUDIX hydrolase | - |
| GOM46_RS05950 (GOM46_05940) | - | 1183327..1183626 (+) | 300 | WP_006149955.1 | DUF1827 family protein | - |
| GOM46_RS05955 (GOM46_05945) | - | 1183815..1184447 (+) | 633 | WP_006149873.1 | GNAT family N-acetyltransferase | - |
| GOM46_RS05960 (GOM46_05950) | metK | 1184493..1185683 (-) | 1191 | WP_006149959.1 | methionine adenosyltransferase | - |
| GOM46_RS05965 (GOM46_05955) | - | 1185922..1186620 (-) | 699 | WP_006149854.1 | GntR family transcriptional regulator | - |
Sequence
Protein
Download Length: 749 a.a. Molecular weight: 83393.62 Da Isoelectric Point: 5.1677
>NTDB_id=405078 GOM46_RS05940 WP_049530939.1 1180475..1182724(+) (clpE) [Streptococcus infantis strain SO]
MLCQNCKINDSTIHLYTNLNGQQKQIDLCQNCYKIIKTDPNNSLFKGITDLNNRDFDPFGDFFNDLNNFRPSSNSNTPPT
QSGGGYGGNGGYGSQNRGPAQTPPPSQEKGLLDEYGINVTEIARRGNIDPVIGRDEEIIRVIEILNRRTKNNPVLIGEPG
VGKTAVVEGLAQKIVDGDVPHKLQGKEVIRLDVVSLVQGTGIRGQFEERMQKLMEEIRKRKDVILFIDEIHEIVGAGSAG
EGNMDAGNILKPALARGELQLVGATTLNEYRIIEKDAALERRMQPVKVDEPTVEETIIILKGIQKKYEDYHHVHYTDGAI
EAAATLSNRYIQDRFLPDKAIDLLDEAGSKMNLTLNFVDPKVIDQRLIEAENLKAQATRDEDFEKAAYFRDQIAKYKEMQ
KTKVTDQDTPIISEKTIEHIIEQKTNIPVGDLKEKEQSQLINLADDLKAHVIGQDDAVDKIAKAIRRNRVGLGTPNRPIG
SFLFVGPTGVGKTELSKQLAIELFGSADSMIRFDMSEYMEKHSVAKLVGAPPGYVGYDEAGQLTEKVRRNPYSLILLDEV
EKAHPDVMHMFLQVLDDGRLTDGQGRTVSFKDAIIIMTSNAGTGKAEASVGFGAAREGRTNSVLGELGNFFSPEFMNRFD
GIIEFKALSKENLLQIVDLMLDDVNKRLSSNNIHLDVTEKVKEKLVDLGYDPKMGARPLRRTIQDYIEDAITDYYLENPS
EKDLKAVMTSKGKIVIKSKNKTETVESND
MLCQNCKINDSTIHLYTNLNGQQKQIDLCQNCYKIIKTDPNNSLFKGITDLNNRDFDPFGDFFNDLNNFRPSSNSNTPPT
QSGGGYGGNGGYGSQNRGPAQTPPPSQEKGLLDEYGINVTEIARRGNIDPVIGRDEEIIRVIEILNRRTKNNPVLIGEPG
VGKTAVVEGLAQKIVDGDVPHKLQGKEVIRLDVVSLVQGTGIRGQFEERMQKLMEEIRKRKDVILFIDEIHEIVGAGSAG
EGNMDAGNILKPALARGELQLVGATTLNEYRIIEKDAALERRMQPVKVDEPTVEETIIILKGIQKKYEDYHHVHYTDGAI
EAAATLSNRYIQDRFLPDKAIDLLDEAGSKMNLTLNFVDPKVIDQRLIEAENLKAQATRDEDFEKAAYFRDQIAKYKEMQ
KTKVTDQDTPIISEKTIEHIIEQKTNIPVGDLKEKEQSQLINLADDLKAHVIGQDDAVDKIAKAIRRNRVGLGTPNRPIG
SFLFVGPTGVGKTELSKQLAIELFGSADSMIRFDMSEYMEKHSVAKLVGAPPGYVGYDEAGQLTEKVRRNPYSLILLDEV
EKAHPDVMHMFLQVLDDGRLTDGQGRTVSFKDAIIIMTSNAGTGKAEASVGFGAAREGRTNSVLGELGNFFSPEFMNRFD
GIIEFKALSKENLLQIVDLMLDDVNKRLSSNNIHLDVTEKVKEKLVDLGYDPKMGARPLRRTIQDYIEDAITDYYLENPS
EKDLKAVMTSKGKIVIKSKNKTETVESND
Nucleotide
Download Length: 2250 bp
>NTDB_id=405078 GOM46_RS05940 WP_049530939.1 1180475..1182724(+) (clpE) [Streptococcus infantis strain SO]
ATGCTCTGTCAAAATTGTAAAATCAATGACTCAACAATCCATCTTTACACTAATCTGAATGGGCAACAAAAGCAAATCGA
TCTTTGCCAAAATTGCTACAAAATTATCAAAACAGATCCTAACAATAGCTTATTTAAAGGAATTACAGATTTAAACAATC
GTGATTTCGATCCATTTGGAGATTTCTTTAACGACCTGAACAACTTTAGACCTTCTTCTAATAGCAATACTCCTCCTACC
CAGTCAGGTGGTGGATACGGCGGTAATGGTGGCTATGGTTCCCAAAATCGTGGACCGGCTCAAACTCCTCCTCCTAGTCA
GGAAAAAGGACTCTTGGATGAATATGGTATCAATGTTACTGAGATTGCCCGTCGTGGAAATATCGACCCTGTGATTGGTC
GTGACGAAGAAATCATCCGCGTTATTGAAATTCTCAACCGTAGAACCAAAAACAACCCTGTCCTTATCGGTGAGCCTGGT
GTTGGTAAGACAGCCGTTGTCGAGGGCTTAGCTCAGAAAATTGTTGATGGTGATGTTCCCCATAAACTTCAAGGGAAAGA
AGTCATTCGCCTAGATGTTGTCAGCCTTGTCCAAGGTACAGGAATTCGTGGACAGTTCGAAGAACGCATGCAAAAACTCA
TGGAAGAAATTCGTAAGCGTAAAGATGTCATCCTCTTCATTGACGAAATTCACGAAATCGTCGGTGCTGGTTCTGCTGGA
GAAGGAAATATGGATGCAGGAAATATCCTCAAACCAGCTCTTGCTCGCGGCGAACTCCAACTAGTTGGTGCTACTACCCT
CAATGAATATCGCATCATCGAAAAAGATGCAGCCCTTGAGCGCCGTATGCAACCTGTCAAAGTTGATGAACCAACTGTGG
AAGAGACTATTATTATTCTTAAAGGTATTCAAAAGAAATACGAAGATTACCACCACGTTCATTATACTGACGGAGCTATC
GAAGCAGCTGCTACTCTCTCTAACCGCTATATCCAAGATCGTTTCTTGCCGGATAAGGCCATTGACCTCCTAGATGAAGC
TGGTTCTAAAATGAACTTAACCTTAAATTTTGTGGATCCAAAGGTCATCGACCAACGTTTAATCGAAGCAGAAAATCTCA
AGGCACAAGCAACCCGCGATGAAGACTTTGAAAAAGCAGCCTACTTCCGTGACCAGATTGCTAAATATAAGGAAATGCAA
AAGACCAAGGTAACGGATCAGGATACCCCAATCATCAGTGAAAAAACAATCGAACACATCATCGAACAAAAGACCAATAT
CCCAGTTGGTGATTTGAAAGAAAAAGAACAGTCTCAACTGATCAACCTAGCAGATGACCTCAAGGCTCATGTTATTGGTC
AAGATGATGCTGTCGATAAGATTGCTAAGGCTATCCGTCGTAACCGTGTTGGGCTTGGAACACCAAACCGTCCAATTGGT
AGCTTCCTCTTTGTTGGTCCTACAGGTGTCGGTAAAACAGAACTCTCTAAACAACTGGCCATCGAACTCTTTGGTTCTGC
TGATAGCATGATTCGCTTTGATATGAGTGAATACATGGAAAAACACAGCGTTGCTAAACTCGTCGGTGCTCCTCCAGGAT
ACGTCGGCTATGATGAGGCTGGACAATTGACTGAAAAGGTTCGTCGTAACCCCTACTCACTGATCCTCTTGGATGAAGTT
GAAAAAGCCCATCCAGATGTTATGCATATGTTCCTTCAAGTCTTGGATGATGGTCGTTTAACAGATGGTCAAGGTCGGAC
TGTTAGCTTTAAGGATGCCATCATTATCATGACCTCAAATGCAGGTACAGGTAAGGCAGAAGCTAGCGTTGGTTTTGGTG
CTGCTAGAGAAGGTCGTACCAACTCTGTTCTTGGAGAATTAGGCAACTTCTTTAGTCCAGAATTCATGAACCGTTTTGAC
GGCATTATCGAATTCAAGGCTCTCAGCAAGGAAAATCTCCTTCAAATCGTTGACCTCATGCTAGATGATGTGAATAAGCG
ACTTTCCAGCAATAATATCCATCTCGATGTCACAGAAAAGGTCAAAGAAAAACTTGTAGACCTTGGTTACGATCCAAAAA
TGGGAGCTCGTCCACTCCGTCGTACCATCCAAGACTATATCGAAGATGCCATTACAGACTACTACCTTGAAAATCCTAGT
GAAAAAGACCTCAAGGCAGTCATGACAAGTAAAGGTAAAATCGTGATTAAGTCCAAAAATAAAACGGAAACGGTCGAGTC
AAACGACTAG
ATGCTCTGTCAAAATTGTAAAATCAATGACTCAACAATCCATCTTTACACTAATCTGAATGGGCAACAAAAGCAAATCGA
TCTTTGCCAAAATTGCTACAAAATTATCAAAACAGATCCTAACAATAGCTTATTTAAAGGAATTACAGATTTAAACAATC
GTGATTTCGATCCATTTGGAGATTTCTTTAACGACCTGAACAACTTTAGACCTTCTTCTAATAGCAATACTCCTCCTACC
CAGTCAGGTGGTGGATACGGCGGTAATGGTGGCTATGGTTCCCAAAATCGTGGACCGGCTCAAACTCCTCCTCCTAGTCA
GGAAAAAGGACTCTTGGATGAATATGGTATCAATGTTACTGAGATTGCCCGTCGTGGAAATATCGACCCTGTGATTGGTC
GTGACGAAGAAATCATCCGCGTTATTGAAATTCTCAACCGTAGAACCAAAAACAACCCTGTCCTTATCGGTGAGCCTGGT
GTTGGTAAGACAGCCGTTGTCGAGGGCTTAGCTCAGAAAATTGTTGATGGTGATGTTCCCCATAAACTTCAAGGGAAAGA
AGTCATTCGCCTAGATGTTGTCAGCCTTGTCCAAGGTACAGGAATTCGTGGACAGTTCGAAGAACGCATGCAAAAACTCA
TGGAAGAAATTCGTAAGCGTAAAGATGTCATCCTCTTCATTGACGAAATTCACGAAATCGTCGGTGCTGGTTCTGCTGGA
GAAGGAAATATGGATGCAGGAAATATCCTCAAACCAGCTCTTGCTCGCGGCGAACTCCAACTAGTTGGTGCTACTACCCT
CAATGAATATCGCATCATCGAAAAAGATGCAGCCCTTGAGCGCCGTATGCAACCTGTCAAAGTTGATGAACCAACTGTGG
AAGAGACTATTATTATTCTTAAAGGTATTCAAAAGAAATACGAAGATTACCACCACGTTCATTATACTGACGGAGCTATC
GAAGCAGCTGCTACTCTCTCTAACCGCTATATCCAAGATCGTTTCTTGCCGGATAAGGCCATTGACCTCCTAGATGAAGC
TGGTTCTAAAATGAACTTAACCTTAAATTTTGTGGATCCAAAGGTCATCGACCAACGTTTAATCGAAGCAGAAAATCTCA
AGGCACAAGCAACCCGCGATGAAGACTTTGAAAAAGCAGCCTACTTCCGTGACCAGATTGCTAAATATAAGGAAATGCAA
AAGACCAAGGTAACGGATCAGGATACCCCAATCATCAGTGAAAAAACAATCGAACACATCATCGAACAAAAGACCAATAT
CCCAGTTGGTGATTTGAAAGAAAAAGAACAGTCTCAACTGATCAACCTAGCAGATGACCTCAAGGCTCATGTTATTGGTC
AAGATGATGCTGTCGATAAGATTGCTAAGGCTATCCGTCGTAACCGTGTTGGGCTTGGAACACCAAACCGTCCAATTGGT
AGCTTCCTCTTTGTTGGTCCTACAGGTGTCGGTAAAACAGAACTCTCTAAACAACTGGCCATCGAACTCTTTGGTTCTGC
TGATAGCATGATTCGCTTTGATATGAGTGAATACATGGAAAAACACAGCGTTGCTAAACTCGTCGGTGCTCCTCCAGGAT
ACGTCGGCTATGATGAGGCTGGACAATTGACTGAAAAGGTTCGTCGTAACCCCTACTCACTGATCCTCTTGGATGAAGTT
GAAAAAGCCCATCCAGATGTTATGCATATGTTCCTTCAAGTCTTGGATGATGGTCGTTTAACAGATGGTCAAGGTCGGAC
TGTTAGCTTTAAGGATGCCATCATTATCATGACCTCAAATGCAGGTACAGGTAAGGCAGAAGCTAGCGTTGGTTTTGGTG
CTGCTAGAGAAGGTCGTACCAACTCTGTTCTTGGAGAATTAGGCAACTTCTTTAGTCCAGAATTCATGAACCGTTTTGAC
GGCATTATCGAATTCAAGGCTCTCAGCAAGGAAAATCTCCTTCAAATCGTTGACCTCATGCTAGATGATGTGAATAAGCG
ACTTTCCAGCAATAATATCCATCTCGATGTCACAGAAAAGGTCAAAGAAAAACTTGTAGACCTTGGTTACGATCCAAAAA
TGGGAGCTCGTCCACTCCGTCGTACCATCCAAGACTATATCGAAGATGCCATTACAGACTACTACCTTGAAAATCCTAGT
GAAAAAGACCTCAAGGCAGTCATGACAAGTAAAGGTAAAATCGTGATTAAGTCCAAAAATAAAACGGAAACGGTCGAGTC
AAACGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| clpE | Streptococcus pneumoniae Rx1 |
94.251 |
99.866 |
0.941 |
| clpE | Streptococcus pneumoniae D39 |
94.251 |
99.866 |
0.941 |
| clpE | Streptococcus pneumoniae R6 |
94.251 |
99.866 |
0.941 |
| clpE | Streptococcus pneumoniae TIGR4 |
94.118 |
99.866 |
0.94 |
| clpE | Streptococcus mutans UA159 |
83.845 |
100 |
0.838 |
| clpC | Lactococcus lactis subsp. cremoris KW2 |
76.72 |
100 |
0.774 |
| clpC | Bacillus subtilis subsp. subtilis str. 168 |
50.718 |
92.924 |
0.471 |
| clpC | Lactococcus lactis subsp. lactis strain DGCC12653 |
48.166 |
83.712 |
0.403 |
| clpC | Streptococcus pneumoniae TIGR4 |
47.134 |
83.845 |
0.395 |
| clpC | Streptococcus pneumoniae Rx1 |
47.134 |
83.845 |
0.395 |
| clpC | Streptococcus pneumoniae D39 |
47.134 |
83.845 |
0.395 |
| clpC | Streptococcus thermophilus LMD-9 |
46.955 |
83.311 |
0.391 |
| clpC | Streptococcus thermophilus LMG 18311 |
46.635 |
83.311 |
0.389 |
| clpC | Streptococcus mutans UA159 |
46.411 |
83.712 |
0.389 |