Detailed information
Overview
| Name | clpC | Type | Regulator |
| Locus tag | H1W98_RS00525 | Genome accession | NZ_LR822026 |
| Coordinates | 87253..89703 (+) | Length | 816 a.a. |
| NCBI ID | WP_179973245.1 | Uniprot ID | - |
| Organism | Streptococcus thermophilus isolate STH_CIRM_967 | ||
| Function | degradation of ComX (predicted from homology) Competence regulation |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| IS/Tn | 89783..91039 | 87253..89703 | flank | 80 |
Gene organization within MGE regions
Location: 87253..91039
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| H1W98_RS00525 (STHERMO_0089) | clpC | 87253..89703 (+) | 2451 | WP_179973245.1 | ATP-dependent Clp protease ATP-binding subunit | Regulator |
| H1W98_RS00530 (STHERMO_0090) | - | 89790..91039 (-) | 1250 | Protein_67 | ISL3 family transposase | - |
Sequence
Protein
Download Length: 816 a.a. Molecular weight: 90442.61 Da Isoelectric Point: 5.6117
>NTDB_id=1131253 H1W98_RS00525 WP_179973245.1 87253..89703(+) (clpC) [Streptococcus thermophilus isolate STH_CIRM_967]
MTIYSRKMQAIFHRAQLEAERFESPFLETWHVLLAMVEVPGSVAYLTFTDFEDRIHSEEIETAAVLAMEKRPKDLSESDI
IDLRAQSPALEAMLQEAQGIASVTGAVEVGSEHVLMAFLLHKDLMVCRLLEVAGFQYKDDSDKPRIIDLRRSLERNAGLS
KQDLKAIHDLRKPKKSKASGNFANMMQPPQSSTGELADYTKDLTALAESGNLDPVIGRDEEISRMIQVLSRKTKNNPVLV
GEAGVGKTALALGLAQRIASGEVPFELADMRILELDMMSVVAGTRFRGDFEERMNQIIDEIEADGKIILFIDELHTIIGS
GSGIDSTLDAANILKPALARGTLHMVGATTQAEYQKHIEKDAALSRRFAKITIEEPSVSEAIDILNGLRSSYEDYHRVTI
TDEAVETAVKAAHRYLTSKNLPDSAIDLLDEASATVQVRIKKEAKREITPLDEALLSGDIRAAVEQYKANQKAKFPKPAL
VDADQIMQTLSRLSGIPVEKMTQTDSKRYLNLESELHKRVIGQDEAVSVISRAIRRNQSGIRTGKRPIGSFMFLGPTGVG
KTELAKALAEVLFDDESALLRFDMSEYMEKFAASRLNGAPPGYVGYDEGGELTEKVRNKPYSVLLFDEIEKAHPDIFNVL
LQVLDDGVLTDSRGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAQTISHNHQAMQARIMEELKKSYRPEFINRIDEKVV
FHSLEEEQLHDIVKIMVKPLISALADKGISLKFQPAALKHLAKDGYDIEMGARPLRRTIQTQVEDKLSELLLGGQVVSGQ
TLKIGCSKDKLTFTVV
MTIYSRKMQAIFHRAQLEAERFESPFLETWHVLLAMVEVPGSVAYLTFTDFEDRIHSEEIETAAVLAMEKRPKDLSESDI
IDLRAQSPALEAMLQEAQGIASVTGAVEVGSEHVLMAFLLHKDLMVCRLLEVAGFQYKDDSDKPRIIDLRRSLERNAGLS
KQDLKAIHDLRKPKKSKASGNFANMMQPPQSSTGELADYTKDLTALAESGNLDPVIGRDEEISRMIQVLSRKTKNNPVLV
GEAGVGKTALALGLAQRIASGEVPFELADMRILELDMMSVVAGTRFRGDFEERMNQIIDEIEADGKIILFIDELHTIIGS
GSGIDSTLDAANILKPALARGTLHMVGATTQAEYQKHIEKDAALSRRFAKITIEEPSVSEAIDILNGLRSSYEDYHRVTI
TDEAVETAVKAAHRYLTSKNLPDSAIDLLDEASATVQVRIKKEAKREITPLDEALLSGDIRAAVEQYKANQKAKFPKPAL
VDADQIMQTLSRLSGIPVEKMTQTDSKRYLNLESELHKRVIGQDEAVSVISRAIRRNQSGIRTGKRPIGSFMFLGPTGVG
KTELAKALAEVLFDDESALLRFDMSEYMEKFAASRLNGAPPGYVGYDEGGELTEKVRNKPYSVLLFDEIEKAHPDIFNVL
LQVLDDGVLTDSRGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAQTISHNHQAMQARIMEELKKSYRPEFINRIDEKVV
FHSLEEEQLHDIVKIMVKPLISALADKGISLKFQPAALKHLAKDGYDIEMGARPLRRTIQTQVEDKLSELLLGGQVVSGQ
TLKIGCSKDKLTFTVV
Nucleotide
Download Length: 2451 bp
>NTDB_id=1131253 H1W98_RS00525 WP_179973245.1 87253..89703(+) (clpC) [Streptococcus thermophilus isolate STH_CIRM_967]
ATGACGATATATTCAAGAAAAATGCAGGCCATTTTCCATCGTGCTCAGCTTGAAGCGGAGCGTTTTGAAAGTCCTTTCTT
GGAGACTTGGCATGTGCTTCTGGCTATGGTTGAGGTTCCGGGATCTGTAGCCTACTTAACATTTACTGATTTTGAGGACC
GTATTCATTCAGAAGAGATTGAGACAGCTGCTGTATTGGCTATGGAGAAGAGGCCAAAAGACTTGTCGGAATCAGATATT
ATCGATTTACGTGCACAGTCACCTGCGCTAGAGGCTATGTTGCAAGAGGCGCAAGGAATCGCTAGTGTGACTGGTGCTGT
AGAGGTAGGGTCTGAACATGTATTGATGGCCTTCCTTCTTCATAAGGATTTAATGGTTTGTCGCCTCCTTGAAGTGGCTG
GTTTTCAATATAAAGATGATAGCGATAAACCTCGCATCATAGATTTACGACGTTCTTTGGAGCGTAATGCTGGTCTTAGC
AAGCAAGATTTGAAGGCAATTCACGATCTTCGTAAACCTAAGAAATCAAAAGCATCTGGGAATTTTGCCAATATGATGCA
ACCTCCTCAATCATCTACTGGTGAACTGGCAGATTATACCAAAGACTTAACTGCATTAGCGGAGTCAGGAAATCTTGATC
CCGTTATTGGACGCGATGAAGAAATTTCACGCATGATCCAGGTTTTGAGTCGTAAAACGAAGAATAATCCCGTCTTGGTA
GGTGAAGCTGGTGTCGGTAAGACAGCACTTGCTCTTGGTTTAGCGCAACGTATTGCTTCAGGCGAAGTACCATTTGAATT
GGCTGATATGCGTATCTTAGAGCTTGACATGATGAGCGTTGTTGCAGGGACACGTTTCCGTGGTGATTTCGAAGAGCGTA
TGAATCAGATTATTGATGAGATTGAAGCTGATGGGAAAATCATTCTCTTTATTGACGAACTACATACGATTATTGGATCT
GGTTCAGGTATTGATAGTACCTTGGACGCGGCTAATATTTTGAAACCGGCCCTTGCGCGCGGGACACTTCACATGGTTGG
AGCAACCACGCAAGCTGAATACCAAAAGCATATTGAGAAAGATGCGGCTTTATCCCGTCGTTTTGCTAAAATTACAATTG
AAGAACCAAGTGTATCTGAAGCAATCGATATTTTAAACGGTTTGCGTTCGTCTTATGAAGACTATCATCGTGTGACAATT
ACGGACGAGGCAGTTGAGACGGCAGTCAAGGCAGCGCATCGCTATTTGACGAGTAAGAATTTGCCTGATTCGGCAATTGA
TCTTTTAGATGAAGCGAGTGCAACTGTTCAAGTTCGTATCAAAAAAGAGGCCAAACGTGAGATAACACCTTTGGATGAAG
CACTTCTATCTGGGGATATTAGGGCTGCTGTTGAACAGTACAAGGCTAATCAAAAGGCAAAATTTCCTAAACCTGCTTTG
GTAGATGCGGATCAGATTATGCAAACTCTTAGTCGTTTATCAGGTATCCCTGTTGAGAAGATGACGCAGACTGACAGCAA
GCGTTACCTGAATCTTGAATCAGAACTCCACAAACGTGTTATTGGTCAAGATGAGGCAGTTTCGGTTATCAGCCGTGCTA
TTCGTCGTAATCAGTCAGGTATTCGTACTGGAAAACGTCCTATTGGCTCCTTCATGTTCCTTGGACCTACTGGTGTTGGT
AAGACAGAATTGGCCAAGGCTTTGGCGGAAGTTCTCTTTGATGATGAATCAGCTTTGCTTCGCTTTGATATGTCGGAATA
TATGGAAAAATTTGCGGCTAGTCGCCTTAATGGTGCTCCTCCAGGGTATGTCGGATATGATGAGGGTGGAGAGCTGACAG
AGAAAGTTCGAAATAAGCCCTACTCAGTTCTTCTCTTTGACGAGATTGAGAAAGCTCACCCAGATATCTTCAACGTTCTC
TTACAGGTTTTGGATGACGGTGTTTTAACAGATAGCCGTGGTCGTAAGGTTGATTTTTCAAACACTATCATCATTATGAC
CTCAAATTTGGGAGCTACAGCTCTTCGTGATGATAAAACTGTTGGTTTTGGTGCTCAAACTATTTCTCATAATCACCAAG
CCATGCAAGCACGCATTATGGAAGAGCTTAAGAAGTCCTATCGTCCAGAATTTATTAACCGTATTGATGAGAAGGTTGTC
TTCCACAGCTTAGAGGAAGAACAACTACATGACATTGTCAAGATTATGGTTAAACCATTAATTTCAGCTCTAGCCGATAA
AGGTATAAGCTTAAAATTCCAACCAGCTGCTCTTAAGCATTTGGCTAAGGATGGCTATGATATTGAGATGGGAGCTCGTC
CATTACGTCGTACGATTCAAACTCAAGTGGAGGACAAGTTGTCTGAGTTATTACTAGGTGGCCAAGTTGTTAGCGGACAG
ACCCTTAAGATTGGTTGCTCGAAAGATAAATTAACCTTTACAGTAGTGTAA
ATGACGATATATTCAAGAAAAATGCAGGCCATTTTCCATCGTGCTCAGCTTGAAGCGGAGCGTTTTGAAAGTCCTTTCTT
GGAGACTTGGCATGTGCTTCTGGCTATGGTTGAGGTTCCGGGATCTGTAGCCTACTTAACATTTACTGATTTTGAGGACC
GTATTCATTCAGAAGAGATTGAGACAGCTGCTGTATTGGCTATGGAGAAGAGGCCAAAAGACTTGTCGGAATCAGATATT
ATCGATTTACGTGCACAGTCACCTGCGCTAGAGGCTATGTTGCAAGAGGCGCAAGGAATCGCTAGTGTGACTGGTGCTGT
AGAGGTAGGGTCTGAACATGTATTGATGGCCTTCCTTCTTCATAAGGATTTAATGGTTTGTCGCCTCCTTGAAGTGGCTG
GTTTTCAATATAAAGATGATAGCGATAAACCTCGCATCATAGATTTACGACGTTCTTTGGAGCGTAATGCTGGTCTTAGC
AAGCAAGATTTGAAGGCAATTCACGATCTTCGTAAACCTAAGAAATCAAAAGCATCTGGGAATTTTGCCAATATGATGCA
ACCTCCTCAATCATCTACTGGTGAACTGGCAGATTATACCAAAGACTTAACTGCATTAGCGGAGTCAGGAAATCTTGATC
CCGTTATTGGACGCGATGAAGAAATTTCACGCATGATCCAGGTTTTGAGTCGTAAAACGAAGAATAATCCCGTCTTGGTA
GGTGAAGCTGGTGTCGGTAAGACAGCACTTGCTCTTGGTTTAGCGCAACGTATTGCTTCAGGCGAAGTACCATTTGAATT
GGCTGATATGCGTATCTTAGAGCTTGACATGATGAGCGTTGTTGCAGGGACACGTTTCCGTGGTGATTTCGAAGAGCGTA
TGAATCAGATTATTGATGAGATTGAAGCTGATGGGAAAATCATTCTCTTTATTGACGAACTACATACGATTATTGGATCT
GGTTCAGGTATTGATAGTACCTTGGACGCGGCTAATATTTTGAAACCGGCCCTTGCGCGCGGGACACTTCACATGGTTGG
AGCAACCACGCAAGCTGAATACCAAAAGCATATTGAGAAAGATGCGGCTTTATCCCGTCGTTTTGCTAAAATTACAATTG
AAGAACCAAGTGTATCTGAAGCAATCGATATTTTAAACGGTTTGCGTTCGTCTTATGAAGACTATCATCGTGTGACAATT
ACGGACGAGGCAGTTGAGACGGCAGTCAAGGCAGCGCATCGCTATTTGACGAGTAAGAATTTGCCTGATTCGGCAATTGA
TCTTTTAGATGAAGCGAGTGCAACTGTTCAAGTTCGTATCAAAAAAGAGGCCAAACGTGAGATAACACCTTTGGATGAAG
CACTTCTATCTGGGGATATTAGGGCTGCTGTTGAACAGTACAAGGCTAATCAAAAGGCAAAATTTCCTAAACCTGCTTTG
GTAGATGCGGATCAGATTATGCAAACTCTTAGTCGTTTATCAGGTATCCCTGTTGAGAAGATGACGCAGACTGACAGCAA
GCGTTACCTGAATCTTGAATCAGAACTCCACAAACGTGTTATTGGTCAAGATGAGGCAGTTTCGGTTATCAGCCGTGCTA
TTCGTCGTAATCAGTCAGGTATTCGTACTGGAAAACGTCCTATTGGCTCCTTCATGTTCCTTGGACCTACTGGTGTTGGT
AAGACAGAATTGGCCAAGGCTTTGGCGGAAGTTCTCTTTGATGATGAATCAGCTTTGCTTCGCTTTGATATGTCGGAATA
TATGGAAAAATTTGCGGCTAGTCGCCTTAATGGTGCTCCTCCAGGGTATGTCGGATATGATGAGGGTGGAGAGCTGACAG
AGAAAGTTCGAAATAAGCCCTACTCAGTTCTTCTCTTTGACGAGATTGAGAAAGCTCACCCAGATATCTTCAACGTTCTC
TTACAGGTTTTGGATGACGGTGTTTTAACAGATAGCCGTGGTCGTAAGGTTGATTTTTCAAACACTATCATCATTATGAC
CTCAAATTTGGGAGCTACAGCTCTTCGTGATGATAAAACTGTTGGTTTTGGTGCTCAAACTATTTCTCATAATCACCAAG
CCATGCAAGCACGCATTATGGAAGAGCTTAAGAAGTCCTATCGTCCAGAATTTATTAACCGTATTGATGAGAAGGTTGTC
TTCCACAGCTTAGAGGAAGAACAACTACATGACATTGTCAAGATTATGGTTAAACCATTAATTTCAGCTCTAGCCGATAA
AGGTATAAGCTTAAAATTCCAACCAGCTGCTCTTAAGCATTTGGCTAAGGATGGCTATGATATTGAGATGGGAGCTCGTC
CATTACGTCGTACGATTCAAACTCAAGTGGAGGACAAGTTGTCTGAGTTATTACTAGGTGGCCAAGTTGTTAGCGGACAG
ACCCTTAAGATTGGTTGCTCGAAAGATAAATTAACCTTTACAGTAGTGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| clpC | Streptococcus thermophilus LMG 18311 |
99.142 |
100 |
0.991 |
| clpC | Streptococcus thermophilus LMD-9 |
98.775 |
100 |
0.988 |
| clpC | Streptococcus mutans UA159 |
72.304 |
100 |
0.723 |
| clpC | Streptococcus pneumoniae TIGR4 |
66.216 |
99.755 |
0.661 |
| clpC | Streptococcus pneumoniae Rx1 |
66.216 |
99.755 |
0.661 |
| clpC | Streptococcus pneumoniae D39 |
66.216 |
99.755 |
0.661 |
| clpC | Lactococcus lactis subsp. lactis strain DGCC12653 |
50.06 |
100 |
0.51 |
| clpC | Bacillus subtilis subsp. subtilis str. 168 |
47.138 |
100 |
0.474 |
| clpE | Streptococcus mutans UA159 |
47.604 |
76.716 |
0.365 |