Detailed information
Overview
| Name | clpC | Type | Regulator |
| Locus tag | MGCS36044_RS04480 | Genome accession | NZ_CP117287 |
| Coordinates | 873695..875794 (-) | Length | 699 a.a. |
| NCBI ID | WP_084916912.1 | Uniprot ID | - |
| Organism | Streptococcus dysgalactiae subsp. equisimilis strain MGCS36044 | ||
| Function | degradation of ComX (predicted from homology) Competence regulation |
||
Genomic Context
Location: 868695..880794
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| MGCS36044_RS04450 (MGCS36044_01786) | - | 869192..870031 (+) | 840 | WP_003054196.1 | thymidylate synthase | - |
| MGCS36044_RS04455 (MGCS36044_01788) | - | 870112..870609 (+) | 498 | WP_084916910.1 | dihydrofolate reductase | - |
| MGCS36044_RS04460 (MGCS36044_01790) | - | 870629..870799 (+) | 171 | WP_003054200.1 | hypothetical protein | - |
| MGCS36044_RS04465 (MGCS36044_01792) | clpX | 870913..872142 (+) | 1230 | WP_022554539.1 | ATP-dependent Clp protease ATP-binding subunit ClpX | Regulator |
| MGCS36044_RS04470 (MGCS36044_01794) | yihA | 872152..872751 (+) | 600 | WP_003061604.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| MGCS36044_RS04475 (MGCS36044_01796) | - | 872895..873638 (+) | 744 | WP_065358637.1 | hypothetical protein | - |
| MGCS36044_RS04480 (MGCS36044_01798) | clpC | 873695..875794 (-) | 2100 | WP_084916912.1 | AAA family ATPase | Regulator |
| MGCS36044_RS04485 | - | 876292..876681 (-) | 390 | Protein_825 | IS30 family transposase | - |
| MGCS36044_RS04490 | - | 876693..876911 (+) | 219 | Protein_826 | IS30 family transposase | - |
| MGCS36044_RS04495 (MGCS36044_01802) | rplJ | 877198..877698 (+) | 501 | WP_003050354.1 | 50S ribosomal protein L10 | - |
| MGCS36044_RS04500 (MGCS36044_01804) | rplL | 877763..878128 (+) | 366 | WP_003043797.1 | 50S ribosomal protein L7/L12 | - |
| MGCS36044_RS04505 (MGCS36044_01808) | - | 878327..878500 (+) | 174 | WP_001168336.1 | hypothetical protein | - |
| MGCS36044_RS04510 (MGCS36044_01810) | - | 878497..879315 (+) | 819 | WP_084916914.1 | replication initiator protein A | - |
Sequence
Protein
Download Length: 699 a.a. Molecular weight: 77425.08 Da Isoelectric Point: 4.8245
>NTDB_id=783330 MGCS36044_RS04480 WP_084916912.1 873695..875794(-) (clpC) [Streptococcus dysgalactiae subsp. equisimilis strain MGCS36044]
MTNYYDKDTFGNMDDIFNQLMGNMGGYRSENRRYLVNGREMTPEEFQAYRQTGRLSDKGSEQPTSQGHQTKKDGILAKLG
TNLTEEARQGKLDPVIGRNHEIQDTAEILARRTKNNPVLVGDAGVGKTAVVEGLAQAIVNGDVPAAIKNKEIISIDISGL
EAGTQYRGSFEENIQNMIQEVKEAGNIILFFDEIHQILGAGSTGGDSGSKGLADMLKPALSRGELTLIGATTQDEYRNTI
LKNAALARRFNEVKVNAPSAEDTFHILMGIRNLYEQHHNVILPDAVLKAAIDYSIQYIPQRSLPDKAIDLVDMTAAHLAA
QHPVTDLKSLEAEMTTQKNIQEEAVAKEDFERALTAKTRIEELQKQMDNHSEDQKVTATVNDIAESVERLTGVPVSNMGA
SDLERLKEISNRLKGHVIGQDGAVEAVARAIRRNRAGFDDGNRPIGSFLFVGPTGVGKTELAKQLALDLFGSKEAIIRLD
MSEYSDRTAVSKLIGTTAGYVGYDDNNNTLTERVRRNPYAIVLLDEIEKADPQVITLLLQVLDDGRLTDGQGNTINFKNT
VIIATSNAGFGYQEDENTDQPTIMDRIAPYFRPEFLNRFNGVIEFNHLQKEDLEEIVDLMLAEVNQTIAKKGISLAISDD
AKAHLIELGYDHAMGVRPLRRVIEQEIRDRITDFYLDHPEVKNLQASLEQGQLVIHDKN
MTNYYDKDTFGNMDDIFNQLMGNMGGYRSENRRYLVNGREMTPEEFQAYRQTGRLSDKGSEQPTSQGHQTKKDGILAKLG
TNLTEEARQGKLDPVIGRNHEIQDTAEILARRTKNNPVLVGDAGVGKTAVVEGLAQAIVNGDVPAAIKNKEIISIDISGL
EAGTQYRGSFEENIQNMIQEVKEAGNIILFFDEIHQILGAGSTGGDSGSKGLADMLKPALSRGELTLIGATTQDEYRNTI
LKNAALARRFNEVKVNAPSAEDTFHILMGIRNLYEQHHNVILPDAVLKAAIDYSIQYIPQRSLPDKAIDLVDMTAAHLAA
QHPVTDLKSLEAEMTTQKNIQEEAVAKEDFERALTAKTRIEELQKQMDNHSEDQKVTATVNDIAESVERLTGVPVSNMGA
SDLERLKEISNRLKGHVIGQDGAVEAVARAIRRNRAGFDDGNRPIGSFLFVGPTGVGKTELAKQLALDLFGSKEAIIRLD
MSEYSDRTAVSKLIGTTAGYVGYDDNNNTLTERVRRNPYAIVLLDEIEKADPQVITLLLQVLDDGRLTDGQGNTINFKNT
VIIATSNAGFGYQEDENTDQPTIMDRIAPYFRPEFLNRFNGVIEFNHLQKEDLEEIVDLMLAEVNQTIAKKGISLAISDD
AKAHLIELGYDHAMGVRPLRRVIEQEIRDRITDFYLDHPEVKNLQASLEQGQLVIHDKN
Nucleotide
Download Length: 2100 bp
>NTDB_id=783330 MGCS36044_RS04480 WP_084916912.1 873695..875794(-) (clpC) [Streptococcus dysgalactiae subsp. equisimilis strain MGCS36044]
ATGACAAATTATTATGACAAAGATACTTTTGGAAACATGGATGATATCTTTAATCAACTAATGGGCAATATGGGTGGCTA
TCGCAGTGAAAATCGACGCTATTTAGTGAACGGACGCGAAATGACACCTGAAGAATTCCAGGCATATCGTCAAACTGGAC
GTCTGAGCGATAAAGGTAGTGAACAACCTACATCCCAAGGTCATCAGACCAAAAAGGATGGGATTTTAGCTAAACTTGGC
ACCAACTTAACGGAAGAAGCAAGACAAGGAAAACTTGATCCTGTTATTGGTCGCAATCACGAAATTCAAGACACTGCGGA
AATTTTAGCTCGCCGAACAAAGAATAATCCTGTTCTAGTCGGAGATGCTGGTGTTGGTAAAACAGCTGTGGTTGAAGGGC
TAGCTCAAGCTATTGTGAATGGTGACGTCCCTGCTGCTATCAAAAATAAAGAAATCATTTCCATTGATATTTCTGGCCTT
GAAGCAGGCACTCAGTACCGTGGCTCCTTTGAAGAAAATATCCAAAACATGATTCAAGAGGTTAAAGAAGCTGGGAATAT
TATTCTTTTCTTTGATGAAATTCATCAAATCCTTGGTGCAGGTTCTACTGGTGGTGACTCTGGGTCTAAAGGATTAGCTG
ATATGCTAAAACCTGCTCTTTCCCGTGGTGAATTGACACTGATTGGCGCCACAACTCAAGATGAATACCGCAATACCATT
TTAAAAAATGCTGCGCTTGCTCGTCGTTTTAATGAGGTTAAGGTAAATGCACCGTCTGCTGAGGACACCTTCCATATTTT
AATGGGAATCCGAAATCTTTACGAACAACACCATAATGTTATCCTTCCGGATGCTGTTCTTAAAGCTGCGATTGACTATT
CTATTCAGTACATTCCTCAACGTAGTTTACCAGATAAAGCGATTGACTTAGTAGATATGACTGCTGCGCATTTAGCCGCT
CAGCATCCTGTGACTGACCTCAAGTCTCTAGAAGCTGAAATGACCACTCAAAAGAATATCCAAGAAGAGGCTGTGGCTAA
GGAAGATTTTGAGAGAGCTTTAACCGCTAAAACACGAATTGAAGAATTGCAAAAACAAATGGACAACCATAGCGAAGACC
AAAAGGTTACTGCTACTGTCAATGATATTGCAGAATCCGTAGAGCGCCTGACTGGTGTCCCAGTGTCCAATATGGGAGCT
AGTGATTTGGAAAGACTCAAAGAAATTAGCAACCGCTTAAAAGGACACGTGATTGGCCAAGATGGAGCTGTTGAAGCTGT
GGCTCGTGCTATTCGCCGTAATCGAGCTGGTTTTGATGATGGCAATCGTCCTATCGGTTCCTTCCTCTTTGTTGGACCAA
CAGGTGTCGGTAAGACAGAGTTAGCCAAACAGCTAGCACTGGACCTCTTTGGTTCCAAAGAGGCTATTATTCGACTTGAC
ATGTCCGAATACAGTGATCGTACTGCTGTGTCCAAGTTAATCGGGACAACTGCCGGTTATGTCGGTTATGATGATAATAA
CAATACGTTAACGGAACGAGTTCGTCGCAACCCGTATGCGATTGTTCTATTGGATGAGATTGAAAAAGCCGATCCGCAAG
TCATTACCCTTCTTCTCCAAGTACTGGATGATGGCCGATTAACAGATGGTCAGGGAAATACCATCAACTTTAAAAACACT
GTTATCATTGCTACTTCTAATGCTGGATTTGGGTATCAAGAAGATGAAAATACTGATCAACCTACTATTATGGACCGTAT
TGCGCCTTACTTTAGACCAGAGTTTCTCAATCGTTTCAATGGGGTTATCGAATTCAATCATCTGCAAAAAGAAGACTTAG
AAGAAATTGTTGATTTAATGTTAGCGGAAGTTAATCAAACCATAGCTAAAAAAGGAATCAGCTTAGCTATCTCTGATGAT
GCTAAAGCTCACTTGATTGAATTAGGTTATGACCACGCAATGGGAGTTCGACCATTACGCCGTGTCATCGAGCAAGAAAT
CCGAGACCGCATTACAGACTTTTACCTCGATCACCCAGAGGTCAAAAACTTACAAGCTAGTCTGGAACAAGGACAGTTAG
TCATTCACGACAAAAACTAA
ATGACAAATTATTATGACAAAGATACTTTTGGAAACATGGATGATATCTTTAATCAACTAATGGGCAATATGGGTGGCTA
TCGCAGTGAAAATCGACGCTATTTAGTGAACGGACGCGAAATGACACCTGAAGAATTCCAGGCATATCGTCAAACTGGAC
GTCTGAGCGATAAAGGTAGTGAACAACCTACATCCCAAGGTCATCAGACCAAAAAGGATGGGATTTTAGCTAAACTTGGC
ACCAACTTAACGGAAGAAGCAAGACAAGGAAAACTTGATCCTGTTATTGGTCGCAATCACGAAATTCAAGACACTGCGGA
AATTTTAGCTCGCCGAACAAAGAATAATCCTGTTCTAGTCGGAGATGCTGGTGTTGGTAAAACAGCTGTGGTTGAAGGGC
TAGCTCAAGCTATTGTGAATGGTGACGTCCCTGCTGCTATCAAAAATAAAGAAATCATTTCCATTGATATTTCTGGCCTT
GAAGCAGGCACTCAGTACCGTGGCTCCTTTGAAGAAAATATCCAAAACATGATTCAAGAGGTTAAAGAAGCTGGGAATAT
TATTCTTTTCTTTGATGAAATTCATCAAATCCTTGGTGCAGGTTCTACTGGTGGTGACTCTGGGTCTAAAGGATTAGCTG
ATATGCTAAAACCTGCTCTTTCCCGTGGTGAATTGACACTGATTGGCGCCACAACTCAAGATGAATACCGCAATACCATT
TTAAAAAATGCTGCGCTTGCTCGTCGTTTTAATGAGGTTAAGGTAAATGCACCGTCTGCTGAGGACACCTTCCATATTTT
AATGGGAATCCGAAATCTTTACGAACAACACCATAATGTTATCCTTCCGGATGCTGTTCTTAAAGCTGCGATTGACTATT
CTATTCAGTACATTCCTCAACGTAGTTTACCAGATAAAGCGATTGACTTAGTAGATATGACTGCTGCGCATTTAGCCGCT
CAGCATCCTGTGACTGACCTCAAGTCTCTAGAAGCTGAAATGACCACTCAAAAGAATATCCAAGAAGAGGCTGTGGCTAA
GGAAGATTTTGAGAGAGCTTTAACCGCTAAAACACGAATTGAAGAATTGCAAAAACAAATGGACAACCATAGCGAAGACC
AAAAGGTTACTGCTACTGTCAATGATATTGCAGAATCCGTAGAGCGCCTGACTGGTGTCCCAGTGTCCAATATGGGAGCT
AGTGATTTGGAAAGACTCAAAGAAATTAGCAACCGCTTAAAAGGACACGTGATTGGCCAAGATGGAGCTGTTGAAGCTGT
GGCTCGTGCTATTCGCCGTAATCGAGCTGGTTTTGATGATGGCAATCGTCCTATCGGTTCCTTCCTCTTTGTTGGACCAA
CAGGTGTCGGTAAGACAGAGTTAGCCAAACAGCTAGCACTGGACCTCTTTGGTTCCAAAGAGGCTATTATTCGACTTGAC
ATGTCCGAATACAGTGATCGTACTGCTGTGTCCAAGTTAATCGGGACAACTGCCGGTTATGTCGGTTATGATGATAATAA
CAATACGTTAACGGAACGAGTTCGTCGCAACCCGTATGCGATTGTTCTATTGGATGAGATTGAAAAAGCCGATCCGCAAG
TCATTACCCTTCTTCTCCAAGTACTGGATGATGGCCGATTAACAGATGGTCAGGGAAATACCATCAACTTTAAAAACACT
GTTATCATTGCTACTTCTAATGCTGGATTTGGGTATCAAGAAGATGAAAATACTGATCAACCTACTATTATGGACCGTAT
TGCGCCTTACTTTAGACCAGAGTTTCTCAATCGTTTCAATGGGGTTATCGAATTCAATCATCTGCAAAAAGAAGACTTAG
AAGAAATTGTTGATTTAATGTTAGCGGAAGTTAATCAAACCATAGCTAAAAAAGGAATCAGCTTAGCTATCTCTGATGAT
GCTAAAGCTCACTTGATTGAATTAGGTTATGACCACGCAATGGGAGTTCGACCATTACGCCGTGTCATCGAGCAAGAAAT
CCGAGACCGCATTACAGACTTTTACCTCGATCACCCAGAGGTCAAAAACTTACAAGCTAGTCTGGAACAAGGACAGTTAG
TCATTCACGACAAAAACTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| clpC | Lactococcus lactis subsp. cremoris KW2 |
47.391 |
100 |
0.481 |
| clpE | Streptococcus mutans UA159 |
46.544 |
100 |
0.472 |
| clpE | Streptococcus pneumoniae Rx1 |
48.447 |
92.132 |
0.446 |
| clpE | Streptococcus pneumoniae D39 |
48.447 |
92.132 |
0.446 |
| clpE | Streptococcus pneumoniae R6 |
48.447 |
92.132 |
0.446 |
| clpE | Streptococcus pneumoniae TIGR4 |
48.367 |
91.989 |
0.445 |
| clpC | Bacillus subtilis subsp. subtilis str. 168 |
44.496 |
92.275 |
0.411 |
| clpC | Streptococcus pneumoniae Rx1 |
40.943 |
97.139 |
0.398 |
| clpC | Streptococcus pneumoniae D39 |
40.943 |
97.139 |
0.398 |
| clpC | Streptococcus pneumoniae TIGR4 |
40.795 |
97.139 |
0.396 |
| clpC | Streptococcus thermophilus LMG 18311 |
43.065 |
88.698 |
0.382 |
| clpC | Streptococcus thermophilus LMD-9 |
43.065 |
88.698 |
0.382 |
| clpC | Streptococcus mutans UA159 |
42.903 |
88.698 |
0.381 |
| clpC | Lactococcus lactis subsp. lactis strain DGCC12653 |
42.165 |
88.555 |
0.373 |