Detailed information
Overview
| Name | clpC | Type | Regulator |
| Locus tag | K6973_RS04875 | Genome accession | NZ_CP082206 |
| Coordinates | 955443..957542 (-) | Length | 699 a.a. |
| NCBI ID | WP_138125405.1 | Uniprot ID | - |
| Organism | Streptococcus dysgalactiae strain DY107 | ||
| Function | degradation of ComX (predicted from homology) Competence regulation |
||
Genomic Context
Location: 950443..962542
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| K6973_RS04845 (K6973_04845) | - | 950938..951777 (+) | 840 | WP_129555803.1 | thymidylate synthase | - |
| K6973_RS04850 (K6973_04850) | - | 951858..952355 (+) | 498 | WP_065957460.1 | dihydrofolate reductase | - |
| K6973_RS04855 (K6973_04855) | - | 952375..952545 (+) | 171 | WP_003054200.1 | hypothetical protein | - |
| K6973_RS04860 (K6973_04860) | clpX | 952661..953890 (+) | 1230 | WP_003054192.1 | ATP-dependent Clp protease ATP-binding subunit ClpX | Regulator |
| K6973_RS04865 (K6973_04865) | yihA | 953900..954499 (+) | 600 | WP_138125401.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| K6973_RS04870 (K6973_04870) | - | 954643..955386 (+) | 744 | WP_138125403.1 | hypothetical protein | - |
| K6973_RS04875 (K6973_04875) | clpC | 955443..957542 (-) | 2100 | WP_138125405.1 | ATP-dependent Clp protease ATP-binding subunit | Regulator |
| K6973_RS04880 (K6973_04880) | - | 958024..959220 (-) | 1197 | WP_222819635.1 | site-specific integrase | - |
| K6973_RS04885 (K6973_04885) | - | 959399..959992 (-) | 594 | WP_222819636.1 | HIRAN domain-containing protein | - |
| K6973_RS04890 (K6973_04890) | - | 960004..960351 (-) | 348 | WP_222819637.1 | ImmA/IrrE family metallo-endopeptidase | - |
| K6973_RS04895 (K6973_04895) | - | 960356..960712 (-) | 357 | WP_222819714.1 | helix-turn-helix transcriptional regulator | - |
| K6973_RS04900 (K6973_04900) | - | 961000..961164 (+) | 165 | WP_015984783.1 | hypothetical protein | - |
| K6973_RS04905 (K6973_04905) | - | 961206..961409 (+) | 204 | WP_015984784.1 | hypothetical protein | - |
| K6973_RS11285 | - | 961411..961545 (+) | 135 | WP_261987967.1 | hypothetical protein | - |
| K6973_RS04910 (K6973_04910) | - | 961542..962297 (+) | 756 | WP_222819638.1 | ORF6C domain-containing protein | - |
Sequence
Protein
Download Length: 699 a.a. Molecular weight: 77438.12 Da Isoelectric Point: 4.8927
>NTDB_id=600106 K6973_RS04875 WP_138125405.1 955443..957542(-) (clpC) [Streptococcus dysgalactiae strain DY107]
MTNYYDKDPFGNMDDIFNQLMGNMGGYRSENRRYLVNGREMTPEEFQAYRQTGRLSDKGTEQPTSQGHQTKKDGILAKLG
TNLTEEARQGKLDPVIGRNHEIQDTAEILARRTKNNPVLVGDAGVGKTAVVEGLAQAIVNGDVPAAIKNKEIISIDISGL
EAGTQYRGSFEENIQNMIQEVKEAGNIILFFDEIHQILGAGSTGGDSGSKGLADMLKPALSRGELTLIGATTQDEYRNTI
LKNAALARRFNEVKVNAPSAEDTFHILMGIRNLYEQHHNVILPDTVLKAAIDYSIQYIPQRSLPDKAIDLVDMTAAHLAA
QHPVTDLKSLEAEMTTQKNIQEEAVAKEDFERALTAKTRIEELQKQMNNHSEDQKVTATVNDIAESVERLTGVPVSNMGA
SDLERLKEISNRLKGHVIGQDGAVEAVARAIRRNRAGFDDGNRPIGSFLFVGPTGVGKTELAKQLALDLFGSKEAIIRLD
MSEYSDRTAVSKLIGTTAGYVGYDDNNNTLTERVRRNPYAIVLLDEIEKADPQVITLLLQVLDDGRLTDGQGNTINFKNT
VIIATSNAGFGHQEDENTDQPAIMDRIAPYFRPEFLNRFNGVIEFNHLQKEDLEEIVDLMLAEVNQTIAKKGISLTISDD
AKAHLIELGYDHAMGVRPLRRVIEQEIRDRITDFYLDHPEVKNLQASLEQGQLVIHDKN
MTNYYDKDPFGNMDDIFNQLMGNMGGYRSENRRYLVNGREMTPEEFQAYRQTGRLSDKGTEQPTSQGHQTKKDGILAKLG
TNLTEEARQGKLDPVIGRNHEIQDTAEILARRTKNNPVLVGDAGVGKTAVVEGLAQAIVNGDVPAAIKNKEIISIDISGL
EAGTQYRGSFEENIQNMIQEVKEAGNIILFFDEIHQILGAGSTGGDSGSKGLADMLKPALSRGELTLIGATTQDEYRNTI
LKNAALARRFNEVKVNAPSAEDTFHILMGIRNLYEQHHNVILPDTVLKAAIDYSIQYIPQRSLPDKAIDLVDMTAAHLAA
QHPVTDLKSLEAEMTTQKNIQEEAVAKEDFERALTAKTRIEELQKQMNNHSEDQKVTATVNDIAESVERLTGVPVSNMGA
SDLERLKEISNRLKGHVIGQDGAVEAVARAIRRNRAGFDDGNRPIGSFLFVGPTGVGKTELAKQLALDLFGSKEAIIRLD
MSEYSDRTAVSKLIGTTAGYVGYDDNNNTLTERVRRNPYAIVLLDEIEKADPQVITLLLQVLDDGRLTDGQGNTINFKNT
VIIATSNAGFGHQEDENTDQPAIMDRIAPYFRPEFLNRFNGVIEFNHLQKEDLEEIVDLMLAEVNQTIAKKGISLTISDD
AKAHLIELGYDHAMGVRPLRRVIEQEIRDRITDFYLDHPEVKNLQASLEQGQLVIHDKN
Nucleotide
Download Length: 2100 bp
>NTDB_id=600106 K6973_RS04875 WP_138125405.1 955443..957542(-) (clpC) [Streptococcus dysgalactiae strain DY107]
ATGACAAATTATTATGACAAAGATCCTTTTGGAAACATGGATGATATCTTTAATCAACTAATGGGCAATATGGGTGGCTA
TCGCAGTGAAAATCGACGCTATTTAGTGAACGGACGCGAAATGACACCTGAAGAATTCCAGGCATATCGTCAAACTGGAC
GTCTGAGCGATAAAGGTACTGAACAACCTACATCCCAAGGTCATCAGACCAAAAAGGATGGGATTTTAGCTAAACTTGGC
ACCAACTTAACGGAAGAAGCAAGACAAGGAAAACTTGATCCTGTTATTGGTCGCAATCACGAAATTCAAGACACTGCGGA
AATTTTAGCTCGCCGAACAAAGAATAATCCTGTTCTAGTCGGAGATGCTGGTGTTGGTAAAACAGCTGTGGTTGAAGGGC
TAGCTCAAGCTATTGTGAATGGTGACGTCCCTGCTGCTATCAAAAATAAAGAAATCATTTCCATTGATATTTCTGGCCTT
GAAGCAGGCACTCAGTACCGTGGCTCCTTTGAAGAAAATATCCAAAACATGATTCAAGAGGTTAAAGAAGCTGGGAATAT
TATTCTCTTCTTTGACGAAATCCATCAAATCCTTGGTGCAGGTTCTACTGGTGGTGACTCTGGGTCTAAAGGATTAGCTG
ATATGCTAAAACCTGCTCTTTCCCGTGGTGAATTGACACTGATTGGTGCCACAACTCAAGATGAATACCGCAATACCATT
TTAAAAAATGCTGCGCTTGCTCGTCGTTTTAATGAGGTTAAGGTAAACGCACCGTCTGCTGAGGACACCTTCCATATTTT
AATGGGAATCCGAAATCTTTACGAACAACACCATAATGTTATCCTTCCGGATACTGTTCTTAAAGCTGCGATTGACTATT
CTATTCAGTACATTCCTCAACGTAGTTTACCAGATAAAGCGATTGACTTAGTAGATATGACTGCTGCGCATTTAGCTGCT
CAGCATCCTGTGACTGACCTCAAGTCTCTAGAAGCTGAAATGACCACTCAAAAGAATATCCAAGAAGAGGCTGTGGCTAA
GGAAGATTTTGAGAGAGCTTTAACCGCTAAAACTCGAATTGAAGAATTGCAAAAACAAATGAACAACCATAGCGAAGACC
AAAAGGTTACTGCTACTGTCAATGACATTGCAGAATCTGTAGAGCGCCTGACTGGTGTCCCAGTGTCCAATATGGGAGCT
AGTGATTTGGAAAGACTCAAAGAAATTAGCAACCGCTTAAAAGGACACGTGATTGGCCAAGATGGAGCTGTTGAAGCTGT
GGCTCGTGCTATTCGCCGTAATCGAGCTGGTTTTGATGACGGCAATCGTCCTATCGGTTCCTTCCTCTTTGTTGGACCAA
CAGGTGTTGGTAAGACAGAATTAGCCAAACAGCTAGCACTGGACCTCTTTGGTTCCAAAGAGGCTATTATTCGACTTGAC
ATGTCCGAATACAGTGATCGTACCGCTGTGTCCAAGCTAATAGGGACAACTGCCGGTTATGTCGGTTATGATGATAATAA
CAATACGTTAACGGAACGTGTTCGTCGCAACCCGTATGCGATTGTTCTATTGGATGAGATTGAAAAAGCCGATCCGCAAG
TCATTACCCTTCTTCTCCAAGTACTAGATGATGGCCGATTAACAGATGGTCAGGGAAATACCATCAACTTTAAAAACACT
GTTATCATTGCTACTTCTAATGCTGGATTTGGGCATCAAGAAGATGAAAATACTGATCAACCTGCTATTATGGACCGTAT
TGCGCCTTACTTTAGACCAGAGTTTCTCAATCGTTTCAATGGGGTTATCGAATTCAATCATCTGCAAAAAGAAGACTTAG
AAGAAATTGTTGATTTAATGTTAGCGGAAGTTAATCAAACTATAGCTAAAAAAGGGATCAGCTTAACTATCTCTGATGAT
GCTAAAGCTCACTTGATTGAATTAGGTTATGACCACGCAATGGGAGTTCGACCATTACGCCGTGTCATCGAGCAAGAAAT
CCGAGACCGCATTACAGACTTTTACCTCGATCACCCAGAGGTCAAAAACCTACAAGCTAGTCTGGAACAAGGACAGTTAG
TCATTCACGACAAAAACTAA
ATGACAAATTATTATGACAAAGATCCTTTTGGAAACATGGATGATATCTTTAATCAACTAATGGGCAATATGGGTGGCTA
TCGCAGTGAAAATCGACGCTATTTAGTGAACGGACGCGAAATGACACCTGAAGAATTCCAGGCATATCGTCAAACTGGAC
GTCTGAGCGATAAAGGTACTGAACAACCTACATCCCAAGGTCATCAGACCAAAAAGGATGGGATTTTAGCTAAACTTGGC
ACCAACTTAACGGAAGAAGCAAGACAAGGAAAACTTGATCCTGTTATTGGTCGCAATCACGAAATTCAAGACACTGCGGA
AATTTTAGCTCGCCGAACAAAGAATAATCCTGTTCTAGTCGGAGATGCTGGTGTTGGTAAAACAGCTGTGGTTGAAGGGC
TAGCTCAAGCTATTGTGAATGGTGACGTCCCTGCTGCTATCAAAAATAAAGAAATCATTTCCATTGATATTTCTGGCCTT
GAAGCAGGCACTCAGTACCGTGGCTCCTTTGAAGAAAATATCCAAAACATGATTCAAGAGGTTAAAGAAGCTGGGAATAT
TATTCTCTTCTTTGACGAAATCCATCAAATCCTTGGTGCAGGTTCTACTGGTGGTGACTCTGGGTCTAAAGGATTAGCTG
ATATGCTAAAACCTGCTCTTTCCCGTGGTGAATTGACACTGATTGGTGCCACAACTCAAGATGAATACCGCAATACCATT
TTAAAAAATGCTGCGCTTGCTCGTCGTTTTAATGAGGTTAAGGTAAACGCACCGTCTGCTGAGGACACCTTCCATATTTT
AATGGGAATCCGAAATCTTTACGAACAACACCATAATGTTATCCTTCCGGATACTGTTCTTAAAGCTGCGATTGACTATT
CTATTCAGTACATTCCTCAACGTAGTTTACCAGATAAAGCGATTGACTTAGTAGATATGACTGCTGCGCATTTAGCTGCT
CAGCATCCTGTGACTGACCTCAAGTCTCTAGAAGCTGAAATGACCACTCAAAAGAATATCCAAGAAGAGGCTGTGGCTAA
GGAAGATTTTGAGAGAGCTTTAACCGCTAAAACTCGAATTGAAGAATTGCAAAAACAAATGAACAACCATAGCGAAGACC
AAAAGGTTACTGCTACTGTCAATGACATTGCAGAATCTGTAGAGCGCCTGACTGGTGTCCCAGTGTCCAATATGGGAGCT
AGTGATTTGGAAAGACTCAAAGAAATTAGCAACCGCTTAAAAGGACACGTGATTGGCCAAGATGGAGCTGTTGAAGCTGT
GGCTCGTGCTATTCGCCGTAATCGAGCTGGTTTTGATGACGGCAATCGTCCTATCGGTTCCTTCCTCTTTGTTGGACCAA
CAGGTGTTGGTAAGACAGAATTAGCCAAACAGCTAGCACTGGACCTCTTTGGTTCCAAAGAGGCTATTATTCGACTTGAC
ATGTCCGAATACAGTGATCGTACCGCTGTGTCCAAGCTAATAGGGACAACTGCCGGTTATGTCGGTTATGATGATAATAA
CAATACGTTAACGGAACGTGTTCGTCGCAACCCGTATGCGATTGTTCTATTGGATGAGATTGAAAAAGCCGATCCGCAAG
TCATTACCCTTCTTCTCCAAGTACTAGATGATGGCCGATTAACAGATGGTCAGGGAAATACCATCAACTTTAAAAACACT
GTTATCATTGCTACTTCTAATGCTGGATTTGGGCATCAAGAAGATGAAAATACTGATCAACCTGCTATTATGGACCGTAT
TGCGCCTTACTTTAGACCAGAGTTTCTCAATCGTTTCAATGGGGTTATCGAATTCAATCATCTGCAAAAAGAAGACTTAG
AAGAAATTGTTGATTTAATGTTAGCGGAAGTTAATCAAACTATAGCTAAAAAAGGGATCAGCTTAACTATCTCTGATGAT
GCTAAAGCTCACTTGATTGAATTAGGTTATGACCACGCAATGGGAGTTCGACCATTACGCCGTGTCATCGAGCAAGAAAT
CCGAGACCGCATTACAGACTTTTACCTCGATCACCCAGAGGTCAAAAACCTACAAGCTAGTCTGGAACAAGGACAGTTAG
TCATTCACGACAAAAACTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| clpC | Lactococcus lactis subsp. cremoris KW2 |
47.532 |
100 |
0.482 |
| clpE | Streptococcus mutans UA159 |
46.544 |
100 |
0.472 |
| clpE | Streptococcus pneumoniae Rx1 |
45.768 |
99.714 |
0.456 |
| clpE | Streptococcus pneumoniae D39 |
45.768 |
99.714 |
0.456 |
| clpE | Streptococcus pneumoniae R6 |
45.768 |
99.714 |
0.456 |
| clpE | Streptococcus pneumoniae TIGR4 |
45.69 |
99.571 |
0.455 |
| clpC | Bacillus subtilis subsp. subtilis str. 168 |
44.341 |
92.275 |
0.409 |
| clpC | Streptococcus pneumoniae D39 |
40.943 |
97.139 |
0.398 |
| clpC | Streptococcus pneumoniae Rx1 |
40.943 |
97.139 |
0.398 |
| clpC | Streptococcus pneumoniae TIGR4 |
40.795 |
97.139 |
0.396 |
| clpC | Streptococcus thermophilus LMG 18311 |
42.903 |
88.698 |
0.381 |
| clpC | Streptococcus mutans UA159 |
42.742 |
88.698 |
0.379 |
| clpC | Streptococcus thermophilus LMD-9 |
42.718 |
88.412 |
0.378 |
| clpC | Lactococcus lactis subsp. lactis strain DGCC12653 |
42.165 |
88.555 |
0.373 |