Detailed information
Overview
| Name | clpC | Type | Regulator |
| Locus tag | NSQ52_RS06700 | Genome accession | NZ_CP151973 |
| Coordinates | 1322710..1324815 (-) | Length | 701 a.a. |
| NCBI ID | WP_326242483.1 | Uniprot ID | - |
| Organism | Bacillus sp. FSL W8-1141 | ||
| Function | degradation of ComX (predicted from homology) Competence regulation |
||
Genomic Context
Location: 1317710..1329815
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| NSQ52_RS06675 (NSQ52_06675) | - | 1317952..1318194 (+) | 243 | WP_008346101.1 | aspartyl-phosphate phosphatase Spo0E family protein | - |
| NSQ52_RS06680 (NSQ52_06680) | - | 1318246..1319745 (-) | 1500 | WP_007498739.1 | ATP-binding protein | - |
| NSQ52_RS06685 (NSQ52_06685) | - | 1319950..1320405 (+) | 456 | WP_003211701.1 | MarR family transcriptional regulator | - |
| NSQ52_RS06690 (NSQ52_06690) | motB | 1320592..1321359 (-) | 768 | WP_024719648.1 | flagellar motor protein MotB | - |
| NSQ52_RS06695 (NSQ52_06695) | motA | 1321352..1322146 (-) | 795 | WP_008346088.1 | flagellar motor stator protein MotA | - |
| NSQ52_RS06700 (NSQ52_06700) | clpC | 1322710..1324815 (-) | 2106 | WP_326242483.1 | ATP-dependent Clp protease ATP-binding subunit | Regulator |
| NSQ52_RS06705 (NSQ52_06705) | - | 1325061..1326104 (+) | 1044 | WP_268356697.1 | hypothetical protein | - |
| NSQ52_RS06710 (NSQ52_06710) | queC | 1326380..1327036 (+) | 657 | WP_008348090.1 | 7-cyano-7-deazaguanine synthase QueC | - |
| NSQ52_RS06715 (NSQ52_06715) | queD | 1327037..1327477 (+) | 441 | WP_178897126.1 | 6-carboxytetrahydropterin synthase QueD | - |
| NSQ52_RS06720 (NSQ52_06720) | queE | 1327470..1328201 (+) | 732 | WP_144666412.1 | 7-carboxy-7-deazaguanine synthase QueE | - |
| NSQ52_RS06725 (NSQ52_06725) | queF | 1328217..1328714 (+) | 498 | WP_003211403.1 | preQ(1) synthase | - |
| NSQ52_RS06730 (NSQ52_06730) | - | 1328973..1329191 (+) | 219 | WP_019743915.1 | hypothetical protein | - |
| NSQ52_RS06735 (NSQ52_06735) | - | 1329316..1329504 (-) | 189 | WP_003211747.1 | DUF2187 family protein | - |
Sequence
Protein
Download Length: 701 a.a. Molecular weight: 78190.80 Da Isoelectric Point: 5.1807
>NTDB_id=982778 NSQ52_RS06700 WP_326242483.1 1322710..1324815(-) (clpC) [Bacillus sp. FSL W8-1141]
MRCQHCQVNEATIRLNMQVNSSRSQMVLCEDCYTSLMEQSKMKMGPNLFGGGSFFSEQAGHATSQERPKQKGLLDELGRN
LTDGAHAGLIDPVIGRDEEVTRVIEILNRRNKNNPVLIGEPGVGKTAIAEGLALKIANGDVPNKLKNKQVYLLDVSSLVA
NTGVRGQFEERMKQLIKELQSRKNIILFVDEIHLLVGAGSAEGSMDAGNILKPALARGELQLVGATTLKEYRQIEKDAAL
ERRFQPVIVDEPTQDEAFEILKGIQDKYEAYHGVTYSDEAIKACVQLSSRYIQDRHLPDKAIDLMDEAGSKANLSIDAAS
EDELTNRLAQIAAEKQAALNEERYEKAAKLRDEEEAIEARLQNKTNDKEHVVTAEDIQSIVEQKTGIPVGKLQADEQTKM
KEIDVRLKARVIGQEHAVEKVAKAVKRSRAGLKSKHRPTGSFLFVGPTGVGKTELSKTLAEELFGSKEAIIRLDMSEYME
KHSVSKIIGSPPGYVGHDEAGQLTEKVRRKPYSIILLDEIEKAHPDVQHMFLQIMEDGRLTDSQGRTISFKDTVIIMTSN
AGSTDKKAVKVGFQSEQEEAIEEQSLIDSLSAYFKPEFLNRFDSIIQFDSLDKDDLVKIVDLLLKDLSEQLKEQNLTVHV
TNEAKEKIAELGYHPAFGARPLRRTIQEHVEDQMTDILLEEEQLTGFTVDVEHDEIVVKKS
MRCQHCQVNEATIRLNMQVNSSRSQMVLCEDCYTSLMEQSKMKMGPNLFGGGSFFSEQAGHATSQERPKQKGLLDELGRN
LTDGAHAGLIDPVIGRDEEVTRVIEILNRRNKNNPVLIGEPGVGKTAIAEGLALKIANGDVPNKLKNKQVYLLDVSSLVA
NTGVRGQFEERMKQLIKELQSRKNIILFVDEIHLLVGAGSAEGSMDAGNILKPALARGELQLVGATTLKEYRQIEKDAAL
ERRFQPVIVDEPTQDEAFEILKGIQDKYEAYHGVTYSDEAIKACVQLSSRYIQDRHLPDKAIDLMDEAGSKANLSIDAAS
EDELTNRLAQIAAEKQAALNEERYEKAAKLRDEEEAIEARLQNKTNDKEHVVTAEDIQSIVEQKTGIPVGKLQADEQTKM
KEIDVRLKARVIGQEHAVEKVAKAVKRSRAGLKSKHRPTGSFLFVGPTGVGKTELSKTLAEELFGSKEAIIRLDMSEYME
KHSVSKIIGSPPGYVGHDEAGQLTEKVRRKPYSIILLDEIEKAHPDVQHMFLQIMEDGRLTDSQGRTISFKDTVIIMTSN
AGSTDKKAVKVGFQSEQEEAIEEQSLIDSLSAYFKPEFLNRFDSIIQFDSLDKDDLVKIVDLLLKDLSEQLKEQNLTVHV
TNEAKEKIAELGYHPAFGARPLRRTIQEHVEDQMTDILLEEEQLTGFTVDVEHDEIVVKKS
Nucleotide
Download Length: 2106 bp
>NTDB_id=982778 NSQ52_RS06700 WP_326242483.1 1322710..1324815(-) (clpC) [Bacillus sp. FSL W8-1141]
ATGCGTTGTCAACATTGTCAAGTAAATGAAGCAACTATTCGCCTGAATATGCAAGTAAATTCGTCCCGTAGCCAAATGGT
TTTATGTGAAGACTGCTATACCTCTTTGATGGAACAGTCAAAAATGAAAATGGGTCCTAACCTTTTCGGAGGCGGTTCAT
TCTTCTCTGAGCAAGCAGGACATGCGACGAGCCAAGAGCGGCCTAAACAAAAAGGCTTACTCGATGAGCTTGGCCGGAAC
TTAACGGATGGCGCACATGCTGGTTTAATCGATCCTGTCATTGGCCGTGATGAAGAAGTCACAAGAGTCATTGAGATTTT
AAATAGAAGAAATAAAAACAATCCTGTCCTCATTGGTGAACCTGGTGTTGGTAAAACGGCCATAGCCGAAGGACTAGCAT
TAAAAATTGCAAATGGAGATGTACCAAATAAATTAAAGAACAAACAAGTTTATTTATTAGATGTCTCTTCACTTGTGGCA
AACACAGGTGTACGTGGTCAATTTGAAGAACGGATGAAGCAATTAATCAAAGAATTGCAAAGCCGTAAAAATATCATCTT
ATTCGTTGATGAAATTCATCTTCTTGTAGGCGCAGGCTCTGCCGAAGGGTCAATGGATGCCGGAAACATTTTGAAACCAG
CTCTAGCTCGAGGGGAACTGCAACTAGTAGGTGCGACGACATTAAAAGAGTATCGTCAGATTGAAAAAGATGCTGCCCTT
GAACGCAGGTTCCAGCCTGTCATCGTTGATGAGCCAACACAGGATGAAGCATTCGAGATCTTAAAAGGCATTCAGGATAA
GTATGAAGCCTATCATGGCGTCACCTATTCTGACGAAGCCATTAAAGCGTGTGTTCAATTATCTTCGCGGTATATTCAAG
ACCGCCACTTGCCAGATAAAGCCATTGATTTAATGGATGAAGCAGGTTCAAAAGCGAACCTTTCCATTGATGCAGCAAGT
GAAGATGAATTAACAAACCGTCTTGCGCAAATTGCCGCTGAAAAACAAGCTGCTTTAAATGAAGAACGATATGAAAAAGC
AGCGAAGCTTCGAGACGAAGAAGAAGCGATTGAAGCGAGATTGCAAAACAAAACAAATGACAAAGAACATGTCGTCACAG
CTGAGGACATTCAGTCGATTGTAGAACAAAAAACAGGTATCCCTGTCGGCAAACTACAAGCAGACGAACAAACAAAAATG
AAAGAAATTGATGTCCGCTTAAAAGCCCGCGTCATTGGTCAGGAACATGCGGTTGAAAAAGTGGCGAAAGCCGTAAAAAG
AAGCAGAGCCGGCTTAAAATCAAAACACAGACCAACTGGCTCCTTCCTATTCGTTGGACCAACAGGCGTCGGGAAAACCG
AATTGTCTAAAACATTAGCGGAAGAATTATTTGGCTCAAAAGAAGCGATCATTCGTTTAGATATGAGTGAATACATGGAG
AAACACTCCGTGTCCAAAATCATCGGTTCTCCTCCTGGTTACGTTGGACATGATGAAGCCGGACAGCTCACGGAAAAGGT
TCGCAGAAAACCATACAGCATCATTTTGCTCGATGAAATCGAAAAAGCGCACCCTGATGTCCAGCATATGTTCCTTCAAA
TCATGGAAGATGGCCGGCTAACAGACAGCCAAGGCAGAACCATCAGCTTTAAAGACACAGTCATCATTATGACAAGTAAC
GCTGGCAGCACAGATAAAAAAGCGGTCAAAGTCGGATTCCAGTCTGAACAAGAAGAAGCCATCGAAGAACAATCTTTGAT
TGATTCACTGAGCGCTTATTTCAAACCGGAATTCTTGAACCGTTTTGACAGCATCATTCAGTTTGATTCATTAGATAAAG
ACGATTTAGTCAAAATTGTGGATCTTCTGCTCAAAGATTTGTCAGAGCAGTTAAAAGAACAAAATCTAACTGTTCACGTC
ACAAACGAAGCGAAAGAAAAAATCGCTGAACTTGGTTATCATCCAGCATTTGGGGCTCGTCCGTTACGAAGAACGATTCA
AGAGCATGTTGAGGATCAAATGACGGACATCTTGCTTGAAGAAGAACAACTTACTGGATTTACAGTCGATGTTGAACATG
ACGAAATTGTTGTGAAAAAGAGCTAA
ATGCGTTGTCAACATTGTCAAGTAAATGAAGCAACTATTCGCCTGAATATGCAAGTAAATTCGTCCCGTAGCCAAATGGT
TTTATGTGAAGACTGCTATACCTCTTTGATGGAACAGTCAAAAATGAAAATGGGTCCTAACCTTTTCGGAGGCGGTTCAT
TCTTCTCTGAGCAAGCAGGACATGCGACGAGCCAAGAGCGGCCTAAACAAAAAGGCTTACTCGATGAGCTTGGCCGGAAC
TTAACGGATGGCGCACATGCTGGTTTAATCGATCCTGTCATTGGCCGTGATGAAGAAGTCACAAGAGTCATTGAGATTTT
AAATAGAAGAAATAAAAACAATCCTGTCCTCATTGGTGAACCTGGTGTTGGTAAAACGGCCATAGCCGAAGGACTAGCAT
TAAAAATTGCAAATGGAGATGTACCAAATAAATTAAAGAACAAACAAGTTTATTTATTAGATGTCTCTTCACTTGTGGCA
AACACAGGTGTACGTGGTCAATTTGAAGAACGGATGAAGCAATTAATCAAAGAATTGCAAAGCCGTAAAAATATCATCTT
ATTCGTTGATGAAATTCATCTTCTTGTAGGCGCAGGCTCTGCCGAAGGGTCAATGGATGCCGGAAACATTTTGAAACCAG
CTCTAGCTCGAGGGGAACTGCAACTAGTAGGTGCGACGACATTAAAAGAGTATCGTCAGATTGAAAAAGATGCTGCCCTT
GAACGCAGGTTCCAGCCTGTCATCGTTGATGAGCCAACACAGGATGAAGCATTCGAGATCTTAAAAGGCATTCAGGATAA
GTATGAAGCCTATCATGGCGTCACCTATTCTGACGAAGCCATTAAAGCGTGTGTTCAATTATCTTCGCGGTATATTCAAG
ACCGCCACTTGCCAGATAAAGCCATTGATTTAATGGATGAAGCAGGTTCAAAAGCGAACCTTTCCATTGATGCAGCAAGT
GAAGATGAATTAACAAACCGTCTTGCGCAAATTGCCGCTGAAAAACAAGCTGCTTTAAATGAAGAACGATATGAAAAAGC
AGCGAAGCTTCGAGACGAAGAAGAAGCGATTGAAGCGAGATTGCAAAACAAAACAAATGACAAAGAACATGTCGTCACAG
CTGAGGACATTCAGTCGATTGTAGAACAAAAAACAGGTATCCCTGTCGGCAAACTACAAGCAGACGAACAAACAAAAATG
AAAGAAATTGATGTCCGCTTAAAAGCCCGCGTCATTGGTCAGGAACATGCGGTTGAAAAAGTGGCGAAAGCCGTAAAAAG
AAGCAGAGCCGGCTTAAAATCAAAACACAGACCAACTGGCTCCTTCCTATTCGTTGGACCAACAGGCGTCGGGAAAACCG
AATTGTCTAAAACATTAGCGGAAGAATTATTTGGCTCAAAAGAAGCGATCATTCGTTTAGATATGAGTGAATACATGGAG
AAACACTCCGTGTCCAAAATCATCGGTTCTCCTCCTGGTTACGTTGGACATGATGAAGCCGGACAGCTCACGGAAAAGGT
TCGCAGAAAACCATACAGCATCATTTTGCTCGATGAAATCGAAAAAGCGCACCCTGATGTCCAGCATATGTTCCTTCAAA
TCATGGAAGATGGCCGGCTAACAGACAGCCAAGGCAGAACCATCAGCTTTAAAGACACAGTCATCATTATGACAAGTAAC
GCTGGCAGCACAGATAAAAAAGCGGTCAAAGTCGGATTCCAGTCTGAACAAGAAGAAGCCATCGAAGAACAATCTTTGAT
TGATTCACTGAGCGCTTATTTCAAACCGGAATTCTTGAACCGTTTTGACAGCATCATTCAGTTTGATTCATTAGATAAAG
ACGATTTAGTCAAAATTGTGGATCTTCTGCTCAAAGATTTGTCAGAGCAGTTAAAAGAACAAAATCTAACTGTTCACGTC
ACAAACGAAGCGAAAGAAAAAATCGCTGAACTTGGTTATCATCCAGCATTTGGGGCTCGTCCGTTACGAAGAACGATTCA
AGAGCATGTTGAGGATCAAATGACGGACATCTTGCTTGAAGAAGAACAACTTACTGGATTTACAGTCGATGTTGAACATG
ACGAAATTGTTGTGAAAAAGAGCTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| clpC | Lactococcus lactis subsp. cremoris KW2 |
56.834 |
100 |
0.599 |
| clpE | Streptococcus mutans UA159 |
56.376 |
100 |
0.599 |
| clpE | Streptococcus pneumoniae Rx1 |
55.708 |
100 |
0.578 |
| clpE | Streptococcus pneumoniae D39 |
55.708 |
100 |
0.578 |
| clpE | Streptococcus pneumoniae R6 |
55.708 |
100 |
0.578 |
| clpE | Streptococcus pneumoniae TIGR4 |
55.708 |
100 |
0.578 |
| clpC | Bacillus subtilis subsp. subtilis str. 168 |
53.502 |
95.72 |
0.512 |
| clpC | Lactococcus lactis subsp. lactis strain DGCC12653 |
46.698 |
90.728 |
0.424 |
| clpC | Streptococcus thermophilus LMD-9 |
44.272 |
92.154 |
0.408 |
| clpC | Streptococcus thermophilus LMG 18311 |
44.118 |
92.154 |
0.407 |
| clpC | Streptococcus pneumoniae TIGR4 |
44.322 |
90.442 |
0.401 |
| clpC | Streptococcus pneumoniae Rx1 |
44.515 |
89.729 |
0.399 |
| clpC | Streptococcus pneumoniae D39 |
44.515 |
89.729 |
0.399 |
| clpC | Streptococcus mutans UA159 |
43.968 |
89.872 |
0.395 |