Detailed information
Overview
| Name | clpC | Type | Regulator |
| Locus tag | NQZ96_RS09095 | Genome accession | NZ_CP102137 |
| Coordinates | 1857361..1859814 (-) | Length | 817 a.a. |
| NCBI ID | WP_012027858.1 | Uniprot ID | A0A2K1SY42 |
| Organism | Streptococcus suis strain M104300_S20 | ||
| Function | degradation of ComW (predicted from homology) Competence regulation |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 1853463..1896690 | 1857361..1859814 | within | 0 |
Gene organization within MGE regions
Location: 1853463..1896690
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| NQZ96_RS09075 (NQZ96_09075) | dusB | 1853463..1854467 (+) | 1005 | WP_012775355.1 | tRNA dihydrouridine synthase DusB | - |
| NQZ96_RS09080 (NQZ96_09080) | - | 1854613..1855386 (-) | 774 | WP_012027854.1 | NUDIX domain-containing protein | - |
| NQZ96_RS09085 (NQZ96_09085) | pnuC | 1855379..1856200 (-) | 822 | WP_012027855.1 | nicotinamide riboside transporter PnuC | - |
| NQZ96_RS09090 (NQZ96_09090) | - | 1856210..1857247 (-) | 1038 | WP_012028524.1 | AAA family ATPase | - |
| NQZ96_RS09095 (NQZ96_09095) | clpC | 1857361..1859814 (-) | 2454 | WP_012027858.1 | ATP-dependent Clp protease ATP-binding subunit | Regulator |
| NQZ96_RS09100 (NQZ96_09100) | - | 1859820..1860278 (-) | 459 | WP_012027859.1 | CtsR family transcriptional regulator | - |
| NQZ96_RS09105 (NQZ96_09105) | - | 1860428..1860766 (+) | 339 | WP_012027860.1 | thioredoxin domain-containing protein | - |
| NQZ96_RS09110 (NQZ96_09110) | - | 1860766..1861467 (+) | 702 | WP_012027861.1 | hypothetical protein | - |
| NQZ96_RS09115 (NQZ96_09115) | tsf | 1861730..1862770 (-) | 1041 | WP_012027862.1 | translation elongation factor Ts | - |
| NQZ96_RS09120 (NQZ96_09120) | rpsB | 1862926..1863702 (-) | 777 | WP_012775356.1 | 30S ribosomal protein S2 | - |
| NQZ96_RS09125 (NQZ96_09125) | - | 1864049..1864561 (-) | 513 | WP_012775452.1 | adenylate kinase | - |
| NQZ96_RS09135 (NQZ96_09135) | - | 1865215..1870293 (-) | 5079 | WP_012775357.1 | S8 family serine peptidase | - |
| NQZ96_RS09140 (NQZ96_09140) | nusG | 1870437..1870976 (-) | 540 | WP_002940254.1 | transcription termination/antitermination protein NusG | - |
| NQZ96_RS09145 (NQZ96_09145) | secE | 1871086..1871262 (-) | 177 | WP_002940255.1 | preprotein translocase subunit SecE | - |
| NQZ96_RS09150 (NQZ96_09150) | rpmG | 1871272..1871424 (-) | 153 | WP_002940258.1 | 50S ribosomal protein L33 | - |
| NQZ96_RS09155 (NQZ96_09155) | pbp2a | 1871458..1873671 (-) | 2214 | WP_012027868.1 | penicillin-binding protein PBP2A | - |
| NQZ96_RS09160 (NQZ96_09160) | - | 1873824..1874792 (+) | 969 | WP_012027869.1 | NAD(P)/FAD-dependent oxidoreductase | - |
| NQZ96_RS09165 (NQZ96_09165) | - | 1874794..1875660 (+) | 867 | WP_012027870.1 | RluA family pseudouridine synthase | - |
| NQZ96_RS09170 (NQZ96_09170) | purR | 1875687..1876499 (-) | 813 | WP_002938438.1 | pur operon repressor | - |
| NQZ96_RS09175 (NQZ96_09175) | - | 1876601..1877467 (-) | 867 | WP_012027872.1 | aminoglycoside phosphotransferase family protein | - |
| NQZ96_RS09180 (NQZ96_09180) | - | 1877467..1878408 (-) | 942 | WP_012775359.1 | 3'-5' exoribonuclease YhaM family protein | - |
| NQZ96_RS09185 (NQZ96_09185) | - | 1878398..1879636 (-) | 1239 | WP_012027874.1 | DNA recombination protein RmuC | - |
| NQZ96_RS09190 (NQZ96_09190) | - | 1879637..1880269 (-) | 633 | WP_012027875.1 | thiamine diphosphokinase | - |
| NQZ96_RS09195 (NQZ96_09195) | rpe | 1880262..1880921 (-) | 660 | WP_012027876.1 | ribulose-phosphate 3-epimerase | - |
| NQZ96_RS09200 (NQZ96_09200) | rsgA | 1880936..1881811 (-) | 876 | WP_012027877.1 | ribosome small subunit-dependent GTPase A | - |
| NQZ96_RS09210 (NQZ96_09210) | - | 1882708..1884285 (-) | 1578 | WP_012027878.1 | ABC transporter ATP-binding protein | - |
| NQZ96_RS09215 (NQZ96_09215) | - | 1884282..1885523 (-) | 1242 | WP_014636518.1 | radical SAM protein | - |
| NQZ96_RS09220 (NQZ96_09220) | - | 1885892..1886755 (-) | 864 | WP_012027880.1 | helix-turn-helix domain-containing protein | - |
| NQZ96_RS09225 (NQZ96_09225) | - | 1887245..1889689 (-) | 2445 | WP_012027881.1 | LTA synthase family protein | - |
| NQZ96_RS09230 (NQZ96_09230) | - | 1889702..1891432 (-) | 1731 | WP_012775361.1 | membrane protein | - |
| NQZ96_RS09235 (NQZ96_09235) | rsmA | 1891797..1892669 (-) | 873 | WP_012775362.1 | 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))- dimethyltransferase RsmA | - |
| NQZ96_RS09240 (NQZ96_09240) | - | 1892689..1893219 (-) | 531 | WP_009910909.1 | DUF1697 domain-containing protein | - |
| NQZ96_RS09245 (NQZ96_09245) | - | 1893233..1893490 (-) | 258 | WP_012775363.1 | Txe/YoeB family addiction module toxin | - |
| NQZ96_RS09250 (NQZ96_09250) | - | 1893492..1893752 (-) | 261 | WP_002939010.1 | type II toxin-antitoxin system Phd/YefM family antitoxin | - |
| NQZ96_RS09255 (NQZ96_09255) | - | 1893828..1894283 (-) | 456 | WP_012028534.1 | 8-oxo-dGTP diphosphatase | - |
| NQZ96_RS09260 (NQZ96_09260) | - | 1894290..1894619 (-) | 330 | WP_012775364.1 | type II toxin-antitoxin system RelE/ParE family toxin | - |
| NQZ96_RS09265 (NQZ96_09265) | - | 1894609..1894896 (-) | 288 | WP_012028535.1 | hypothetical protein | - |
| NQZ96_RS09270 (NQZ96_09270) | - | 1894994..1895362 (-) | 369 | WP_012775453.1 | hypothetical protein | - |
| NQZ96_RS09275 (NQZ96_09275) | rnmV | 1895355..1895924 (-) | 570 | WP_012775366.1 | ribonuclease M5 | - |
| NQZ96_RS09280 (NQZ96_09280) | - | 1895908..1896690 (-) | 783 | WP_012028536.1 | TatD family hydrolase | - |
Sequence
Protein
Download Length: 817 a.a. Molecular weight: 90132.53 Da Isoelectric Point: 6.3138
>NTDB_id=714704 NQZ96_RS09095 WP_012027858.1 1857361..1859814(-) (clpC) [Streptococcus suis strain M104300_S20]
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG
Nucleotide
Download Length: 2454 bp
>NTDB_id=714704 NQZ96_RS09095 WP_012027858.1 1857361..1859814(-) (clpC) [Streptococcus suis strain M104300_S20]
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTAGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGGCTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTAGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGGCTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| clpC | Streptococcus pneumoniae TIGR4 |
71.464 |
99.51 |
0.711 |
| clpC | Streptococcus pneumoniae Rx1 |
71.464 |
99.51 |
0.711 |
| clpC | Streptococcus pneumoniae D39 |
71.464 |
99.51 |
0.711 |
| clpC | Streptococcus mutans UA159 |
66.173 |
99.143 |
0.656 |
| clpC | Streptococcus thermophilus LMG 18311 |
64.828 |
99.878 |
0.647 |
| clpC | Streptococcus thermophilus LMD-9 |
64.706 |
99.878 |
0.646 |
| clpC | Lactococcus lactis subsp. lactis strain DGCC12653 |
48.086 |
100 |
0.492 |
| clpC | Bacillus subtilis subsp. subtilis str. 168 |
44.403 |
99.51 |
0.442 |