Detailed information
Overview
| Name | clpC | Type | Regulator |
| Locus tag | CR541_RS09785 | Genome accession | NZ_CP024050 |
| Coordinates | 1920640..1923093 (-) | Length | 817 a.a. |
| NCBI ID | WP_014636516.1 | Uniprot ID | - |
| Organism | Streptococcus suis strain CS100322 | ||
| Function | degradation of ComW (predicted from homology) Competence regulation |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 1916742..1959968 | 1920640..1923093 | within | 0 |
Gene organization within MGE regions
Location: 1916742..1959968
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| CR541_RS09765 (CR541_09705) | dusB | 1916742..1917746 (+) | 1005 | WP_012775355.1 | tRNA dihydrouridine synthase DusB | - |
| CR541_RS09770 (CR541_09710) | - | 1917892..1918665 (-) | 774 | WP_012027854.1 | NUDIX domain-containing protein | - |
| CR541_RS09775 (CR541_09715) | pnuC | 1918658..1919479 (-) | 822 | WP_012027855.1 | nicotinamide riboside transporter PnuC | - |
| CR541_RS09780 (CR541_09720) | - | 1919489..1920526 (-) | 1038 | WP_012028524.1 | AAA family ATPase | - |
| CR541_RS09785 (CR541_09725) | clpC | 1920640..1923093 (-) | 2454 | WP_014636516.1 | ATP-dependent Clp protease ATP-binding subunit | Regulator |
| CR541_RS09790 (CR541_09730) | - | 1923099..1923557 (-) | 459 | WP_012027859.1 | CtsR family transcriptional regulator | - |
| CR541_RS09795 (CR541_09735) | - | 1923707..1924045 (+) | 339 | WP_012027860.1 | thioredoxin domain-containing protein | - |
| CR541_RS09800 (CR541_09740) | - | 1924045..1924746 (+) | 702 | WP_012027861.1 | hypothetical protein | - |
| CR541_RS09805 (CR541_09745) | tsf | 1925009..1926049 (-) | 1041 | WP_012027862.1 | translation elongation factor Ts | - |
| CR541_RS09810 (CR541_09750) | rpsB | 1926205..1926981 (-) | 777 | WP_012775356.1 | 30S ribosomal protein S2 | - |
| CR541_RS09815 (CR541_09755) | - | 1927328..1927840 (-) | 513 | WP_012775452.1 | adenylate kinase | - |
| CR541_RS09825 (CR541_09765) | - | 1928494..1933572 (-) | 5079 | WP_012775357.1 | S8 family serine peptidase | - |
| CR541_RS09830 (CR541_09770) | nusG | 1933715..1934254 (-) | 540 | WP_002940254.1 | transcription termination/antitermination protein NusG | - |
| CR541_RS09835 (CR541_09775) | secE | 1934364..1934540 (-) | 177 | WP_002940255.1 | preprotein translocase subunit SecE | - |
| CR541_RS09840 (CR541_09780) | rpmG | 1934550..1934702 (-) | 153 | WP_002940258.1 | 50S ribosomal protein L33 | - |
| CR541_RS09845 (CR541_09785) | pbp2a | 1934736..1936949 (-) | 2214 | WP_012027868.1 | penicillin-binding protein PBP2A | - |
| CR541_RS09850 (CR541_09790) | - | 1937102..1938070 (+) | 969 | WP_012027869.1 | NAD(P)/FAD-dependent oxidoreductase | - |
| CR541_RS09855 (CR541_09795) | - | 1938072..1938938 (+) | 867 | WP_012027870.1 | RluA family pseudouridine synthase | - |
| CR541_RS09860 (CR541_09800) | purR | 1938965..1939777 (-) | 813 | WP_002938438.1 | pur operon repressor | - |
| CR541_RS09865 (CR541_09805) | - | 1939879..1940745 (-) | 867 | WP_012027872.1 | aminoglycoside phosphotransferase family protein | - |
| CR541_RS09870 (CR541_09810) | - | 1940745..1941686 (-) | 942 | WP_012775359.1 | 3'-5' exoribonuclease YhaM family protein | - |
| CR541_RS09875 (CR541_09815) | - | 1941676..1942914 (-) | 1239 | WP_012027874.1 | DNA recombination protein RmuC | - |
| CR541_RS09880 (CR541_09820) | - | 1942915..1943547 (-) | 633 | WP_012027875.1 | thiamine diphosphokinase | - |
| CR541_RS09885 (CR541_09825) | rpe | 1943540..1944199 (-) | 660 | WP_012027876.1 | ribulose-phosphate 3-epimerase | - |
| CR541_RS09890 (CR541_09830) | rsgA | 1944214..1945089 (-) | 876 | WP_012027877.1 | ribosome small subunit-dependent GTPase A | - |
| CR541_RS09900 (CR541_09840) | - | 1945986..1947563 (-) | 1578 | WP_012027878.1 | ATP-binding cassette domain-containing protein | - |
| CR541_RS09905 (CR541_09845) | - | 1947560..1948801 (-) | 1242 | WP_014636518.1 | radical SAM protein | - |
| CR541_RS09910 (CR541_09850) | - | 1949170..1950033 (-) | 864 | WP_012027880.1 | helix-turn-helix domain-containing protein | - |
| CR541_RS09920 (CR541_09860) | - | 1950523..1952967 (-) | 2445 | WP_012027881.1 | LTA synthase family protein | - |
| CR541_RS09925 (CR541_09865) | - | 1952980..1954710 (-) | 1731 | WP_012775361.1 | membrane protein | - |
| CR541_RS09930 (CR541_09870) | rsmA | 1955075..1955947 (-) | 873 | WP_012775362.1 | 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))- dimethyltransferase RsmA | - |
| CR541_RS09935 (CR541_09875) | - | 1955967..1956497 (-) | 531 | WP_009910909.1 | DUF1697 domain-containing protein | - |
| CR541_RS09940 (CR541_09880) | - | 1956511..1956768 (-) | 258 | WP_012775363.1 | Txe/YoeB family addiction module toxin | - |
| CR541_RS09945 (CR541_09885) | - | 1956770..1957030 (-) | 261 | WP_002939010.1 | type II toxin-antitoxin system Phd/YefM family antitoxin | - |
| CR541_RS09950 (CR541_09890) | - | 1957106..1957561 (-) | 456 | WP_012028534.1 | 8-oxo-dGTP diphosphatase | - |
| CR541_RS09955 (CR541_09895) | - | 1957568..1957897 (-) | 330 | WP_012775364.1 | type II toxin-antitoxin system RelE/ParE family toxin | - |
| CR541_RS09960 (CR541_09900) | - | 1957887..1958174 (-) | 288 | WP_012028535.1 | hypothetical protein | - |
| CR541_RS09965 (CR541_09905) | - | 1958272..1958640 (-) | 369 | WP_012775453.1 | hypothetical protein | - |
| CR541_RS09970 (CR541_09910) | rnmV | 1958633..1959202 (-) | 570 | WP_012775366.1 | ribonuclease M5 | - |
| CR541_RS09975 (CR541_09915) | - | 1959186..1959968 (-) | 783 | WP_012028536.1 | TatD family hydrolase | - |
Sequence
Protein
Download Length: 817 a.a. Molecular weight: 90162.56 Da Isoelectric Point: 6.3138
>NTDB_id=251449 CR541_RS09785 WP_014636516.1 1920640..1923093(-) (clpC) [Streptococcus suis strain CS100322]
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFATSRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFATSRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG
Nucleotide
Download Length: 2454 bp
>NTDB_id=251449 CR541_RS09785 WP_014636516.1 1920640..1923093(-) (clpC) [Streptococcus suis strain CS100322]
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTAGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGACTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTAGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGACTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| clpC | Streptococcus pneumoniae TIGR4 |
71.341 |
99.51 |
0.71 |
| clpC | Streptococcus pneumoniae Rx1 |
71.341 |
99.51 |
0.71 |
| clpC | Streptococcus pneumoniae D39 |
71.341 |
99.51 |
0.71 |
| clpC | Streptococcus mutans UA159 |
66.049 |
99.143 |
0.655 |
| clpC | Streptococcus thermophilus LMG 18311 |
64.706 |
99.878 |
0.646 |
| clpC | Streptococcus thermophilus LMD-9 |
64.583 |
99.878 |
0.645 |
| clpC | Lactococcus lactis subsp. lactis strain DGCC12653 |
48.206 |
100 |
0.493 |
| clpC | Bacillus subtilis subsp. subtilis str. 168 |
44.526 |
99.51 |
0.443 |