Detailed information
Overview
| Name | clpC | Type | Regulator |
| Locus tag | SSGZ1_RS09015 | Genome accession | NC_017617 |
| Coordinates | 1823582..1826035 (-) | Length | 817 a.a. |
| NCBI ID | WP_012027858.1 | Uniprot ID | A0A2K1SY42 |
| Organism | Streptococcus suis GZ1 | ||
| Function | degradation of ComW (predicted from homology) Competence regulation |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 1819684..1862910 | 1823582..1826035 | within | 0 |
Gene organization within MGE regions
Location: 1819684..1862910
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SSGZ1_RS08995 (SSGZ1_1786) | dusB | 1819684..1820688 (+) | 1005 | WP_012775355.1 | tRNA dihydrouridine synthase DusB | - |
| SSGZ1_RS09000 (SSGZ1_1787) | - | 1820834..1821607 (-) | 774 | WP_012027854.1 | NUDIX domain-containing protein | - |
| SSGZ1_RS09005 (SSGZ1_1788) | pnuC | 1821600..1822421 (-) | 822 | WP_012027855.1 | nicotinamide riboside transporter PnuC | - |
| SSGZ1_RS09010 (SSGZ1_1789) | - | 1822431..1823468 (-) | 1038 | WP_012028524.1 | AAA family ATPase | - |
| SSGZ1_RS09015 (SSGZ1_1790) | clpC | 1823582..1826035 (-) | 2454 | WP_012027858.1 | ATP-dependent Clp protease ATP-binding subunit | Regulator |
| SSGZ1_RS09020 (SSGZ1_1791) | - | 1826041..1826499 (-) | 459 | WP_012027859.1 | CtsR family transcriptional regulator | - |
| SSGZ1_RS09025 (SSGZ1_1792) | - | 1826649..1826987 (+) | 339 | WP_012027860.1 | thioredoxin domain-containing protein | - |
| SSGZ1_RS09030 (SSGZ1_1793) | - | 1826987..1827688 (+) | 702 | WP_012027861.1 | hypothetical protein | - |
| SSGZ1_RS09035 (SSGZ1_1794) | tsf | 1827951..1828991 (-) | 1041 | WP_012027862.1 | translation elongation factor Ts | - |
| SSGZ1_RS09040 (SSGZ1_1795) | rpsB | 1829147..1829923 (-) | 777 | WP_012775356.1 | 30S ribosomal protein S2 | - |
| SSGZ1_RS09045 (SSGZ1_1796) | - | 1830270..1830782 (-) | 513 | WP_012775452.1 | adenylate kinase | - |
| SSGZ1_RS09055 (SSGZ1_1797) | - | 1831436..1836514 (-) | 5079 | WP_012775357.1 | S8 family serine peptidase | - |
| SSGZ1_RS09060 (SSGZ1_1798) | nusG | 1836658..1837196 (-) | 539 | Protein_1738 | transcription termination/antitermination protein NusG | - |
| SSGZ1_RS09065 (SSGZ1_1799) | secE | 1837306..1837482 (-) | 177 | WP_002940255.1 | preprotein translocase subunit SecE | - |
| SSGZ1_RS09070 (SSGZ1_1800) | rpmG | 1837492..1837644 (-) | 153 | WP_002940258.1 | 50S ribosomal protein L33 | - |
| SSGZ1_RS09075 (SSGZ1_1801) | pbp2a | 1837678..1839891 (-) | 2214 | WP_012027868.1 | penicillin-binding protein PBP2A | - |
| SSGZ1_RS09080 (SSGZ1_1802) | - | 1840044..1841012 (+) | 969 | WP_012027869.1 | NAD(P)/FAD-dependent oxidoreductase | - |
| SSGZ1_RS09085 (SSGZ1_1803) | - | 1841014..1841880 (+) | 867 | WP_012027870.1 | RluA family pseudouridine synthase | - |
| SSGZ1_RS09090 (SSGZ1_1804) | purR | 1841907..1842719 (-) | 813 | WP_002938438.1 | pur operon repressor | - |
| SSGZ1_RS09095 (SSGZ1_1805) | - | 1842821..1843687 (-) | 867 | WP_012027872.1 | aminoglycoside phosphotransferase family protein | - |
| SSGZ1_RS09100 (SSGZ1_1806) | - | 1843687..1844628 (-) | 942 | WP_012775359.1 | 3'-5' exoribonuclease YhaM family protein | - |
| SSGZ1_RS09105 (SSGZ1_1807) | - | 1844618..1845856 (-) | 1239 | WP_012027874.1 | DNA recombination protein RmuC | - |
| SSGZ1_RS09110 (SSGZ1_1808) | - | 1845857..1846489 (-) | 633 | WP_012027875.1 | thiamine diphosphokinase | - |
| SSGZ1_RS09115 (SSGZ1_1809) | rpe | 1846482..1847141 (-) | 660 | WP_012027876.1 | ribulose-phosphate 3-epimerase | - |
| SSGZ1_RS09120 (SSGZ1_1810) | rsgA | 1847156..1848031 (-) | 876 | WP_012027877.1 | ribosome small subunit-dependent GTPase A | - |
| SSGZ1_RS09130 (SSGZ1_1811) | - | 1848928..1850505 (-) | 1578 | WP_012027878.1 | ATP-binding cassette domain-containing protein | - |
| SSGZ1_RS09135 (SSGZ1_1812) | - | 1850502..1851743 (-) | 1242 | WP_014636518.1 | radical SAM protein | - |
| SSGZ1_RS09140 (SSGZ1_1813) | - | 1852112..1852975 (-) | 864 | WP_012027880.1 | helix-turn-helix domain-containing protein | - |
| SSGZ1_RS09150 (SSGZ1_1814) | - | 1853465..1855909 (-) | 2445 | WP_012027881.1 | LTA synthase family protein | - |
| SSGZ1_RS10660 (SSGZ1_1815) | - | 1855922..1856104 (-) | 183 | WP_014636156.1 | hypothetical protein | - |
| SSGZ1_RS09155 (SSGZ1_1816) | - | 1856139..1857653 (-) | 1515 | WP_014636157.1 | membrane protein | - |
| SSGZ1_RS09160 (SSGZ1_1817) | rsmA | 1858018..1858890 (-) | 873 | WP_012775362.1 | 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))- dimethyltransferase RsmA | - |
| SSGZ1_RS09165 (SSGZ1_1818) | - | 1858910..1859439 (-) | 530 | Protein_1758 | DUF1697 domain-containing protein | - |
| SSGZ1_RS09170 | - | 1859453..1859704 (-) | 252 | WP_041179125.1 | Txe/YoeB family addiction module toxin | - |
| SSGZ1_RS09175 (SSGZ1_1820) | - | 1859713..1859972 (-) | 260 | Protein_1760 | type II toxin-antitoxin system Phd/YefM family antitoxin | - |
| SSGZ1_RS09180 (SSGZ1_1821) | - | 1860048..1860503 (-) | 456 | WP_012028534.1 | 8-oxo-dGTP diphosphatase | - |
| SSGZ1_RS09185 (SSGZ1_1822) | - | 1860510..1860839 (-) | 330 | WP_012775364.1 | type II toxin-antitoxin system RelE/ParE family toxin | - |
| SSGZ1_RS09190 (SSGZ1_1823) | - | 1860829..1861116 (-) | 288 | WP_012028535.1 | hypothetical protein | - |
| SSGZ1_RS09195 (SSGZ1_1824) | - | 1861214..1861582 (-) | 369 | WP_012775453.1 | hypothetical protein | - |
| SSGZ1_RS09200 (SSGZ1_1825) | rnmV | 1861575..1862144 (-) | 570 | WP_012775366.1 | ribonuclease M5 | - |
| SSGZ1_RS09205 (SSGZ1_1826) | - | 1862128..1862910 (-) | 783 | WP_012028536.1 | TatD family hydrolase | - |
Sequence
Protein
Download Length: 817 a.a. Molecular weight: 90132.53 Da Isoelectric Point: 6.3138
>NTDB_id=49601 SSGZ1_RS09015 WP_012027858.1 1823582..1826035(-) (clpC) [Streptococcus suis GZ1]
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG
Nucleotide
Download Length: 2454 bp
>NTDB_id=49601 SSGZ1_RS09015 WP_012027858.1 1823582..1826035(-) (clpC) [Streptococcus suis GZ1]
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTAGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGGCTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTAGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGGCTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| clpC | Streptococcus pneumoniae TIGR4 |
71.464 |
99.51 |
0.711 |
| clpC | Streptococcus pneumoniae Rx1 |
71.464 |
99.51 |
0.711 |
| clpC | Streptococcus pneumoniae D39 |
71.464 |
99.51 |
0.711 |
| clpC | Streptococcus mutans UA159 |
66.173 |
99.143 |
0.656 |
| clpC | Streptococcus thermophilus LMG 18311 |
64.828 |
99.878 |
0.647 |
| clpC | Streptococcus thermophilus LMD-9 |
64.706 |
99.878 |
0.646 |
| clpC | Lactococcus lactis subsp. lactis strain DGCC12653 |
48.086 |
100 |
0.492 |
| clpC | Bacillus subtilis subsp. subtilis str. 168 |
44.403 |
99.51 |
0.442 |