Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   LIO33_RS08605 Genome accession   NZ_CP085088
Coordinates   1748436..1750889 (-) Length   817 a.a.
NCBI ID   WP_012027858.1    Uniprot ID   A0A2K1SY42
Organism   Streptococcus suis strain Ssuis_MA1     
Function   degradation of ComW (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1744538..1787765 1748436..1750889 within 0


Gene organization within MGE regions


Location: 1744538..1787765
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LIO33_RS08585 dusB 1744538..1745542 (+) 1005 WP_012775355.1 tRNA dihydrouridine synthase DusB -
  LIO33_RS08590 - 1745688..1746461 (-) 774 WP_012027854.1 NUDIX domain-containing protein -
  LIO33_RS08595 pnuC 1746454..1747275 (-) 822 WP_012027855.1 nicotinamide riboside transporter PnuC -
  LIO33_RS08600 - 1747285..1748322 (-) 1038 WP_012028524.1 AAA family ATPase -
  LIO33_RS08605 clpC 1748436..1750889 (-) 2454 WP_012027858.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  LIO33_RS08610 - 1750895..1751353 (-) 459 WP_012027859.1 CtsR family transcriptional regulator -
  LIO33_RS08615 - 1751503..1751841 (+) 339 WP_012027860.1 thioredoxin domain-containing protein -
  LIO33_RS08620 - 1751841..1752542 (+) 702 WP_012027861.1 hypothetical protein -
  LIO33_RS08625 tsf 1752805..1753845 (-) 1041 WP_012027862.1 translation elongation factor Ts -
  LIO33_RS08630 rpsB 1754001..1754777 (-) 777 WP_012775356.1 30S ribosomal protein S2 -
  LIO33_RS08635 - 1755124..1755636 (-) 513 WP_012775452.1 adenylate kinase -
  LIO33_RS08645 - 1756290..1761368 (-) 5079 WP_012775357.1 S8 family serine peptidase -
  LIO33_RS08650 nusG 1761512..1762051 (-) 540 WP_002940254.1 transcription termination/antitermination protein NusG -
  LIO33_RS08655 secE 1762161..1762337 (-) 177 WP_002940255.1 preprotein translocase subunit SecE -
  LIO33_RS08660 rpmG 1762347..1762499 (-) 153 WP_002940258.1 50S ribosomal protein L33 -
  LIO33_RS08665 pbp2a 1762533..1764746 (-) 2214 WP_012027868.1 penicillin-binding protein PBP2A -
  LIO33_RS08670 - 1764899..1765867 (+) 969 WP_012027869.1 NAD(P)/FAD-dependent oxidoreductase -
  LIO33_RS08675 - 1765869..1766735 (+) 867 WP_012027870.1 RluA family pseudouridine synthase -
  LIO33_RS08680 purR 1766762..1767574 (-) 813 WP_002938438.1 pur operon repressor -
  LIO33_RS08685 - 1767676..1768542 (-) 867 WP_012027872.1 aminoglycoside phosphotransferase family protein -
  LIO33_RS08690 - 1768542..1769483 (-) 942 WP_012775359.1 3'-5' exoribonuclease YhaM family protein -
  LIO33_RS08695 - 1769473..1770711 (-) 1239 WP_012027874.1 DNA recombination protein RmuC -
  LIO33_RS08700 - 1770712..1771344 (-) 633 WP_012027875.1 thiamine diphosphokinase -
  LIO33_RS08705 rpe 1771337..1771996 (-) 660 WP_012027876.1 ribulose-phosphate 3-epimerase -
  LIO33_RS08710 rsgA 1772011..1772886 (-) 876 WP_012027877.1 ribosome small subunit-dependent GTPase A -
  LIO33_RS08720 - 1773783..1775360 (-) 1578 WP_012027878.1 ABC transporter ATP-binding protein -
  LIO33_RS08725 - 1775357..1776598 (-) 1242 WP_014636518.1 radical SAM protein -
  LIO33_RS08730 - 1776967..1777830 (-) 864 WP_012027880.1 helix-turn-helix domain-containing protein -
  LIO33_RS08735 - 1778320..1780764 (-) 2445 WP_012027881.1 LTA synthase family protein -
  LIO33_RS08740 - 1780777..1782507 (-) 1731 WP_012775361.1 membrane protein -
  LIO33_RS08745 rsmA 1782872..1783744 (-) 873 WP_012775362.1 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))- dimethyltransferase RsmA -
  LIO33_RS08750 - 1783764..1784294 (-) 531 WP_009910909.1 DUF1697 domain-containing protein -
  LIO33_RS08755 - 1784308..1784565 (-) 258 WP_012775363.1 Txe/YoeB family addiction module toxin -
  LIO33_RS08760 - 1784567..1784827 (-) 261 WP_002939010.1 type II toxin-antitoxin system Phd/YefM family antitoxin -
  LIO33_RS08765 - 1784903..1785358 (-) 456 WP_012028534.1 8-oxo-dGTP diphosphatase -
  LIO33_RS08770 - 1785365..1785694 (-) 330 WP_012775364.1 type II toxin-antitoxin system RelE/ParE family toxin -
  LIO33_RS08775 - 1785684..1785971 (-) 288 WP_012028535.1 hypothetical protein -
  LIO33_RS08780 - 1786069..1786437 (-) 369 WP_012775453.1 hypothetical protein -
  LIO33_RS08785 rnmV 1786430..1786999 (-) 570 WP_012775366.1 ribonuclease M5 -
  LIO33_RS08790 - 1786983..1787765 (-) 783 WP_012028536.1 TatD family hydrolase -

Sequence


Protein


Download         Length: 817 a.a.        Molecular weight: 90132.53 Da        Isoelectric Point: 6.3138

>NTDB_id=615342 LIO33_RS08605 WP_012027858.1 1748436..1750889(-) (clpC) [Streptococcus suis strain Ssuis_MA1]
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG

Nucleotide


Download         Length: 2454 bp        

>NTDB_id=615342 LIO33_RS08605 WP_012027858.1 1748436..1750889(-) (clpC) [Streptococcus suis strain Ssuis_MA1]
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTAGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGGCTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2K1SY42

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus pneumoniae TIGR4

71.464

99.51

0.711

  clpC Streptococcus pneumoniae Rx1

71.464

99.51

0.711

  clpC Streptococcus pneumoniae D39

71.464

99.51

0.711

  clpC Streptococcus mutans UA159

66.173

99.143

0.656

  clpC Streptococcus thermophilus LMG 18311

64.828

99.878

0.647

  clpC Streptococcus thermophilus LMD-9

64.706

99.878

0.646

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

48.086

100

0.492

  clpC Bacillus subtilis subsp. subtilis str. 168

44.403

99.51

0.442