Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   QBE57_RS09825 Genome accession   NZ_CP121670
Coordinates   1999033..2001486 (+) Length   817 a.a.
NCBI ID   WP_014636516.1    Uniprot ID   -
Organism   Streptococcus suis strain HBXY-SL     
Function   degradation of ComW (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1962927..2006243 1999033..2001486 within 0


Gene organization within MGE regions


Location: 1962927..2006243
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QBE57_RS09645 rnmV 1962927..1963496 (+) 570 WP_012775366.1 ribonuclease M5 -
  QBE57_RS09650 - 1963489..1963857 (+) 369 WP_012775453.1 hypothetical protein -
  QBE57_RS09655 - 1963955..1964242 (+) 288 WP_012028535.1 hypothetical protein -
  QBE57_RS09660 - 1964232..1964561 (+) 330 WP_012775364.1 type II toxin-antitoxin system RelE/ParE family toxin -
  QBE57_RS09665 - 1964568..1965023 (+) 456 WP_012028534.1 8-oxo-dGTP diphosphatase -
  QBE57_RS09670 - 1965099..1965359 (+) 261 WP_002939010.1 type II toxin-antitoxin system Phd/YefM family antitoxin -
  QBE57_RS09675 - 1965361..1965618 (+) 258 WP_012775363.1 Txe/YoeB family addiction module toxin -
  QBE57_RS09680 - 1965632..1966162 (+) 531 WP_009910909.1 DUF1697 domain-containing protein -
  QBE57_RS09685 rsmA 1966182..1967054 (+) 873 WP_012775362.1 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))- dimethyltransferase RsmA -
  QBE57_RS09690 - 1967418..1969148 (+) 1731 WP_012775361.1 membrane protein -
  QBE57_RS09695 - 1969161..1971605 (+) 2445 WP_012027881.1 LTA synthase family protein -
  QBE57_RS09700 - 1972095..1972958 (+) 864 WP_012027880.1 helix-turn-helix domain-containing protein -
  QBE57_RS09705 - 1973327..1974568 (+) 1242 WP_014636518.1 radical SAM protein -
  QBE57_RS09710 - 1974565..1976142 (+) 1578 WP_012027878.1 ATP-binding cassette domain-containing protein -
  QBE57_RS09720 rsgA 1977037..1977912 (+) 876 WP_012027877.1 ribosome small subunit-dependent GTPase A -
  QBE57_RS09725 rpe 1977927..1978586 (+) 660 WP_012027876.1 ribulose-phosphate 3-epimerase -
  QBE57_RS09730 - 1978579..1979211 (+) 633 WP_012027875.1 thiamine diphosphokinase -
  QBE57_RS09735 - 1979212..1980450 (+) 1239 WP_012027874.1 DNA recombination protein RmuC -
  QBE57_RS09740 - 1980440..1981381 (+) 942 WP_012775359.1 3'-5' exoribonuclease YhaM family protein -
  QBE57_RS09745 - 1981381..1982247 (+) 867 WP_012027872.1 aminoglycoside phosphotransferase family protein -
  QBE57_RS09750 purR 1982349..1983161 (+) 813 WP_002938438.1 pur operon repressor -
  QBE57_RS09755 - 1983188..1984054 (-) 867 WP_012027870.1 RluA family pseudouridine synthase -
  QBE57_RS09760 - 1984056..1985024 (-) 969 WP_012027869.1 NAD(P)/FAD-dependent oxidoreductase -
  QBE57_RS09765 pbp2a 1985177..1987390 (+) 2214 WP_012027868.1 penicillin-binding protein PBP2A -
  QBE57_RS09770 rpmG 1987424..1987576 (+) 153 WP_002940258.1 50S ribosomal protein L33 -
  QBE57_RS09775 secE 1987586..1987762 (+) 177 WP_002940255.1 preprotein translocase subunit SecE -
  QBE57_RS09780 nusG 1987872..1988411 (+) 540 WP_002940254.1 transcription termination/antitermination protein NusG -
  QBE57_RS09785 - 1988555..1993633 (+) 5079 WP_012775357.1 S8 family serine peptidase -
  QBE57_RS09795 - 1994287..1994799 (+) 513 WP_012775452.1 adenylate kinase -
  QBE57_RS09800 rpsB 1995146..1995922 (+) 777 WP_012775356.1 30S ribosomal protein S2 -
  QBE57_RS09805 tsf 1996078..1997118 (+) 1041 WP_012027862.1 translation elongation factor Ts -
  QBE57_RS09810 - 1997381..1998082 (-) 702 WP_012027861.1 hypothetical protein -
  QBE57_RS09815 - 1998082..1998420 (-) 339 WP_012027860.1 thioredoxin domain-containing protein -
  QBE57_RS09820 - 1998570..1999027 (+) 458 Protein_1891 CtsR family transcriptional regulator -
  QBE57_RS09825 clpC 1999033..2001486 (+) 2454 WP_014636516.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  QBE57_RS09830 - 2001600..2002637 (+) 1038 WP_012028524.1 AAA family ATPase -
  QBE57_RS09835 pnuC 2002647..2003468 (+) 822 WP_012027855.1 nicotinamide riboside transporter PnuC -
  QBE57_RS09840 - 2003461..2004234 (+) 774 WP_012027854.1 NUDIX domain-containing protein -
  QBE57_RS09845 dusB 2004380..2005384 (-) 1005 WP_012775355.1 tRNA dihydrouridine synthase DusB -
  QBE57_RS09850 hslO 2005377..2006243 (-) 867 WP_012027852.1 Hsp33 family molecular chaperone HslO -

Sequence


Protein


Download         Length: 817 a.a.        Molecular weight: 90162.56 Da        Isoelectric Point: 6.3138

>NTDB_id=813809 QBE57_RS09825 WP_014636516.1 1999033..2001486(+) (clpC) [Streptococcus suis strain HBXY-SL]
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFATSRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG

Nucleotide


Download         Length: 2454 bp        

>NTDB_id=813809 QBE57_RS09825 WP_014636516.1 1999033..2001486(+) (clpC) [Streptococcus suis strain HBXY-SL]
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTAGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGACTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus pneumoniae TIGR4

71.341

99.51

0.71

  clpC Streptococcus pneumoniae Rx1

71.341

99.51

0.71

  clpC Streptococcus pneumoniae D39

71.341

99.51

0.71

  clpC Streptococcus mutans UA159

66.049

99.143

0.655

  clpC Streptococcus thermophilus LMG 18311

64.706

99.878

0.646

  clpC Streptococcus thermophilus LMD-9

64.583

99.878

0.645

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

48.206

100

0.493

  clpC Bacillus subtilis subsp. subtilis str. 168

44.526

99.51

0.443