Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   MU520_RS01085 Genome accession   NZ_CP095463
Coordinates   232591..235044 (+) Length   817 a.a.
NCBI ID   WP_012027858.1    Uniprot ID   A0A2K1SY42
Organism   Streptococcus suis strain TJS56     
Function   degradation of ComW (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 196481..239801 232591..235044 within 0


Gene organization within MGE regions


Location: 196481..239801
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MU520_RS00905 rnmV 196481..197050 (+) 570 WP_012775366.1 ribonuclease M5 -
  MU520_RS00910 - 197043..197411 (+) 369 WP_012775453.1 hypothetical protein -
  MU520_RS00915 - 197509..197796 (+) 288 WP_012028535.1 hypothetical protein -
  MU520_RS00920 - 197786..198115 (+) 330 WP_012775364.1 type II toxin-antitoxin system RelE/ParE family toxin -
  MU520_RS00925 - 198122..198577 (+) 456 WP_012028534.1 8-oxo-dGTP diphosphatase -
  MU520_RS00930 - 198653..198913 (+) 261 WP_002939010.1 type II toxin-antitoxin system Phd/YefM family antitoxin -
  MU520_RS00935 - 198915..199172 (+) 258 WP_012775363.1 Txe/YoeB family addiction module toxin -
  MU520_RS00940 - 199186..199716 (+) 531 WP_009910909.1 DUF1697 domain-containing protein -
  MU520_RS00945 rsmA 199736..200608 (+) 873 WP_012775362.1 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))- dimethyltransferase RsmA -
  MU520_RS00950 - 200973..202703 (+) 1731 WP_012775361.1 membrane protein -
  MU520_RS00955 - 202716..205160 (+) 2445 WP_012027881.1 alkaline phosphatase family protein -
  MU520_RS00960 - 205650..206513 (+) 864 WP_012027880.1 Rgg/GadR/MutR family transcriptional regulator -
  MU520_RS00965 - 206882..208123 (+) 1242 WP_014636518.1 radical SAM protein -
  MU520_RS00970 - 208120..209697 (+) 1578 WP_172074192.1 ABC transporter ATP-binding protein -
  MU520_RS00980 rsgA 210594..211469 (+) 876 WP_012027877.1 ribosome small subunit-dependent GTPase A -
  MU520_RS00985 rpe 211484..212143 (+) 660 WP_012027876.1 ribulose-phosphate 3-epimerase -
  MU520_RS00990 - 212136..212768 (+) 633 WP_012027875.1 thiamine diphosphokinase -
  MU520_RS00995 rmuC 212769..214007 (+) 1239 WP_012027874.1 DNA recombination protein RmuC -
  MU520_RS01000 - 213997..214938 (+) 942 WP_012775359.1 3'-5' exoribonuclease YhaM family protein -
  MU520_RS01005 - 214938..215804 (+) 867 WP_012027872.1 phosphotransferase -
  MU520_RS01010 purR 215906..216718 (+) 813 WP_002938438.1 pur operon repressor -
  MU520_RS01015 - 216745..217611 (-) 867 WP_012027870.1 RluA family pseudouridine synthase -
  MU520_RS01020 - 217613..218581 (-) 969 WP_012027869.1 NAD(P)/FAD-dependent oxidoreductase -
  MU520_RS01025 pbp2a 218734..220947 (+) 2214 WP_012027868.1 penicillin-binding protein PBP2A -
  MU520_RS01030 rpmG 220981..221133 (+) 153 WP_002940258.1 50S ribosomal protein L33 -
  MU520_RS01035 secE 221143..221319 (+) 177 WP_002940255.1 preprotein translocase subunit SecE -
  MU520_RS01040 nusG 221429..221968 (+) 540 WP_002940254.1 transcription termination/antitermination protein NusG -
  MU520_RS01045 - 222112..227190 (+) 5079 WP_012775357.1 S8 family serine peptidase -
  MU520_RS01055 - 227844..228356 (+) 513 WP_012775452.1 adenylate kinase -
  MU520_RS01060 rpsB 228703..229479 (+) 777 WP_012775356.1 30S ribosomal protein S2 -
  MU520_RS01065 tsf 229635..230675 (+) 1041 WP_012027862.1 translation elongation factor Ts -
  MU520_RS01070 - 230938..231639 (-) 702 WP_012027861.1 hypothetical protein -
  MU520_RS01075 - 231639..231977 (-) 339 WP_012027860.1 thioredoxin domain-containing protein -
  MU520_RS01080 - 232127..232585 (+) 459 WP_012027859.1 CtsR family transcriptional regulator -
  MU520_RS01085 clpC 232591..235044 (+) 2454 WP_012027858.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  MU520_RS01090 - 235158..236195 (+) 1038 WP_012028524.1 AAA family ATPase -
  MU520_RS01095 pnuC 236205..237026 (+) 822 WP_012027855.1 nicotinamide riboside transporter PnuC -
  MU520_RS01100 - 237019..237792 (+) 774 WP_012027854.1 NUDIX domain-containing protein -
  MU520_RS01105 dusB 237938..238942 (-) 1005 WP_012775355.1 tRNA dihydrouridine synthase DusB -
  MU520_RS01110 hslO 238935..239801 (-) 867 WP_012027852.1 Hsp33 family molecular chaperone HslO -

Sequence


Protein


Download         Length: 817 a.a.        Molecular weight: 90132.53 Da        Isoelectric Point: 6.3138

>NTDB_id=676989 MU520_RS01085 WP_012027858.1 232591..235044(+) (clpC) [Streptococcus suis strain TJS56]
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG

Nucleotide


Download         Length: 2454 bp        

>NTDB_id=676989 MU520_RS01085 WP_012027858.1 232591..235044(+) (clpC) [Streptococcus suis strain TJS56]
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTAGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGGCTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2K1SY42

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus pneumoniae TIGR4

71.464

99.51

0.711

  clpC Streptococcus pneumoniae Rx1

71.464

99.51

0.711

  clpC Streptococcus pneumoniae D39

71.464

99.51

0.711

  clpC Streptococcus mutans UA159

66.173

99.143

0.656

  clpC Streptococcus thermophilus LMG 18311

64.828

99.878

0.647

  clpC Streptococcus thermophilus LMD-9

64.706

99.878

0.646

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

48.086

100

0.492

  clpC Bacillus subtilis subsp. subtilis str. 168

44.403

99.51

0.442