Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   SKB18_RS06495 Genome accession   NZ_CP139163
Coordinates   1316323..1318776 (+) Length   817 a.a.
NCBI ID   WP_012027858.1    Uniprot ID   A0A2K1SY42
Organism   Streptococcus suis strain cnzyss2-311     
Function   degradation of ComW (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1280233..1323533 1316323..1318776 within 0


Gene organization within MGE regions


Location: 1280233..1323533
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SKB18_RS06310 rnmV 1280233..1280802 (+) 570 WP_012775366.1 ribonuclease M5 -
  SKB18_RS06315 - 1280795..1281163 (+) 369 WP_012775453.1 hypothetical protein -
  SKB18_RS06320 - 1281261..1281548 (+) 288 WP_012028535.1 hypothetical protein -
  SKB18_RS06325 - 1281538..1281867 (+) 330 WP_012775364.1 type II toxin-antitoxin system RelE/ParE family toxin -
  SKB18_RS06330 - 1281874..1282329 (+) 456 WP_012028534.1 8-oxo-dGTP diphosphatase -
  SKB18_RS06335 - 1282405..1282665 (+) 261 WP_002939010.1 type II toxin-antitoxin system Phd/YefM family antitoxin -
  SKB18_RS06340 - 1282667..1282924 (+) 258 WP_012775363.1 Txe/YoeB family addiction module toxin -
  SKB18_RS06345 - 1282938..1283468 (+) 531 WP_009910909.1 DUF1697 domain-containing protein -
  SKB18_RS06350 rsmA 1283488..1284360 (+) 873 WP_012775362.1 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))- dimethyltransferase RsmA -
  SKB18_RS06355 - 1284725..1286455 (+) 1731 WP_012775361.1 membrane protein -
  SKB18_RS06360 - 1286468..1288912 (+) 2445 WP_012027881.1 LTA synthase family protein -
  SKB18_RS06365 - 1289402..1290265 (+) 864 WP_012027880.1 helix-turn-helix domain-containing protein -
  SKB18_RS06370 - 1290634..1291875 (+) 1242 WP_014636518.1 radical SAM protein -
  SKB18_RS06375 - 1291872..1293449 (+) 1578 WP_012027878.1 ABC transporter ATP-binding protein -
  SKB18_RS06385 rsgA 1294346..1295221 (+) 876 WP_012027877.1 ribosome small subunit-dependent GTPase A -
  SKB18_RS06390 rpe 1295236..1295895 (+) 660 WP_012027876.1 ribulose-phosphate 3-epimerase -
  SKB18_RS06395 - 1295888..1296520 (+) 633 WP_012027875.1 thiamine diphosphokinase -
  SKB18_RS06400 - 1296521..1297759 (+) 1239 WP_012027874.1 DNA recombination protein RmuC -
  SKB18_RS06405 - 1297749..1298690 (+) 942 WP_012775359.1 3'-5' exoribonuclease YhaM family protein -
  SKB18_RS06410 - 1298690..1299556 (+) 867 WP_012027872.1 aminoglycoside phosphotransferase family protein -
  SKB18_RS06415 purR 1299658..1300470 (+) 813 WP_002938438.1 pur operon repressor -
  SKB18_RS06420 - 1300497..1301345 (-) 849 WP_014636718.1 RluA family pseudouridine synthase -
  SKB18_RS06425 - 1301347..1302315 (-) 969 WP_012027869.1 NAD(P)/FAD-dependent oxidoreductase -
  SKB18_RS06430 pbp2a 1302468..1304681 (+) 2214 WP_012027868.1 penicillin-binding protein PBP2A -
  SKB18_RS06435 rpmG 1304715..1304867 (+) 153 WP_002940258.1 50S ribosomal protein L33 -
  SKB18_RS06440 secE 1304877..1305053 (+) 177 WP_002940255.1 preprotein translocase subunit SecE -
  SKB18_RS06445 nusG 1305163..1305702 (+) 540 WP_002940254.1 transcription termination/antitermination protein NusG -
  SKB18_RS06450 - 1305846..1306748 (+) 903 WP_320531342.1 YSIRK-type signal peptide-containing protein -
  SKB18_RS06455 - 1306720..1310922 (+) 4203 WP_320531343.1 S8 family serine peptidase -
  SKB18_RS06465 - 1311576..1312088 (+) 513 WP_012775452.1 adenylate kinase -
  SKB18_RS06470 rpsB 1312435..1313211 (+) 777 WP_012775356.1 30S ribosomal protein S2 -
  SKB18_RS06475 tsf 1313367..1314407 (+) 1041 WP_012027862.1 translation elongation factor Ts -
  SKB18_RS06480 - 1314670..1315371 (-) 702 WP_012027861.1 hypothetical protein -
  SKB18_RS06485 - 1315371..1315709 (-) 339 WP_012027860.1 thioredoxin domain-containing protein -
  SKB18_RS06490 - 1315859..1316317 (+) 459 WP_012027859.1 CtsR family transcriptional regulator -
  SKB18_RS06495 clpC 1316323..1318776 (+) 2454 WP_012027858.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  SKB18_RS06500 - 1318890..1319927 (+) 1038 WP_012028524.1 AAA family ATPase -
  SKB18_RS06505 pnuC 1319937..1320758 (+) 822 WP_012027855.1 nicotinamide riboside transporter PnuC -
  SKB18_RS06510 - 1320751..1321524 (+) 774 WP_012027854.1 NUDIX domain-containing protein -
  SKB18_RS06515 dusB 1321670..1322674 (-) 1005 WP_012775355.1 tRNA dihydrouridine synthase DusB -
  SKB18_RS06520 hslO 1322667..1323533 (-) 867 WP_012027852.1 Hsp33 family molecular chaperone HslO -

Sequence


Protein


Download         Length: 817 a.a.        Molecular weight: 90132.53 Da        Isoelectric Point: 6.3138

>NTDB_id=907463 SKB18_RS06495 WP_012027858.1 1316323..1318776(+) (clpC) [Streptococcus suis strain cnzyss2-311]
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG

Nucleotide


Download         Length: 2454 bp        

>NTDB_id=907463 SKB18_RS06495 WP_012027858.1 1316323..1318776(+) (clpC) [Streptococcus suis strain cnzyss2-311]
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTAGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGGCTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2K1SY42

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus pneumoniae TIGR4

71.464

99.51

0.711

  clpC Streptococcus pneumoniae Rx1

71.464

99.51

0.711

  clpC Streptococcus pneumoniae D39

71.464

99.51

0.711

  clpC Streptococcus mutans UA159

66.173

99.143

0.656

  clpC Streptococcus thermophilus LMG 18311

64.828

99.878

0.647

  clpC Streptococcus thermophilus LMD-9

64.706

99.878

0.646

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

48.086

100

0.492

  clpC Bacillus subtilis subsp. subtilis str. 168

44.403

99.51

0.442