Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   PFZ59_RS04420 Genome accession   NZ_CP116393
Coordinates   826707..829154 (+) Length   815 a.a.
NCBI ID   WP_277697588.1    Uniprot ID   -
Organism   Streptococcus suis strain SS/UPM/MY/F001     
Function   degradation of ComW (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 780112..830940 826707..829154 within 0
IScluster/Tn 818565..829754 826707..829154 within 0


Gene organization within MGE regions


Location: 780112..830940
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PFZ59_RS04055 (PFZ59_04055) - 780112..781182 (-) 1071 WP_105125704.1 site-specific integrase -
  PFZ59_RS04060 (PFZ59_04060) - 781425..782441 (-) 1017 WP_277697535.1 Abi family protein -
  PFZ59_RS04065 (PFZ59_04065) - 782735..783073 (-) 339 WP_277697536.1 ImmA/IrrE family metallo-endopeptidase -
  PFZ59_RS04070 (PFZ59_04070) - 783080..783445 (-) 366 WP_277697537.1 helix-turn-helix transcriptional regulator -
  PFZ59_RS04075 (PFZ59_04075) - 783835..783924 (-) 90 WP_277697538.1 type I toxin-antitoxin system Fst family toxin -
  PFZ59_RS04080 (PFZ59_04080) - 784129..784218 (-) 90 WP_099780080.1 type I addiction module toxin, Fst family -
  PFZ59_RS04085 (PFZ59_04085) - 784331..784489 (+) 159 WP_165437029.1 hypothetical protein -
  PFZ59_RS04090 (PFZ59_04090) - 784520..784732 (+) 213 WP_277697539.1 helix-turn-helix transcriptional regulator -
  PFZ59_RS04095 (PFZ59_04095) - 784737..784916 (+) 180 WP_277697540.1 hypothetical protein -
  PFZ59_RS04100 (PFZ59_04100) - 784909..785082 (+) 174 WP_277697541.1 hypothetical protein -
  PFZ59_RS04105 (PFZ59_04105) - 785229..786017 (-) 789 WP_277697542.1 DUF4393 domain-containing protein -
  PFZ59_RS04110 (PFZ59_04110) - 786066..786794 (+) 729 WP_277697543.1 phage antirepressor KilAC domain-containing protein -
  PFZ59_RS04115 (PFZ59_04115) - 786837..787121 (+) 285 WP_044671772.1 hypothetical protein -
  PFZ59_RS04120 (PFZ59_04120) - 787118..787306 (+) 189 WP_277697544.1 hypothetical protein -
  PFZ59_RS04125 (PFZ59_04125) - 787308..787562 (+) 255 WP_277697545.1 hypothetical protein -
  PFZ59_RS04130 (PFZ59_04130) - 787577..787741 (+) 165 WP_277697546.1 hypothetical protein -
  PFZ59_RS04135 (PFZ59_04135) - 787725..788519 (+) 795 WP_277697547.1 phage replisome organizer N-terminal domain-containing protein -
  PFZ59_RS04140 (PFZ59_04140) - 788507..788725 (+) 219 WP_044758388.1 hypothetical protein -
  PFZ59_RS04145 (PFZ59_04145) - 788725..788892 (+) 168 WP_277697548.1 hypothetical protein -
  PFZ59_RS04150 (PFZ59_04150) - 788923..789153 (+) 231 WP_277697549.1 DUF3310 domain-containing protein -
  PFZ59_RS04155 (PFZ59_04155) - 789150..789452 (+) 303 WP_277697090.1 DUF1372 family protein -
  PFZ59_RS04160 (PFZ59_04160) - 789428..790708 (+) 1281 WP_277697550.1 N-6 DNA methylase -
  PFZ59_RS04165 (PFZ59_04165) - 790708..791130 (+) 423 WP_277697087.1 restriction endonuclease subunit S -
  PFZ59_RS04170 (PFZ59_04170) - 791147..791395 (+) 249 WP_277697551.1 hypothetical protein -
  PFZ59_RS04175 (PFZ59_04175) - 791388..791954 (+) 567 WP_277697552.1 hypothetical protein -
  PFZ59_RS04180 (PFZ59_04180) - 791954..792145 (+) 192 WP_277697553.1 hypothetical protein -
  PFZ59_RS04185 (PFZ59_04185) - 792138..792671 (+) 534 WP_277697554.1 DUF1642 domain-containing protein -
  PFZ59_RS04190 (PFZ59_04190) - 792671..792898 (+) 228 WP_277697555.1 hypothetical protein -
  PFZ59_RS04195 (PFZ59_04195) - 792882..793265 (+) 384 WP_277697556.1 YopX family protein -
  PFZ59_RS04200 (PFZ59_04200) - 793863..794381 (+) 519 WP_277697557.1 hypothetical protein -
  PFZ59_RS04205 (PFZ59_04205) - 794378..794641 (+) 264 WP_277697558.1 hypothetical protein -
  PFZ59_RS04210 (PFZ59_04210) - 794631..795023 (+) 393 WP_277697559.1 hypothetical protein -
  PFZ59_RS04215 (PFZ59_04215) - 795037..795186 (+) 150 WP_153309172.1 hypothetical protein -
  PFZ59_RS04220 (PFZ59_04220) - 795155..795424 (+) 270 WP_277697560.1 hypothetical protein -
  PFZ59_RS04225 (PFZ59_04225) - 795435..795824 (+) 390 WP_277697561.1 ArpU family phage packaging/lysis transcriptional regulator -
  PFZ59_RS04230 (PFZ59_04230) - 795925..796491 (+) 567 WP_024378300.1 site-specific integrase -
  PFZ59_RS04235 (PFZ59_04235) - 796633..796971 (+) 339 WP_277697562.1 HNH endonuclease signature motif containing protein -
  PFZ59_RS04240 (PFZ59_04240) - 797069..797701 (+) 633 WP_277697563.1 HNH endonuclease -
  PFZ59_RS04245 (PFZ59_04245) - 797703..798965 (+) 1263 WP_277697564.1 hypothetical protein -
  PFZ59_RS04250 (PFZ59_04250) - 798955..800205 (+) 1251 WP_277697565.1 hypothetical protein -
  PFZ59_RS04255 (PFZ59_04255) - 800192..800443 (+) 252 WP_238595115.1 hypothetical protein -
  PFZ59_RS04260 (PFZ59_04260) - 800549..800782 (+) 234 WP_277697566.1 hypothetical protein -
  PFZ59_RS04265 (PFZ59_04265) - 800800..801144 (+) 345 WP_024378297.1 YjcQ family protein -
  PFZ59_RS04270 (PFZ59_04270) - 801358..801621 (+) 264 WP_277697567.1 hypothetical protein -
  PFZ59_RS04275 (PFZ59_04275) - 801621..801851 (+) 231 WP_277697568.1 hypothetical protein -
  PFZ59_RS04280 (PFZ59_04280) - 801934..803346 (+) 1413 WP_277697569.1 terminase large subunit -
  PFZ59_RS04285 (PFZ59_04285) - 803429..803887 (+) 459 WP_277697570.1 DUF4355 domain-containing protein -
  PFZ59_RS04290 (PFZ59_04290) - 803891..804790 (+) 900 WP_277697571.1 phage major capsid protein -
  PFZ59_RS04295 (PFZ59_04295) - 804919..805314 (+) 396 WP_277697572.1 phage Gp19/Gp15/Gp42 family protein -
  PFZ59_RS04300 (PFZ59_04300) - 805301..805639 (+) 339 WP_277697573.1 hypothetical protein -
  PFZ59_RS04305 (PFZ59_04305) - 805632..805871 (+) 240 WP_277697574.1 hypothetical protein -
  PFZ59_RS04310 (PFZ59_04310) - 805868..806203 (+) 336 WP_277697575.1 hypothetical protein -
  PFZ59_RS04315 (PFZ59_04315) - 806219..806806 (+) 588 WP_277697576.1 phage tail protein -
  PFZ59_RS04320 (PFZ59_04320) - 806817..807095 (+) 279 WP_002937863.1 hypothetical protein -
  PFZ59_RS04325 (PFZ59_04325) - 807092..807475 (+) 384 WP_277697577.1 DUF5361 domain-containing protein -
  PFZ59_RS04330 (PFZ59_04330) - 807465..809816 (+) 2352 WP_277697578.1 phage tail protein -
  PFZ59_RS04335 (PFZ59_04335) - 809816..810511 (+) 696 WP_277697579.1 phage tail protein -
  PFZ59_RS04340 (PFZ59_04340) - 810508..813648 (+) 3141 WP_277697580.1 phage tail spike protein -
  PFZ59_RS04345 (PFZ59_04345) - 813578..815059 (+) 1482 WP_277697581.1 gp58-like family protein -
  PFZ59_RS04350 (PFZ59_04350) - 815071..815310 (+) 240 WP_277697582.1 hypothetical protein -
  PFZ59_RS04355 (PFZ59_04355) - 815313..815657 (+) 345 WP_277697058.1 hypothetical protein -
  PFZ59_RS04360 (PFZ59_04360) - 815703..816074 (+) 372 Protein_802 phage holin family protein -
  PFZ59_RS04365 (PFZ59_04365) - 816303..817142 (+) 840 WP_277697583.1 N-acetylmuramoyl-L-alanine amidase -
  PFZ59_RS04370 (PFZ59_04370) - 817249..817473 (-) 225 WP_277697584.1 helix-turn-helix transcriptional regulator -
  PFZ59_RS04380 (PFZ59_04380) - 818565..819728 (-) 1164 WP_277697376.1 IS30 family transposase -
  PFZ59_RS04385 (PFZ59_04385) rpsB 820013..820789 (+) 777 WP_014637198.1 30S ribosomal protein S2 -
  PFZ59_RS04390 (PFZ59_04390) tsf 821109..822149 (+) 1041 WP_277697585.1 translation elongation factor Ts -
  PFZ59_RS04395 (PFZ59_04395) - 822329..823585 (-) 1257 WP_277697779.1 ISL3 family transposase -
  PFZ59_RS04400 (PFZ59_04400) - 824018..824864 (+) 847 WP_277697586.1 IS630 family transposase -
  PFZ59_RS04405 (PFZ59_04405) - 824936..825637 (-) 702 WP_208580666.1 GNAT family acetyltransferase -
  PFZ59_RS04410 (PFZ59_04410) - 825637..825975 (-) 339 WP_277697587.1 thioredoxin domain-containing protein -
  PFZ59_RS04415 (PFZ59_04415) - 826245..826703 (+) 459 WP_002937810.1 CtsR family transcriptional regulator -
  PFZ59_RS04420 (PFZ59_04420) clpC 826707..829154 (+) 2448 WP_277697588.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  PFZ59_RS04425 (PFZ59_04425) tnpA 829329..829754 (+) 426 WP_277697774.1 IS200/IS605 family transposase -
  PFZ59_RS04430 (PFZ59_04430) - 829922..830395 (+) 474 WP_277697589.1 GNAT family N-acetyltransferase -
  PFZ59_RS04435 (PFZ59_04435) - 830449..830940 (+) 492 WP_277697590.1 adenylyltransferase/cytidyltransferase family protein -

Sequence


Protein


Download         Length: 815 a.a.        Molecular weight: 90229.76 Da        Isoelectric Point: 6.1676

>NTDB_id=777090 PFZ59_RS04420 WP_277697588.1 826707..829154(+) (clpC) [Streptococcus suis strain SS/UPM/MY/F001]
MKISTGLQRVFEDAQLVAQRYACDYLETWHVLLSFVINHDTVAGAVLAEYPISISDYEHATFVVTDRVYREELDSFRILP
SSKRLDETASFAKKIAEVVKAKELGTEHLFMAMLLDKRSTASQILDKVGFCFEDSDDKFRFLDLRKNLEARAGFTKEDLK
AIRSVMKGGKAKPANMGQMMGMPPAPQSGGLEDYTRDLTALAREGKIEPVIGRDAEIARMIQILSRKTKNNPVLVGDAGV
GKTALALGLAQRVAAGDVPVSLAKMRVLELDLMNVIAGTRFRGDFEERMNNIINDIEEDGQVILFIDELHTIMGSGSGID
STLDAANILKPALARGTLRTVGATTQTEYQKHIEKDAALVRRFAKVTIEEPTAEDSIAILTGLKGTFEKYHRVRIGQAAV
ETAVTYAKRYLTSKNLPDSAIDLLDEASATVQNRVKGQAEETGLTSIDKALMDKKYKTVSKLLIKTKEDAEASQNYDLEV
TEEDVLETLSRLSGIPVAKLSQSDTKKYLNLEAELHKRVIGQEEAISAVSRAIRRNQSGIRTGRRPIGSFMFLGPTGVGK
TELAKALAEVLFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLL
QVLDDGVLTDSKGRKVDFSNTIIIMTSNLGATALRDDKTVGFGALDLSKDHKEVEKRIFEELKKAYRPEFINRIDEKVVF
HSLTESHMQDVVKVMIKPLLAVTAEKGITLKLQPLALKLLAKQGYDPEMGARPLRRLLQTKLEDPLAEMLLRGDLATGST
LKVGVKGEELKLCDF

Nucleotide


Download         Length: 2448 bp        

>NTDB_id=777090 PFZ59_RS04420 WP_277697588.1 826707..829154(+) (clpC) [Streptococcus suis strain SS/UPM/MY/F001]
ATGAAAATTTCAACTGGTTTACAGCGGGTTTTTGAAGATGCCCAGTTAGTGGCCCAACGTTATGCCTGTGATTACTTGGA
AACCTGGCATGTTCTCTTGTCTTTCGTTATCAACCATGATACAGTCGCCGGCGCTGTTTTGGCAGAATATCCTATTTCAA
TTTCTGACTATGAACATGCAACTTTTGTTGTGACCGACAGGGTCTATCGTGAAGAGTTGGATAGTTTTCGTATTTTACCA
TCTTCTAAACGTTTGGATGAAACAGCTAGTTTTGCCAAGAAAATTGCAGAAGTCGTCAAGGCTAAGGAGCTGGGAACGGA
ACATTTGTTTATGGCTATGTTGTTGGACAAGCGGTCAACAGCTAGTCAGATTTTGGACAAGGTAGGTTTTTGTTTTGAAG
ACTCAGATGATAAGTTTCGTTTCTTGGATTTGCGTAAAAACTTGGAGGCACGTGCGGGATTTACTAAAGAGGACCTTAAA
GCTATCCGCAGTGTGATGAAGGGTGGCAAAGCCAAACCTGCCAATATGGGACAGATGATGGGAATGCCCCCAGCACCTCA
AAGTGGTGGATTAGAAGACTACACTCGTGATTTGACAGCTTTGGCGCGTGAAGGGAAGATTGAACCTGTCATCGGTCGAG
ACGCTGAGATTGCACGGATGATCCAAATTCTATCACGTAAAACTAAGAACAATCCAGTTCTAGTTGGTGATGCTGGTGTC
GGAAAAACTGCGCTTGCTTTGGGTCTTGCTCAACGTGTTGCTGCAGGTGATGTCCCTGTCAGCTTGGCTAAGATGCGGGT
TTTGGAACTTGACTTGATGAACGTTATCGCTGGAACTCGTTTCCGTGGTGATTTCGAAGAGCGGATGAATAATATCATCA
ACGACATTGAGGAAGATGGTCAAGTCATTCTTTTCATCGATGAGTTGCATACCATTATGGGTTCTGGCTCGGGGATTGAT
TCGACCTTGGATGCAGCCAATATCCTCAAACCAGCTCTTGCTCGTGGCACCCTCCGTACGGTCGGAGCGACGACTCAGAC
AGAGTACCAAAAGCATATTGAGAAGGACGCAGCCCTGGTTCGTCGTTTTGCCAAAGTGACCATTGAAGAGCCGACAGCGG
AAGATAGCATTGCCATTTTGACTGGTTTAAAAGGTACTTTTGAGAAATACCACAGGGTCCGTATTGGACAAGCAGCTGTA
GAGACTGCTGTGACCTACGCTAAGCGGTATTTGACCAGTAAAAACCTGCCAGATTCAGCCATTGATTTGCTGGATGAAGC
CAGTGCAACGGTACAAAATCGAGTGAAAGGGCAGGCTGAGGAAACAGGACTGACGAGCATAGACAAGGCCTTGATGGATA
AAAAATATAAGACTGTCAGCAAGCTTTTGATTAAAACGAAAGAAGATGCCGAGGCTAGTCAAAACTATGACCTTGAAGTA
ACAGAAGAAGATGTTTTAGAAACACTTAGCCGCTTGTCAGGTATCCCAGTAGCCAAGCTTAGCCAATCAGATACTAAGAA
GTATCTGAATCTGGAAGCAGAATTGCACAAGCGTGTGATTGGGCAGGAAGAAGCTATTTCTGCAGTTAGCAGAGCTATTC
GTCGTAATCAGTCAGGGATTCGTACAGGTCGCAGACCGATTGGCTCCTTTATGTTCTTGGGACCAACTGGTGTCGGTAAG
ACAGAGCTTGCTAAGGCCTTGGCAGAAGTCCTCTTTGATGACGAATCAGCCCTTATCCGTTTTGATATGTCCGAATACAT
GGAGAAATTCGCAGCCAGCCGTTTAAACGGTGCCCCTCCAGGCTACGTGGGCTATGAAGAAGGTGGCGAATTGACAGAAA
AAGTTCGTAACAAACCTTATTCTGTTCTTCTTTTTGATGAGGTTGAAAAGGCTCATCCAGATATTTTCAATGTTCTCTTG
CAGGTCTTGGATGATGGTGTTCTGACAGACAGTAAGGGACGTAAGGTTGATTTCTCAAATACAATTATTATCATGACGTC
AAACCTTGGAGCAACAGCCCTTCGCGATGATAAGACTGTTGGTTTTGGTGCCCTTGATTTGTCTAAGGACCATAAGGAAG
TGGAAAAACGCATTTTTGAGGAATTGAAAAAAGCCTATCGTCCAGAATTTATCAACCGTATTGATGAGAAAGTTGTCTTC
CATAGCTTGACAGAGTCCCATATGCAGGATGTTGTCAAGGTCATGATTAAACCTTTGTTGGCTGTGACAGCTGAGAAAGG
AATTACCCTTAAACTGCAACCGTTAGCCCTTAAGTTATTAGCTAAACAAGGTTACGACCCAGAAATGGGGGCTCGTCCTC
TTCGTAGACTCTTACAGACCAAATTGGAAGACCCATTGGCAGAAATGTTGCTCCGTGGTGACTTGGCTACTGGTTCGACC
TTGAAGGTTGGGGTCAAGGGCGAAGAGCTCAAATTATGTGACTTTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus pneumoniae TIGR4

71.587

99.755

0.714

  clpC Streptococcus pneumoniae Rx1

71.587

99.755

0.714

  clpC Streptococcus pneumoniae D39

71.587

99.755

0.714

  clpC Streptococcus mutans UA159

66.461

99.509

0.661

  clpC Streptococcus thermophilus LMD-9

63.876

100

0.655

  clpC Streptococcus thermophilus LMG 18311

64.294

100

0.654

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

48.684

100

0.499

  clpC Bacillus subtilis subsp. subtilis str. 168

43.099

100

0.437