Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   DK237_RS09670 Genome accession   NZ_CP031377
Coordinates   1922220..1924673 (-) Length   817 a.a.
NCBI ID   WP_009910884.1    Uniprot ID   A0A7Y6RRF0
Organism   Streptococcus suis strain ISU2614     
Function   degradation of ComW (predicted from homology)   
Competence regulation

Genomic Context


Location: 1917220..1929673
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DK237_RS09655 (DK237_09950) hslO 1919647..1920513 (+) 867 WP_013730589.1 Hsp33 family molecular chaperone HslO -
  DK237_RS09660 (DK237_09955) dusB 1920506..1921510 (+) 1005 WP_009910879.1 tRNA dihydrouridine synthase DusB -
  DK237_RS09665 (DK237_09960) - 1921611..1922102 (-) 492 WP_009910880.1 adenylyltransferase/cytidyltransferase family protein -
  DK237_RS09670 (DK237_09965) clpC 1922220..1924673 (-) 2454 WP_009910884.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  DK237_RS09675 (DK237_09970) - 1924677..1925129 (-) 453 WP_013730590.1 CtsR family transcriptional regulator -
  DK237_RS09680 (DK237_09975) - 1925297..1925656 (+) 360 WP_013730591.1 bacteriocin transporter -
  DK237_RS09685 (DK237_09980) - 1925728..1927596 (-) 1869 WP_013730592.1 Ig-like domain-containing protein -
  DK237_RS09690 (DK237_09985) - 1928141..1929343 (+) 1203 WP_009909264.1 IS110-like element ISSsu7 family transposase -

Sequence


Protein


Download         Length: 817 a.a.        Molecular weight: 90461.93 Da        Isoelectric Point: 6.0456

>NTDB_id=306364 DK237_RS09670 WP_009910884.1 1922220..1924673(-) (clpC) [Streptococcus suis strain ISU2614]
MKISTGLQRVFEDAQLVAQRYACDYLETWHVLLSFVINHDTVAGAVLAEYPISISDYEHATFVVTDKVYREELDSFRILP
SSKRLDETASFAKKIAEVVKAKELGTEHLFMAMLLDKRSTASQILDKVGFCFEDSDDKFRFLDLRKNLEARAGFTKEDLK
AIRSVMKGGKAKPTNMGQMMGMPPAPQSGGLEDYTRDLTALAREGKIEPVIGRDAEIARMIQILSRKTKNNPVLVGDAGV
GKTALALGLAQRVAAGDVPVSLAKMRVLELDLMNVIAGTRFRGDFEERMNNIINDIEEDGQVILFIDELHTIMGSGSGID
STLDAANILKPALARGTLRTVGATTQTEYQKHIEKDAALVRRFAKVTIEEPTVEDSIAILTGLKGTFEKYHRVRIGQAAV
ETAVTYAKRYLTSKNLPDSAIDLLDEASATVQNRVKGQGEETGLTSIDKALMDKKYKTVSKLLIETKEDAEASQNYDLEV
TEEDVLETLSRLSGIPVAKLSQSDTKKYLNLEAELHKRVIGQEEAISAVSRAIRRNQSGIRTGRRPIGSFMFLGPTGVGK
TELAKALAEVLFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLL
QVLDDGVLTDSKGRKVDFSNTIIIMTSNLGATALRDDKTVGFGALDLSKDHKEVEKRIFEELKKAYRPEFINRIDEKVVF
HSLTETHMQDVVKVMIKPLLAVTAEKGITLKLQPSALKLLAKQGYDPEMGARPLRRLLQTKLEDPLAEMLLRGDLTTSST
LKVGVKGEELKFDVVKG

Nucleotide


Download         Length: 2454 bp        

>NTDB_id=306364 DK237_RS09670 WP_009910884.1 1922220..1924673(-) (clpC) [Streptococcus suis strain ISU2614]
ATGAAAATTTCAACTGGTTTACAGCGGGTTTTTGAAGATGCCCAGTTAGTGGCTCAACGTTATGCCTGTGATTACTTGGA
AACCTGGCATGTTCTCTTGTCTTTCGTTATCAACCATGATACAGTCGCTGGCGCTGTTTTGGCAGAATATCCTATTTCAA
TTTCTGACTATGAACATGCAACTTTTGTTGTGACCGACAAGGTCTATCGTGAAGAGTTGGATAGTTTTCGTATTTTACCA
TCTTCTAAACGTTTGGATGAAACGGCTAGTTTTGCCAAGAAAATTGCAGAAGTCGTCAAGGCTAAGGAGCTGGGAACGGA
ACATTTGTTTATGGCTATGTTGTTGGACAAGCGGTCAACAGCTAGTCAGATTTTGGACAAGGTAGGTTTTTGTTTTGAAG
ACTCAGATGATAAGTTTCGTTTCTTGGATTTGCGTAAAAACTTGGAGGCACGTGCGGGATTTACCAAAGAGGACCTTAAA
GCTATCCGCAGTGTGATGAAGGGTGGCAAAGCCAAACCTACTAATATGGGACAGATGATGGGAATGCCCCCAGCACCTCA
AAGTGGTGGATTAGAAGACTACACTCGTGATTTGACAGCTTTGGCGCGTGAAGGGAAGATTGAACCTGTCATCGGTCGAG
ACGCTGAGATTGCACGGATGATCCAAATTCTATCACGTAAAACCAAGAATAACCCAGTTCTAGTTGGTGATGCTGGTGTC
GGAAAAACTGCGCTTGCTTTGGGTCTTGCTCAACGTGTTGCTGCAGGTGATGTCCCTGTCAGCTTGGCTAAGATGCGGGT
TTTGGAACTTGACTTGATGAACGTTATCGCTGGAACTCGTTTCCGTGGTGATTTCGAAGAGCGGATGAATAATATCATCA
ACGACATCGAGGAAGATGGTCAGGTCATCCTTTTCATCGATGAGTTGCATACCATTATGGGTTCTGGCTCAGGGATTGAT
TCGACCTTGGATGCAGCCAATATCCTCAAACCAGCTCTTGCTCGGGGCACCCTCCGTACGGTCGGAGCGACGACTCAGAC
AGAGTACCAAAAGCATATTGAGAAAGACGCAGCCCTGGTTCGTCGTTTTGCCAAAGTGACCATTGAAGAGCCGACAGTGG
AAGATAGCATTGCCATTTTGACTGGTTTAAAAGGTACTTTTGAAAAATACCACAGGGTCCGTATTGGACAAGCAGCTGTA
GAGACTGCTGTGACCTACGCTAAGCGGTATTTGACCAGTAAAAACCTGCCAGATTCTGCCATTGACTTGCTGGATGAAGC
CAGTGCAACGGTACAAAATCGAGTGAAAGGGCAGGGTGAGGAAACAGGACTGACGAGCATAGACAAGGCCTTGATGGATA
AAAAATATAAGACTGTCAGCAAGCTTTTGATTGAAACGAAAGAAGATGCCGAGGCTAGTCAAAACTATGACCTTGAAGTA
ACAGAAGAAGATGTTTTAGAAACACTTAGCCGCTTGTCAGGTATCCCAGTAGCCAAGCTTAGCCAATCAGATACTAAGAA
GTATCTGAATCTGGAAGCAGAATTGCACAAGCGTGTGATTGGGCAGGAAGAAGCTATTTCTGCAGTTAGCAGAGCTATTC
GCCGTAATCAGTCAGGGATTCGTACAGGTCGCAGACCGATTGGTTCCTTTATGTTCTTGGGACCAACTGGTGTCGGTAAG
ACAGAGCTTGCTAAAGCCTTGGCAGAGGTTCTCTTTGATGATGAATCTGCTCTTATCCGTTTTGATATGTCCGAGTATAT
GGAGAAATTTGCGGCCAGCCGTCTCAATGGAGCGCCTCCAGGTTACGTGGGCTATGAAGAAGGTGGCGAATTGACAGAAA
AAGTTCGTAACAAACCTTATTCTGTTCTTCTTTTTGATGAGGTTGAAAAGGCTCATCCAGATATTTTCAATGTTCTCTTG
CAGGTCTTGGATGATGGTGTTCTGACGGACAGTAAGGGACGTAAGGTTGATTTCTCAAATACTATTATTATCATGACGTC
AAACCTTGGAGCAACAGCTCTTCGTGATGATAAGACAGTTGGTTTTGGTGCCCTTGATTTATCTAAGGACCACAAGGAAG
TGGAGAAACGCATTTTTGAGGAATTGAAAAAAGCCTATCGTCCAGAATTTATCAACCGTATTGACGAGAAAGTTGTCTTC
CATAGCCTAACGGAAACTCATATGCAGGATGTTGTCAAGGTCATGATTAAACCTTTGTTGGCTGTAACAGCTGAGAAAGG
AATTACTCTTAAACTGCAACCGTCAGCCCTTAAGTTATTGGCTAAACAAGGTTATGACCCAGAAATGGGGGCTCGTCCGC
TACGTAGGCTCTTACAAACCAAATTGGAAGATCCGTTGGCAGAAATGTTACTCCGTGGTGACTTGACTACTAGTTCGACC
TTGAAGGTTGGGGTCAAGGGTGAAGAGCTCAAGTTTGATGTGGTAAAGGGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7Y6RRF0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus pneumoniae TIGR4

71.902

99.755

0.717

  clpC Streptococcus pneumoniae Rx1

71.902

99.755

0.717

  clpC Streptococcus pneumoniae D39

71.902

99.755

0.717

  clpC Streptococcus mutans UA159

66.339

99.633

0.661

  clpC Streptococcus thermophilus LMD-9

64.14

100

0.652

  clpC Streptococcus thermophilus LMG 18311

64.14

100

0.652

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

48.982

100

0.501

  clpC Bacillus subtilis subsp. subtilis str. 168

43.349

100

0.447


Multiple sequence alignment