Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   SSUST3_RS08880 Genome accession   NC_015433
Coordinates   1806160..1808613 (-) Length   817 a.a.
NCBI ID   WP_009910884.1    Uniprot ID   A0A7Y6RRF0
Organism   Streptococcus suis ST3     
Function   degradation of ComW (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1804446..1825921 1806160..1808613 within 0


Gene organization within MGE regions


Location: 1804446..1825921
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SSUST3_RS08870 (SSUST3_1810) dusB 1804446..1805450 (+) 1005 WP_009910879.1 tRNA dihydrouridine synthase DusB -
  SSUST3_RS08875 (SSUST3_1811) - 1805551..1806042 (-) 492 WP_009910880.1 adenylyltransferase/cytidyltransferase family protein -
  SSUST3_RS08880 (SSUST3_1812) clpC 1806160..1808613 (-) 2454 WP_009910884.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  SSUST3_RS08885 (SSUST3_1813) - 1808617..1809069 (-) 453 WP_013730590.1 CtsR family transcriptional regulator -
  SSUST3_RS08890 (SSUST3_1814) - 1809237..1809596 (+) 360 WP_013730591.1 bacteriocin transporter -
  SSUST3_RS08895 (SSUST3_1815) - 1809668..1811536 (-) 1869 WP_013730592.1 Ig-like domain-containing protein -
  SSUST3_RS08900 (SSUST3_1816) tsf 1811891..1812931 (-) 1041 WP_002937816.1 translation elongation factor Ts -
  SSUST3_RS08905 (SSUST3_1817) rpsB 1813184..1813960 (-) 777 WP_012775356.1 30S ribosomal protein S2 -
  SSUST3_RS08910 (SSUST3_1818) - 1814307..1814819 (-) 513 WP_013730593.1 adenylate kinase -
  SSUST3_RS08920 (SSUST3_1819) - 1815473..1820551 (-) 5079 WP_013730594.1 S8 family serine peptidase -
  SSUST3_RS08925 (SSUST3_1820) nusG 1820695..1821234 (-) 540 WP_002940254.1 transcription termination/antitermination protein NusG -
  SSUST3_RS08930 (SSUST3_1821) secE 1821344..1821520 (-) 177 WP_002940255.1 preprotein translocase subunit SecE -
  SSUST3_RS08935 (SSUST3_1822) rpmG 1821530..1821682 (-) 153 WP_002940258.1 50S ribosomal protein L33 -
  SSUST3_RS08940 (SSUST3_1823) pbp2a 1821716..1823929 (-) 2214 WP_009910891.1 penicillin-binding protein PBP2A -
  SSUST3_RS08945 (SSUST3_1824) - 1824082..1825050 (+) 969 WP_013730595.1 NAD(P)/FAD-dependent oxidoreductase -
  SSUST3_RS08950 (SSUST3_1825) - 1825052..1825921 (+) 870 WP_013730596.1 RluA family pseudouridine synthase -

Sequence


Protein


Download         Length: 817 a.a.        Molecular weight: 90461.93 Da        Isoelectric Point: 6.0456

>NTDB_id=40563 SSUST3_RS08880 WP_009910884.1 1806160..1808613(-) (clpC) [Streptococcus suis ST3]
MKISTGLQRVFEDAQLVAQRYACDYLETWHVLLSFVINHDTVAGAVLAEYPISISDYEHATFVVTDKVYREELDSFRILP
SSKRLDETASFAKKIAEVVKAKELGTEHLFMAMLLDKRSTASQILDKVGFCFEDSDDKFRFLDLRKNLEARAGFTKEDLK
AIRSVMKGGKAKPTNMGQMMGMPPAPQSGGLEDYTRDLTALAREGKIEPVIGRDAEIARMIQILSRKTKNNPVLVGDAGV
GKTALALGLAQRVAAGDVPVSLAKMRVLELDLMNVIAGTRFRGDFEERMNNIINDIEEDGQVILFIDELHTIMGSGSGID
STLDAANILKPALARGTLRTVGATTQTEYQKHIEKDAALVRRFAKVTIEEPTVEDSIAILTGLKGTFEKYHRVRIGQAAV
ETAVTYAKRYLTSKNLPDSAIDLLDEASATVQNRVKGQGEETGLTSIDKALMDKKYKTVSKLLIETKEDAEASQNYDLEV
TEEDVLETLSRLSGIPVAKLSQSDTKKYLNLEAELHKRVIGQEEAISAVSRAIRRNQSGIRTGRRPIGSFMFLGPTGVGK
TELAKALAEVLFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLL
QVLDDGVLTDSKGRKVDFSNTIIIMTSNLGATALRDDKTVGFGALDLSKDHKEVEKRIFEELKKAYRPEFINRIDEKVVF
HSLTETHMQDVVKVMIKPLLAVTAEKGITLKLQPSALKLLAKQGYDPEMGARPLRRLLQTKLEDPLAEMLLRGDLTTSST
LKVGVKGEELKFDVVKG

Nucleotide


Download         Length: 2454 bp        

>NTDB_id=40563 SSUST3_RS08880 WP_009910884.1 1806160..1808613(-) (clpC) [Streptococcus suis ST3]
ATGAAAATTTCAACTGGTTTACAGCGGGTTTTTGAAGATGCCCAGTTAGTGGCTCAACGTTATGCCTGTGATTACTTGGA
AACCTGGCATGTTCTCTTGTCTTTCGTTATCAACCATGATACAGTCGCTGGCGCTGTTTTGGCAGAATATCCTATTTCAA
TTTCTGACTATGAACATGCAACTTTTGTTGTGACCGACAAGGTCTATCGTGAAGAGTTGGATAGTTTTCGTATTTTACCA
TCTTCTAAACGTTTGGATGAAACGGCTAGTTTTGCCAAGAAAATTGCAGAAGTCGTCAAGGCTAAGGAGCTGGGAACGGA
ACATTTGTTTATGGCTATGTTGTTGGACAAGCGGTCAACAGCTAGTCAGATTTTGGACAAGGTAGGTTTTTGTTTTGAAG
ACTCAGATGATAAGTTTCGTTTCTTGGATTTGCGTAAAAACTTGGAGGCACGTGCGGGATTTACCAAAGAGGACCTTAAA
GCTATCCGCAGTGTGATGAAGGGTGGCAAAGCCAAACCTACTAATATGGGACAGATGATGGGAATGCCCCCAGCACCTCA
AAGTGGTGGATTAGAAGACTACACTCGTGATTTGACAGCTTTGGCGCGTGAAGGGAAGATTGAACCTGTCATCGGTCGAG
ACGCTGAGATTGCACGGATGATCCAAATTCTATCACGTAAAACCAAGAATAACCCAGTTCTAGTTGGTGATGCTGGTGTC
GGAAAAACTGCGCTTGCTTTGGGTCTTGCTCAACGTGTTGCTGCAGGTGATGTCCCTGTCAGCTTGGCTAAGATGCGGGT
TTTGGAACTTGACTTGATGAACGTTATCGCTGGAACTCGTTTCCGTGGTGATTTCGAAGAGCGGATGAATAATATCATCA
ACGACATCGAGGAAGATGGTCAGGTCATCCTTTTCATCGATGAGTTGCATACCATTATGGGTTCTGGCTCAGGGATTGAT
TCGACCTTGGATGCAGCCAATATCCTCAAACCAGCTCTTGCTCGGGGCACCCTCCGTACGGTCGGAGCGACGACTCAGAC
AGAGTACCAAAAGCATATTGAGAAAGACGCAGCCCTGGTTCGTCGTTTTGCCAAAGTGACCATTGAAGAGCCGACAGTGG
AAGATAGCATTGCCATTTTGACTGGTTTAAAAGGTACTTTTGAAAAATACCACAGGGTCCGTATTGGACAAGCAGCTGTA
GAGACTGCTGTGACCTACGCTAAGCGGTATTTGACCAGTAAAAACCTGCCAGATTCTGCCATTGACTTGCTGGATGAAGC
CAGTGCAACGGTACAAAATCGAGTGAAAGGGCAGGGTGAGGAAACAGGACTGACGAGCATAGACAAGGCCTTGATGGATA
AAAAATATAAGACTGTCAGCAAGCTTTTGATTGAAACGAAAGAAGATGCCGAGGCTAGTCAAAACTATGACCTTGAAGTA
ACAGAAGAAGATGTTTTAGAAACACTTAGCCGCTTGTCAGGTATCCCAGTAGCCAAGCTTAGCCAATCAGATACTAAGAA
GTATCTGAATCTGGAAGCAGAATTGCACAAGCGTGTGATTGGGCAGGAAGAAGCTATTTCTGCAGTTAGCAGAGCTATTC
GCCGTAATCAGTCAGGGATTCGTACAGGTCGCAGACCGATTGGTTCCTTTATGTTCTTGGGACCAACTGGTGTCGGTAAG
ACAGAGCTTGCTAAAGCCTTGGCAGAGGTTCTCTTTGATGATGAATCTGCTCTTATCCGTTTTGATATGTCCGAGTATAT
GGAGAAATTTGCGGCCAGCCGTCTCAATGGAGCGCCTCCAGGTTACGTGGGCTATGAAGAAGGTGGCGAATTGACAGAAA
AAGTTCGTAACAAACCTTATTCTGTTCTTCTTTTTGATGAGGTTGAAAAGGCTCATCCAGATATTTTCAATGTTCTCTTG
CAGGTCTTGGATGATGGTGTTCTGACGGACAGTAAGGGACGTAAGGTTGATTTCTCAAATACTATTATTATCATGACGTC
AAACCTTGGAGCAACAGCTCTTCGTGATGATAAGACAGTTGGTTTTGGTGCCCTTGATTTATCTAAGGACCACAAGGAAG
TGGAGAAACGCATTTTTGAGGAATTGAAAAAAGCCTATCGTCCAGAATTTATCAACCGTATTGACGAGAAAGTTGTCTTC
CATAGCCTAACGGAAACTCATATGCAGGATGTTGTCAAGGTCATGATTAAACCTTTGTTGGCTGTAACAGCTGAGAAAGG
AATTACTCTTAAACTGCAACCGTCAGCCCTTAAGTTATTGGCTAAACAAGGTTATGACCCAGAAATGGGGGCTCGTCCGC
TACGTAGGCTCTTACAAACCAAATTGGAAGATCCGTTGGCAGAAATGTTACTCCGTGGTGACTTGACTACTAGTTCGACC
TTGAAGGTTGGGGTCAAGGGTGAAGAGCTCAAGTTTGATGTGGTAAAGGGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7Y6RRF0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus pneumoniae TIGR4

71.902

99.755

0.717

  clpC Streptococcus pneumoniae Rx1

71.902

99.755

0.717

  clpC Streptococcus pneumoniae D39

71.902

99.755

0.717

  clpC Streptococcus mutans UA159

66.339

99.633

0.661

  clpC Streptococcus thermophilus LMD-9

64.14

100

0.652

  clpC Streptococcus thermophilus LMG 18311

64.14

100

0.652

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

48.982

100

0.501

  clpC Bacillus subtilis subsp. subtilis str. 168

43.349

100

0.447


Multiple sequence alignment