Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   SSUST1_RS09100 Genome accession   NC_017950
Coordinates   1820681..1823134 (-) Length   817 a.a.
NCBI ID   WP_012027858.1    Uniprot ID   A0A2K1SY42
Organism   Streptococcus suis ST1     
Function   degradation of ComW (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1816783..1837993 1820681..1823134 within 0


Gene organization within MGE regions


Location: 1816783..1837993
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SSUST1_RS09080 (SSUST1_1865) dusB 1816783..1817787 (+) 1005 WP_002937787.1 tRNA dihydrouridine synthase DusB -
  SSUST1_RS09085 (SSUST1_1866) - 1817933..1818706 (-) 774 WP_012027854.1 NUDIX domain-containing protein -
  SSUST1_RS09090 (SSUST1_1867) pnuC 1818699..1819520 (-) 822 WP_012027855.1 nicotinamide riboside transporter PnuC -
  SSUST1_RS09095 (SSUST1_1868) - 1819530..1820567 (-) 1038 WP_014736445.1 AAA family ATPase -
  SSUST1_RS09100 (SSUST1_1869) clpC 1820681..1823134 (-) 2454 WP_012027858.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  SSUST1_RS09105 (SSUST1_1870) - 1823140..1823598 (-) 459 WP_014736446.1 CtsR family transcriptional regulator -
  SSUST1_RS09110 (SSUST1_1871) - 1823748..1824086 (+) 339 WP_012027860.1 thioredoxin domain-containing protein -
  SSUST1_RS09115 (SSUST1_1872) - 1824086..1824787 (+) 702 WP_012027861.1 hypothetical protein -
  SSUST1_RS09120 (SSUST1_1873) tsf 1824941..1825981 (-) 1041 WP_014736447.1 translation elongation factor Ts -
  SSUST1_RS09125 (SSUST1_1874) rpsB 1826135..1826911 (-) 777 WP_012775356.1 30S ribosomal protein S2 -
  SSUST1_RS09130 (SSUST1_1875) - 1827259..1827771 (-) 513 WP_014736448.1 adenylate kinase -
  SSUST1_RS09140 (SSUST1_1876) - 1828425..1833494 (-) 5070 WP_014736449.1 S8 family serine peptidase -
  SSUST1_RS09145 (SSUST1_1877) nusG 1833638..1834177 (-) 540 WP_014736450.1 transcription termination/antitermination protein NusG -
  SSUST1_RS09150 (SSUST1_1878) secE 1834287..1834463 (-) 177 WP_002940255.1 preprotein translocase subunit SecE -
  SSUST1_RS09155 (SSUST1_1879) rpmG 1834473..1834625 (-) 153 WP_002940258.1 50S ribosomal protein L33 -
  SSUST1_RS09160 (SSUST1_1880) pbp2a 1834659..1836872 (-) 2214 WP_014736451.1 penicillin-binding protein PBP2A -
  SSUST1_RS09165 (SSUST1_1881) - 1837025..1837993 (+) 969 WP_014736452.1 NAD(P)/FAD-dependent oxidoreductase -

Sequence


Protein


Download         Length: 817 a.a.        Molecular weight: 90132.53 Da        Isoelectric Point: 6.3138

>NTDB_id=51490 SSUST1_RS09100 WP_012027858.1 1820681..1823134(-) (clpC) [Streptococcus suis ST1]
MKISRGLQGVYEDAQLIAQRYSSDYLETWHLLLAFVINPDTVAGAILAEYPADVLDYERAVYMVMGRRYHEELESFFFLP
SSKRVKELQVFAEKIAEIVKSKGLGTEHIFMGMLLDKRSTASQILDQVGFHFEDSDDKVRFLDLRKNLEAKAGFTKEHLK
AIRTMTKGGKPKQATVGNMMGMTQSQSGGLEDYTRDLTALARSGQLEPVIGRDEEISRMLQILSRKTKNNPVLVGDAGVG
KTALALGLAQRIANGEVPASLVNMRILELDLMNVIAGTRFRGDFEERMNNIINDIEEDGRVILFIDELHTIMGSGSGIDS
ILDAANILKPALSRGTLRTVGATTQDEYQKHIEKDAALVRRFAKVTIEEPSVADSVAILQGLKPAYEAHHKVTISDQAVV
TAVAYAKRYLTSKNLPDSAIDLLDEASATVQNRAKGQVEEGGLTALDQALMAGKYKTVTQLLLKAQEAENQATSYSLEVT
EEDILATLSRLSGIPVTKLSQTDAKKYLNLEQELHKRVIGQEEAISAVSRAIRRNQSGIRTGHRPIGSFMFLGPTGVGKT
ELAKALAEILFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGVLTDRKGRKVDFSNTVIIMTSNLGATALRDDKTVGFGALDLSKSQEHVEKRIFEALKKAYRPEFINRIDEKVVFH
SLTEADMQDVVKVMVKPLIAVAASKGITLKLQASALKLLAKEGYDPEMGARPLRRLLQTKLEDPLAEMLLRGELPAGVTL
KVGVKAEQLKFDSVKAG

Nucleotide


Download         Length: 2454 bp        

>NTDB_id=51490 SSUST1_RS09100 WP_012027858.1 1820681..1823134(-) (clpC) [Streptococcus suis ST1]
ATGAAGATTTCAAGAGGGTTACAGGGTGTCTATGAAGATGCTCAATTGATTGCACAGCGTTATAGTAGTGACTATTTGGA
GACCTGGCACTTGTTGTTAGCCTTTGTCATCAATCCAGATACCGTTGCGGGAGCTATTTTAGCAGAATATCCTGCGGATG
TATTGGACTATGAACGTGCAGTTTATATGGTGATGGGGCGGCGTTACCATGAAGAGTTAGAGAGCTTTTTCTTTCTTCCA
TCGTCCAAGCGGGTGAAGGAATTGCAGGTCTTTGCCGAGAAGATTGCGGAGATTGTCAAGAGTAAGGGGCTAGGAACGGA
GCATATTTTCATGGGAATGCTCTTGGACAAGCGTTCGACTGCCTCACAAATTCTGGATCAGGTCGGTTTTCACTTTGAGG
ATTCGGATGATAAGGTTCGTTTTCTGGATTTGCGGAAAAATCTGGAAGCCAAGGCTGGCTTTACCAAGGAGCATCTGAAG
GCTATCCGCACCATGACGAAAGGTGGCAAGCCCAAGCAGGCAACGGTTGGCAATATGATGGGCATGACCCAGTCACAAAG
TGGTGGCTTGGAAGACTATACACGTGATTTGACGGCTTTGGCCCGCTCAGGTCAGTTGGAGCCAGTCATCGGACGGGATG
AGGAAATTTCCCGTATGCTTCAGATTTTGTCGCGGAAAACCAAGAACAATCCTGTCTTGGTTGGAGATGCGGGTGTTGGG
AAAACAGCTCTGGCACTGGGTCTAGCCCAGCGGATTGCTAATGGAGAGGTGCCAGCTAGTCTTGTCAATATGCGGATCTT
GGAATTGGACTTGATGAATGTCATTGCGGGAACGCGTTTCCGTGGGGATTTTGAGGAGCGGATGAACAATATCATCAACG
ATATTGAAGAAGATGGTCGAGTGATTCTCTTCATTGATGAACTCCATACCATTATGGGATCGGGGTCAGGGATTGACTCG
ATCCTGGATGCTGCCAATATTTTGAAGCCTGCTCTGTCCCGTGGGACTTTGCGGACAGTTGGAGCAACGACTCAGGATGA
ATACCAGAAGCATATTGAAAAAGATGCTGCCTTGGTACGTCGATTTGCCAAGGTGACCATTGAGGAACCGAGTGTAGCAG
ACAGCGTAGCAATTTTGCAGGGGTTGAAGCCAGCCTATGAGGCTCACCACAAGGTGACCATTTCGGATCAGGCGGTGGTA
ACGGCGGTAGCCTATGCCAAACGCTATCTGACCAGTAAGAATTTGCCAGATTCGGCTATTGATTTGCTGGATGAAGCCAG
TGCGACGGTTCAAAATCGTGCCAAGGGACAGGTAGAAGAAGGTGGATTGACCGCTTTAGACCAAGCCTTGATGGCTGGGA
AATACAAGACGGTAACGCAGCTCTTGCTTAAGGCTCAAGAGGCGGAAAATCAGGCGACTAGCTATAGCTTGGAAGTCACA
GAAGAAGACATTTTGGCAACCCTCAGTCGCTTGTCAGGTATTCCTGTCACCAAACTGAGTCAGACAGATGCCAAGAAGTA
CCTTAATCTTGAACAGGAATTGCACAAGCGTGTTATCGGGCAGGAAGAGGCGATTTCAGCTGTCAGCCGGGCAATTCGCC
GCAACCAGTCAGGCATTCGCACTGGTCACAGACCGATTGGTTCCTTTATGTTCTTGGGGCCAACAGGTGTCGGTAAGACA
GAATTGGCCAAGGCCTTGGCGGAGATCCTCTTTGATGACGAATCTGCCTTGATTCGTTTTGATATGAGTGAGTATATGGA
GAAATTTGCGGCTAGTCGCCTCAACGGTGCTCCTCCAGGCTATGTTGGCTATGAAGAAGGGGGCGAGCTGACAGAAAAAG
TTCGCAACAAGCCATACTCTGTCCTACTTTTTGATGAGGTGGAGAAAGCACATCCAGATATTTTCAATGTTCTTTTGCAG
GTCTTGGATGACGGTGTCTTGACGGACAGAAAAGGTCGCAAGGTTGATTTCTCTAATACGGTCATCATTATGACGTCTAA
CTTAGGGGCAACCGCTTTACGTGATGATAAGACAGTTGGGTTTGGGGCTCTTGATTTGTCTAAGAGTCAGGAACACGTTG
AAAAACGGATTTTTGAGGCGTTGAAGAAGGCCTATCGTCCTGAATTTATTAACCGGATTGATGAAAAAGTGGTCTTCCAT
AGCCTGACAGAAGCAGATATGCAGGATGTGGTCAAGGTCATGGTCAAACCATTGATTGCCGTGGCGGCCAGCAAGGGTAT
TACCCTCAAATTGCAGGCTTCTGCTCTTAAACTCTTGGCCAAAGAAGGCTACGATCCAGAAATGGGTGCCCGCCCACTTC
GTCGCCTCCTCCAAACCAAGTTGGAAGATCCATTGGCAGAAATGCTCTTACGTGGAGAACTGCCAGCTGGTGTGACCTTA
AAAGTAGGGGTCAAGGCCGAGCAGTTGAAGTTTGATAGTGTGAAAGCAGGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2K1SY42

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus pneumoniae TIGR4

71.464

99.51

0.711

  clpC Streptococcus pneumoniae Rx1

71.464

99.51

0.711

  clpC Streptococcus pneumoniae D39

71.464

99.51

0.711

  clpC Streptococcus mutans UA159

66.173

99.143

0.656

  clpC Streptococcus thermophilus LMG 18311

64.828

99.878

0.647

  clpC Streptococcus thermophilus LMD-9

64.706

99.878

0.646

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

48.086

100

0.492

  clpC Bacillus subtilis subsp. subtilis str. 168

44.403

99.51

0.442


Multiple sequence alignment