Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   H1W81_RS00750 Genome accession   NZ_LR824002
Coordinates   150818..153268 (-) Length   816 a.a.
NCBI ID   WP_011225298.1    Uniprot ID   Q5M6G1
Organism   Streptococcus thermophilus isolate STH_CIRM_67     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Genomic Context


Location: 145818..158268
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H1W81_RS00740 (STHERMO_0165) pbp3 148040..149287 (+) 1248 WP_041828096.1 D-alanyl-D-alanine carboxypeptidase PBP3 -
  H1W81_RS00745 (STHERMO_0168) pbp3 149457..150690 (+) 1234 Protein_141 D-alanyl-D-alanine carboxypeptidase PBP3 -
  H1W81_RS00750 (STHERMO_0169) clpC 150818..153268 (-) 2451 WP_011225298.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  H1W81_RS00755 (STHERMO_0170) - 153269..153724 (-) 456 WP_011225297.1 CtsR family transcriptional regulator -
  H1W81_RS00760 (STHERMO_0171) - 153968..154438 (+) 471 WP_011225296.1 COG2426 family protein -
  H1W81_RS00765 (STHERMO_0172) tsf 154549..155589 (-) 1041 WP_002949105.1 translation elongation factor Ts -
  H1W81_RS00770 (STHERMO_0173) rpsB 155700..156467 (-) 768 WP_002944541.1 30S ribosomal protein S2 -
  H1W81_RS00780 - 156990..157942 (+) 953 Protein_147 S66 family peptidase -

Sequence


Protein


Download         Length: 816 a.a.        Molecular weight: 90284.49 Da        Isoelectric Point: 5.7956

>NTDB_id=1132310 H1W81_RS00750 WP_011225298.1 150818..153268(-) (clpC) [Streptococcus thermophilus isolate STH_CIRM_67]
MTIYSRKMQAIFHRAQLEAERFESPFLETWHVLLAMVEVPGSVAYLTFTDFEDRIHSEEIETAAVLAMEKRPKDLSESDI
IDLRAQSPALEAMLQEAQGIASVTGAVEVGSEHVLMAFLLHKDLMVCRLLEVAGFQYKDDSDKPRIIDLRRSLERNAGLS
KQDLKAIHDLRKPKKSKASANFANMMQPPQSSTGELADYTKDLTALAESGNLDPIIGRDEEISRMIQVLSRKTKNNPVLV
GEAGVGKTALALGLAQRIASGEVPFELADMRILELDMMSVVAGTRFRGDFEERMNQIIDEIEADGKIILFIDELHTIIGS
GSGIDSTLDAANILKPALARGTLHMVGATTQAEYQKHIEKDAALSRRFAKITIEEPSVSEAIDILNGLRSSYEDYHRVTI
TDAAVETAVKAAHRYLTSKNLPDSAIDLLDEASATVQVRIKKEAKREITPLDEALISGDIGAAVKQYKANQKAKFPKPAL
VDADQIMQTLSRLSGIPVEKMTQTDSKRYLNLESELHKRVIGQDEAVSAISRAIRRNQSGIRTGKRPIGSFMFLGPTGVG
KTELAKALAEVLFDDESALLRFDMSEYMEKFAASRLNGAPPGYVGYDEGGELTEKVRNKPYSVLLFDEIEKAHPDIFNVL
LQVLDDGVLTDSRGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAQTISHNHQAMQARIMEELKKSYRPEFINRIDEKVV
FHSLEEEQLHDIVKIMVKPLISALADKGISLKFQPAALKHLAKDGYDIEMGARPLRRTIQTQVEDKLSELLLGGQVVSGQ
TLKIGCSKDKLTFTVV

Nucleotide


Download         Length: 2451 bp        

>NTDB_id=1132310 H1W81_RS00750 WP_011225298.1 150818..153268(-) (clpC) [Streptococcus thermophilus isolate STH_CIRM_67]
ATGACGATATATTCAAGAAAAATGCAGGCCATTTTCCATCGTGCTCAGCTTGAAGCGGAGCGTTTTGAAAGTCCTTTCTT
GGAGACTTGGCATGTGCTTCTAGCTATGGTTGAGGTTCCGGGATCTGTAGCCTACTTAACATTTACTGATTTTGAGGACC
GTATTCATTCGGAAGAGATTGAGACAGCTGCTGTATTGGCTATGGAGAAGAGGCCAAAAGACTTGTCGGAATCAGATATT
ATCGATTTACGTGCACAGTCACCTGCGCTAGAGGCTATGTTGCAAGAGGCGCAAGGAATCGCTAGTGTGACTGGTGCTGT
AGAGGTGGGGTCTGAACATGTATTGATGGCCTTCCTTCTTCATAAGGATTTAATGGTTTGTCGCCTCCTTGAAGTGGCTG
GTTTTCAATATAAAGATGATAGCGATAAACCTCGCATCATAGATTTACGACGTTCTTTGGAGCGTAATGCTGGTCTTAGC
AAGCAAGATTTGAAGGCAATTCACGATCTTCGTAAACCTAAGAAATCAAAAGCATCTGCGAATTTTGCCAATATGATGCA
ACCTCCTCAATCATCTACTGGTGAACTGGCAGATTATACCAAAGACTTAACTGCATTAGCGGAGTCAGGAAATCTTGATC
CCATTATTGGACGCGATGAAGAAATTTCACGCATGATTCAGGTTTTGAGTCGTAAAACGAAGAATAATCCTGTCTTGGTA
GGTGAAGCTGGTGTCGGTAAGACAGCACTTGCTCTTGGTTTAGCGCAACGTATTGCTTCAGGCGAAGTACCATTTGAATT
GGCTGATATGCGTATCTTAGAGCTTGACATGATGAGCGTTGTTGCAGGGACACGTTTCCGTGGTGATTTCGAAGAGCGTA
TGAATCAGATCATTGATGAGATTGAAGCTGATGGGAAAATCATTCTCTTTATTGACGAACTACATACGATTATTGGATCT
GGTTCAGGTATTGATAGTACCTTGGATGCGGCTAATATTTTGAAACCGGCCCTTGCGCGCGGGACACTTCACATGGTTGG
AGCAACCACGCAAGCTGAATACCAAAAGCATATTGAGAAAGATGCAGCTTTATCCCGTCGTTTTGCTAAAATTACAATTG
AAGAACCAAGTGTATCTGAAGCAATCGATATTTTAAACGGTTTGCGTTCGTCTTATGAAGACTATCATCGTGTGACTATT
ACGGACGCGGCAGTTGAGACGGCAGTCAAGGCAGCGCATCGCTATTTGACGAGTAAGAATTTGCCTGATTCGGCAATTGA
CCTTTTAGATGAAGCGAGTGCAACTGTTCAAGTTCGTATCAAAAAAGAGGCCAAACGTGAGATAACGCCTTTGGATGAAG
CACTTATATCTGGGGATATTGGGGCTGCTGTTAAACAGTATAAGGCTAACCAAAAGGCAAAATTTCCTAAACCTGCTTTG
GTAGATGCGGATCAGATTATGCAAACTCTTAGTCGTTTATCAGGTATCCCTGTTGAGAAGATGACGCAGACTGACAGCAA
GCGTTACCTGAATCTTGAATCAGAACTCCACAAACGTGTTATTGGTCAAGATGAGGCGGTTTCGGCTATCAGCCGTGCTA
TTCGTCGTAATCAGTCAGGTATTCGTACTGGAAAACGTCCTATTGGCTCCTTCATGTTCCTTGGACCTACTGGTGTTGGT
AAGACAGAATTGGCCAAGGCTTTGGCGGAAGTTCTCTTTGATGATGAATCAGCTTTGCTTCGCTTTGATATGTCGGAGTA
TATGGAAAAATTTGCGGCTAGTCGCCTTAATGGTGCTCCTCCAGGATATGTCGGATATGATGAGGGTGGAGAGTTGACAG
AGAAAGTTCGAAATAAGCCCTACTCAGTTCTTCTCTTTGACGAGATTGAGAAAGCTCATCCAGATATCTTCAACGTTCTC
TTACAGGTTTTGGATGACGGTGTTTTAACAGATAGCCGTGGTCGTAAGGTTGATTTTTCAAACACTATCATCATTATGAC
CTCAAATTTGGGAGCTACAGCTCTTCGTGATGATAAAACTGTTGGTTTTGGTGCTCAAACTATTTCTCATAATCACCAAG
CCATGCAAGCACGCATTATGGAAGAGCTTAAGAAGTCCTATCGTCCAGAATTTATTAACCGTATTGATGAGAAGGTTGTC
TTCCACAGCTTAGAGGAAGAACAACTACATGACATTGTCAAGATTATGGTTAAACCATTAATTTCAGCTCTAGCCGATAA
AGGTATAAGCTTAAAATTCCAACCAGCTGCTCTTAAGCATTTGGCTAAGGATGGCTATGATATTGAGATGGGAGCTCGTC
CATTACGTCGTACGATTCAAACTCAAGTGGAGGACAAGTTGTCTGAGTTATTACTAGGTGGCCAAGTTGTTAGCGGACAG
ACCCTTAAGATTGGTTGCTCGAAAGATAAATTAACCTTTACAGTAGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q5M6G1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus thermophilus LMG 18311

100

100

1

  clpC Streptococcus thermophilus LMD-9

99.387

100

0.994

  clpC Streptococcus mutans UA159

72.426

100

0.724

  clpC Streptococcus pneumoniae TIGR4

65.971

99.755

0.658

  clpC Streptococcus pneumoniae Rx1

65.971

99.755

0.658

  clpC Streptococcus pneumoniae D39

65.971

99.755

0.658

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

50.421

100

0.513

  clpC Bacillus subtilis subsp. subtilis str. 168

46.407

100

0.467

  clpE Streptococcus mutans UA159

47.444

76.716

0.364


Multiple sequence alignment