Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   ESP48_RS05625 Genome accession   NZ_CP035306
Coordinates   1036989..1039439 (+) Length   816 a.a.
NCBI ID   WP_023909319.1    Uniprot ID   -
Organism   Streptococcus thermophilus strain IDCC2201     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Genomic Context


Location: 1031989..1044439
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ESP48_RS05595 (ESP48_05595) - 1032313..1033268 (-) 956 Protein_1031 S66 family peptidase -
  ESP48_RS05605 (ESP48_05605) rpsB 1033791..1034558 (+) 768 WP_002944541.1 30S ribosomal protein S2 -
  ESP48_RS05610 (ESP48_05610) tsf 1034669..1035709 (+) 1041 WP_002949105.1 translation elongation factor Ts -
  ESP48_RS05615 (ESP48_05615) - 1035819..1036289 (-) 471 WP_011225296.1 COG2426 family protein -
  ESP48_RS05620 (ESP48_05620) - 1036533..1036988 (+) 456 WP_011225297.1 CtsR family transcriptional regulator -
  ESP48_RS05625 (ESP48_05625) clpC 1036989..1039439 (+) 2451 WP_023909319.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  ESP48_RS10425 (ESP48_05630) pbp3 1039567..1040824 (-) 1258 Protein_1037 D-alanyl-D-alanine carboxypeptidase PBP3 -
  ESP48_RS05640 (ESP48_05640) pbp3 1042657..1043904 (-) 1248 WP_041826942.1 D-alanyl-D-alanine carboxypeptidase PBP3 -

Sequence


Protein


Download         Length: 816 a.a.        Molecular weight: 90283.51 Da        Isoelectric Point: 5.8921

>NTDB_id=339182 ESP48_RS05625 WP_023909319.1 1036989..1039439(+) (clpC) [Streptococcus thermophilus strain IDCC2201]
MTIYSRKMQAIFHRAQLEAERFESPFLETWHVLLAMVEVPGSVAYLTFTDFEDRIHSEEIETAAVLAMEKRPKDLSESDI
IDLRAQSPALEAMLQEAQGIASVTGAVEVGSEHVLMAFLLHKDLMVCRLLEVAGFQYKDDSDKPRIIDLRRSLERNAGLS
KQDLKAIHDLRKPKKSKASANFANMMQPPQSSTGELADYTKDLTALAESGNLDPVIGRDEEISRMIQVLSRKTKNNPVLV
GEAGVGKTALALGLAQRIASGEVPFELADMRILELDMMSVVAGTRFRGDFEERMNQIIDEIEADGKIILFIDELHTIIGS
GSGIDSTLDAANILKPALARGTLHMVGATTQAEYQKHIEKDAALSRRFAKITIEEPSVSEAIDILNGLRSSYEDYHRVTI
TDAAVETAVKAAHRYLTSKNLPDSAIDLLDEASATVQVRIKKEAKREITPLDEALISGDIGAAVKQYKANQKAKFPKPAL
VDADQIMQTLSRLSGIPVEKMTQTDSKRYLNLESELHKRVIGQDEAVSAISRAIRRNQSGIRTGKRPIGSFMFLGPTGVG
KTELAKALAEVLFDDESALLRFDMSEYMEKFAASRLNGAPPGYVGYDEGGELTEKVRNKPYSVLLFDEIEKAHPDIFNIL
LQVLDDGVLTDSRGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAQTISHNHQAMQARIMEELKKSYRPEFINRIDEKVV
FHSLEEEQLHDIVKIMVKPLISALADKGISLKFQPAALKHLAKNGYDIEMGARPLRRTIQTQVEDKLSELLLGGQVVSGQ
TLKIGCSKDKLTFTVV

Nucleotide


Download         Length: 2451 bp        

>NTDB_id=339182 ESP48_RS05625 WP_023909319.1 1036989..1039439(+) (clpC) [Streptococcus thermophilus strain IDCC2201]
ATGACGATATATTCAAGAAAAATGCAGGCCATTTTCCATCGTGCTCAGCTTGAAGCGGAGCGTTTTGAAAGTCCTTTCTT
GGAGACTTGGCATGTGCTTCTAGCTATGGTTGAGGTTCCGGGATCTGTAGCCTACTTAACATTTACTGATTTTGAGGACC
GTATTCATTCGGAAGAGATTGAGACAGCTGCTGTATTGGCTATGGAGAAGAGGCCAAAAGACTTGTCGGAATCAGATATT
ATCGATTTACGTGCACAGTCACCTGCGCTAGAGGCTATGTTGCAAGAGGCGCAAGGAATCGCTAGTGTGACTGGTGCTGT
AGAGGTGGGGTCTGAACATGTATTGATGGCCTTCCTTCTTCATAAGGATTTAATGGTTTGTCGCCTCCTTGAAGTGGCTG
GTTTTCAATATAAAGATGATAGCGATAAACCTCGCATCATAGATTTACGACGTTCTTTGGAGCGTAATGCTGGTCTTAGC
AAGCAAGATTTGAAGGCAATTCACGATCTTCGTAAACCTAAGAAATCAAAAGCATCTGCGAATTTTGCCAATATGATGCA
ACCTCCTCAATCATCTACTGGTGAACTGGCAGATTATACCAAAGACTTAACTGCATTAGCGGAGTCAGGAAATCTTGATC
CCGTTATTGGACGCGATGAAGAAATTTCACGTATGATTCAGGTTTTGAGTCGTAAAACGAAGAATAATCCTGTCTTGGTA
GGTGAAGCTGGTGTCGGTAAGACAGCACTTGCTCTTGGTTTAGCGCAACGTATTGCTTCAGGCGAAGTACCATTTGAATT
GGCTGATATGCGTATCTTAGAGCTTGACATGATGAGCGTTGTTGCAGGGACACGTTTCCGTGGTGATTTTGAAGAGCGTA
TGAATCAGATCATTGATGAGATTGAAGCTGATGGGAAAATCATTCTCTTTATTGACGAACTACATACGATTATTGGATCT
GGTTCAGGTATTGATAGTACCTTGGATGCGGCTAATATTTTGAAACCGGCCCTTGCGCGCGGGACACTTCACATGGTTGG
AGCAACCACGCAAGCTGAATACCAAAAGCATATTGAGAAAGATGCAGCTTTATCCCGTCGTTTTGCTAAAATTACAATTG
AAGAACCAAGTGTATCTGAAGCAATCGATATTTTAAATGGTTTGCGTTCGTCTTATGAAGACTATCATCGTGTGACTATT
ACGGACGCGGCAGTTGAGACGGCAGTCAAGGCAGCGCATCGCTATTTGACGAGTAAGAATTTGCCTGATTCGGCAATTGA
CCTTTTAGATGAAGCGAGTGCAACTGTTCAAGTTCGTATCAAAAAAGAGGCCAAACGTGAGATAACGCCTTTGGATGAAG
CACTTATATCTGGGGATATTGGGGCTGCTGTTAAACAGTATAAGGCTAACCAAAAGGCAAAATTTCCTAAACCTGCTTTG
GTAGATGCGGATCAGATTATGCAAACTCTTAGTCGTTTATCAGGTATCCCTGTTGAGAAGATGACGCAGACTGACAGCAA
GCGTTACCTGAATCTTGAATCAGAACTCCACAAACGTGTTATTGGTCAAGATGAGGCGGTTTCGGCTATCAGCCGTGCTA
TTCGTCGTAATCAGTCAGGTATTCGTACTGGAAAACGTCCTATTGGCTCCTTCATGTTCCTTGGACCTACTGGTGTTGGT
AAGACAGAATTGGCTAAGGCTTTGGCGGAAGTTCTCTTTGATGATGAATCAGCTTTGCTTCGCTTTGATATGTCGGAGTA
TATGGAAAAATTTGCGGCTAGTCGCCTTAATGGTGCTCCTCCAGGGTATGTCGGATATGATGAGGGTGGAGAGTTGACAG
AGAAAGTTCGAAATAAGCCCTACTCAGTTCTTCTCTTTGACGAGATTGAGAAAGCTCATCCAGATATCTTCAACATTCTC
TTACAGGTTTTGGATGACGGTGTTTTAACAGATAGCCGTGGTCGTAAGGTTGATTTTTCAAACACTATCATCATTATGAC
CTCAAATTTGGGAGCTACAGCTCTTCGTGATGATAAAACTGTTGGTTTTGGTGCTCAAACTATTTCTCATAATCACCAAG
CCATGCAAGCACGCATTATGGAAGAGCTTAAGAAGTCCTATCGTCCAGAATTTATTAACCGTATTGATGAGAAGGTTGTC
TTCCACAGCTTAGAGGAAGAACAACTACATGACATTGTCAAGATTATGGTTAAACCATTAATTTCAGCTCTAGCCGATAA
AGGTATAAGCTTAAAATTCCAACCAGCTGCTCTTAAGCATTTGGCTAAGAATGGCTATGATATTGAGATGGGAGCTCGTC
CATTACGTCGTACGATTCAAACTCAAGTGGAGGACAAGTTGTCTGAGTTATTACTAGGTGGCCAAGTTGTTAGCGGACAG
ACCCTTAAGATTGGTTGCTCGAAAGATAAATTAACCTTTACAGTAGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus thermophilus LMG 18311

99.632

100

0.996

  clpC Streptococcus thermophilus LMD-9

99.51

100

0.995

  clpC Streptococcus mutans UA159

72.672

100

0.727

  clpC Streptococcus pneumoniae TIGR4

65.971

99.755

0.658

  clpC Streptococcus pneumoniae Rx1

65.971

99.755

0.658

  clpC Streptococcus pneumoniae D39

65.971

99.755

0.658

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

50.542

100

0.515

  clpC Bacillus subtilis subsp. subtilis str. 168

46.65

100

0.469

  clpE Streptococcus mutans UA159

47.604

76.716

0.365


Multiple sequence alignment