Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   ST4067_RS00520 Genome accession   NZ_CP065496
Coordinates   84763..87213 (+) Length   816 a.a.
NCBI ID   WP_011226754.1    Uniprot ID   -
Organism   Streptococcus thermophilus strain 4067     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Genomic Context


Location: 79763..92213
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ST4067_RS00490 (ST4067_00485) - 80087..81042 (-) 956 Protein_59 S66 peptidase family protein -
  ST4067_RS00500 (ST4067_00495) rpsB 81565..82332 (+) 768 WP_002944541.1 30S ribosomal protein S2 -
  ST4067_RS00505 (ST4067_00500) tsf 82443..83483 (+) 1041 WP_002949105.1 translation elongation factor Ts -
  ST4067_RS00510 (ST4067_00505) - 83593..84063 (-) 471 WP_011225296.1 COG2426 family protein -
  ST4067_RS00515 (ST4067_00510) - 84307..84762 (+) 456 WP_011225297.1 CtsR family transcriptional regulator -
  ST4067_RS00520 (ST4067_00515) clpC 84763..87213 (+) 2451 WP_011226754.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  ST4067_RS00525 (ST4067_00520) pbp3 87341..88574 (-) 1234 Protein_65 D-alanyl-D-alanine carboxypeptidase PBP3 -
  ST4067_RS00530 (ST4067_00525) pbp3 88744..89991 (-) 1248 WP_002948441.1 D-alanyl-D-alanine carboxypeptidase PBP3 -

Sequence


Protein


Download         Length: 816 a.a.        Molecular weight: 90270.47 Da        Isoelectric Point: 5.7956

>NTDB_id=511054 ST4067_RS00520 WP_011226754.1 84763..87213(+) (clpC) [Streptococcus thermophilus strain 4067]
MTIYSRKMQAIFHRAQLEAERFESPFLETWHVLLAMVEVPGSVAYLTFTDFEDRIHSEEIETAAVLAMEKRPKDLSESDI
IDLRAQSPALEAMLQEAQGIASVTGAVEVGSEHVLMAFLLHKDLMVCRLLEVAGFQYKDDSDKPRIIDLRRSLERNAGLS
KQDLKAIHDLRKPKKSKASANFANMMQPPQSSTGELADYTKDLTALAESGNLDPVIGRDEEISRMIQVLSRKTKNNPVLV
GEAGVGKTALALGLAQRIASGEVPFELADMRILELDMMSVVAGTRFRGDFEERMNQIIDEIEADGKIILFIDELHTIIGS
GSGIDSTLDAANILKPALARGTLHMVGATTQAEYQKHIEKDAALSRRFAKITIEEPSVSEAIDILNGLRSSYEDYHRVTI
TDAAVETAVKAAHRYLTSKNLPDSAIDLLDEASATVQVRIKKEAKREITPLDEALISGDIGAAVKQYKANQKAKFPKPAL
VDADQIMQTLSRLSGIPVEKMTQTDSKRYLNLESELHKRVIGQDEAVSAISRAIRRNQSGIRTGKRPIGSFMFLGPTGVG
KTELAKALAEVLFDDESALLRFDMSEYMEKFAASRLNGAPPGYVGYDEGGELTEKVRNKPYSVLLFDEIEKAHPDIFNVL
LQVLDDGVLTDSRGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAQTISHNHQAMQARIMEELKKSYRPEFINRIDEKVV
FHSLEEEQLHDIVKIMVKPLISALADKGISLKFQPAALKHLAKDGYDIEMGARPLRRTIQTQVEDKLSELLLGGQVVSGQ
TLKIGCSKDKLTFTVV

Nucleotide


Download         Length: 2451 bp        

>NTDB_id=511054 ST4067_RS00520 WP_011226754.1 84763..87213(+) (clpC) [Streptococcus thermophilus strain 4067]
ATGACGATATATTCAAGAAAAATGCAGGCCATTTTCCATCGTGCTCAGCTTGAAGCGGAGCGTTTTGAAAGTCCTTTCTT
GGAGACTTGGCATGTGCTTCTAGCTATGGTTGAGGTTCCGGGATCTGTAGCCTACTTAACATTTACTGATTTTGAGGACC
GTATTCATTCGGAAGAGATTGAGACAGCTGCTGTATTGGCTATGGAGAAGAGGCCAAAAGACTTGTCGGAATCAGATATT
ATCGATTTACGTGCACAGTCACCTGCGCTAGAGGCTATGTTGCAAGAGGCGCAAGGAATCGCTAGTGTGACTGGTGCTGT
AGAGGTGGGGTCTGAACATGTATTGATGGCCTTCCTTCTTCATAAGGATTTAATGGTTTGTCGCCTCCTTGAAGTGGCTG
GTTTTCAATATAAAGATGATAGCGATAAACCTCGCATCATAGATTTACGACGTTCTTTGGAGCGTAATGCTGGTCTTAGC
AAGCAAGATTTGAAGGCAATTCACGATCTTCGTAAACCTAAGAAATCAAAAGCATCTGCGAATTTTGCCAATATGATGCA
ACCTCCTCAATCATCTACTGGTGAACTGGCAGATTATACCAAAGACTTAACTGCATTAGCGGAGTCAGGAAATCTTGATC
CCGTTATTGGACGCGATGAAGAAATTTCACGTATGATTCAGGTTTTGAGTCGTAAAACGAAGAATAATCCTGTCTTGGTA
GGTGAAGCTGGTGTCGGTAAGACAGCACTTGCTCTTGGTTTAGCGCAACGTATTGCTTCAGGCGAAGTACCATTTGAATT
GGCTGATATGCGTATCTTAGAGCTTGACATGATGAGCGTTGTTGCAGGGACACGTTTCCGTGGTGATTTTGAAGAGCGTA
TGAATCAGATCATTGATGAGATTGAAGCTGATGGGAAAATCATTCTCTTTATTGACGAACTACATACGATTATTGGATCT
GGTTCAGGTATTGATAGTACCTTGGATGCGGCTAATATTTTGAAACCGGCCCTTGCGCGCGGGACACTTCACATGGTTGG
AGCAACCACGCAAGCTGAATACCAAAAGCATATTGAGAAAGATGCAGCTTTATCCCGTCGTTTTGCTAAAATTACAATTG
AAGAACCAAGTGTATCTGAAGCAATCGATATTTTAAATGGTTTGCGTTCGTCTTATGAAGACTATCATCGTGTGACTATT
ACGGACGCGGCAGTTGAGACGGCAGTCAAGGCAGCGCATCGCTATTTGACGAGTAAGAATTTGCCTGATTCGGCAATTGA
CCTTTTAGATGAAGCGAGTGCAACTGTTCAAGTTCGTATCAAAAAAGAGGCCAAACGTGAGATAACGCCTTTGGATGAAG
CACTTATATCTGGGGATATTGGGGCTGCTGTTAAACAGTATAAGGCTAACCAAAAGGCAAAATTTCCTAAACCTGCTTTG
GTAGATGCGGATCAGATTATGCAAACTCTTAGTCGTTTATCAGGTATCCCTGTTGAGAAGATGACGCAGACTGACAGCAA
GCGTTACCTGAATCTTGAATCAGAACTCCACAAACGTGTTATTGGTCAAGATGAGGCGGTTTCGGCTATCAGCCGTGCTA
TTCGTCGTAATCAGTCAGGTATTCGTACTGGAAAACGTCCTATTGGCTCCTTCATGTTCCTTGGACCTACTGGTGTTGGT
AAGACAGAATTGGCCAAGGCTTTGGCGGAAGTTCTCTTTGATGATGAATCAGCTTTGCTTCGCTTTGATATGTCGGAGTA
TATGGAAAAATTTGCGGCTAGTCGCCTTAATGGTGCTCCTCCAGGATATGTCGGATATGATGAGGGTGGAGAGTTGACAG
AGAAAGTTCGAAATAAGCCCTACTCAGTTCTTCTCTTTGACGAGATTGAGAAAGCTCATCCAGATATCTTCAATGTTCTC
TTACAGGTTTTGGATGACGGTGTTTTAACAGATAGCCGTGGTCGTAAGGTTGATTTTTCAAACACTATCATCATTATGAC
CTCAAATTTGGGAGCTACAGCTCTTCGTGATGATAAAACTGTTGGTTTTGGTGCTCAAACTATTTCTCATAATCACCAAG
CCATGCAAGCACGCATTATGGAAGAGCTTAAGAAGTCCTATCGTCCAGAATTTATTAACCGTATTGATGAGAAGGTTGTC
TTCCACAGCTTAGAGGAAGAACAACTACATGACATTGTCAAGATTATGGTTAAACCATTAATTTCAGCTCTAGCCGATAA
AGGTATAAGCTTAAAATTCCAACCAGCTGCTCTTAAGCATTTGGCTAAGGATGGCTATGATATTGAGATGGGAGCTCGTC
CATTACGTCGTACGATTCAAACTCAAGTGGAGGACAAGTTGTCTGAGTTATTACTAGGTGGCCAAGTTGTTAGCGGACAG
ACCCTTAAGATTGGTTGCTCGAAAGATAAATTAACCTTTACAGTAGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus thermophilus LMG 18311

99.877

100

0.999

  clpC Streptococcus thermophilus LMD-9

99.51

100

0.995

  clpC Streptococcus mutans UA159

72.549

100

0.725

  clpC Streptococcus pneumoniae TIGR4

66.093

99.755

0.659

  clpC Streptococcus pneumoniae Rx1

66.093

99.755

0.659

  clpC Streptococcus pneumoniae D39

66.093

99.755

0.659

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

50.421

100

0.513

  clpC Bacillus subtilis subsp. subtilis str. 168

46.529

100

0.468

  clpE Streptococcus mutans UA159

47.604

76.716

0.365