Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   B1761_RS06145 Genome accession   NZ_CP019935
Coordinates   1135576..1138026 (+) Length   816 a.a.
NCBI ID   WP_023909319.1    Uniprot ID   -
Organism   Streptococcus thermophilus strain APC151     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Genomic Context


Location: 1130576..1143026
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  B1761_RS06115 (B1761_06200) - 1130900..1131855 (-) 956 Protein_1127 S66 peptidase family protein -
  B1761_RS06125 (B1761_06210) rpsB 1132378..1133145 (+) 768 WP_002944541.1 30S ribosomal protein S2 -
  B1761_RS06130 (B1761_06215) tsf 1133256..1134296 (+) 1041 WP_002949105.1 translation elongation factor Ts -
  B1761_RS06135 (B1761_06220) - 1134406..1134876 (-) 471 WP_011225296.1 COG2426 family protein -
  B1761_RS06140 (B1761_06225) - 1135120..1135575 (+) 456 WP_011225297.1 CtsR family transcriptional regulator -
  B1761_RS06145 (B1761_06230) clpC 1135576..1138026 (+) 2451 WP_023909319.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  B1761_RS10100 (B1761_06235) pbp3 1138154..1139411 (-) 1258 Protein_1133 D-alanyl-D-alanine carboxypeptidase PBP3 -
  B1761_RS06160 (B1761_06245) pbp3 1139556..1140803 (-) 1248 WP_014621034.1 D-alanyl-D-alanine carboxypeptidase PBP3 -

Sequence


Protein


Download         Length: 816 a.a.        Molecular weight: 90283.51 Da        Isoelectric Point: 5.8921

>NTDB_id=218936 B1761_RS06145 WP_023909319.1 1135576..1138026(+) (clpC) [Streptococcus thermophilus strain APC151]
MTIYSRKMQAIFHRAQLEAERFESPFLETWHVLLAMVEVPGSVAYLTFTDFEDRIHSEEIETAAVLAMEKRPKDLSESDI
IDLRAQSPALEAMLQEAQGIASVTGAVEVGSEHVLMAFLLHKDLMVCRLLEVAGFQYKDDSDKPRIIDLRRSLERNAGLS
KQDLKAIHDLRKPKKSKASANFANMMQPPQSSTGELADYTKDLTALAESGNLDPVIGRDEEISRMIQVLSRKTKNNPVLV
GEAGVGKTALALGLAQRIASGEVPFELADMRILELDMMSVVAGTRFRGDFEERMNQIIDEIEADGKIILFIDELHTIIGS
GSGIDSTLDAANILKPALARGTLHMVGATTQAEYQKHIEKDAALSRRFAKITIEEPSVSEAIDILNGLRSSYEDYHRVTI
TDAAVETAVKAAHRYLTSKNLPDSAIDLLDEASATVQVRIKKEAKREITPLDEALISGDIGAAVKQYKANQKAKFPKPAL
VDADQIMQTLSRLSGIPVEKMTQTDSKRYLNLESELHKRVIGQDEAVSAISRAIRRNQSGIRTGKRPIGSFMFLGPTGVG
KTELAKALAEVLFDDESALLRFDMSEYMEKFAASRLNGAPPGYVGYDEGGELTEKVRNKPYSVLLFDEIEKAHPDIFNIL
LQVLDDGVLTDSRGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAQTISHNHQAMQARIMEELKKSYRPEFINRIDEKVV
FHSLEEEQLHDIVKIMVKPLISALADKGISLKFQPAALKHLAKNGYDIEMGARPLRRTIQTQVEDKLSELLLGGQVVSGQ
TLKIGCSKDKLTFTVV

Nucleotide


Download         Length: 2451 bp        

>NTDB_id=218936 B1761_RS06145 WP_023909319.1 1135576..1138026(+) (clpC) [Streptococcus thermophilus strain APC151]
ATGACGATATATTCAAGAAAAATGCAGGCCATTTTCCATCGTGCTCAGCTTGAAGCGGAGCGTTTTGAAAGTCCTTTCTT
GGAGACTTGGCATGTGCTTCTAGCTATGGTTGAGGTTCCGGGATCTGTAGCCTACTTAACATTTACTGATTTTGAGGACC
GTATTCATTCGGAAGAGATTGAGACAGCTGCTGTATTGGCTATGGAGAAGAGGCCAAAAGACTTGTCGGAATCAGATATT
ATCGATTTACGTGCACAGTCACCTGCGCTAGAGGCTATGTTGCAAGAGGCGCAAGGAATCGCTAGTGTGACTGGTGCTGT
AGAGGTGGGGTCTGAACATGTATTGATGGCCTTCCTTCTTCATAAGGATTTAATGGTTTGTCGCCTCCTTGAAGTGGCTG
GTTTTCAATATAAAGATGATAGCGATAAACCTCGCATCATAGATTTACGACGTTCTTTGGAGCGTAATGCTGGTCTTAGC
AAGCAAGATTTGAAGGCAATTCACGATCTTCGTAAACCTAAGAAATCAAAAGCATCTGCGAATTTTGCCAATATGATGCA
ACCTCCTCAATCATCTACTGGTGAACTGGCAGATTATACCAAAGACTTAACTGCATTAGCGGAGTCAGGAAATCTTGATC
CCGTTATTGGACGCGATGAAGAAATTTCACGTATGATTCAGGTTTTGAGTCGTAAAACGAAGAATAATCCTGTCTTGGTA
GGTGAAGCTGGTGTCGGTAAGACAGCACTTGCTCTTGGTTTAGCGCAACGTATTGCTTCAGGCGAAGTACCATTTGAATT
GGCTGATATGCGTATCTTAGAGCTTGACATGATGAGCGTTGTTGCAGGGACACGTTTCCGTGGTGATTTTGAAGAGCGTA
TGAATCAGATCATTGATGAGATTGAAGCTGATGGGAAAATCATTCTCTTTATTGACGAACTACATACGATTATTGGATCT
GGTTCAGGTATTGATAGTACCTTGGATGCGGCTAATATTTTGAAACCGGCCCTTGCGCGCGGGACACTTCACATGGTTGG
AGCAACCACGCAAGCTGAATACCAAAAGCATATTGAGAAAGATGCAGCTTTATCCCGTCGTTTTGCTAAAATTACAATTG
AAGAACCAAGTGTATCTGAAGCAATCGATATTTTAAATGGTTTGCGTTCGTCTTATGAAGACTATCATCGTGTGACTATT
ACGGACGCGGCAGTTGAGACGGCAGTCAAGGCAGCGCATCGCTATTTGACGAGTAAGAATTTGCCTGATTCGGCAATTGA
CCTTTTAGATGAAGCGAGTGCAACTGTTCAAGTTCGTATCAAAAAAGAGGCCAAACGTGAGATAACGCCTTTGGATGAAG
CACTTATATCTGGGGATATTGGGGCTGCTGTTAAACAGTATAAGGCTAACCAAAAGGCAAAATTTCCTAAACCTGCTTTG
GTAGATGCGGATCAGATTATGCAAACTCTTAGTCGTTTATCAGGTATCCCTGTTGAGAAGATGACGCAGACTGACAGCAA
GCGTTACCTGAATCTTGAATCAGAACTCCACAAACGTGTTATTGGTCAAGATGAGGCGGTTTCGGCTATCAGCCGTGCTA
TTCGTCGTAATCAGTCAGGTATTCGTACTGGAAAACGTCCTATTGGCTCCTTCATGTTCCTTGGACCTACTGGTGTTGGT
AAGACAGAATTGGCTAAGGCTTTGGCGGAAGTTCTCTTTGATGATGAATCAGCTTTGCTTCGCTTTGATATGTCGGAGTA
TATGGAAAAATTTGCGGCTAGTCGCCTTAATGGTGCTCCTCCAGGGTATGTCGGATATGATGAGGGTGGAGAGTTGACAG
AGAAAGTTCGAAATAAGCCCTACTCAGTTCTTCTCTTTGACGAGATTGAGAAAGCTCATCCAGATATCTTCAACATTCTC
TTACAGGTTTTGGATGACGGTGTTTTAACAGATAGCCGTGGTCGTAAGGTTGATTTTTCAAACACTATCATCATTATGAC
CTCAAATTTGGGAGCTACAGCTCTTCGTGATGATAAAACTGTTGGTTTTGGTGCTCAAACTATTTCTCATAATCACCAAG
CCATGCAAGCACGCATTATGGAAGAGCTTAAGAAGTCCTATCGTCCAGAATTTATTAACCGTATTGATGAGAAGGTTGTC
TTCCACAGCTTAGAGGAAGAACAACTACATGACATTGTCAAGATTATGGTTAAACCATTAATTTCAGCTCTAGCCGATAA
AGGTATAAGCTTAAAATTCCAACCAGCTGCTCTTAAGCATTTGGCTAAGAATGGCTATGATATTGAGATGGGAGCTCGTC
CATTACGTCGTACGATTCAAACTCAAGTGGAGGACAAGTTGTCTGAGTTATTACTAGGTGGCCAAGTTGTTAGCGGACAG
ACCCTTAAGATTGGTTGCTCGAAAGATAAATTAACCTTTACAGTAGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus thermophilus LMG 18311

99.632

100

0.996

  clpC Streptococcus thermophilus LMD-9

99.51

100

0.995

  clpC Streptococcus mutans UA159

72.672

100

0.727

  clpC Streptococcus pneumoniae TIGR4

65.971

99.755

0.658

  clpC Streptococcus pneumoniae Rx1

65.971

99.755

0.658

  clpC Streptococcus pneumoniae D39

65.971

99.755

0.658

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

50.542

100

0.515

  clpC Bacillus subtilis subsp. subtilis str. 168

46.65

100

0.469

  clpE Streptococcus mutans UA159

47.604

76.716

0.365


Multiple sequence alignment