Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   U5640_RS19300 Genome accession   NZ_CP141223
Coordinates   4183107..4185632 (-) Length   841 a.a.
NCBI ID   WP_340377004.1    Uniprot ID   -
Organism   Streptomyces sp. SS7     
Function   degradation of ComK; degradation of DegU (predicted from homology)   
Competence regulation

Genomic Context


Location: 4178107..4190632
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  - clpC 4183107..4185632 (-) 2526 - - Regulator

Sequence


Protein


Download         Length: 841 a.a.        Molecular weight: 93048.22 Da        Isoelectric Point: 5.9363

>NTDB_id=836615 U5640_RS19300 WP_340377004.1 4183107..4185632(-) (clpC) [Streptomyces sp. SS7]
MFERFTDRARRVVVLAQEEARMLNHNYIGTEHILLGLIHEGEGVAAKALESLGISLEAVRQQVEEIIGQGQQAPSGHIPF
TPRAKKVLELSLREALQLGHNYIGTEHILLGLIREGEGVAAQVLVKLGADLNRVRQQVIQLLSGYQGKETATAGGPAEGT
PSTSLVLDQFGRNLTQAARESKLDPVIGREKEIERVMQVLSRRTKNNPVLIGEPGVGKTAVVEGLAQAIVKGEVPETLKD
KHLYTLDLGALVAGSRYRGDFEERLKKVLKEIRTRGDIILFIDELHTLVGAGAAEGAIDAASILKPMLARGELQTIGATT
LDEYRKHLEKDAALERRFQPIQVAEPSLPHTIEILKGLRDRYEAHHRVSITDEALVQAATLADRYISDRFLPDKAIDLID
EAGSRMRIRRMTAPPDLREFDEKIAGVRRDKESAIDSQDFEKAASLRDKEKQLLAAKAKREKEWKAGDMDVVAEVDGELI
AEVLATATGIPVFKLTEEESSRLLRMEDELHKRVIGQVDAVKALSKAIRRTRAGLKDPKRPGGSFIFAGPSGVGKTELSK
ALAEFLFGDEDALISLDMSEFSEKHTVSRLFGSPPGYVGYEEGGQLTEKVRRKPFSVVLFDEVEKAHPDIFNSLLQILED
GRLTDSQGRVVDFKNTVIIMTTNLGTRDISKGFNLGFAASGDTKSNYERMKNKVSDELKQHFRPEFLNRVDDVVVFPQLT
QEDILRIVDLMISKVDERLKDRDMGIELSQSAKELLSKKGYDPVLGARPLRRTIQREIEDTLSEKILFGELRPGHIVVVD
TEGEGETKTFTFRGEEKSSLPDVPPIEQAAGGAGPNLSKDA

Nucleotide


Download         Length: 2526 bp        

>NTDB_id=836615 U5640_RS19300 WP_340377004.1 4183107..4185632(-) (clpC) [Streptomyces sp. SS7]
ATGTTCGAGAGGTTCACCGACCGCGCGCGGCGGGTTGTCGTCCTGGCTCAGGAAGAAGCCCGGATGCTCAACCACAACTA
CATCGGCACCGAGCACATCCTCCTGGGCCTGATCCACGAGGGTGAGGGTGTCGCCGCCAAGGCCCTTGAGAGCCTCGGGA
TTTCGCTCGAGGCGGTCCGCCAGCAGGTGGAGGAGATCATCGGCCAGGGCCAGCAGGCCCCGTCCGGGCACATCCCCTTC
ACCCCCCGTGCCAAGAAGGTCCTGGAGCTGTCGCTCCGCGAGGCCCTTCAGCTGGGCCACAACTACATCGGCACGGAGCA
CATCCTGCTCGGCCTGATCCGCGAGGGCGAGGGCGTCGCCGCCCAGGTCCTCGTCAAGCTGGGCGCTGATCTCAACCGGG
TGCGGCAGCAGGTCATCCAGCTGCTCTCCGGCTACCAGGGCAAGGAGACCGCCACCGCCGGCGGGCCTGCCGAGGGCACC
CCCTCGACGTCCCTGGTCCTCGACCAGTTCGGCCGGAACCTCACCCAGGCCGCTCGTGAGTCCAAGCTCGACCCGGTCAT
CGGGCGCGAGAAGGAGATCGAGCGGGTCATGCAGGTGCTGTCCCGCCGTACCAAGAACAACCCGGTCCTGATCGGTGAGC
CCGGCGTCGGCAAGACCGCCGTCGTCGAGGGCCTCGCCCAGGCCATCGTCAAGGGCGAGGTGCCCGAGACCCTCAAGGAC
AAGCACCTCTACACCCTCGACCTCGGCGCGCTGGTCGCCGGCTCCCGCTACCGCGGTGACTTCGAGGAGCGCCTGAAGAA
GGTGCTCAAGGAGATCCGCACCCGCGGCGACATCATCCTGTTCATCGACGAGCTCCACACGCTGGTCGGTGCGGGTGCCG
CCGAGGGCGCCATCGACGCCGCTTCGATCCTGAAGCCGATGCTGGCCCGCGGTGAGCTCCAGACCATCGGTGCCACCACG
CTCGACGAGTACCGCAAGCACCTGGAGAAGGACGCCGCCCTCGAGCGCCGCTTCCAGCCCATCCAGGTCGCCGAGCCGTC
GCTGCCGCACACCATCGAGATCCTCAAGGGTCTCCGCGACCGGTACGAGGCCCACCACCGCGTCTCCATCACCGACGAGG
CGCTGGTCCAGGCCGCCACCCTGGCCGACCGGTACATCTCGGACCGCTTCCTGCCGGACAAGGCGATCGACCTGATCGAC
GAGGCCGGTTCCCGGATGCGCATCCGCCGGATGACCGCTCCGCCGGACCTGCGCGAGTTCGACGAGAAGATCGCCGGCGT
GCGCCGCGACAAGGAGTCCGCGATCGACTCGCAGGACTTCGAGAAGGCCGCCTCCCTGCGCGACAAGGAGAAGCAGCTCC
TCGCCGCCAAGGCGAAGCGCGAGAAGGAGTGGAAGGCCGGCGACATGGACGTCGTCGCCGAGGTCGACGGCGAGCTGATC
GCCGAGGTCCTCGCGACGGCCACCGGCATCCCGGTCTTCAAGCTCACGGAGGAGGAGTCGTCCCGCCTGCTGCGCATGGA
GGACGAGCTCCACAAGCGGGTCATCGGCCAGGTCGACGCCGTCAAGGCGCTGTCCAAGGCGATCCGCCGTACCCGTGCCG
GTCTGAAGGACCCGAAGCGTCCGGGTGGTTCGTTCATCTTCGCCGGCCCGTCCGGTGTCGGTAAGACCGAGCTGTCCAAG
GCGCTCGCCGAGTTCCTCTTCGGCGACGAGGACGCGCTGATCTCCCTCGACATGTCGGAGTTCAGCGAGAAGCACACGGT
CTCGCGTCTCTTCGGCTCCCCGCCCGGATACGTGGGCTACGAAGAGGGCGGCCAGCTGACCGAGAAGGTCCGCCGCAAGC
CGTTCTCCGTCGTCCTCTTCGACGAGGTCGAGAAGGCCCACCCGGACATCTTCAACTCGCTGCTGCAGATCCTGGAGGAC
GGTCGCCTGACCGACTCCCAGGGCCGGGTCGTGGACTTCAAGAACACGGTCATCATCATGACGACCAACCTCGGCACCCG
GGACATCTCCAAGGGCTTCAACCTGGGCTTCGCGGCCTCGGGCGACACGAAGTCCAACTACGAGCGCATGAAGAACAAGG
TCTCGGACGAGCTCAAGCAGCACTTCCGCCCCGAGTTCCTCAACCGCGTCGACGACGTGGTCGTCTTCCCGCAGCTGACG
CAGGAGGACATCCTGCGGATCGTCGACCTGATGATCAGCAAGGTGGACGAGCGCCTGAAGGACCGGGACATGGGCATCGA
GCTCTCCCAGTCCGCCAAGGAACTGCTGTCCAAGAAGGGCTACGACCCGGTGCTGGGCGCGCGTCCGCTGCGTCGCACGA
TCCAGCGCGAGATCGAGGACACGCTCTCCGAGAAGATCCTCTTCGGCGAGCTGCGTCCCGGTCACATCGTGGTCGTGGAC
ACCGAGGGCGAGGGTGAGACCAAGACCTTCACCTTCCGCGGCGAGGAGAAGTCGTCCCTGCCCGACGTCCCGCCGATCGA
GCAGGCCGCCGGCGGCGCCGGCCCGAACCTGAGCAAGGACGCGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Bacillus subtilis subsp. subtilis str. 168

61.358

96.314

0.591

  clpC Streptococcus mutans UA159

44.337

98.692

0.438

  clpC Streptococcus thermophilus LMD-9

42.891

99.524

0.427

  clpC Streptococcus thermophilus LMG 18311

42.721

99.643

0.426

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

43.827

96.314

0.422

  clpC Streptococcus pneumoniae D39

44.209

95.482

0.422

  clpC Streptococcus pneumoniae Rx1

44.209

95.482

0.422

  clpC Streptococcus pneumoniae TIGR4

44.236

94.887

0.42

  clpE Streptococcus mutans UA159

54.153

74.435

0.403

  clpE Streptococcus pneumoniae TIGR4

52.97

74.078

0.392

  clpC Lactococcus lactis subsp. cremoris KW2

53.226

73.722

0.392

  clpE Streptococcus pneumoniae Rx1

52.809

74.078

0.391

  clpE Streptococcus pneumoniae R6

52.809

74.078

0.391

  clpE Streptococcus pneumoniae D39

52.809

74.078

0.391