Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   MP619_RS10855 Genome accession   NZ_CP095081
Coordinates   2181760..2184204 (-) Length   814 a.a.
NCBI ID   WP_129556297.1    Uniprot ID   -
Organism   Streptococcus dysgalactiae strain WJ001     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2179631..2191798 2181760..2184204 within 0


Gene organization within MGE regions


Location: 2179631..2191798
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MP619_RS10845 (MP619_10815) groL 2179631..2181256 (-) 1626 WP_268227775.1 chaperonin GroEL -
  MP619_RS10850 (MP619_10820) groES 2181292..2181582 (-) 291 WP_003054738.1 co-chaperone GroES -
  MP619_RS10855 (MP619_10825) clpC 2181760..2184204 (-) 2445 WP_129556297.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  MP619_RS10860 (MP619_10830) - 2184204..2184665 (-) 462 WP_003054759.1 CtsR family transcriptional regulator -
  MP619_RS10865 (MP619_10835) - 2184859..2185062 (-) 204 WP_003053018.1 cold-shock protein -
  MP619_RS10870 (MP619_10840) - 2185277..2186299 (-) 1023 WP_129555386.1 IS30 family transposase -
  MP619_RS10875 (MP619_10845) - 2186389..2186967 (+) 579 WP_232022241.1 hypothetical protein -
  MP619_RS10880 (MP619_10850) - 2186999..2187157 (-) 159 WP_129556337.1 hypothetical protein -
  MP619_RS10885 (MP619_10855) - 2187518..2188267 (+) 750 WP_129556338.1 S24 family peptidase -
  MP619_RS10890 (MP619_10860) - 2188426..2189448 (+) 1023 WP_129555386.1 IS30 family transposase -
  MP619_RS10895 (MP619_10865) - 2189659..2190603 (+) 945 WP_129556339.1 Abi family protein -
  MP619_RS10900 (MP619_10870) - 2190725..2191798 (+) 1074 WP_129556340.1 site-specific integrase -

Sequence


Protein


Download         Length: 814 a.a.        Molecular weight: 90264.83 Da        Isoelectric Point: 6.4813

>NTDB_id=675118 MP619_RS10855 WP_129556297.1 2181760..2184204(-) (clpC) [Streptococcus dysgalactiae strain WJ001]
MIMYSLKMQEIFRQAQFQAARFDSQYLETWHILLAMARVDHSLAGLVLSEFDAKVAVEEYEAAAILAMGKSPKDQVSHID
FRPQSKTLTNLLQFAQAISQVTKDQEVGSEHVLFAILLNPDIMATRLLEMAGYTIKDKGNGEPRLADLRKAIEIHAGYSK
EIIKAIHELRKPKKTKNQGSFSDMMKPPSTAGDLADFTRDLTEMASQGLLEPVIGRDAEVSRMIQVLSRKTKNNPVLVGD
AGVGKTALAYGLAQRIANGAIPYELQDMRVLELDMMSVVAGTRFRGDFEERMNQIIDDIESDGKIILFVDELHTIMGSGS
GIDSTLDAANILKPALSRGTLHMVGATTQEEYQKHIEKDAALSRRFAKILIEEPNVEDAYQILLGLKGSYETYHNVTIAD
QAVRTAVKMAHRYLTSKNLPDSAIDLLDEASATVQGMVKKSSPEIITPLDQALIDGDMKKASRLLAKDVKGQHRKPTAVT
EEDILTTLSKLSGIPLEKLSQADSKKYLNLEKELHKRVIGQEDAVSAISRAIRRNQSGIRTGKRPIGSFMFLGPTGVGKT
ELAKALAEVLFDDASALIRFDMSEYMEKFAASRLNGAPPGYVGYDEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGMLTDSRGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAKGISHDHQAMEKRILEELKKAYRPEFINRIDEKVVFH
SLTQDNMREVVKIMVQPLMATLAEKGITLKFQPMALKYLSEEGYDVEMGARPLRRTLQTQVEDKLSELILAGELASGHTL
KIGLSHGKLSFNIE

Nucleotide


Download         Length: 2445 bp        

>NTDB_id=675118 MP619_RS10855 WP_129556297.1 2181760..2184204(-) (clpC) [Streptococcus dysgalactiae strain WJ001]
ATGATCATGTATTCATTGAAGATGCAAGAAATTTTCAGGCAAGCGCAGTTTCAAGCAGCCCGCTTTGATAGTCAATACCT
AGAAACTTGGCATATATTGCTAGCTATGGCGAGGGTTGATCACTCCCTAGCGGGCTTGGTGCTAAGCGAATTTGATGCTA
AGGTTGCAGTGGAAGAATATGAGGCTGCAGCTATTTTAGCGATGGGCAAAAGCCCTAAAGACCAAGTGTCTCACATTGAC
TTTAGGCCTCAGTCAAAGACCTTAACGAACCTTTTGCAATTTGCCCAAGCTATCAGTCAAGTCACCAAAGACCAAGAGGT
AGGTTCAGAGCATGTTCTCTTTGCCATTTTGCTTAATCCAGATATTATGGCGACTCGCTTATTAGAGATGGCTGGTTATA
CTATCAAGGATAAAGGAAACGGGGAGCCTCGCTTAGCTGATTTACGAAAAGCTATCGAGATTCATGCAGGCTATAGCAAG
GAAATAATTAAGGCTATCCACGAGTTGCGTAAGCCAAAGAAAACCAAAAATCAAGGTTCTTTTTCAGACATGATGAAGCC
ACCAAGCACGGCTGGGGACTTGGCGGACTTTACACGTGATTTGACCGAGATGGCAAGCCAAGGTCTCTTAGAGCCAGTTA
TTGGGCGTGATGCTGAAGTGTCACGGATGATTCAAGTACTGAGTCGTAAAACCAAGAATAACCCTGTTCTTGTGGGTGAT
GCAGGTGTGGGTAAAACGGCCCTTGCCTACGGCCTTGCTCAACGTATTGCTAATGGTGCTATTCCTTACGAACTGCAAGA
CATGCGTGTTCTAGAGTTAGACATGATGAGTGTTGTTGCGGGAACCCGTTTTCGTGGGGACTTTGAAGAGCGCATGAATC
AAATCATTGACGATATTGAGTCAGATGGCAAGATTATTCTCTTCGTAGATGAATTGCACACCATTATGGGATCAGGAAGT
GGTATTGATAGCACGCTTGATGCTGCCAATATTTTAAAACCAGCCTTATCTCGTGGAACTCTCCATATGGTGGGTGCAAC
AACGCAGGAAGAATACCAAAAACATATTGAAAAAGATGCCGCTCTTTCGCGCCGCTTTGCTAAGATTTTAATTGAAGAAC
CTAATGTAGAAGATGCTTATCAGATTCTGCTAGGACTAAAAGGCTCTTACGAGACTTACCATAATGTGACCATTGCTGAT
CAGGCTGTTAGAACCGCTGTGAAAATGGCACATCGCTATCTGACCAGCAAAAACCTTCCGGATTCTGCCATTGATTTGTT
GGATGAAGCCAGTGCTACAGTGCAAGGTATGGTTAAAAAATCTAGTCCAGAAATCATCACGCCATTAGATCAAGCTTTGA
TTGATGGCGATATGAAGAAAGCCTCTCGTTTGTTGGCAAAAGACGTTAAAGGGCAACATCGCAAGCCAACAGCTGTGACA
GAAGAGGATATCCTGACGACCTTGAGCAAGCTATCAGGTATTCCACTGGAAAAACTCAGCCAAGCTGATAGCAAAAAATA
CCTTAATTTGGAAAAAGAACTGCATAAGCGCGTGATTGGGCAAGAAGATGCTGTCTCAGCTATTTCTAGAGCCATTCGCC
GTAATCAGTCAGGCATTCGTACAGGTAAACGTCCAATCGGTTCTTTCATGTTCCTTGGTCCAACAGGGGTTGGTAAGACC
GAGTTGGCAAAAGCTTTGGCAGAAGTTCTCTTTGATGACGCGTCCGCCCTTATCCGCTTTGATATGTCAGAGTATATGGA
AAAATTTGCGGCTTCTCGCCTTAATGGCGCACCTCCAGGCTATGTCGGTTACGATGAAGGTGGTGAATTAACAGAGAAGG
TCAGAAACAAGCCTTATTCTGTGCTCCTCTTTGACGAGGTGGAAAAAGCTCACCCTGATATTTTCAACGTCCTCTTACAA
GTGCTTGATGATGGCATGTTGACAGATAGCCGTGGGCGTAAAGTGGACTTCTCAAACACCATTATTATCATGACAAGTAA
TCTAGGGGCAACAGCTCTGCGTGATGATAAAACAGTTGGCTTTGGGGCAAAAGGTATCAGCCATGACCACCAAGCCATGG
AAAAACGGATTTTGGAAGAGTTGAAGAAAGCTTACCGACCAGAATTTATCAACCGAATTGATGAAAAGGTTGTCTTCCAC
AGCCTCACTCAGGACAATATGAGAGAAGTGGTCAAGATTATGGTGCAACCGTTGATGGCTACTTTGGCAGAAAAAGGCAT
TACCCTCAAATTCCAGCCAATGGCCCTCAAGTATTTATCAGAAGAAGGGTATGATGTGGAAATGGGTGCTCGTCCATTGC
GCCGCACTTTGCAAACTCAGGTGGAAGATAAATTGTCTGAATTGATTCTTGCTGGTGAATTGGCAAGTGGTCATACCCTG
AAAATTGGCCTTTCTCATGGAAAACTCAGCTTTAACATTGAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus mutans UA159

76.753

99.877

0.767

  clpC Streptococcus thermophilus LMG 18311

73.039

100

0.732

  clpC Streptococcus thermophilus LMD-9

73.039

100

0.732

  clpC Streptococcus pneumoniae TIGR4

67.282

99.877

0.672

  clpC Streptococcus pneumoniae Rx1

67.282

99.877

0.672

  clpC Streptococcus pneumoniae D39

67.282

99.877

0.672

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

48.801

100

0.5

  clpC Bacillus subtilis subsp. subtilis str. 168

42.96

100

0.439