Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   DQM52_RS09950 Genome accession   NZ_LS483367
Coordinates   1970886..1973330 (-) Length   814 a.a.
NCBI ID   WP_111681919.1    Uniprot ID   -
Organism   Streptococcus dysgalactiae subsp. equisimilis strain NCTC5371     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1968073..1979552 1970886..1973330 within 0


Gene organization within MGE regions


Location: 1968073..1979552
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQM52_RS09935 (NCTC5371_02023) - 1968073..1968540 (+) 468 WP_111681917.1 8-oxo-dGTP diphosphatase -
  DQM52_RS09940 (NCTC5371_02024) groL 1968757..1970382 (-) 1626 WP_111681918.1 chaperonin GroEL -
  DQM52_RS09945 (NCTC5371_02025) groES 1970418..1970708 (-) 291 WP_003054738.1 co-chaperone GroES -
  DQM52_RS09950 (NCTC5371_02026) clpC 1970886..1973330 (-) 2445 WP_111681919.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  DQM52_RS09955 (NCTC5371_02027) - 1973330..1973791 (-) 462 WP_003054759.1 CtsR family transcriptional regulator -
  DQM52_RS09960 - 1973987..1974190 (-) 204 WP_003053018.1 cold-shock protein -
  DQM52_RS09965 (NCTC5371_02029) - 1974772..1975503 (+) 732 WP_231872349.1 amidohydrolase family protein -
  DQM52_RS09970 (NCTC5371_02030) - 1975490..1976985 (-) 1496 Protein_1890 IS1182 family transposase -
  DQM52_RS09975 (NCTC5371_02031) - 1977009..1977269 (-) 261 Protein_1891 DUF4355 domain-containing protein -
  DQM52_RS09980 (NCTC5371_02032) - 1977407..1977631 (-) 225 WP_015058113.1 hypothetical protein -
  DQM52_RS09985 (NCTC5371_02033) - 1977684..1978103 (-) 420 WP_015058114.1 HD domain-containing protein -
  DQM52_RS09990 (NCTC5371_02034) - 1978100..1978306 (-) 207 WP_015058115.1 hypothetical protein -
  DQM52_RS09995 (NCTC5371_02035) - 1978308..1979552 (-) 1245 WP_231872350.1 minor capsid protein -

Sequence


Protein


Download         Length: 814 a.a.        Molecular weight: 90296.97 Da        Isoelectric Point: 6.8331

>NTDB_id=1138866 DQM52_RS09950 WP_111681919.1 1970886..1973330(-) (clpC) [Streptococcus dysgalactiae subsp. equisimilis strain NCTC5371]
MIMYSLKMQEIFRQAQFQAARFDSQYLETWHILLAMARVDHSLAGLVLSEFDAKVAVEEYEAAAILAMGKSPKYQVSHID
FRPQSKTLTNLLQFAQAISQVTKDQEVGSEHVLFAILLNPDIMATRLLEMAGYTIKDKGNGEPRLADLRKAIEIHAGYSK
EIIKAIHELRKPKKTKNQGSFSDMMKPPSTAGDLADFTRDLTEMASQGLLEAVIGRDAEVSRMIQVLSRKTKNNPVLVGD
AGVGKTALAYGLAQRIANGAIPYELQDMRVLELDMMSVVAGTRFRGDFEERMNQIIGDIESDGKIILFVDELHTIMGSGS
GIDSTLDAANILKPALSRGTLHMVGATTQEEYQKHIEKDAALSRRFAKILIEEPNVEDAYQILLGLKGSYETYHNVTIAN
QAVRTAVKMAHRYLTSKNLPDSAIDLLDEASATVQGMVKKSRPEIITPLDQALIDGDMKKASRLLAKDVKGQHRKPTAVT
EEDILTTLSKLSGIPLEKLSQADSKKYLNLEKELHKRVIGQEDAVSAISRAIRRNQSGIRTGKRPIGSFMFLGPTGVGKT
ELAKALAEVLFDDASALIRFDMSEYMEKFAASRLNGAPPGYVGYDEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGMLTDSRGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAKGISHDHQAMEKRILEELKKAYRPEFINRIDEKVVFH
SLTQDNMREVVKIMVQPLMATLAEKGITLKFQPMALKYLSEEGYDVEMGARPLRRTLQTQVEDKLSELILAGELASGHTL
KIGLSHGKLSFNLE

Nucleotide


Download         Length: 2445 bp        

>NTDB_id=1138866 DQM52_RS09950 WP_111681919.1 1970886..1973330(-) (clpC) [Streptococcus dysgalactiae subsp. equisimilis strain NCTC5371]
ATGATCATGTATTCATTGAAGATGCAAGAAATTTTCAGGCAGGCGCAGTTTCAAGCAGCCCGCTTTGATAGCCAATACCT
AGAAACTTGGCATATATTGCTAGCTATGGCGAGGGTTGACCACTCCCTAGCGGGTTTGGTGCTAAGCGAATTTGATGCTA
AGGTTGCAGTGGAAGAATATGAGGCTGCAGCTATTTTAGCGATGGGCAAAAGTCCTAAATACCAAGTGTCTCACATTGAC
TTTAGGCCTCAGTCAAAAACCTTAACGAACCTTTTGCAATTTGCCCAAGCTATCAGTCAAGTCACCAAAGACCAAGAGGT
AGGTTCAGAGCATGTTCTCTTTGCCATTTTGCTTAATCCAGATATTATGGCTACTCGCTTATTAGAGATGGCGGGTTATA
CTATCAAGGATAAAGGAAACGGGGAGCCTCGCTTAGCTGATTTACGAAAAGCTATCGAGATTCATGCAGGCTATAGCAAG
GAAATAATTAAGGCTATCCACGAGTTGCGTAAGCCAAAGAAAACCAAAAATCAAGGTTCTTTTTCAGACATGATGAAGCC
ACCAAGCACGGCTGGGGACTTGGCGGACTTTACACGTGATTTGACCGAGATGGCAAGTCAAGGTCTCTTAGAGGCAGTTA
TTGGCCGTGATGCTGAAGTGTCACGGATGATTCAAGTACTGAGTCGTAAAACAAAGAATAACCCTGTTCTTGTGGGTGAT
GCAGGTGTGGGTAAAACGGCCCTTGCCTACGGCCTAGCCCAACGTATTGCTAATGGTGCTATTCCTTATGAGCTGCAAGA
CATGCGTGTTCTAGAATTAGACATGATGAGTGTTGTTGCGGGGACCCGTTTTCGTGGGGATTTTGAAGAGCGCATGAATC
AAATCATTGGCGATATTGAGTCAGATGGCAAAATTATTCTCTTCGTAGATGAATTGCACACCATTATGGGATCAGGAAGT
GGTATTGATAGCACGCTTGATGCTGCCAATATTTTAAAACCAGCCTTATCTCGTGGAACCCTCCATATGGTGGGTGCAAC
AACGCAGGAAGAATACCAAAAACACATTGAAAAAGATGCCGCTCTTTCGCGCCGCTTTGCTAAGATTTTAATTGAAGAAC
CTAATGTAGAAGATGCTTATCAGATTCTGCTAGGACTAAAAGGCTCTTACGAGACTTACCATAATGTGACCATTGCTAAT
CAGGCTGTTAGAACCGCTGTGAAAATGGCACATCGCTATCTGACCAGCAAAAACCTTCCGGATTCTGCCATTGATTTGTT
GGATGAAGCCAGTGCTACAGTGCAAGGTATGGTTAAAAAATCTAGGCCAGAAATCATCACGCCATTAGATCAAGCTTTGA
TTGATGGCGATATGAAGAAGGCCTCTCGTTTGTTGGCAAAAGACGTTAAAGGGCAACATCGCAAGCCAACAGCTGTGACA
GAAGAGGACATCCTGACGACCTTGAGCAAGCTATCAGGTATTCCACTGGAAAAACTCAGCCAAGCTGATAGCAAAAAATA
CCTTAATTTGGAAAAAGAACTGCATAAGCGCGTGATTGGGCAAGAAGATGCTGTCTCAGCTATTTCTAGAGCCATTCGCC
GTAATCAGTCAGGCATTCGTACAGGTAAACGTCCAATCGGTTCTTTCATGTTCCTTGGTCCAACAGGGGTTGGTAAGACC
GAGTTGGCAAAAGCCTTGGCAGAAGTTCTCTTTGATGACGCGTCCGCCCTTATCCGCTTTGATATGTCAGAGTATATGGA
GAAATTTGCGGCTTCTCGCCTTAATGGCGCACCTCCAGGCTATGTTGGTTACGATGAAGGTGGTGAATTAACAGAGAAGG
TCAGAAACAAGCCTTATTCTGTGCTCCTCTTTGACGAGGTGGAAAAAGCTCACCCTGATATTTTCAACGTCCTCTTACAA
GTGCTTGATGATGGCATGTTGACAGATAGCCGTGGGCGTAAAGTGGACTTCTCAAATACCATTATTATCATGACAAGTAA
TCTAGGGGCAACAGCTCTGCGTGATGATAAAACAGTTGGCTTTGGGGCAAAAGGCATTAGCCATGACCACCAAGCCATGG
AAAAACGGATTTTGGAAGAGTTGAAAAAAGCTTACCGACCAGAATTTATCAACCGAATTGATGAAAAGGTTGTCTTCCAC
AGCCTCACTCAGGACAATATGAGAGAAGTAGTCAAGATTATGGTGCAGCCGTTGATGGCTACTTTGGCAGAAAAAGGCAT
TACCCTCAAATTCCAGCCAATGGCTCTCAAGTATTTATCAGAAGAAGGGTATGATGTGGAAATGGGTGCTCGTCCATTAC
GCCGCACTTTGCAAACTCAGGTGGAAGATAAATTGTCTGAATTGATTCTTGCTGGTGAATTGGCAAGTGGTCATACCCTG
AAAATTGGCCTTTCTCATGGAAAACTCAGCTTTAACCTTGAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus mutans UA159

76.384

99.877

0.763

  clpC Streptococcus thermophilus LMD-9

72.549

100

0.727

  clpC Streptococcus thermophilus LMG 18311

72.549

100

0.727

  clpC Streptococcus pneumoniae TIGR4

67.036

99.877

0.67

  clpC Streptococcus pneumoniae Rx1

67.036

99.877

0.67

  clpC Streptococcus pneumoniae D39

67.036

99.877

0.67

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

48.561

100

0.498

  clpC Bacillus subtilis subsp. subtilis str. 168

42.599

100

0.435


Multiple sequence alignment