Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   DQL37_RS08895 Genome accession   NZ_LS483352
Coordinates   1712596..1715040 (-) Length   814 a.a.
NCBI ID   WP_111685848.1    Uniprot ID   -
Organism   Streptococcus pyogenes strain NCTC12052     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1710461..1748205 1712596..1715040 within 0


Gene organization within MGE regions


Location: 1710461..1748205
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQL37_RS08885 (NCTC12052_01753) groL 1710461..1712092 (-) 1632 WP_002982320.1 chaperonin GroEL -
  DQL37_RS08890 (NCTC12052_01754) groES 1712128..1712418 (-) 291 WP_002991292.1 co-chaperone GroES -
  DQL37_RS08895 (NCTC12052_01755) clpC 1712596..1715040 (-) 2445 WP_111685848.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  DQL37_RS08900 (NCTC12052_01756) - 1715040..1715501 (-) 462 WP_002982312.1 CtsR family transcriptional regulator -
  DQL37_RS08905 - 1715697..1715900 (-) 204 WP_002991299.1 cold-shock protein -
  DQL37_RS08910 (NCTC12052_01758) - 1716499..1717569 (-) 1071 WP_063629765.1 tyrosine-type recombinase/integrase -
  DQL37_RS08915 (NCTC12052_01759) - 1717697..1717999 (-) 303 WP_111685849.1 hypothetical protein -
  DQL37_RS08920 (NCTC12052_01760) - 1717992..1718378 (-) 387 WP_063629763.1 ImmA/IrrE family metallo-endopeptidase -
  DQL37_RS08925 (NCTC12052_01761) - 1718382..1718732 (-) 351 WP_063629762.1 helix-turn-helix domain-containing protein -
  DQL37_RS08935 (NCTC12052_01762) - 1719129..1719338 (+) 210 WP_111685850.1 helix-turn-helix domain-containing protein -
  DQL37_RS08940 (NCTC12052_01763) - 1719338..1719700 (+) 363 WP_063629760.1 hypothetical protein -
  DQL37_RS08945 (NCTC12052_01764) - 1719722..1720006 (+) 285 WP_063629759.1 hypothetical protein -
  DQL37_RS08950 (NCTC12052_01765) - 1720033..1720251 (+) 219 WP_063629758.1 hypothetical protein -
  DQL37_RS08955 (NCTC12052_01766) dnaB 1720260..1721606 (+) 1347 WP_111685851.1 replicative DNA helicase -
  DQL37_RS08960 (NCTC12052_01767) - 1721593..1722339 (+) 747 WP_111685852.1 conserved phage C-terminal domain-containing protein -
  DQL37_RS08965 (NCTC12052_01768) - 1722340..1723161 (+) 822 WP_111685853.1 ATP-binding protein -
  DQL37_RS08970 (NCTC12052_01769) - 1723161..1723388 (+) 228 WP_063629755.1 hypothetical protein -
  DQL37_RS09770 (NCTC12052_01770) - 1723402..1723572 (+) 171 WP_168388561.1 hypothetical protein -
  DQL37_RS08975 (NCTC12052_01771) - 1723569..1723838 (+) 270 WP_109982045.1 hypothetical protein -
  DQL37_RS08980 (NCTC12052_01772) - 1723841..1724590 (+) 750 WP_111685854.1 DNA-methyltransferase -
  DQL37_RS08985 (NCTC12052_01773) - 1724638..1725384 (+) 747 WP_231907105.1 DUF1642 domain-containing protein -
  DQL37_RS08990 (NCTC12052_01774) - 1725396..1725614 (+) 219 WP_111685855.1 hypothetical protein -
  DQL37_RS08995 (NCTC12052_01775) ssbA 1725611..1726003 (+) 393 WP_032460876.1 single-stranded DNA-binding protein Machinery gene
  DQL37_RS09000 (NCTC12052_01776) - 1726017..1726295 (+) 279 WP_029713970.1 hypothetical protein -
  DQL37_RS09010 (NCTC12052_01778) - 1726390..1726617 (+) 228 WP_063629751.1 hypothetical protein -
  DQL37_RS09015 (NCTC12052_01779) - 1726589..1727053 (+) 465 WP_063629750.1 hypothetical protein -
  DQL37_RS09020 (NCTC12052_01780) - 1727163..1727705 (+) 543 WP_063629749.1 site-specific integrase -
  DQL37_RS09025 (NCTC12052_01781) - 1727875..1728057 (+) 183 WP_063629748.1 hypothetical protein -
  DQL37_RS09920 - 1728446..1728640 (+) 195 WP_227876734.1 HNH endonuclease -
  DQL37_RS09035 (NCTC12052_01782) - 1728758..1729243 (+) 486 WP_000601034.1 hypothetical protein -
  DQL37_RS09040 (NCTC12052_01783) - 1729236..1730948 (+) 1713 WP_111685857.1 terminase TerL endonuclease subunit -
  DQL37_RS09045 (NCTC12052_01784) - 1730957..1732099 (+) 1143 WP_111685858.1 phage portal protein -
  DQL37_RS09050 (NCTC12052_01785) - 1732150..1732692 (+) 543 WP_063629746.1 HK97 family phage prohead protease -
  DQL37_RS09055 (NCTC12052_01786) - 1732703..1733965 (+) 1263 WP_063629745.1 phage major capsid protein -
  DQL37_RS09060 (NCTC12052_01787) - 1733985..1734323 (+) 339 WP_032460894.1 hypothetical protein -
  DQL37_RS09065 (NCTC12052_01788) - 1734320..1734625 (+) 306 WP_032460893.1 head-tail adaptor protein -
  DQL37_RS09070 (NCTC12052_01789) - 1734625..1734972 (+) 348 WP_032460892.1 hypothetical protein -
  DQL37_RS09075 (NCTC12052_01790) - 1734959..1735303 (+) 345 WP_063629744.1 hypothetical protein -
  DQL37_RS09080 (NCTC12052_01791) - 1735318..1735989 (+) 672 WP_063629743.1 phage tail protein -
  DQL37_RS09085 (NCTC12052_01792) - 1735993..1736466 (+) 474 WP_063629742.1 hypothetical protein -
  DQL37_RS09095 (NCTC12052_01793) - 1736656..1739391 (+) 2736 WP_111685860.1 phage tail tape measure protein -
  DQL37_RS09100 (NCTC12052_01794) - 1739388..1740110 (+) 723 WP_063629740.1 hypothetical protein -
  DQL37_RS09105 (NCTC12052_01795) - 1740111..1742054 (+) 1944 WP_231907106.1 phage tail spike protein -
  DQL37_RS09110 (NCTC12052_01796) - 1742051..1743166 (+) 1116 WP_063629043.1 hyaluronoglucosaminidase -
  DQL37_RS09115 (NCTC12052_01797) - 1743181..1744962 (+) 1782 WP_111685861.1 gp58-like family protein -
  DQL37_RS09120 (NCTC12052_01798) - 1744971..1745399 (+) 429 WP_111685862.1 DUF1617 family protein -
  DQL37_RS09125 (NCTC12052_01799) - 1745402..1746040 (+) 639 WP_111685863.1 hypothetical protein -
  DQL37_RS09130 (NCTC12052_01800) - 1746050..1746322 (+) 273 WP_111685864.1 hypothetical protein -
  DQL37_RS09135 (NCTC12052_01801) - 1746319..1746546 (+) 228 WP_003058873.1 phage holin -
  DQL37_RS09140 (NCTC12052_01803) - 1746662..1747879 (+) 1218 WP_111685865.1 peptidoglycan amidohydrolase family protein -
  DQL37_RS10045 (NCTC12052_01804) prx 1748044..1748205 (+) 162 WP_309543630.1 hypothetical protein Regulator

Sequence


Protein


Download         Length: 814 a.a.        Molecular weight: 90621.28 Da        Isoelectric Point: 6.7693

>NTDB_id=1138068 DQL37_RS08895 WP_111685848.1 1712596..1715040(-) (clpC) [Streptococcus pyogenes strain NCTC12052]
MIMYSTKMQDIFRQAQFQAARFDSHCLETWHVLLAMVAVDNSLANMILSEYDAQVAIEEYEAAAILAMGKTPKEQLSRVD
FRPQSKTLTNLLAFAQAISQITRDQEVGSEHVLFAILLNPDIMASRLLEIAGYQIKDNGNGQPRLADLRKAIERHAGYSK
EMIKAIHELRKPKKTKTQGTFSDMMKPPSTAGELSDFTRDLTEMARQGLLESVIGRDQEVSRMIQVLSRKTKNNPVLVGD
AGVGKTALAYGLAQRIANGAIPYELKEMRVLELDMMSVVAGTRFRGDFEERMNQIIDDIEADGQIILFVDELHTIMGSGS
GIDSTLDAANILKPALSRGTLHMVGATTQEEYQKHIEKDAALSRRFAKILIEEPNTEDAYQILMGLKLSYETYHNVSISN
EAVKTAVKMAHRYLTSKNLPDSAIDLLDEASAAVQNMVKKSAPETLTPIDQALINGDMKKVSHLLAKEAKGQMKNPTPVT
EDDILATLSKLSGIPLEKLTQADSKKYLNLEKELHKRVIGQDAAVTAISRAIRRNQSGIRTGKRPIGSFMFLGPTGVGKT
ELAKALAEVLFDDEAALIRFDMSEYMEKFAASRLNGAPPGYVGYDEGGELTQKVRNKPYSVLLFDEVEKAHPDIFNVLLQ
VLDDGILTDSRGRKVDFSNTIIIMTSNLGATALRDDKTVGFGVKGIHQDHQAMEKRILEELRKTYRPEFINRIDEKVVFH
SLTQDNMRDVVKIMVQPLITTLAEKGITLKIQPLALKHLSEVGYDEHMGARPLRRTLQTEIEDKLSELILSRELTSGHTL
KIGLSYGKLTFHIA

Nucleotide


Download         Length: 2445 bp        

>NTDB_id=1138068 DQL37_RS08895 WP_111685848.1 1712596..1715040(-) (clpC) [Streptococcus pyogenes strain NCTC12052]
ATGATTATGTATTCAACGAAGATGCAAGACATTTTTAGACAGGCGCAGTTCCAAGCTGCTCGCTTTGATAGCCATTGCCT
GGAAACTTGGCATGTTTTGTTAGCTATGGTAGCTGTAGATAATTCTTTAGCAAATATGATTTTAAGTGAATATGATGCCC
AAGTCGCCATAGAAGAATATGAAGCTGCAGCTATTTTAGCCATGGGCAAAACCCCTAAAGAACAGTTGTCTCGTGTAGAC
TTCAGACCTCAATCTAAAACCTTGACTAACTTGTTAGCTTTTGCGCAGGCTATTAGCCAAATCACTAGGGATCAAGAAGT
CGGCTCTGAGCATGTCTTATTTGCTATTTTATTGAATCCAGATATTATGGCGAGTCGTTTGTTAGAAATAGCTGGCTATC
AGATAAAAGATAACGGCAATGGGCAGCCGCGATTAGCTGACTTGCGAAAAGCAATAGAACGTCATGCAGGTTACAGTAAG
GAAATGATCAAGGCTATTCACGAACTACGTAAGCCTAAAAAAACGAAAACACAAGGGACCTTTTCAGATATGATGAAGCC
ACCAAGTACAGCTGGTGAGTTGAGTGATTTCACAAGAGATTTGACTGAAATGGCAAGACAAGGTTTGTTAGAATCGGTGA
TTGGACGTGACCAAGAAGTATCTCGTATGATTCAGGTACTAAGTCGTAAAACGAAAAACAATCCTGTCTTGGTAGGTGAT
GCAGGTGTTGGTAAAACTGCGCTTGCTTATGGCCTTGCTCAACGGATTGCAAATGGCGCTATTCCTTATGAACTTAAGGA
GATGCGTGTCCTAGAATTAGACATGATGAGTGTGGTAGCAGGAACCCGTTTTCGTGGGGATTTTGAAGAGCGCATGAATC
AAATCATTGATGATATTGAAGCTGATGGTCAGATTATTCTTTTTGTTGATGAACTACATACTATTATGGGTTCTGGCAGT
GGTATTGACAGTACACTTGATGCGGCTAACATTTTAAAACCAGCATTATCGCGCGGCACTCTTCATATGGTTGGAGCAAC
AACTCAAGAAGAATACCAAAAACATATTGAAAAAGATGCAGCTCTTTCGCGTCGTTTTGCTAAAATATTAATTGAAGAAC
CTAATACAGAAGATGCTTATCAGATTTTGATGGGCCTAAAATTATCTTATGAGACCTACCATAATGTCTCGATATCAAAT
GAGGCAGTTAAAACAGCTGTAAAAATGGCACACCGTTATTTAACCAGTAAAAATCTCCCTGATTCAGCTATCGATTTATT
AGATGAAGCTAGTGCTGCTGTGCAAAACATGGTGAAAAAATCAGCACCTGAGACTTTAACACCAATAGACCAAGCTCTTA
TCAATGGTGATATGAAAAAAGTATCTCACCTCTTAGCTAAAGAAGCAAAAGGTCAGATGAAAAACCCAACACCGGTGACA
GAAGATGATATTTTGGCAACCTTGAGTAAGTTATCGGGAATTCCACTTGAAAAACTGACGCAAGCTGATAGTAAAAAATA
CCTCAATTTAGAAAAAGAACTGCATAAGCGTGTGATTGGTCAGGATGCTGCTGTTACGGCTATTTCAAGAGCCATTCGTC
GCAATCAGTCAGGTATTCGAACAGGAAAACGTCCTATTGGATCATTTATGTTTCTTGGCCCAACAGGAGTAGGTAAGACA
GAACTAGCAAAGGCCCTTGCAGAAGTTCTCTTTGATGATGAAGCAGCGCTTATTCGTTTTGATATGTCTGAGTACATGGA
AAAATTCGCAGCGTCTAGGCTTAATGGAGCACCTCCTGGTTATGTCGGCTATGATGAAGGAGGTGAACTGACACAGAAAG
TTAGAAATAAACCTTATTCAGTCTTGCTTTTTGATGAAGTGGAAAAAGCACATCCTGATATTTTTAACGTTCTCCTTCAA
GTATTAGATGATGGTATATTGACTGATAGTCGTGGGCGTAAGGTCGATTTTTCAAATACTATTATTATCATGACCAGCAA
TCTTGGCGCAACAGCCCTGCGTGATGATAAAACGGTCGGTTTTGGGGTCAAAGGCATTCACCAAGACCATCAAGCTATGG
AGAAACGTATTTTAGAAGAATTAAGAAAAACTTACCGCCCAGAATTTATCAATCGTATTGATGAAAAAGTGGTCTTTCAT
AGTCTGACCCAAGATAACATGCGCGATGTGGTTAAAATCATGGTACAGCCTCTGATTACTACATTGGCAGAAAAAGGTAT
TACCCTTAAAATTCAGCCTTTGGCCTTGAAACATTTGTCCGAGGTCGGCTATGATGAGCATATGGGGGCAAGACCATTAC
GTCGAACGCTGCAAACTGAGATAGAAGATAAGCTATCAGAGCTTATTCTTTCTCGAGAATTGACAAGTGGGCATACGCTA
AAAATTGGATTATCATATGGCAAATTAACGTTTCACATAGCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus mutans UA159

75.031

99.877

0.749

  clpC Streptococcus thermophilus LMG 18311

71.201

100

0.714

  clpC Streptococcus thermophilus LMD-9

71.078

100

0.713

  clpC Streptococcus pneumoniae TIGR4

65.848

100

0.658

  clpC Streptococcus pneumoniae Rx1

65.848

100

0.658

  clpC Streptococcus pneumoniae D39

65.848

100

0.658

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

47.852

100

0.493

  clpC Bacillus subtilis subsp. subtilis str. 168

42.326

100

0.434


Multiple sequence alignment