Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   SNAG_RS09330 Genome accession   NZ_AP017652
Coordinates   1845358..1847790 (-) Length   810 a.a.
NCBI ID   WP_096408680.1    Uniprot ID   A0A1E1GD86
Organism   Streptococcus sp. NPS 308     
Function   degradation of ComW (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1843365..1878052 1845358..1847790 within 0


Gene organization within MGE regions


Location: 1843365..1878052
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SNAG_RS09320 (SNAG_1824) pnuC 1843365..1844144 (-) 780 WP_096408675.1 nicotinamide riboside transporter PnuC -
  SNAG_RS09325 (SNAG_1825) - 1844154..1845212 (-) 1059 WP_096408678.1 AAA family ATPase -
  SNAG_RS09330 (SNAG_1826) clpC 1845358..1847790 (-) 2433 WP_096408680.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  SNAG_RS09335 (SNAG_1827) - 1847792..1848250 (-) 459 WP_001211257.1 CtsR family transcriptional regulator -
  SNAG_RS09340 (SNAG_1828) - 1848767..1849645 (-) 879 WP_049537994.1 hypothetical protein -
  SNAG_RS09345 (SNAG_1829) - 1849642..1850115 (-) 474 WP_049537991.1 hypothetical protein -
  SNAG_RS09350 (SNAG_1830) - 1850499..1850909 (-) 411 WP_049537989.1 DUF722 domain-containing protein -
  SNAG_RS09355 (SNAG_1831) - 1851287..1851652 (-) 366 WP_231906667.1 hypothetical protein -
  SNAG_RS09360 (SNAG_1832) - 1851844..1852704 (-) 861 WP_049509664.1 phage major capsid protein -
  SNAG_RS09365 (SNAG_1833) - 1852753..1852983 (-) 231 WP_004261935.1 hypothetical protein -
  SNAG_RS09840 (SNAG_1834) - 1853073..1853234 (-) 162 WP_000997172.1 hypothetical protein -
  SNAG_RS09370 (SNAG_1835) - 1853231..1854076 (-) 846 WP_049537987.1 ATP-binding protein -
  SNAG_RS09375 (SNAG_1836) - 1854088..1854978 (-) 891 WP_049537985.1 conserved phage C-terminal domain-containing protein -
  SNAG_RS09380 (SNAG_1837) - 1854965..1855237 (-) 273 WP_049490554.1 hypothetical protein -
  SNAG_RS09385 (SNAG_1838) - 1855239..1855448 (-) 210 WP_061853314.1 hypothetical protein -
  SNAG_RS09390 (SNAG_1839) - 1855450..1855650 (-) 201 WP_049537981.1 hypothetical protein -
  SNAG_RS09400 (SNAG_1840) - 1855987..1856247 (-) 261 WP_049537979.1 hypothetical protein -
  SNAG_RS09405 (SNAG_1841) - 1856259..1856861 (-) 603 WP_049537977.1 Rha family transcriptional regulator -
  SNAG_RS09410 (SNAG_1842) - 1857024..1857209 (-) 186 WP_000909284.1 hypothetical protein -
  SNAG_RS09415 (SNAG_1843) - 1857351..1857911 (+) 561 WP_049487577.1 helix-turn-helix transcriptional regulator -
  SNAG_RS09420 (SNAG_1844) - 1857916..1858185 (+) 270 WP_096408682.1 DUF4177 domain-containing protein -
  SNAG_RS09425 (SNAG_1845) - 1858372..1859136 (+) 765 WP_042900397.1 type II toxin-antitoxin system PemK/MazF family toxin -
  SNAG_RS09435 (SNAG_1846) - 1859448..1860614 (+) 1167 WP_096408685.1 site-specific integrase -
  SNAG_RS09440 (SNAG_1847) - 1860824..1864615 (-) 3792 WP_231906668.1 FctA domain-containing protein -
  SNAG_RS09445 (SNAG_1848) - 1864819..1865547 (-) 729 WP_096408687.1 ABC transporter ATP-binding protein -
  SNAG_RS09450 (SNAG_1849) - 1865547..1866554 (-) 1008 WP_096408690.1 ABC transporter substrate-binding protein -
  SNAG_RS09455 (SNAG_1850) - 1866591..1867349 (-) 759 WP_096408692.1 ABC transporter permease -
  SNAG_RS09460 (SNAG_1851) - 1867312..1867602 (-) 291 WP_096408916.1 thiamine-binding protein -
  SNAG_RS09465 (SNAG_1852) polA 1867881..1870514 (-) 2634 WP_096408695.1 DNA polymerase I -
  SNAG_RS09470 (SNAG_1853) - 1870643..1875118 (-) 4476 WP_096408698.1 SIALI-17 repeat-containing surface protein -
  SNAG_RS09475 (SNAG_1854) - 1875226..1876335 (-) 1110 WP_096408700.1 SH3 domain-containing protein -
  SNAG_RS09480 (SNAG_1855) - 1876426..1876698 (-) 273 WP_001278164.1 Veg family protein -
  SNAG_RS09485 (SNAG_1856) dnaB 1876700..1878052 (-) 1353 WP_096408703.1 replicative DNA helicase -

Sequence


Protein


Download         Length: 810 a.a.        Molecular weight: 90469.30 Da        Isoelectric Point: 6.0180

>NTDB_id=68377 SNAG_RS09330 WP_096408680.1 1845358..1847790(-) (clpC) [Streptococcus sp. NPS 308]
MNYSKALNECIESAYMVASHFGARYLESWHLLIAMSNHSYSVAGATLNDYPYEMDRLEEVALELTETDYSQDETFTELPF
SHRLEVLFAEAEYVASVVHAKVLGTEHVLYAILHDGNALATRILERAGFSYEDQKDQVRIAALRRNLEERAGWTREDLKA
LRQRHRTVTDKQNSMANMMGMPQAQSGGLEDYTHDLTEQARSGKLEPVIGRDKEISRMIQILSRKTKNNPVLVGDAGVGK
TALALGLAQRIASGDVPTEMAKMRVLELDLMNVVAGTRFRGDFEERMNNIIKDIEEDGKVILFIDELHTIMGSGSGIDST
LDAANILKPALARGTLRTVGATTQEEYQKHIEKDAALSRRFAKVTIEEPSLADSMTILQGLKATYEKHHRVQITDEAVET
AVKMAHRYLTSRHLPDSAIDLLDEAAATVQNKSKHVRMDESDLSPADKALMDGKWKQAGQLIAKEQEVPVYKDLVTETEI
LTTLSRLSGIPVQKLTQTDAKKYLNLEAELHKRVIGQDQAVSSISRAIRRNQSGIRSHKRPIGSFMFLGPTGVGKTELAK
ALAEVFFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQVLDD
GVLTDSKGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAKDIRFDQENMEKRIFEELKKAYRPEFINRIDEKVVFHSLDS
EHMQEIVKIMVKPLIASLAEKGIDLKLQASALKLLASQGYNPEMGARPLRRTLQTEVEDKLAELLLKGELVAGKTLKIGV
KAGQLKFDIA

Nucleotide


Download         Length: 2433 bp        

>NTDB_id=68377 SNAG_RS09330 WP_096408680.1 1845358..1847790(-) (clpC) [Streptococcus sp. NPS 308]
ATGAACTATTCAAAAGCATTGAATGAATGTATCGAAAGTGCCTACATGGTTGCGAGCCATTTTGGAGCTCGATATCTAGA
GTCTTGGCATTTATTGATTGCCATGTCCAATCACAGTTACAGTGTGGCAGGTGCGACTTTAAACGATTATCCTTATGAAA
TGGATCGTTTAGAAGAGGTTGCGTTGGAACTGACTGAAACCGACTATAGCCAAGATGAAACCTTTACGGAATTACCCTTT
TCCCATCGTTTGGAGGTCCTCTTTGCGGAAGCAGAGTATGTAGCCTCAGTGGTCCACGCAAAGGTGCTAGGGACAGAGCA
TGTCCTCTATGCAATCTTGCATGACGGCAATGCCTTGGCAACTCGCATCTTGGAGAGAGCGGGCTTTTCTTATGAAGACC
AGAAAGATCAGGTCAGAATTGCTGCTCTTCGTCGCAATCTAGAAGAACGTGCAGGATGGACAAGAGAAGATCTCAAGGCT
TTGCGTCAACGTCATCGCACAGTAACTGACAAGCAAAATTCCATGGCCAATATGATGGGCATGCCTCAAGCTCAAAGTGG
TGGTCTAGAGGACTACACGCATGACCTAACGGAGCAAGCGCGTTCTGGCAAGTTAGAGCCAGTTATAGGTCGAGACAAGG
AAATCTCACGTATGATTCAGATTTTGAGTCGGAAGACCAAGAACAATCCGGTCTTGGTTGGAGATGCTGGTGTCGGGAAA
ACAGCTCTAGCCCTTGGCCTTGCCCAGCGTATTGCTAGTGGGGATGTACCTACGGAAATGGCCAAGATGCGCGTTTTGGA
GCTTGATTTGATGAATGTTGTTGCGGGGACACGTTTCCGTGGAGATTTTGAAGAGCGCATGAACAATATCATCAAGGATA
TCGAGGAAGATGGTAAAGTAATCCTCTTTATTGATGAACTTCACACCATCATGGGTTCTGGTAGCGGTATTGATTCGACT
CTAGATGCGGCCAATATCTTGAAGCCAGCCTTGGCACGTGGAACTTTGAGAACGGTTGGTGCAACTACTCAGGAAGAATA
CCAAAAACACATCGAAAAAGATGCTGCCCTTTCTCGTCGTTTTGCCAAAGTGACGATTGAAGAGCCAAGTTTAGCTGACA
GCATGACCATTTTGCAAGGTTTGAAGGCTACCTATGAGAAACACCACCGTGTGCAAATCACAGATGAAGCTGTCGAAACA
GCTGTTAAGATGGCACATCGTTACCTGACCAGTCGTCACTTGCCAGACTCTGCTATCGACCTCTTGGATGAGGCAGCAGC
AACTGTGCAAAACAAATCCAAGCATGTGAGAATGGACGAATCCGACTTGAGTCCAGCTGACAAGGCCTTGATGGATGGCA
AGTGGAAACAAGCAGGCCAGCTAATCGCAAAAGAGCAGGAAGTCCCTGTCTATAAAGACTTGGTGACAGAGACTGAAATC
CTGACTACTTTGAGTCGCTTGTCAGGGATTCCAGTCCAAAAACTGACGCAAACGGATGCCAAGAAATATCTGAACTTGGA
AGCTGAACTACACAAACGTGTCATCGGTCAAGATCAAGCTGTTTCAAGTATTAGCCGTGCTATTCGTCGCAATCAGTCAG
GGATTCGCAGTCACAAGCGTCCGATTGGTTCCTTTATGTTCCTAGGGCCGACAGGAGTCGGAAAGACCGAATTGGCAAAG
GCTTTGGCAGAAGTCTTCTTTGATGACGAATCAGCCCTTATCCGCTTTGATATGAGTGAGTATATGGAGAAATTCGCAGC
TAGCCGTCTCAATGGAGCTCCTCCGGGCTATGTAGGCTACGAAGAAGGTGGGGAGTTGACAGAGAAGGTTCGCAACAAAC
CATACTCTGTTCTCCTCTTTGACGAGGTAGAGAAGGCCCACCCAGACATCTTTAATGTTCTCTTGCAGGTTCTGGACGAC
GGTGTCTTGACAGACAGCAAGGGGCGCAAGGTGGACTTTTCAAATACCATCATCATCATGACGTCAAACCTTGGTGCGAC
AGCCCTTCGTGATGACAAGACAGTCGGCTTTGGGGCCAAGGACATTCGTTTTGACCAGGAAAATATGGAAAAACGCATCT
TCGAAGAGTTGAAAAAAGCTTATCGACCAGAGTTTATCAACCGTATTGATGAAAAGGTGGTCTTCCATAGCTTGGATAGT
GAACACATGCAGGAAATTGTCAAGATCATGGTGAAACCATTGATTGCTAGCCTGGCAGAGAAGGGTATCGACTTGAAACT
GCAAGCTTCGGCACTGAAGTTACTAGCTAGTCAAGGTTACAATCCAGAAATGGGAGCCCGTCCACTTCGCAGAACCCTGC
AAACAGAAGTGGAAGACAAGCTAGCAGAACTCCTTCTCAAGGGAGAACTGGTAGCAGGCAAGACCCTCAAAATTGGTGTC
AAAGCTGGACAATTGAAATTTGATATTGCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1E1GD86

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Streptococcus pneumoniae D39

96.049

100

0.96

  clpC Streptococcus pneumoniae TIGR4

96.049

100

0.96

  clpC Streptococcus pneumoniae Rx1

96.049

100

0.96

  clpC Streptococcus mutans UA159

67.282

100

0.675

  clpC Streptococcus thermophilus LMD-9

66.83

100

0.672

  clpC Streptococcus thermophilus LMG 18311

66.708

100

0.67

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

49.576

100

0.505

  clpC Bacillus subtilis subsp. subtilis str. 168

44.704

100

0.448

  clpE Streptococcus mutans UA159

48.387

76.543

0.37


Multiple sequence alignment