Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpC   Type   Regulator
Locus tag   KVP03_RS07070 Genome accession   NZ_CP077959
Coordinates   1490782..1492875 (-) Length   697 a.a.
NCBI ID   WP_230923008.1    Uniprot ID   -
Organism   Streptococcus equi subsp. zooepidemicus strain SEZ_14-145     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1454732..1500585 1490782..1492875 within 0


Gene organization within MGE regions


Location: 1454732..1500585
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KVP03_RS06920 (KVP03_06905) - 1454732..1455331 (+) 600 WP_230923000.1 glycoside hydrolase family 73 protein -
  KVP03_RS06925 (KVP03_06910) - 1455893..1456102 (+) 210 WP_012515721.1 helix-turn-helix transcriptional regulator -
  KVP03_RS06930 (KVP03_06915) - 1456092..1456484 (+) 393 WP_021320833.1 hypothetical protein -
  KVP03_RS06935 (KVP03_06920) - 1456946..1457176 (+) 231 WP_014622960.1 hypothetical protein -
  KVP03_RS06940 - 1457258..1457470 (+) 213 Protein_1319 conjugal transfer protein TraA -
  KVP03_RS06945 (KVP03_06925) - 1457584..1458942 (+) 1359 WP_230923001.1 SLC13 family permease -
  KVP03_RS06950 (KVP03_06930) - 1459389..1460102 (+) 714 WP_230923002.1 glycoside hydrolase family 73 protein -
  KVP03_RS06955 (KVP03_06935) - 1460330..1460710 (+) 381 WP_012677870.1 DUF1149 family protein -
  KVP03_RS06960 (KVP03_06940) - 1460710..1461558 (+) 849 WP_012679645.1 DegV family protein -
  KVP03_RS06965 (KVP03_06945) - 1461658..1462869 (+) 1212 WP_043029790.1 CCA tRNA nucleotidyltransferase -
  KVP03_RS06970 (KVP03_06950) - 1462862..1464730 (+) 1869 WP_230923003.1 ABC-F family ATP-binding cassette domain-containing protein -
  KVP03_RS06975 (KVP03_06955) - 1464835..1466580 (+) 1746 WP_012677874.1 ABC transporter ATP-binding protein -
  KVP03_RS06980 (KVP03_06960) - 1466580..1468352 (+) 1773 WP_111678181.1 ABC transporter ATP-binding protein -
  KVP03_RS06985 (KVP03_06965) gdhA 1468493..1469842 (+) 1350 WP_230923004.1 NADP-specific glutamate dehydrogenase -
  KVP03_RS06990 (KVP03_06970) - 1469974..1470384 (+) 411 WP_012677877.1 peptide deformylase -
  KVP03_RS06995 (KVP03_06975) - 1470577..1472583 (-) 2007 WP_230923005.1 surface-anchored 5'-nucleotidase -
  KVP03_RS07000 (KVP03_06980) - 1473031..1474752 (+) 1722 WP_230923296.1 ABC transporter ATP-binding protein -
  KVP03_RS07005 (KVP03_06985) - 1474754..1476505 (+) 1752 WP_111678177.1 ABC transporter ATP-binding protein -
  KVP03_RS07010 (KVP03_06990) mvk 1477041..1477919 (+) 879 WP_111692493.1 mevalonate kinase -
  KVP03_RS07015 (KVP03_06995) mvaD 1477901..1478845 (+) 945 WP_111692492.1 diphosphomevalonate decarboxylase -
  KVP03_RS07020 (KVP03_07000) - 1478838..1479836 (+) 999 WP_111690156.1 phosphomevalonate kinase -
  KVP03_RS07025 (KVP03_07005) fni 1479839..1480831 (+) 993 WP_230923006.1 type 2 isopentenyl-diphosphate Delta-isomerase -
  KVP03_RS07030 (KVP03_07010) - 1481106..1482389 (-) 1284 WP_206158030.1 hydroxymethylglutaryl-CoA reductase, degradative -
  KVP03_RS07035 (KVP03_07015) - 1482376..1483551 (-) 1176 WP_230923007.1 hydroxymethylglutaryl-CoA synthase -
  KVP03_RS07040 (KVP03_07020) - 1483736..1484575 (+) 840 WP_043025902.1 thymidylate synthase -
  KVP03_RS07045 (KVP03_07025) - 1484656..1485153 (+) 498 WP_111692489.1 dihydrofolate reductase -
  KVP03_RS07050 (KVP03_07030) - 1485173..1485343 (+) 171 WP_012515696.1 hypothetical protein -
  KVP03_RS07055 (KVP03_07035) clpX 1485379..1486608 (+) 1230 WP_012677889.1 ATP-dependent Clp protease ATP-binding subunit ClpX Regulator
  KVP03_RS07060 (KVP03_07040) yihA 1486618..1487217 (+) 600 WP_111692488.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  KVP03_RS07065 (KVP03_07045) hupY 1487590..1490592 (+) 3003 WP_154299001.1 leucine-rich repeat adhesin HupY/LrrG -
  KVP03_RS07070 (KVP03_07050) clpC 1490782..1492875 (-) 2094 WP_230923008.1 AAA family ATPase Regulator
  KVP03_RS07075 (KVP03_07055) ndk 1493246..1493665 (+) 420 WP_012515690.1 nucleoside-diphosphate kinase -
  KVP03_RS07080 (KVP03_07060) - 1494165..1494971 (+) 807 WP_126436747.1 DNA/RNA non-specific endonuclease -
  KVP03_RS07085 (KVP03_07065) - 1495256..1496221 (-) 966 WP_230923009.1 polysaccharide deacetylase family protein -
  KVP03_RS07090 (KVP03_07070) - 1496393..1497676 (+) 1284 WP_111692482.1 homoserine dehydrogenase -
  KVP03_RS07095 (KVP03_07075) thrB 1497678..1498538 (+) 861 WP_037581040.1 homoserine kinase -
  KVP03_RS07100 (KVP03_07080) - 1498703..1499962 (+) 1260 WP_111692481.1 bifunctional folylpolyglutamate synthase/dihydrofolate synthase -
  KVP03_RS07105 (KVP03_07085) folE 1500019..1500585 (+) 567 WP_308733129.1 GTP cyclohydrolase I FolE -

Sequence


Protein


Download         Length: 697 a.a.        Molecular weight: 76676.67 Da        Isoelectric Point: 5.1735

>NTDB_id=585665 KVP03_RS07070 WP_230923008.1 1490782..1492875(-) (clpC) [Streptococcus equi subsp. zooepidemicus strain SEZ_14-145]
MTTYYGKDPFGNMDDIFNQLMGNIGGYRSENRHYLVNGREMTPEEFQEYRKTGKLAATEGQDKQAGSLKKDGILTKLGTN
LTEEARQGRLDPVIGRNDEIQATAEILARRTKNNPVLVGDAGVGKTAVVEGLAQAIVNGDVPAAIKNKEIISIDISGLEA
GTQYRGSFEENIQNMVKEVKEAGNIILFFDEIHQILGAGSAGSDSGSKGLADILKPALSRGELTLIGATTQDEYRNTILK
NAALARRFNEVKVNAPSAEDTFHILMGIRNLYEQHHNVILPDSVLKAAIDYAIQYIPQRSLPDKAIDLVDMTAAHLAAQH
PVTDLKSLEAEIAKQKELQEKAVAAEDFEKALTAKTRIEELQKQINHHSEGQKVTATVNDIAESVERLTGVPVSNMGAND
LERLKEISNRLKGHVIGQDGAVEAVARAIRRNRAGFDDGNRPIGSFLFVGPTGVGKTELAKQLALDMFGSKDAIIRLDMS
EYSDRTAVSKLIGTTAGYVGYDDNQNTLTERVRRSPYSIILLDEIEKADPQVITLLLQVLDDGRLTDGQGNTINFKNTVI
IATSNAGFGHQAGDSADQEPAIMERIAPYFRPEFLNRFNGVIEFNHLAKKDLQEIVALMLAEVNQTLAKKGISLEVTDDV
KERLIDLGYDHAMGVRPLRRVIEQEIRDRITDYYLDHPTAKHLLASLDQDTITISEK

Nucleotide


Download         Length: 2094 bp        

>NTDB_id=585665 KVP03_RS07070 WP_230923008.1 1490782..1492875(-) (clpC) [Streptococcus equi subsp. zooepidemicus strain SEZ_14-145]
ATGACAACTTATTATGGAAAAGACCCCTTTGGAAATATGGACGATATTTTCAATCAATTAATGGGAAACATTGGTGGCTA
TCGCAGTGAAAATCGGCACTATCTTGTCAATGGACGTGAGATGACACCAGAGGAATTTCAGGAATACCGGAAAACCGGTA
AGCTAGCTGCAACTGAAGGACAAGACAAACAGGCTGGTAGCCTTAAAAAAGACGGCATTCTAACCAAGCTCGGAACAAAT
CTGACTGAAGAAGCTCGTCAAGGAAGGCTAGATCCTGTCATCGGACGAAACGATGAGATTCAAGCAACCGCAGAAATTCT
AGCTAGGCGAACCAAGAATAACCCTGTGCTTGTCGGTGACGCTGGCGTTGGTAAAACTGCTGTTGTCGAAGGTTTAGCCC
AGGCTATCGTCAATGGTGATGTGCCAGCTGCTATCAAAAATAAAGAGATTATCTCAATTGATATTTCTGGCCTTGAGGCT
GGCACACAGTATCGTGGCTCCTTTGAAGAAAACATTCAAAACATGGTTAAAGAGGTTAAAGAGGCTGGTAATATTATCCT
CTTCTTTGACGAAATTCATCAAATTCTTGGTGCAGGGTCAGCTGGTTCAGATTCAGGCTCTAAGGGCTTGGCTGATATTC
TAAAGCCAGCCCTCTCACGCGGTGAGTTGACCTTGATTGGTGCCACAACTCAAGATGAGTATCGCAATACTATTTTAAAA
AATGCTGCCCTTGCTCGTCGTTTCAACGAGGTTAAGGTTAATGCTCCTTCAGCAGAGGATACCTTCCACATTTTGATGGG
CATTCGCAACCTCTACGAGCAGCACCACAATGTGATCCTGCCAGATTCTGTCTTAAAAGCAGCTATTGATTATGCTATCC
AATATATTCCACAACGAAGCCTGCCTGATAAAGCCATTGACCTTGTTGATATGACGGCTGCCCACCTAGCAGCTCAGCAC
CCTGTCACTGATCTAAAGTCACTTGAAGCAGAAATCGCTAAGCAAAAGGAGCTACAGGAAAAGGCTGTTGCAGCAGAAGA
CTTTGAAAAGGCCTTGACTGCTAAGACACGCATTGAGGAGCTCCAAAAGCAAATCAATCATCACAGTGAAGGGCAAAAGG
TCACTGCCACCGTCAATGACATTGCAGAATCTGTTGAACGACTAACTGGTGTCCCAGTCTCAAATATGGGGGCTAATGAT
TTAGAAAGGCTCAAAGAAATCAGTAATCGTCTTAAAGGCCATGTTATCGGACAAGATGGTGCAGTTGAGGCTGTTGCTCG
TGCGATTCGACGCAATAGAGCAGGCTTTGATGATGGCAATCGTCCAATTGGCTCCTTCCTTTTTGTTGGTCCGACTGGTG
TCGGCAAGACCGAGCTGGCTAAGCAGTTAGCCCTTGATATGTTCGGCTCAAAGGACGCTATCATTCGCTTAGACATGTCT
GAATATAGTGACCGCACAGCTGTTTCCAAATTAATCGGAACAACAGCTGGCTATGTTGGCTATGATGATAATCAAAATAC
CCTTACAGAACGTGTCCGTCGCAGTCCTTACTCTATCATCTTGCTTGATGAAATCGAAAAAGCAGACCCACAGGTCATTA
CACTGCTTCTTCAAGTGCTTGATGATGGGCGATTGACAGATGGACAAGGCAACACGATCAATTTCAAAAATACGGTTATC
ATCGCAACCTCTAATGCTGGTTTTGGCCATCAAGCAGGAGATAGTGCTGATCAAGAGCCTGCTATCATGGAGCGTATTGC
ACCTTACTTTAGACCAGAATTCTTGAATCGCTTCAACGGTGTCATCGAATTCAACCATTTAGCGAAAAAGGATCTACAAG
AAATCGTTGCTTTAATGCTAGCAGAGGTGAACCAAACCCTTGCTAAAAAAGGGATTAGCCTTGAGGTGACTGATGATGTT
AAAGAGCGTTTGATCGATCTGGGCTATGATCATGCGATGGGAGTTCGTCCACTTAGACGTGTCATCGAACAAGAAATTCG
TGACCGTATTACTGACTACTACCTGGATCACCCGACTGCTAAGCACCTATTAGCAAGTCTTGATCAAGATACTATTACGA
TAAGTGAGAAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpC Lactococcus lactis subsp. cremoris KW2

48.207

100

0.482

  clpE Streptococcus mutans UA159

48.17

97.991

0.472

  clpE Streptococcus pneumoniae Rx1

50.794

90.387

0.459

  clpE Streptococcus pneumoniae D39

50.794

90.387

0.459

  clpE Streptococcus pneumoniae R6

50.794

90.387

0.459

  clpE Streptococcus pneumoniae TIGR4

50.794

90.387

0.459

  clpC Bacillus subtilis subsp. subtilis str. 168

45.833

89.527

0.41

  clpC Streptococcus mutans UA159

43.226

88.953

0.385

  clpC Streptococcus pneumoniae Rx1

43.678

87.374

0.382

  clpC Streptococcus pneumoniae D39

43.678

87.374

0.382

  clpC Streptococcus pneumoniae TIGR4

43.514

87.374

0.38

  clpC Streptococcus thermophilus LMD-9

42.718

88.666

0.379

  clpC Streptococcus thermophilus LMG 18311

42.557

88.666

0.377

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

42.326

88.809

0.376