Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpP   Type   Regulator
Locus tag   MHHHKEFG_RS16130 Genome accession   NZ_CP069125
Coordinates   3106653..3107249 (+) Length   198 a.a.
NCBI ID   WP_034660066.1    Uniprot ID   -
Organism   Bacillus pumilus strain D5     
Function   degradation of ComK; degradation of DegU (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 3042056..3107249 3106653..3107249 within 0


Gene organization within MGE regions


Location: 3042056..3107249
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MHHHKEFG_RS15665 (MHHHKEFG_03192) - 3042056..3042844 (-) 789 WP_066030335.1 helix-turn-helix domain-containing protein -
  MHHHKEFG_RS15670 (MHHHKEFG_03193) - 3042985..3043203 (+) 219 Protein_3028 helix-turn-helix domain-containing protein -
  MHHHKEFG_RS15675 (MHHHKEFG_03194) - 3043261..3043512 (+) 252 WP_066030333.1 hypothetical protein -
  MHHHKEFG_RS15680 (MHHHKEFG_03195) - 3043558..3043902 (+) 345 WP_081311367.1 YolD-like family protein -
  MHHHKEFG_RS15685 (MHHHKEFG_03196) - 3043919..3044497 (-) 579 WP_066030332.1 hypothetical protein -
  MHHHKEFG_RS15690 (MHHHKEFG_03197) - 3044566..3044805 (-) 240 WP_066030331.1 phage holin -
  MHHHKEFG_RS15695 (MHHHKEFG_03198) - 3044802..3045641 (-) 840 WP_066030330.1 N-acetylmuramoyl-L-alanine amidase -
  MHHHKEFG_RS15700 (MHHHKEFG_03199) - 3045667..3045933 (-) 267 WP_066030329.1 hemolysin XhlA family protein -
  MHHHKEFG_RS15705 - 3045964..3046104 (-) 141 WP_074041882.1 XkdX family protein -
  MHHHKEFG_RS15710 (MHHHKEFG_03201) - 3046104..3046391 (-) 288 WP_066030328.1 hypothetical protein -
  MHHHKEFG_RS15715 (MHHHKEFG_03202) - 3046403..3047863 (-) 1461 WP_066030327.1 phage baseplate upper protein -
  MHHHKEFG_RS15720 (MHHHKEFG_03203) - 3047878..3049758 (-) 1881 WP_066030326.1 right-handed parallel beta-helix repeat-containing protein -
  MHHHKEFG_RS15725 (MHHHKEFG_03204) - 3049771..3051162 (-) 1392 WP_066030325.1 phage tail protein -
  MHHHKEFG_RS15730 (MHHHKEFG_03205) - 3051174..3052601 (-) 1428 WP_066030324.1 glucosaminidase domain-containing protein -
  MHHHKEFG_RS15735 (MHHHKEFG_03206) - 3052617..3055403 (-) 2787 WP_081311366.1 phage tail tape measure protein -
  MHHHKEFG_RS15740 (MHHHKEFG_03207) - 3055425..3055577 (-) 153 WP_155758990.1 hypothetical protein -
  MHHHKEFG_RS15745 (MHHHKEFG_03208) - 3055739..3056122 (-) 384 WP_066030323.1 hypothetical protein -
  MHHHKEFG_RS15750 (MHHHKEFG_03209) - 3056178..3056708 (-) 531 WP_066030322.1 hypothetical protein -
  MHHHKEFG_RS15755 (MHHHKEFG_03210) - 3056751..3057125 (-) 375 WP_066030321.1 minor capsid protein -
  MHHHKEFG_RS15760 (MHHHKEFG_03211) - 3057125..3057535 (-) 411 WP_066030320.1 hypothetical protein -
  MHHHKEFG_RS15765 (MHHHKEFG_03212) - 3057532..3057870 (-) 339 WP_066030319.1 hypothetical protein -
  MHHHKEFG_RS15770 (MHHHKEFG_03213) - 3057871..3058257 (-) 387 WP_066030318.1 hypothetical protein -
  MHHHKEFG_RS15775 (MHHHKEFG_03214) - 3058273..3058482 (-) 210 WP_066030317.1 hypothetical protein -
  MHHHKEFG_RS15780 (MHHHKEFG_03215) - 3058482..3058766 (-) 285 WP_066030316.1 hypothetical protein -
  MHHHKEFG_RS15785 (MHHHKEFG_03216) - 3058816..3059919 (-) 1104 WP_066030315.1 DUF5309 family protein -
  MHHHKEFG_RS15790 (MHHHKEFG_03217) - 3059931..3060617 (-) 687 WP_066030314.1 Clp protease ClpB -
  MHHHKEFG_RS15795 (MHHHKEFG_03218) - 3060681..3061511 (-) 831 WP_066030313.1 phage minor capsid protein -
  MHHHKEFG_RS15800 (MHHHKEFG_03219) - 3061511..3063115 (-) 1605 WP_066030312.1 phage portal protein -
  MHHHKEFG_RS15805 (MHHHKEFG_03220) - 3063119..3063544 (-) 426 WP_066030311.1 phBC6A51 family helix-turn-helix protein -
  MHHHKEFG_RS15810 (MHHHKEFG_03221) terL 3063562..3065328 (-) 1767 WP_066030310.1 phage terminase large subunit -
  MHHHKEFG_RS15815 (MHHHKEFG_03222) - 3065430..3065975 (+) 546 WP_066030309.1 tyrosine-type recombinase/integrase -
  MHHHKEFG_RS15820 (MHHHKEFG_03223) - 3065990..3066235 (-) 246 WP_066030308.1 hypothetical protein -
  MHHHKEFG_RS15825 (MHHHKEFG_03224) - 3066328..3066549 (-) 222 WP_066030307.1 hypothetical protein -
  MHHHKEFG_RS15830 (MHHHKEFG_03225) - 3066665..3066952 (-) 288 WP_066030306.1 hypothetical protein -
  MHHHKEFG_RS15835 (MHHHKEFG_03226) - 3066949..3067236 (-) 288 WP_066030305.1 hypothetical protein -
  MHHHKEFG_RS15840 (MHHHKEFG_03227) - 3067238..3067519 (-) 282 WP_066030304.1 hypothetical protein -
  MHHHKEFG_RS15845 - 3069036..3069176 (+) 141 WP_186320273.1 hypothetical protein -
  MHHHKEFG_RS15850 (MHHHKEFG_03229) - 3069581..3069766 (+) 186 WP_066030302.1 hypothetical protein -
  MHHHKEFG_RS15855 (MHHHKEFG_03230) - 3069766..3070152 (+) 387 WP_066030301.1 hypothetical protein -
  MHHHKEFG_RS15860 - 3070166..3070297 (+) 132 WP_264188711.1 hypothetical protein -
  MHHHKEFG_RS15865 - 3070341..3070517 (+) 177 WP_373861341.1 hypothetical protein -
  MHHHKEFG_RS15870 (MHHHKEFG_03232) - 3070694..3071155 (+) 462 WP_066030300.1 hypothetical protein -
  MHHHKEFG_RS15875 (MHHHKEFG_03233) - 3071129..3071938 (-) 810 WP_066030299.1 hypothetical protein -
  MHHHKEFG_RS15880 (MHHHKEFG_03234) - 3071980..3072480 (-) 501 WP_066030298.1 hypothetical protein -
  MHHHKEFG_RS15885 (MHHHKEFG_03235) - 3072623..3073009 (-) 387 WP_066030297.1 sigma factor-like helix-turn-helix DNA-binding protein -
  MHHHKEFG_RS15890 (MHHHKEFG_03236) - 3073010..3074233 (-) 1224 WP_066030296.1 class I SAM-dependent DNA methyltransferase -
  MHHHKEFG_RS15895 (MHHHKEFG_03238) - 3074664..3075710 (-) 1047 WP_066030294.1 DNA cytosine methyltransferase -
  MHHHKEFG_RS15900 (MHHHKEFG_03239) - 3075707..3076261 (-) 555 WP_066030293.1 hypothetical protein -
  MHHHKEFG_RS15905 (MHHHKEFG_03240) - 3076258..3077112 (-) 855 WP_066030292.1 hypothetical protein -
  MHHHKEFG_RS15910 (MHHHKEFG_03241) - 3077112..3077393 (-) 282 WP_081311364.1 hypothetical protein -
  MHHHKEFG_RS15915 (MHHHKEFG_03242) - 3077406..3077915 (-) 510 WP_066030291.1 3D domain-containing protein -
  MHHHKEFG_RS15920 (MHHHKEFG_03243) thyX 3077915..3078733 (-) 819 WP_081311363.1 FAD-dependent thymidylate synthase -
  MHHHKEFG_RS15925 (MHHHKEFG_03244) - 3078734..3078946 (-) 213 WP_066030290.1 hypothetical protein -
  MHHHKEFG_RS15930 - 3078950..3079126 (-) 177 WP_196769770.1 hypothetical protein -
  MHHHKEFG_RS15935 (MHHHKEFG_03247) - 3079554..3079748 (-) 195 WP_066030288.1 hypothetical protein -
  MHHHKEFG_RS15940 (MHHHKEFG_03248) - 3079749..3079919 (-) 171 WP_155758989.1 hypothetical protein -
  MHHHKEFG_RS15945 (MHHHKEFG_03249) nrdF 3080043..3081017 (-) 975 WP_066030287.1 class 1b ribonucleoside-diphosphate reductase subunit beta -
  MHHHKEFG_RS15950 (MHHHKEFG_03250) nrdE 3081038..3083143 (-) 2106 WP_081311362.1 class 1b ribonucleoside-diphosphate reductase subunit alpha -
  MHHHKEFG_RS15955 (MHHHKEFG_03251) nrdI 3083140..3083496 (-) 357 WP_066030286.1 class Ib ribonucleoside-diphosphate reductase assembly flavoprotein NrdI -
  MHHHKEFG_RS15960 (MHHHKEFG_03253) - 3083652..3083813 (-) 162 WP_155758988.1 hypothetical protein -
  MHHHKEFG_RS15965 (MHHHKEFG_03255) - 3084016..3084372 (-) 357 WP_066030284.1 hypothetical protein -
  MHHHKEFG_RS15970 (MHHHKEFG_03256) - 3084365..3084628 (-) 264 WP_066030283.1 hypothetical protein -
  MHHHKEFG_RS15975 (MHHHKEFG_03257) - 3084634..3085152 (-) 519 WP_066030282.1 hypothetical protein -
  MHHHKEFG_RS15980 (MHHHKEFG_03258) - 3085153..3085713 (-) 561 WP_066030281.1 hypothetical protein -
  MHHHKEFG_RS15985 (MHHHKEFG_03259) - 3085710..3085889 (-) 180 WP_066030280.1 hypothetical protein -
  MHHHKEFG_RS15990 (MHHHKEFG_03261) - 3086143..3087210 (-) 1068 WP_066030278.1 hypothetical protein -
  MHHHKEFG_RS15995 (MHHHKEFG_03262) - 3087211..3087933 (-) 723 WP_066030277.1 PolC-type DNA polymerase III -
  MHHHKEFG_RS16000 (MHHHKEFG_03263) - 3087933..3090173 (-) 2241 WP_066030276.1 DNA polymerase -
  MHHHKEFG_RS16005 (MHHHKEFG_03264) - 3090213..3090374 (-) 162 WP_155758987.1 hypothetical protein -
  MHHHKEFG_RS16010 (MHHHKEFG_03265) - 3090528..3091271 (-) 744 WP_066030275.1 hypothetical protein -
  MHHHKEFG_RS16015 (MHHHKEFG_03266) - 3091312..3091746 (-) 435 WP_066030274.1 hypothetical protein -
  MHHHKEFG_RS16020 (MHHHKEFG_03267) - 3091747..3092001 (-) 255 WP_066030273.1 hypothetical protein -
  MHHHKEFG_RS16025 (MHHHKEFG_03268) - 3092015..3092794 (-) 780 WP_066030272.1 hypothetical protein -
  MHHHKEFG_RS16030 (MHHHKEFG_03269) - 3093044..3093568 (-) 525 WP_066030271.1 hypothetical protein -
  MHHHKEFG_RS16035 (MHHHKEFG_03271) - 3093843..3094280 (-) 438 WP_155758986.1 hypothetical protein -
  MHHHKEFG_RS16040 (MHHHKEFG_03273) - 3094744..3095910 (-) 1167 WP_066030269.1 AimR family lysis-lysogeny pheromone receptor -
  MHHHKEFG_RS16045 - 3096237..3096527 (+) 291 WP_081311361.1 helix-turn-helix domain-containing protein -
  MHHHKEFG_RS16050 (MHHHKEFG_03275) - 3096524..3097525 (-) 1002 WP_066030267.1 toprim domain-containing protein -
  MHHHKEFG_RS16055 (MHHHKEFG_03276) - 3097637..3099028 (-) 1392 WP_066031718.1 DnaB-like helicase C-terminal domain-containing protein -
  MHHHKEFG_RS16060 (MHHHKEFG_03277) - 3099025..3099657 (-) 633 WP_081311478.1 hypothetical protein -
  MHHHKEFG_RS16065 (MHHHKEFG_03278) - 3099658..3100419 (-) 762 WP_066031717.1 ATP-binding protein -
  MHHHKEFG_RS16070 (MHHHKEFG_03279) - 3100409..3100606 (-) 198 WP_066031716.1 hypothetical protein -
  MHHHKEFG_RS16075 (MHHHKEFG_03280) - 3100607..3101416 (-) 810 WP_066031715.1 hypothetical protein -
  MHHHKEFG_RS16080 (MHHHKEFG_03281) - 3101418..3101786 (-) 369 WP_066031714.1 hypothetical protein -
  MHHHKEFG_RS16085 (MHHHKEFG_03282) - 3101770..3102048 (-) 279 WP_066031713.1 hypothetical protein -
  MHHHKEFG_RS16090 (MHHHKEFG_03283) - 3102048..3102392 (-) 345 WP_066031712.1 hypothetical protein -
  MHHHKEFG_RS16095 (MHHHKEFG_03284) - 3102394..3102672 (-) 279 WP_066031711.1 hypothetical protein -
  MHHHKEFG_RS16100 (MHHHKEFG_03285) - 3102696..3103031 (-) 336 WP_066031710.1 hypothetical protein -
  MHHHKEFG_RS16105 (MHHHKEFG_03286) - 3103034..3103522 (-) 489 WP_066031709.1 hypothetical protein -
  MHHHKEFG_RS16110 (MHHHKEFG_03287) - 3103725..3104102 (-) 378 WP_066031708.1 helix-turn-helix domain-containing protein -
  MHHHKEFG_RS16115 (MHHHKEFG_03288) - 3104099..3105025 (-) 927 WP_066031707.1 hypothetical protein -
  MHHHKEFG_RS16120 (MHHHKEFG_03289) - 3105155..3106141 (-) 987 WP_081311477.1 tyrosine-type recombinase/integrase -
  MHHHKEFG_RS16130 (MHHHKEFG_03291) clpP 3106653..3107249 (+) 597 WP_034660066.1 ATP-dependent Clp endopeptidase proteolytic subunit ClpP Regulator

Sequence


Protein


Download         Length: 198 a.a.        Molecular weight: 21832.99 Da        Isoelectric Point: 4.6141

>NTDB_id=532407 MHHHKEFG_RS16130 WP_034660066.1 3106653..3107249(+) (clpP) [Bacillus pumilus strain D5]
MNLIPTVIEQTNRGERAYDIYSRLLKDRIIMLGSAIDDNVANSIVSQLLFLEAEDPEKDISIYINSPGGSITAGMAIYDT
MQFIKPKVSTICIGMAASMGAFLLAAGEKGKRYALPNSEVMIHQPLGGAQGQATEIEIAAKRILSLRDKLNQVLAERTGQ
PIEVIERDTDRDNFKTAEEALEYGLIDKVLTRNSEEQK

Nucleotide


Download         Length: 597 bp        

>NTDB_id=532407 MHHHKEFG_RS16130 WP_034660066.1 3106653..3107249(+) (clpP) [Bacillus pumilus strain D5]
ATGAATTTAATACCTACAGTCATTGAGCAAACAAATCGTGGGGAAAGAGCTTACGACATTTATTCTCGTCTTTTAAAAGA
CCGTATTATCATGCTTGGTTCTGCGATCGATGACAATGTTGCCAACTCCATCGTGTCACAGCTGCTTTTCTTAGAAGCTG
AAGATCCAGAAAAAGATATTTCTATCTACATTAACAGCCCTGGCGGTTCGATCACAGCTGGTATGGCCATTTACGATACG
ATGCAATTTATTAAACCAAAGGTATCAACCATTTGTATTGGTATGGCTGCATCTATGGGTGCGTTCCTGCTTGCTGCTGG
TGAAAAAGGTAAGCGTTATGCCCTTCCAAACAGTGAAGTCATGATTCACCAACCACTAGGTGGTGCCCAAGGTCAAGCAA
CAGAAATTGAAATTGCGGCAAAACGAATCCTTTCTTTACGCGATAAACTGAACCAAGTACTTGCTGAACGTACTGGTCAG
CCAATTGAAGTGATTGAGCGCGATACAGATCGTGACAACTTCAAAACAGCGGAAGAAGCACTTGAATACGGACTCATTGA
CAAAGTCTTGACCCGTAATTCAGAAGAACAAAAATAA

Domains


Predicted by InterproScan.

(13-192)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpP Bacillus subtilis subsp. subtilis str. 168

92.893

99.495

0.924

  clpP Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

68.063

96.465

0.657

  clpP Streptococcus thermophilus LMG 18311

59.067

97.475

0.576

  clpP Streptococcus thermophilus LMD-9

59.067

97.475

0.576

  clpP Lactococcus lactis subsp. cremoris KW2

57.216

97.98

0.561

  clpP Streptococcus pneumoniae D39

56.701

97.98

0.556

  clpP Streptococcus pneumoniae Rx1

56.701

97.98

0.556

  clpP Streptococcus pneumoniae R6

56.701

97.98

0.556

  clpP Streptococcus pneumoniae TIGR4

56.701

97.98

0.556

  clpP Lactococcus lactis subsp. lactis strain DGCC12653

55.155

97.98

0.54

  clpP Streptococcus pyogenes JRS4

55.44

97.475

0.54

  clpP Streptococcus pyogenes MGAS315

55.44

97.475

0.54

  clpP Streptococcus mutans UA159

54.639

97.98

0.535