Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpP   Type   Regulator
Locus tag   P3F89_RS13765 Genome accession   NZ_CP119875
Coordinates   2652762..2653343 (+) Length   193 a.a.
NCBI ID   WP_000991609.1    Uniprot ID   A0A853X8N5
Organism   Bacillus tropicus strain T36S-23     
Function   degradation of ComK; degradation of DegU (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2639170..2655624 2652762..2653343 within 0


Gene organization within MGE regions


Location: 2639170..2655624
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  P3F89_RS13675 (P3F89_13655) - 2639170..2639946 (+) 777 WP_097996132.1 serine hydrolase -
  P3F89_RS13680 (P3F89_13660) - 2639943..2640314 (+) 372 WP_042514996.1 hypothetical protein -
  P3F89_RS13685 (P3F89_13665) - 2640347..2640763 (+) 417 WP_033691754.1 GNAT family N-acetyltransferase -
  P3F89_RS13690 (P3F89_13670) - 2641040..2641846 (+) 807 WP_000540642.1 GH25 family lysozyme -
  P3F89_RS13695 (P3F89_13675) - 2641911..2642123 (-) 213 WP_097996131.1 DUF3006 domain-containing protein -
  P3F89_RS13700 (P3F89_13680) - 2642126..2643511 (-) 1386 WP_309573635.1 S-layer homology domain-containing protein -
  P3F89_RS13705 (P3F89_13685) - 2643942..2644358 (+) 417 WP_001981059.1 GNAT family N-acetyltransferase -
  P3F89_RS13710 (P3F89_13690) - 2644361..2644714 (+) 354 WP_000648514.1 MmcQ/YjbR family DNA-binding protein -
  P3F89_RS13715 (P3F89_13695) - 2644736..2645443 (+) 708 WP_033691753.1 class I SAM-dependent methyltransferase -
  P3F89_RS13720 (P3F89_13700) - 2645509..2645895 (+) 387 WP_002194813.1 DUF2809 domain-containing protein -
  P3F89_RS13725 (P3F89_13705) - 2645917..2646420 (+) 504 WP_002194812.1 DNA topology modulation protein -
  P3F89_RS13730 (P3F89_13710) - 2646566..2647246 (+) 681 WP_097996129.1 SUMF1/EgtB/PvdO family nonheme iron enzyme -
  P3F89_RS13735 (P3F89_13715) - 2647332..2648525 (+) 1194 WP_042515001.1 macrolide family glycosyltransferase -
  P3F89_RS13740 (P3F89_13720) - 2648883..2650250 (+) 1368 WP_097996128.1 lytic polysaccharide monooxygenase -
  P3F89_RS13745 (P3F89_13725) - 2650381..2650815 (-) 435 Protein_2722 GNAT family N-acetyltransferase -
  P3F89_RS13750 (P3F89_13730) rpiA 2650922..2651584 (-) 663 WP_042515003.1 ribose 5-phosphate isomerase A -
  P3F89_RS13755 (P3F89_13735) - 2651589..2651909 (-) 321 WP_001125390.1 2Fe-2S iron-sulfur cluster-binding protein -
  P3F89_RS13760 (P3F89_13740) - 2652054..2652740 (+) 687 WP_042515005.1 RNA polymerase subunit sigma-70 -
  P3F89_RS13765 (P3F89_13745) clpP 2652762..2653343 (+) 582 WP_000991609.1 ATP-dependent Clp endopeptidase proteolytic subunit ClpP Regulator
  P3F89_RS13770 (P3F89_13750) - 2653390..2654247 (-) 858 WP_042515006.1 glycine betaine ABC transporter substrate-binding protein -
  P3F89_RS13775 (P3F89_13755) - 2654419..2655624 (+) 1206 WP_000370618.1 glycine betaine/L-proline ABC transporter ATP-binding protein -

Sequence


Protein


Download         Length: 193 a.a.        Molecular weight: 21192.49 Da        Isoelectric Point: 5.3394

>NTDB_id=801213 P3F89_RS13765 WP_000991609.1 2652762..2653343(+) (clpP) [Bacillus tropicus strain T36S-23]
MNAIPYVVEQTKLGERSYDIYSRLLKDRIIIIGSEINDQVASSVVAQLLFLEAEDAEKDIFLYINSPGGSTTAGFAILDT
MNLIKPDVQTLCMGFAASFGALLLLSGAKGKRFALPNSEIMIHQPLGGAKGQATEIEITAKRILKLKHDINKMIAEKTGQ
PIERVAHDTERDYFMTAEEAKAYGIVDDVVTKK

Nucleotide


Download         Length: 582 bp        

>NTDB_id=801213 P3F89_RS13765 WP_000991609.1 2652762..2653343(+) (clpP) [Bacillus tropicus strain T36S-23]
ATGAATGCAATTCCATATGTTGTAGAACAAACGAAATTAGGAGAACGTTCCTATGATATATATTCAAGGTTATTAAAAGA
CCGTATTATTATTATCGGTTCAGAAATAAATGATCAAGTAGCGAGTAGTGTCGTAGCTCAATTATTATTTTTAGAAGCAG
AAGATGCAGAGAAAGATATATTTTTATACATTAATAGCCCCGGCGGATCAACGACAGCGGGATTTGCTATATTAGATACA
ATGAATTTAATTAAACCTGATGTGCAAACGCTATGCATGGGCTTTGCAGCATCATTTGGTGCATTGTTATTATTATCAGG
TGCAAAAGGAAAACGATTTGCACTCCCAAATAGTGAAATTATGATTCATCAGCCGCTTGGTGGTGCAAAAGGGCAGGCAA
CAGAAATTGAAATAACAGCAAAAAGGATATTGAAGTTAAAGCATGATATTAATAAAATGATTGCAGAAAAAACGGGGCAA
CCGATTGAAAGAGTAGCACATGATACAGAAAGAGATTATTTTATGACCGCAGAAGAAGCAAAGGCATATGGGATTGTAGA
TGATGTTGTTACGAAAAAATAA

Domains


Predicted by InterproScan.

(14-192)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A853X8N5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpP Bacillus subtilis subsp. subtilis str. 168

68.063

98.964

0.674

  clpP Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

67.196

97.927

0.658

  clpP Streptococcus pyogenes JRS4

56.085

97.927

0.549

  clpP Streptococcus pyogenes MGAS315

56.085

97.927

0.549

  clpP Streptococcus pneumoniae R6

55.208

99.482

0.549

  clpP Streptococcus pneumoniae TIGR4

55.208

99.482

0.549

  clpP Streptococcus pneumoniae Rx1

55.208

99.482

0.549

  clpP Streptococcus thermophilus LMG 18311

55.208

99.482

0.549

  clpP Streptococcus thermophilus LMD-9

55.208

99.482

0.549

  clpP Streptococcus pneumoniae D39

55.208

99.482

0.549

  clpP Streptococcus mutans UA159

53.968

97.927

0.528

  clpP Lactococcus lactis subsp. cremoris KW2

51.852

97.927

0.508

  clpP Lactococcus lactis subsp. lactis strain DGCC12653

50.265

97.927

0.492