Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpP   Type   Regulator
Locus tag   E0F39_RS03835 Genome accession   NZ_LR216060
Coordinates   707273..707863 (+) Length   196 a.a.
NCBI ID   WP_000613477.1    Uniprot ID   A0A064BX72
Organism   Streptococcus pneumoniae strain GPSC47 substr. ST315     
Function   degradation of ComX; degradation of ComW (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 705338..715899 707273..707863 within 0


Gene organization within MGE regions


Location: 705338..715899
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E0F39_RS03820 (SAMEA3713867_00714) - 705338..705895 (+) 558 WP_000004139.1 TetR/AcrR family transcriptional regulator -
  E0F39_RS03825 (SAMEA3713867_00715) - 705914..706381 (+) 468 WP_000136976.1 deoxycytidylate deaminase -
  E0F39_RS03830 (SAMEA3713867_00716) upp 706470..707099 (+) 630 WP_001812010.1 uracil phosphoribosyltransferase -
  E0F39_RS03835 (SAMEA3713867_00717) clpP 707273..707863 (+) 591 WP_000613477.1 ATP-dependent Clp protease proteolytic subunit ClpP Regulator
  E0F39_RS03845 (SAMEA3713867_00718) - 707942..708190 (+) 249 WP_000462126.1 YlbG family protein -
  E0F39_RS03850 (SAMEA3713867_00719) - 708291..709451 (+) 1161 WP_000726149.1 ABC transporter substrate-binding protein -
  E0F39_RS03855 (SAMEA3713867_00720) - 709719..710588 (+) 870 WP_001865036.1 branched-chain amino acid ABC transporter permease -
  E0F39_RS03860 (SAMEA3713867_00721) - 710592..711548 (+) 957 WP_000662293.1 branched-chain amino acid ABC transporter permease -
  E0F39_RS03865 (SAMEA3713867_00722) - 711548..712312 (+) 765 WP_001186003.1 ABC transporter ATP-binding protein -
  E0F39_RS03870 (SAMEA3713867_00723) - 712312..713022 (+) 711 WP_000062219.1 ABC transporter ATP-binding protein -
  E0F39_RS03875 (SAMEA3713867_00724) - 713330..713986 (+) 657 WP_000268677.1 CBS domain-containing protein -
  E0F39_RS03880 (SAMEA3713867_00725) prfB 714094..715189 (+) 1096 WP_115264891.1 peptide chain release factor 2 -
  E0F39_RS03885 (SAMEA3713867_00726) ftsE 715207..715899 (+) 693 WP_000022265.1 cell division ATP-binding protein FtsE -

Sequence


Protein


Download         Length: 196 a.a.        Molecular weight: 21358.36 Da        Isoelectric Point: 4.4910

>NTDB_id=1126177 E0F39_RS03835 WP_000613477.1 707273..707863(+) (clpP) [Streptococcus pneumoniae strain GPSC47 substr. ST315]
MIPVVIEQTSRGERSYDIYSRLLKDRIIMLTGPVEDNMANSVIAQLLFLDAQDSTKDIYLYVNTPGGSVSAGLAIVDTMN
FIKADVQTIVMGMAASMGTVIASSGAKGKRFMLPNAEYMIHQPMGGTGGGTQQTDMAIAAEHLLKTRNTLEKILAENSGQ
SMEKVHADAERDNWMSAQETLEYGFIDEIMANNSLN

Nucleotide


Download         Length: 591 bp        

>NTDB_id=1126177 E0F39_RS03835 WP_000613477.1 707273..707863(+) (clpP) [Streptococcus pneumoniae strain GPSC47 substr. ST315]
ATGATTCCTGTAGTTATTGAACAAACAAGCCGTGGAGAACGTTCTTACGATATTTACTCACGTCTTCTCAAAGACCGCAT
CATTATGCTGACAGGTCCGGTTGAAGACAATATGGCTAACTCTGTTATTGCCCAATTGCTTTTCTTGGATGCCCAAGATA
GTACAAAAGATATTTACCTTTATGTCAATACACCAGGTGGTTCTGTTTCAGCTGGTTTGGCAATCGTAGATACCATGAAC
TTTATCAAGGCAGATGTCCAAACCATTGTTATGGGAATGGCTGCATCTATGGGAACAGTCATCGCATCAAGTGGAGCAAA
AGGCAAACGTTTCATGCTTCCAAATGCTGAATACATGATTCACCAACCAATGGGCGGTACAGGTGGTGGTACCCAACAAA
CTGATATGGCTATCGCTGCAGAACACTTGCTCAAAACTCGTAATACCTTGGAAAAAATCTTGGCTGAAAATTCAGGTCAG
TCAATGGAAAAAGTCCATGCAGATGCAGAACGTGATAACTGGATGAGCGCCCAGGAAACACTTGAATATGGCTTTATTGA
TGAAATTATGGCCAACAATTCATTGAACTAA

Domains


Predicted by InterproScan.

(11-192)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A064BX72

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpP Streptococcus pneumoniae Rx1

100

100

1

  clpP Streptococcus pneumoniae D39

100

100

1

  clpP Streptococcus pneumoniae R6

100

100

1

  clpP Streptococcus pneumoniae TIGR4

100

100

1

  clpP Streptococcus thermophilus LMG 18311

92.821

99.49

0.923

  clpP Streptococcus thermophilus LMD-9

92.821

99.49

0.923

  clpP Streptococcus pyogenes JRS4

90.769

99.49

0.903

  clpP Streptococcus pyogenes MGAS315

90.769

99.49

0.903

  clpP Streptococcus mutans UA159

85.714

100

0.857

  clpP Lactococcus lactis subsp. lactis strain DGCC12653

85.128

99.49

0.847

  clpP Lactococcus lactis subsp. cremoris KW2

84.615

99.49

0.842

  clpP Bacillus subtilis subsp. subtilis str. 168

58.854

97.959

0.577

  clpP Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

58.031

98.469

0.571


Multiple sequence alignment