Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpE   Type   Regulator
Locus tag   EQH33_RS06500 Genome accession   NZ_CP035247
Coordinates   1305488..1307746 (+) Length   752 a.a.
NCBI ID   WP_000882497.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain TVO_1901940     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 1308346..1312886 1305488..1307746 flank 600


Gene organization within MGE regions


Location: 1305488..1312886
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH33_RS06500 (EQH33_06920) clpE 1305488..1307746 (+) 2259 WP_000882497.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  EQH33_RS06505 (EQH33_06925) - 1307953..1308675 (-) 723 Protein_1301 IS630 transposase-related protein -
  EQH33_RS06510 (EQH33_06930) - 1308757..1309208 (+) 452 Protein_1302 NUDIX hydrolase -
  EQH33_RS06515 (EQH33_06940) - 1309373..1309672 (+) 300 WP_000767193.1 DUF1827 family protein -
  EQH33_RS06520 (EQH33_06945) - 1309813..1311161 (+) 1349 Protein_1304 IS3 family transposase -
  EQH33_RS06525 (EQH33_06950) tnpB 1311376..1311699 (+) 324 WP_000586643.1 IS66 family insertion sequence element accessory protein TnpB -
  EQH33_RS06530 (EQH33_06955) - 1311840..1312886 (-) 1047 Protein_1306 IS630 family transposase -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 83840.26 Da        Isoelectric Point: 5.3713

>NTDB_id=337515 EQH33_RS06500 WP_000882497.1 1305488..1307746(+) (clpE) [Streptococcus pneumoniae strain TVO_1901940]
MLCQNCKINDSTIHLYTNLNGKQKQIDLCQNCYKIIKTDPNNGLFKGMTDLNNRDFDPFGDFFNDLNNFRPSSNTPPIPP
TQSGGGYGGNGGYGSQNRGSAQTPPPSQEKGLLEEFGINVTEIARRGDIDPVIGRDDEIIRVIEILNRRTKNNPVLIGEP
GVGKTAVVEGLAQKIVDGDVPHKLQGKQVIRLDVVSLVQGTGIRGQFEERMQKLMEEIRKREDIILFIDEIHEIVGAGSA
SDGNMDAGNILKPALARGELQLVGATTLNEYRIIEKDAALERRMQPVKVDEPTVDETITILKGIQKKYEDYHHVQYTDAA
IEAAATLSNRYIQDRFLPDKAIDLLDEAGSKMNLTLNFVDPKVIDQRLIEAENLKSQATREEDFEKAAYFRDQIAKYKEM
QKKKITDQDTPIISEKTIEHIIEQKTNIPVGDLKEKEQSQLIHLAEDLKSHVIGQDDAVDKIAKAIRRNRVGLGTPNRPI
GSFLFVGPTGVGKTELSKQLAIELFGSADSMIRFDMSEYMEKHSVAKLVGAPPGYVGYDEAGQLTEKVRHNPYSLILLDE
VEKAHPDVMHMFLQVLDDGRLTDGQGRTVSFKDAIIIMTSNAGTGKTEASVGFGAAREGRTNSVLGELGNFFSPEFMNRF
DGIIEFKALSKDNLLQIVELMLADVNKRLSSNNIRLDVTDKVKEKLVDLGYDPKMGARPLRRTIQDYIEDTITDYYLENP
SEKDLKAVMTSKGNIQIKSAKKTEVKSSEKEK

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=337515 EQH33_RS06500 WP_000882497.1 1305488..1307746(+) (clpE) [Streptococcus pneumoniae strain TVO_1901940]
ATGCTTTGTCAAAACTGTAAAATTAACGACTCAACAATTCATCTTTACACCAATCTCAATGGAAAACAAAAACAAATTGA
CCTCTGTCAAAACTGCTATAAGATTATCAAAACAGATCCTAACAATGGTCTCTTCAAAGGTATGACGGATCTGAACAATC
GTGACTTCGATCCCTTTGGTGATTTCTTCAATGATCTAAACAATTTCAGACCTTCTAGCAATACTCCTCCTATTCCCCCA
ACCCAATCAGGTGGAGGTTACGGTGGAAACGGCGGTTATGGTTCCCAAAATCGTGGATCTGCTCAAACTCCGCCACCTAG
CCAAGAAAAAGGCCTGCTGGAAGAATTTGGTATTAATGTAACTGAAATTGCCCGTCGTGGAGACATTGACCCCGTTATTG
GGCGCGACGATGAGATTATCCGTGTCATCGAGATTCTCAATCGTAGAACCAAGAATAATCCTGTCCTTATCGGTGAACCT
GGTGTCGGAAAAACGGCCGTTGTCGAAGGTCTAGCTCAGAAAATTGTCGATGGCGATGTGCCACATAAACTCCAAGGTAA
ACAAGTCATCCGTCTGGATGTGGTTAGCTTAGTTCAAGGAACGGGGATTAGAGGACAATTTGAAGAACGCATGCAAAAAC
TCATGGAAGAAATTCGCAAACGTGAAGACATCATCCTCTTTATCGATGAAATCCATGAAATTGTTGGTGCTGGTTCTGCG
AGTGATGGTAATATGGACGCAGGAAATATCCTCAAGCCAGCCCTTGCTCGTGGAGAACTGCAACTAGTCGGTGCTACTAC
CCTCAATGAATACCGTATCATTGAAAAGGATGCTGCCCTCGAGCGTCGTATGCAGCCTGTTAAAGTCGATGAACCAACGG
TGGATGAAACAATCACTATTCTCAAAGGGATTCAAAAGAAATACGAAGATTACCACCACGTTCAATATACCGATGCTGCG
ATTGAAGCAGCTGCAACTCTTTCCAATCGCTACATCCAAGATCGCTTCTTGCCTGACAAGGCCATTGACCTCCTAGATGA
AGCTGGTTCTAAGATGAACTTGACCTTGAATTTTGTGGATCCTAAAGTAATTGATCAGCGCTTGATTGAGGCTGAAAATC
TCAAGTCTCAAGCTACACGAGAAGAAGATTTTGAGAAGGCGGCCTACTTCCGCGACCAGATTGCCAAGTATAAGGAAATG
CAAAAGAAAAAGATCACAGACCAGGATACTCCTATCATCAGCGAGAAAACTATTGAGCACATTATCGAGCAGAAAACCAA
TATCCCTGTTGGTGATTTGAAAGAGAAAGAACAATCTCAACTCATCCATCTAGCTGAAGATCTCAAGTCTCATGTTATTG
GCCAAGATGATGCAGTCGATAAGATTGCCAAGGCTATTCGCCGTAATCGTGTCGGACTTGGTACCCCTAACCGCCCAATC
GGAAGCTTCCTCTTCGTTGGGCCAACTGGTGTCGGTAAGACAGAACTTTCCAAACAACTGGCTATCGAACTTTTTGGTTC
TGCTGATAGTATGATTCGCTTTGATATGAGTGAATACATGGAAAAACATAGTGTGGCTAAGTTGGTCGGCGCTCCTCCAG
GTTATGTTGGCTATGATGAGGCTGGTCAATTAACTGAAAAAGTTCGCCACAATCCATATTCTCTCATCCTTCTCGATGAA
GTGGAAAAAGCTCACCCAGATGTTATGCACATGTTTCTTCAAGTCTTGGACGATGGTCGTTTGACAGACGGGCAAGGACG
CACCGTTAGCTTCAAGGATGCCATCATTATCATGACCTCAAATGCAGGTACAGGAAAGACCGAAGCTAGCGTTGGATTTG
GTGCTGCTAGAGAAGGACGTACCAATTCTGTCCTCGGTGAACTCGGTAACTTCTTTAGCCCAGAGTTTATGAACCGTTTT
GATGGCATTATCGAATTTAAGGCTCTCAGCAAGGATAACCTCCTTCAGATTGTCGAGCTCATGCTAGCAGATGTTAACAA
GCGCCTCTCTAGCAACAACATTCGTTTGGATGTAACTGATAAGGTCAAGGAAAAGTTGGTTGACCTAGGTTATGATCCAA
AAATGGGAGCACGCCCACTTCGTCGGACTATTCAAGACTATATTGAGGACACAATCACTGACTACTACCTTGAAAATCCA
AGCGAAAAAGATCTCAAAGCAGTTATGACTAGCAAGGGAAACATTCAGATTAAATCTGCCAAAAAAACTGAAGTTAAAAG
TTCTGAAAAAGAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpE Streptococcus pneumoniae Rx1

99.734

100

0.997

  clpE Streptococcus pneumoniae D39

99.734

100

0.997

  clpE Streptococcus pneumoniae R6

99.734

100

0.997

  clpE Streptococcus pneumoniae TIGR4

99.601

100

0.996

  clpE Streptococcus mutans UA159

82.865

99.335

0.823

  clpC Lactococcus lactis subsp. cremoris KW2

76.839

97.606

0.75

  clpC Bacillus subtilis subsp. subtilis str. 168

51.108

90.027

0.46

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

47.687

83.378

0.398

  clpC Streptococcus pneumoniae TIGR4

46.91

83.91

0.394

  clpC Streptococcus pneumoniae Rx1

46.91

83.91

0.394

  clpC Streptococcus pneumoniae D39

46.91

83.91

0.394

  clpC Streptococcus thermophilus LMD-9

46.795

82.979

0.388

  clpC Streptococcus mutans UA159

46.571

83.378

0.388

  clpC Streptococcus thermophilus LMG 18311

46.4

83.112

0.386


Multiple sequence alignment