Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpE   Type   Regulator
Locus tag   H020_RS0104030 Genome accession   NZ_AKVY01000001
Coordinates   772187..774445 (-) Length   752 a.a.
NCBI ID   WP_000882523.1    Uniprot ID   A0A4J1WRX6
Organism   Streptococcus pneumoniae TIGR4     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 767158..771980 772187..774445 flank 207


Gene organization within MGE regions


Location: 767158..774445
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H020_RS0103985 tnpB 768156..768443 (-) 288 WP_000586646.1 IS66 family insertion sequence element accessory protein TnpB -
  H020_RS13395 - 768684..770021 (+) 1338 Protein_760 IS3 family transposase -
  H020_RS0104010 - 770130..770429 (-) 300 WP_000767193.1 DUF1827 family protein -
  H020_RS0104015 - 770594..771016 (-) 423 WP_001849729.1 NUDIX hydrolase -
  H020_RS12645 - 771127..771980 (+) 854 Protein_763 IS630 family transposase -
  H020_RS0104030 clpE 772187..774445 (-) 2259 WP_000882523.1 ATP-dependent Clp protease ATP-binding subunit Regulator

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 83814.18 Da        Isoelectric Point: 5.3713

>NTDB_id=64925 H020_RS0104030 WP_000882523.1 772187..774445(-) (clpE) [Streptococcus pneumoniae TIGR4]
MLCQNCKINDSTIHLYTNLNGKQKQIDLCQNCYKIIKTDPNNSLFKGMTDLNNRDFDPFGDFFNDLNNFRPSSNTPPIPP
TQSGGGYGGNGGYGSQNRGSAQTPPPSQEKGLLEEFGINVTEIARRGDIDPVIGRDDEIIRVIEILNRRTKNNPVLIGEP
GVGKTAVVEGLAQKIVDGDVPHKLQGKQVIRLDVVSLVQGTGIRGQFEERMQKLMEEIRKREDIILFIDEIHEIVGAGSA
SDGNMDAGNILKPALARGELQLVGATTLNEYRIIEKDAALERRMQPVKVDEPTVDETITILKGIQKKYEDYHHVQYTDAA
IEAAATLSNRYIQDRFLPDKAIDLLDEAGSKMNLTLNFVDPKVIDQRLIEAENLKSQATREEDFEKAAYFRDQIAKYKEM
QKKKITDQDTPSISEKTIEHIIEQKTNIPVGDLKEKEQSQLIHLAEDLKSHVIGQDDAVDKIAKAIRRNRVGLGTPNRPI
GSFLFVGPTGVGKTELSKQLAIELFGSADSMIRFDMSEYMEKHSVAKLVGAPPGYVGYDEAGQLTEKVRHNPYSLILLDE
VEKAHPDVMHMFLQVLDDGRLTDGQGRTVSFKDAIIIMTSNAGTGKTEASVGFGAAREGRTNSVLGELGNFFSPEFMNRF
DGIIEFKALSKDNLLQIVELMLADVNKRLSSNNIRLDVTDKVKEKLVDLGYDPKMGARPLRRTIQDYIEDTITDYYLENP
SEKDLKAVMTSKGNIQIKSAKKAEVKSSEKEK

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=64925 H020_RS0104030 WP_000882523.1 772187..774445(-) (clpE) [Streptococcus pneumoniae TIGR4]
ATGCTTTGTCAAAACTGTAAAATTAACGACTCAACAATTCATCTTTACACCAATCTCAATGGAAAACAAAAACAAATTGA
CCTCTGTCAAAACTGCTATAAGATTATCAAAACAGATCCTAACAATAGCCTCTTCAAAGGTATGACGGATCTGAACAATC
GTGACTTCGATCCCTTTGGTGATTTCTTCAATGATCTAAACAATTTCAGACCTTCTAGCAATACTCCTCCTATTCCCCCA
ACCCAATCAGGTGGAGGTTACGGTGGAAACGGCGGTTATGGTTCCCAAAATCGTGGATCTGCTCAAACTCCGCCACCTAG
CCAAGAAAAAGGCCTGCTGGAAGAATTTGGTATTAATGTAACTGAAATTGCCCGTCGTGGAGACATTGACCCCGTTATTG
GGCGCGACGATGAGATTATCCGTGTCATCGAGATTCTCAATCGTAGAACCAAGAATAATCCTGTCCTTATCGGTGAACCT
GGTGTCGGAAAAACGGCCGTTGTCGAAGGTCTAGCTCAGAAAATTGTCGATGGCGATGTGCCACATAAACTCCAAGGTAA
ACAAGTCATCCGTCTGGATGTGGTTAGCTTAGTTCAAGGAACGGGGATTCGAGGACAATTTGAAGAACGCATGCAAAAAC
TCATGGAAGAAATTCGCAAACGTGAAGACATCATCCTCTTTATCGATGAAATCCATGAAATTGTTGGTGCTGGTTCTGCG
AGTGATGGTAATATGGACGCAGGAAATATCCTCAAGCCAGCCCTTGCTCGTGGAGAACTGCAACTAGTCGGTGCTACTAC
CCTCAATGAATACCGTATCATTGAAAAGGATGCTGCCCTCGAGCGTCGTATGCAGCCTGTTAAAGTCGATGAACCAACGG
TGGACGAAACAATCACTATTCTCAAAGGGATTCAAAAGAAATACGAAGATTACCACCACGTTCAATATACAGATGCTGCG
ATTGAAGCAGCTGCAACTCTTTCCAATCGCTACATCCAAGATCGCTTCTTGCCTGACAAGGCCATTGACCTCCTAGATGA
AGCTGGTTCTAAGATGAACTTGACCTTGAATTTTGTGGATCCTAAAGTAATTGATCAGCGCTTGATTGAGGCTGAAAATC
TCAAGTCTCAAGCTACACGAGAAGAAGATTTTGAGAAGGCGGCCTACTTCCGCGACCAGATTGCCAAGTATAAGGAAATG
CAAAAGAAAAAGATCACAGACCAGGATACTCCTAGCATCAGCGAGAAAACTATTGAGCACATTATCGAGCAGAAAACCAA
TATCCCTGTTGGTGATTTGAAAGAGAAAGAACAATCTCAACTCATCCATCTAGCCGAAGATCTCAAGTCTCATGTTATTG
GTCAAGATGATGCAGTCGATAAGATTGCCAAGGCTATTCGCCGTAATCGTGTCGGACTTGGTACCCCTAACCGCCCAATC
GGAAGCTTCCTCTTCGTTGGGCCAACTGGTGTCGGTAAGACAGAACTTTCCAAACAACTGGCTATCGAACTTTTTGGTTC
TGCTGATAGTATGATTCGCTTTGATATGAGTGAATACATGGAAAAACATAGTGTGGCTAAGTTGGTCGGCGCTCCTCCAG
GTTATGTTGGCTATGATGAGGCTGGTCAATTAACTGAAAAAGTTCGCCACAATCCATATTCTCTCATCCTTCTCGATGAA
GTGGAAAAAGCTCACCCAGATGTTATGCACATGTTTCTTCAAGTCTTGGACGATGGTCGTTTGACAGACGGGCAAGGACG
CACCGTTAGCTTCAAGGATGCCATCATTATCATGACCTCAAATGCAGGTACAGGAAAGACCGAAGCTAGCGTTGGATTTG
GTGCTGCTAGAGAAGGACGTACCAATTCTGTCCTCGGTGAACTCGGTAACTTCTTTAGCCCAGAGTTTATGAACCGTTTT
GATGGCATTATCGAATTTAAGGCTCTCAGCAAGGATAACCTCCTTCAGATTGTCGAGCTCATGCTAGCAGATGTTAACAA
GCGCCTCTCTAGCAACAACATTCGTTTGGATGTAACTGATAAGGTCAAGGAAAAGTTGGTTGACCTAGGTTATGATCCAA
AAATGGGAGCACGCCCACTTCGTCGGACTATTCAAGACTATATTGAGGACACAATCACTGACTACTACCTTGAAAATCCA
AGCGAAAAAGATCTCAAAGCAGTTATGACTAGCAAGGGAAACATTCAGATTAAATCTGCCAAAAAAGCTGAAGTTAAAAG
TTCTGAAAAAGAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4J1WRX6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpE Streptococcus pneumoniae TIGR4

100

100

1

  clpE Streptococcus pneumoniae Rx1

99.867

100

0.999

  clpE Streptococcus pneumoniae D39

99.867

100

0.999

  clpE Streptococcus pneumoniae R6

99.867

100

0.999

  clpE Streptococcus mutans UA159

82.731

99.335

0.822

  clpC Lactococcus lactis subsp. cremoris KW2

76.839

97.606

0.75

  clpC Bacillus subtilis subsp. subtilis str. 168

52.308

86.436

0.452

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

47.687

83.378

0.398

  clpC Streptococcus pneumoniae TIGR4

46.91

83.91

0.394

  clpC Streptococcus pneumoniae Rx1

46.91

83.91

0.394

  clpC Streptococcus pneumoniae D39

46.91

83.91

0.394

  clpC Streptococcus mutans UA159

47.36

83.112

0.394

  clpC Streptococcus thermophilus LMD-9

46.88

83.112

0.39

  clpC Streptococcus thermophilus LMG 18311

46.486

83.245

0.387


Multiple sequence alignment