Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpE   Type   Regulator
Locus tag   NQZ97_RS09155 Genome accession   NZ_CP102136
Coordinates   1878302..1880530 (+) Length   742 a.a.
NCBI ID   WP_043026227.1    Uniprot ID   A0A116M7Z4
Organism   Streptococcus suis strain M105052_S26     
Function   degradation of ComX (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1872713..1884951 1878302..1880530 within 0


Gene organization within MGE regions


Location: 1872713..1884951
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NQZ97_RS09120 (NQZ97_09120) - 1872713..1873414 (+) 702 WP_015647212.1 CPBP family intramembrane glutamic endopeptidase -
  NQZ97_RS09125 (NQZ97_09125) - 1873667..1874398 (-) 732 WP_170244159.1 hypothetical protein -
  NQZ97_RS09130 (NQZ97_09130) - 1874395..1874967 (-) 573 WP_170238178.1 hypothetical protein -
  NQZ97_RS11955 - 1874927..1875052 (-) 126 WP_079270322.1 YSIRK-type signal peptide-containing protein -
  NQZ97_RS09135 (NQZ97_09135) - 1875328..1876176 (-) 849 WP_105130177.1 bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase -
  NQZ97_RS09140 (NQZ97_09140) - 1876306..1877040 (-) 735 WP_043026224.1 amino acid ABC transporter ATP-binding protein -
  NQZ97_RS09145 (NQZ97_09145) - 1877033..1877722 (-) 690 WP_002938702.1 amino acid ABC transporter permease -
  NQZ97_RS09150 (NQZ97_09150) - 1877848..1878078 (-) 231 WP_002938704.1 DUF1797 family protein -
  NQZ97_RS09155 (NQZ97_09155) clpE 1878302..1880530 (+) 2229 WP_043026227.1 ATP-dependent Clp protease ATP-binding subunit Regulator
  NQZ97_RS09160 (NQZ97_09160) - 1880713..1881174 (+) 462 WP_029174038.1 NUDIX hydrolase -
  NQZ97_RS09165 (NQZ97_09165) - 1881229..1881537 (+) 309 WP_002938710.1 DUF1827 family protein -
  NQZ97_RS09170 (NQZ97_09170) ileS 1881708..1884497 (-) 2790 WP_105130175.1 isoleucine--tRNA ligase -
  NQZ97_RS09175 (NQZ97_09175) - 1884508..1884951 (-) 444 WP_014637568.1 GNAT family N-acetyltransferase -

Sequence


Protein


Download         Length: 742 a.a.        Molecular weight: 82596.91 Da        Isoelectric Point: 5.2123

>NTDB_id=714644 NQZ97_RS09155 WP_043026227.1 1878302..1880530(+) (clpE) [Streptococcus suis strain M105052_S26]
MLCKNCNINDATIHLYTNLNGKQQQVDLCHNCYQIMKTDPNNAILRGLGDLTNPNNMDPFSEFFNHLGGYPGNTPAGKNR
EQTPPTQAGGHNGRGGQTPPPQQPQQPNGLLEEFGINVTEIARRGDIDPVIGRDQEITRVIEILNRRTKNNPVLIGEPGV
GKTAVVEGLAQKIVDGDVPQKLRDKEVIRLDVVSLVQGTGIRGQFEERMQKLMEEIRNRREIILFIDEIHEIVGAGSAGD
GNMDAGNILKPALARGEMQLVGATTLNEYRIIEKDAALERRMQPVKVEEPSVEETITILKGIQNKYQDYHHVKYSDAAIE
AAAVLSNRYIQDRFLPDKAIDLLDEAGSKMNLTLNFIDPKEIDQRLIDAENRKAQATQDEDYEKAAYFRDQIAKYKEMQK
ATISEEDIPLITEKEIEAIVEQKTNIPVGDLKEKEQSQLVNLASDLKAHVIGQDEAVDKIAKAIRRNRVGLGAPNRPIGS
FLFVGPTGVGKTELSKQLAIELFGSADSMIRFDMSEYMEKHAVAKLVGAPPGYVGYEEAGQLTEKVRRNPYSLILLDEVE
KAHPDVMHMFLQVLDDGRLTDGQGRTVSFKDTIIIMTSNAGTGKVEASVGFGAAMEGRTQSVLGQLSNFFTPEFMNRFDG
IIEFQPLSRENLLEIVSLMLDDVNKRLSHNGISLHVTDKVKEKLVDLGYDPKMGARPLRRTIQDQIEDAITDFYLEHPAE
KDLRAVMSSKGTIQIKAQTKTK

Nucleotide


Download         Length: 2229 bp        

>NTDB_id=714644 NQZ97_RS09155 WP_043026227.1 1878302..1880530(+) (clpE) [Streptococcus suis strain M105052_S26]
ATGCTCTGCAAAAATTGTAATATCAACGACGCAACGATCCATCTCTACACCAACTTAAACGGTAAACAGCAGCAGGTAGA
CCTCTGTCACAACTGCTACCAAATCATGAAAACAGACCCTAATAATGCCATCCTACGTGGGTTGGGAGATTTAACAAATC
CTAACAATATGGATCCCTTTAGTGAATTTTTCAATCACCTAGGTGGCTATCCTGGCAATACCCCTGCTGGAAAAAATCGC
GAACAGACTCCTCCCACTCAAGCCGGTGGGCATAATGGACGTGGTGGACAGACCCCTCCTCCGCAACAACCCCAACAACC
AAATGGGCTCCTTGAAGAATTTGGTATCAACGTTACCGAAATTGCCCGTCGTGGGGATATTGATCCAGTAATCGGTCGCG
ATCAAGAAATTACCCGTGTCATCGAAATTCTCAACCGTCGTACAAAAAACAACCCTGTGCTTATCGGTGAGCCTGGTGTA
GGTAAAACAGCGGTTGTTGAAGGCTTGGCTCAAAAAATTGTCGATGGCGATGTCCCACAAAAATTACGAGACAAGGAAGT
CATTCGTCTGGATGTTGTCAGCCTCGTACAAGGAACAGGTATCCGCGGACAATTTGAAGAACGCATGCAAAAACTCATGG
AAGAAATCCGTAATCGTCGCGAAATCATTCTTTTTATTGACGAGATTCACGAAATTGTCGGTGCTGGCTCTGCAGGCGAT
GGTAACATGGATGCAGGCAATATCCTCAAACCAGCCCTCGCACGTGGTGAAATGCAGCTAGTCGGTGCAACAACGCTCAA
CGAGTACCGTATTATCGAAAAAGATGCCGCCCTAGAACGCCGTATGCAACCTGTAAAAGTTGAAGAACCAAGTGTCGAAG
AAACCATTACTATTCTCAAAGGTATCCAAAATAAATACCAGGATTACCATCATGTTAAATACTCTGATGCTGCTATCGAA
GCGGCCGCAGTACTGTCCAACCGCTACATTCAAGACCGTTTCCTACCAGACAAAGCCATTGACCTTCTGGATGAAGCTGG
TTCAAAAATGAATTTGACCCTCAACTTTATTGATCCAAAGGAAATTGACCAACGCCTGATTGATGCCGAAAATCGTAAGG
CACAAGCTACACAGGATGAGGACTATGAAAAAGCAGCCTATTTCCGTGACCAAATTGCTAAGTACAAGGAGATGCAAAAG
GCAACTATCAGCGAAGAAGATATTCCACTCATTACTGAAAAAGAAATTGAGGCCATCGTTGAGCAAAAAACAAATATCCC
AGTTGGTGATTTAAAAGAAAAGGAACAGTCCCAGCTTGTGAACCTCGCTAGTGACTTAAAAGCCCATGTTATCGGACAAG
ATGAAGCTGTTGACAAGATTGCCAAGGCCATTCGCCGTAACCGTGTTGGTCTTGGTGCCCCAAACCGTCCAATCGGCTCC
TTCCTCTTTGTCGGACCAACCGGTGTCGGTAAAACAGAGCTATCCAAGCAACTAGCGATTGAACTCTTCGGTTCAGCCGA
TAGTATGATTCGCTTCGACATGTCCGAATACATGGAAAAACACGCTGTAGCTAAACTCGTCGGAGCTCCTCCAGGCTATG
TAGGCTATGAGGAGGCTGGACAATTGACTGAAAAAGTCCGTCGTAATCCTTACTCACTCATCCTCCTTGACGAAGTTGAA
AAAGCCCACCCTGATGTCATGCACATGTTCCTTCAGGTCTTAGACGATGGGCGTCTGACAGATGGACAAGGTCGTACTGT
CAGCTTCAAAGATACCATCATCATCATGACATCAAATGCTGGTACTGGAAAAGTTGAAGCAAGTGTCGGTTTTGGTGCGG
CTATGGAAGGGCGGACTCAATCAGTCCTTGGCCAACTTAGCAATTTCTTCACACCTGAATTTATGAACCGATTTGACGGT
ATTATTGAGTTCCAACCACTGAGCAGGGAAAACCTACTTGAAATCGTCAGCCTCATGCTAGACGATGTAAACAAACGCCT
TTCTCACAATGGCATCAGCCTCCATGTAACAGATAAAGTTAAGGAGAAATTAGTGGACCTTGGCTACGATCCAAAGATGG
GAGCTCGTCCACTACGCAGAACCATCCAAGATCAGATTGAAGATGCTATCACCGACTTCTACTTAGAACATCCAGCAGAA
AAAGACCTACGTGCTGTCATGTCAAGTAAAGGTACAATTCAAATCAAGGCACAAACAAAGACAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A116M7Z4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpE Streptococcus pneumoniae Rx1

82.095

100

0.834

  clpE Streptococcus pneumoniae D39

82.095

100

0.834

  clpE Streptococcus pneumoniae R6

82.095

100

0.834

  clpE Streptococcus pneumoniae TIGR4

82.095

100

0.834

  clpE Streptococcus mutans UA159

82.109

100

0.829

  clpC Lactococcus lactis subsp. cremoris KW2

77.517

97.709

0.757

  clpC Bacillus subtilis subsp. subtilis str. 168

52.462

87.601

0.46

  clpC Streptococcus mutans UA159

48.558

84.097

0.408

  clpC Streptococcus thermophilus LMD-9

48.248

84.636

0.408

  clpC Streptococcus thermophilus LMG 18311

47.93

84.636

0.406

  clpC Streptococcus pneumoniae TIGR4

47.84

84.232

0.403

  clpC Streptococcus pneumoniae Rx1

47.84

84.232

0.403

  clpC Streptococcus pneumoniae D39

47.84

84.232

0.403

  clpC Lactococcus lactis subsp. lactis strain DGCC12653

46.815

84.636

0.396