Detailed information of component protein
Summary
External database links
Locus tag (Gene) | EC042_RS24435 |
Coordinate (Strand) | 4908894..4911443 (+) |
NCBI ID | WP_000029849.1 |
RefSeq | NC_017626 |
Uniprot ID | D3GV04_ECO44, A0A3J8YFL8_ECOLX, B7LFQ0_ECO55 |
KEGG ID | elo:EC042_4577, eck:EC55989_3283 |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
AAA_2 | PF07724.16 | 2.4e-40 | 577..740 |
AAA_lid_9 | PF17871.3 | 1.4e-24 | 353..444 |
ClpB_D2-small | PF10431.11 | 9.2e-10 | 747..821 |
AAA | PF00004.31 | 9.8e-09 | 213..325 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Outside | 1-849 | VSIYLKPIINKLTPESRNTLDSAINYAISRSHHEVDCLHLLWKLLQEHKYIAEVLYEQSLFNPEWVLNAIESELIRINTVPQSSPVFSESMQTLLEKTWIHASTKWQIDHIDIPVFLSTMINFRDSIFPLNVSDALCCDMDVAEELLISFSDEAEHSASHRPTDRSSHEYLSKYTENLSLRAETGKLDPVTGREKEVRQLIDILLRRRQNNPILTGEPGVGKSSIVEGLALQIASGRVPDVLKNVRIHALDMGALLAGASVRGEFENRLKSLLTELNSLDGTAILFIDEAHSLIGAGGLPGQTDAANLLKPALARGELRIIAATTWGEYKKYFEKDGALARRFQIVKVAEPNQDVTAEMLRSLLPMMEKHHNVSIREEAITATVHLSDRYLHGRRQPDKSVSLLDTACSRVAVSQSTSPDAIQDVEASLVRYQGELALLTQERSDVLRQEMLANKIAQLEEELEQLKSAWRHQSELVAKIQSSDDISSKNMYRKELESAYKKDSPMVFECVDKNCVADVVSGWTGVPLGVCLDGEQQKASGLLRCLEQRVLGQRYAMSAIASQVLICRADLKDPVKPDGVFLLAGPSGTGKTETARALAEFVYGDENKLITINMTEFQEAHTVSTLKGAPPGYVGFGQGGTLTERVSHNPYSVILLDEIEKAHPDVLEFFFQIFDSGIIEDAEGKMVSFRDCLIIMTSNFASENITNIWNDGETNRDRIKEMLLPLFNEHFGAAFMGRTNLIPFTPLHSKTLRDIVLIKIDKICQRFEQASGQMYKIEYNDSLIDWITHHCQCDKSGARDIDSVLNSTVLPVLARYLTDSEDNRTPKKIRISVRKNNITLRSSQFATRN |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9974 | - | - |
Protein sequence: 850 a.a.
>T6CP005149 NC_017626:4908894-4911443 [Escherichia coli 042] [TssH]
VSIYLKPIINKLTPESRNTLDSAINYAISRSHHEVDCLHLLWKLLQEHKYIAEVLYEQSLFNPEWVLNAIESELIRINTV
PQSSPVFSESMQTLLEKTWIHASTKWQIDHIDIPVFLSTMINFRDSIFPLNVSDALCCDMDVAEELLISFSDEAEHSASH
RPTDRSSHEYLSKYTENLSLRAETGKLDPVTGREKEVRQLIDILLRRRQNNPILTGEPGVGKSSIVEGLALQIASGRVPD
VLKNVRIHALDMGALLAGASVRGEFENRLKSLLTELNSLDGTAILFIDEAHSLIGAGGLPGQTDAANLLKPALARGELRI
IAATTWGEYKKYFEKDGALARRFQIVKVAEPNQDVTAEMLRSLLPMMEKHHNVSIREEAITATVHLSDRYLHGRRQPDKS
VSLLDTACSRVAVSQSTSPDAIQDVEASLVRYQGELALLTQERSDVLRQEMLANKIAQLEEELEQLKSAWRHQSELVAKI
QSSDDISSKNMYRKELESAYKKDSPMVFECVDKNCVADVVSGWTGVPLGVCLDGEQQKASGLLRCLEQRVLGQRYAMSAI
ASQVLICRADLKDPVKPDGVFLLAGPSGTGKTETARALAEFVYGDENKLITINMTEFQEAHTVSTLKGAPPGYVGFGQGG
TLTERVSHNPYSVILLDEIEKAHPDVLEFFFQIFDSGIIEDAEGKMVSFRDCLIIMTSNFASENITNIWNDGETNRDRIK
EMLLPLFNEHFGAAFMGRTNLIPFTPLHSKTLRDIVLIKIDKICQRFEQASGQMYKIEYNDSLIDWITHHCQCDKSGARD
IDSVLNSTVLPVLARYLTDSEDNRTPKKIRISVRKNNITLRSSQFATRN*
VSIYLKPIINKLTPESRNTLDSAINYAISRSHHEVDCLHLLWKLLQEHKYIAEVLYEQSLFNPEWVLNAIESELIRINTV
PQSSPVFSESMQTLLEKTWIHASTKWQIDHIDIPVFLSTMINFRDSIFPLNVSDALCCDMDVAEELLISFSDEAEHSASH
RPTDRSSHEYLSKYTENLSLRAETGKLDPVTGREKEVRQLIDILLRRRQNNPILTGEPGVGKSSIVEGLALQIASGRVPD
VLKNVRIHALDMGALLAGASVRGEFENRLKSLLTELNSLDGTAILFIDEAHSLIGAGGLPGQTDAANLLKPALARGELRI
IAATTWGEYKKYFEKDGALARRFQIVKVAEPNQDVTAEMLRSLLPMMEKHHNVSIREEAITATVHLSDRYLHGRRQPDKS
VSLLDTACSRVAVSQSTSPDAIQDVEASLVRYQGELALLTQERSDVLRQEMLANKIAQLEEELEQLKSAWRHQSELVAKI
QSSDDISSKNMYRKELESAYKKDSPMVFECVDKNCVADVVSGWTGVPLGVCLDGEQQKASGLLRCLEQRVLGQRYAMSAI
ASQVLICRADLKDPVKPDGVFLLAGPSGTGKTETARALAEFVYGDENKLITINMTEFQEAHTVSTLKGAPPGYVGFGQGG
TLTERVSHNPYSVILLDEIEKAHPDVLEFFFQIFDSGIIEDAEGKMVSFRDCLIIMTSNFASENITNIWNDGETNRDRIK
EMLLPLFNEHFGAAFMGRTNLIPFTPLHSKTLRDIVLIKIDKICQRFEQASGQMYKIEYNDSLIDWITHHCQCDKSGARD
IDSVLNSTVLPVLARYLTDSEDNRTPKKIRISVRKNNITLRSSQFATRN*
Nucleotide sequence: 2550 bp
>T6CP005149 NC_017626:4908894-4911443 [Escherichia coli 042] [TssH]
GTGAGTATCTATCTGAAACCAATTATTAATAAATTAACTCCAGAAAGTCGAAATACTCTGGATTCTGCAATTAATTACGC
AATATCAAGGTCACATCATGAAGTAGACTGTCTACATCTTCTATGGAAGTTATTGCAGGAACATAAATATATAGCCGAAG
TGCTTTATGAGCAGTCCTTATTTAATCCTGAGTGGGTTCTTAATGCTATTGAAAGTGAACTTATCCGTATCAATACAGTA
CCACAATCCTCACCTGTATTTTCTGAGTCGATGCAGACTCTACTGGAGAAAACATGGATACATGCCAGTACTAAATGGCA
AATCGATCATATAGATATCCCTGTCTTTTTGAGTACTATGATTAATTTCAGAGATTCAATATTTCCTCTCAATGTTTCTG
ATGCGTTATGTTGTGATATGGATGTAGCTGAGGAGTTACTTATCTCTTTTTCAGACGAAGCAGAGCATTCTGCTTCTCAT
CGTCCCACCGATAGGTCTTCACATGAATATCTCAGTAAATATACTGAGAATCTCTCTCTGCGGGCAGAAACAGGTAAATT
AGATCCGGTTACTGGCAGAGAGAAGGAAGTCAGACAACTTATCGATATTTTGCTTCGTCGCAGACAGAATAATCCAATAC
TGACCGGTGAACCGGGAGTAGGGAAAAGTAGTATAGTAGAAGGTTTAGCACTGCAAATTGCGTCTGGCCGTGTGCCTGAT
GTACTGAAGAATGTCCGGATCCATGCCCTTGATATGGGGGCTCTGCTTGCGGGAGCCAGTGTCAGAGGGGAATTCGAAAA
TCGCCTAAAATCACTGTTGACCGAACTTAATTCTTTGGATGGCACAGCTATTTTGTTTATTGATGAGGCTCATTCCCTTA
TCGGTGCAGGAGGACTTCCGGGACAGACTGATGCAGCTAATTTGCTTAAGCCTGCTCTTGCCCGTGGAGAATTGCGCATT
ATAGCCGCAACGACATGGGGAGAATATAAAAAGTATTTTGAAAAGGATGGGGCGTTAGCCCGGCGTTTTCAGATAGTAAA
AGTAGCAGAGCCAAATCAGGATGTCACAGCGGAAATGCTACGTTCTCTTTTGCCCATGATGGAAAAACACCATAATGTTT
CTATCAGGGAAGAAGCAATAACTGCAACGGTACATCTATCAGACCGTTATCTGCATGGGCGCCGTCAACCAGATAAATCT
GTAAGTCTGTTAGATACAGCTTGTTCTCGGGTTGCTGTATCTCAGTCTACATCACCAGATGCCATTCAGGATGTCGAAGC
TTCATTGGTAAGATACCAAGGGGAACTTGCGCTGCTGACACAGGAAAGGTCAGATGTTTTGCGTCAGGAAATGCTTGCAA
ATAAAATTGCTCAACTTGAGGAGGAGCTTGAGCAACTCAAATCAGCGTGGCGTCATCAAAGTGAACTGGTTGCAAAAATA
CAAAGTTCAGATGATATTTCATCAAAGAATATGTACAGAAAAGAACTGGAAAGTGCATATAAGAAAGACAGTCCTATGGT
TTTTGAGTGTGTGGATAAAAATTGTGTTGCTGATGTTGTTTCTGGGTGGACAGGTGTTCCACTGGGTGTATGCCTTGATG
GTGAACAACAAAAAGCATCCGGTCTGTTAAGATGCTTGGAGCAGCGAGTTTTGGGGCAGCGATATGCTATGTCCGCTATT
GCATCACAAGTTCTTATTTGTCGGGCAGATCTTAAAGATCCTGTAAAACCTGATGGGGTATTTTTACTTGCGGGCCCTTC
TGGCACAGGCAAAACGGAGACAGCCAGGGCGCTGGCCGAGTTTGTTTATGGGGATGAGAATAAACTCATCACGATTAATA
TGACTGAATTTCAGGAAGCTCACACTGTTTCCACTCTGAAGGGGGCACCTCCTGGATATGTTGGTTTTGGTCAGGGAGGG
ACTCTTACTGAAAGGGTGAGTCATAATCCATACAGTGTTATTCTGCTTGATGAAATTGAAAAAGCACATCCGGATGTCCT
GGAGTTCTTCTTCCAGATATTTGACTCAGGTATTATCGAGGATGCTGAAGGTAAGATGGTGTCTTTCAGAGATTGTCTCA
TAATAATGACATCAAACTTTGCCAGTGAAAATATCACAAACATATGGAATGATGGTGAGACAAATAGAGACAGAATAAAA
GAAATGCTTCTTCCTTTGTTTAATGAGCATTTTGGGGCGGCATTTATGGGGAGAACTAACTTAATTCCGTTCACACCGTT
GCATTCCAAAACACTGAGAGATATCGTACTCATTAAAATTGATAAGATCTGCCAACGTTTTGAACAGGCATCAGGTCAGA
TGTATAAGATAGAATACAATGATTCTCTTATTGACTGGATTACACATCATTGTCAATGTGATAAGTCTGGAGCTAGGGAT
ATAGACTCAGTACTTAACAGCACGGTTCTTCCTGTCTTGGCCAGATATTTGACTGACTCTGAAGATAATAGAACCCCCAA
GAAAATAAGAATATCAGTAAGGAAAAATAACATTACATTACGTAGTTCGCAATTTGCAACTCGTAATTGA
GTGAGTATCTATCTGAAACCAATTATTAATAAATTAACTCCAGAAAGTCGAAATACTCTGGATTCTGCAATTAATTACGC
AATATCAAGGTCACATCATGAAGTAGACTGTCTACATCTTCTATGGAAGTTATTGCAGGAACATAAATATATAGCCGAAG
TGCTTTATGAGCAGTCCTTATTTAATCCTGAGTGGGTTCTTAATGCTATTGAAAGTGAACTTATCCGTATCAATACAGTA
CCACAATCCTCACCTGTATTTTCTGAGTCGATGCAGACTCTACTGGAGAAAACATGGATACATGCCAGTACTAAATGGCA
AATCGATCATATAGATATCCCTGTCTTTTTGAGTACTATGATTAATTTCAGAGATTCAATATTTCCTCTCAATGTTTCTG
ATGCGTTATGTTGTGATATGGATGTAGCTGAGGAGTTACTTATCTCTTTTTCAGACGAAGCAGAGCATTCTGCTTCTCAT
CGTCCCACCGATAGGTCTTCACATGAATATCTCAGTAAATATACTGAGAATCTCTCTCTGCGGGCAGAAACAGGTAAATT
AGATCCGGTTACTGGCAGAGAGAAGGAAGTCAGACAACTTATCGATATTTTGCTTCGTCGCAGACAGAATAATCCAATAC
TGACCGGTGAACCGGGAGTAGGGAAAAGTAGTATAGTAGAAGGTTTAGCACTGCAAATTGCGTCTGGCCGTGTGCCTGAT
GTACTGAAGAATGTCCGGATCCATGCCCTTGATATGGGGGCTCTGCTTGCGGGAGCCAGTGTCAGAGGGGAATTCGAAAA
TCGCCTAAAATCACTGTTGACCGAACTTAATTCTTTGGATGGCACAGCTATTTTGTTTATTGATGAGGCTCATTCCCTTA
TCGGTGCAGGAGGACTTCCGGGACAGACTGATGCAGCTAATTTGCTTAAGCCTGCTCTTGCCCGTGGAGAATTGCGCATT
ATAGCCGCAACGACATGGGGAGAATATAAAAAGTATTTTGAAAAGGATGGGGCGTTAGCCCGGCGTTTTCAGATAGTAAA
AGTAGCAGAGCCAAATCAGGATGTCACAGCGGAAATGCTACGTTCTCTTTTGCCCATGATGGAAAAACACCATAATGTTT
CTATCAGGGAAGAAGCAATAACTGCAACGGTACATCTATCAGACCGTTATCTGCATGGGCGCCGTCAACCAGATAAATCT
GTAAGTCTGTTAGATACAGCTTGTTCTCGGGTTGCTGTATCTCAGTCTACATCACCAGATGCCATTCAGGATGTCGAAGC
TTCATTGGTAAGATACCAAGGGGAACTTGCGCTGCTGACACAGGAAAGGTCAGATGTTTTGCGTCAGGAAATGCTTGCAA
ATAAAATTGCTCAACTTGAGGAGGAGCTTGAGCAACTCAAATCAGCGTGGCGTCATCAAAGTGAACTGGTTGCAAAAATA
CAAAGTTCAGATGATATTTCATCAAAGAATATGTACAGAAAAGAACTGGAAAGTGCATATAAGAAAGACAGTCCTATGGT
TTTTGAGTGTGTGGATAAAAATTGTGTTGCTGATGTTGTTTCTGGGTGGACAGGTGTTCCACTGGGTGTATGCCTTGATG
GTGAACAACAAAAAGCATCCGGTCTGTTAAGATGCTTGGAGCAGCGAGTTTTGGGGCAGCGATATGCTATGTCCGCTATT
GCATCACAAGTTCTTATTTGTCGGGCAGATCTTAAAGATCCTGTAAAACCTGATGGGGTATTTTTACTTGCGGGCCCTTC
TGGCACAGGCAAAACGGAGACAGCCAGGGCGCTGGCCGAGTTTGTTTATGGGGATGAGAATAAACTCATCACGATTAATA
TGACTGAATTTCAGGAAGCTCACACTGTTTCCACTCTGAAGGGGGCACCTCCTGGATATGTTGGTTTTGGTCAGGGAGGG
ACTCTTACTGAAAGGGTGAGTCATAATCCATACAGTGTTATTCTGCTTGATGAAATTGAAAAAGCACATCCGGATGTCCT
GGAGTTCTTCTTCCAGATATTTGACTCAGGTATTATCGAGGATGCTGAAGGTAAGATGGTGTCTTTCAGAGATTGTCTCA
TAATAATGACATCAAACTTTGCCAGTGAAAATATCACAAACATATGGAATGATGGTGAGACAAATAGAGACAGAATAAAA
GAAATGCTTCTTCCTTTGTTTAATGAGCATTTTGGGGCGGCATTTATGGGGAGAACTAACTTAATTCCGTTCACACCGTT
GCATTCCAAAACACTGAGAGATATCGTACTCATTAAAATTGATAAGATCTGCCAACGTTTTGAACAGGCATCAGGTCAGA
TGTATAAGATAGAATACAATGATTCTCTTATTGACTGGATTACACATCATTGTCAATGTGATAAGTCTGGAGCTAGGGAT
ATAGACTCAGTACTTAACAGCACGGTTCTTCCTGTCTTGGCCAGATATTTGACTGACTCTGAAGATAATAGAACCCCCAA
GAAAATAAGAATATCAGTAAGGAAAAATAACATTACATTACGTAGTTCGCAATTTGCAACTCGTAATTGA