Detailed information of component protein
Summary
External database links
Locus tag (Gene) | EC042_RS24200 |
Coordinate (Strand) | 4859340..4861997 (+) |
NCBI ID | WP_000431646.1 |
RefSeq | NC_017626 |
Uniprot ID | D3GUW1_ECO44 |
KEGG ID | elo:EC042_4530 |
PDB ID | 4HH6, 4HH5 |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
AAA_2 | PF07724.16 | 1.5e-39 | 615..778 |
AAA_lid_9 | PF17871.3 | 2.8e-31 | 381..472 |
ClpB_D2-small | PF10431.11 | 7.6e-12 | 784..859 |
AAA | PF00004.31 | 2.7e-08 | 241..353 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Outside | 1-885 | MENSAALLRRLNHYCARALEGAASLCQTRAHAEITPEHWLLKLLEQGEGDLTVLGRRYDWDMDAIWQSLLGWLDNQPRSVRSRPQLAQSLNALLKQAWMVASLQGEEHIRSVHLLGALTENPHLVRCDGLWPLLTLSQSQLQRLSPLLDAQSDECPETLQDAEPVLPQGDSVTFIGRPVGADTAGIPSGDLPPVLQGALDKFTRDITASAREGKIDPVSGRDTEIRQMVDILSRRRKNNPILVGDPGVGKTALVEGLALRIVEGNVPESLRPVTLRTLDLGLLQAGAGVKGEFEQRLKNVIDAVQLSPAPVLLFIDEAHTLIGAGNQAGGADAANLLKPALARGELRTIAATTWSEYKQYLERDAALERRFQMVKVDEPDDETACLMLRSLKSRYAEHHNVHITDEAVRAAVTLSRRYLTERQLPDKAVDLLDTAAARVRMSLDTVPEQLTRIRSQLASLGMEKQALLEDIAVGHQNHGERLSAIEQEENVLMTARDDLEQQYARECELTGELLESRSDISRQSETHHLQQALHDIQQNQPLLSVDVDVRTVAGVVADWTGVPLSSLMKDEQTELLHLEKDIGRRVVGQDVALESIAQRLRAAKTGLTSGNGPQEVFLLVGPSGVGKTETALALADVMYGGEKSLITINLSEYQEPHTVSQLKGSPPGYVGYGQGGILTEAVRKRPYSVVLLDEVEKAHRDVLNLFYQVFDRGFMRDGEGREIDFRNTVILMTSNLGSDLLMQQLSEKPETTESELHELIRPLLRDHFQPALLARFQTVIYRPLTPSAMRTIVEMKLAQVCERLHCHYGLSTSVDERVYDALTSACLLPDTGARNVESLLNQQLLPVLSRQLLSHMAAKQKPQALALAWSDEDGMVIELRQECAL |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.5740 | - | - |
Protein sequence: 886 a.a.
>T6CP005162 NC_017626:4859340-4861997 [Escherichia coli 042] [TssH]
MENSAALLRRLNHYCARALEGAASLCQTRAHAEITPEHWLLKLLEQGEGDLTVLGRRYDWDMDAIWQSLLGWLDNQPRSV
RSRPQLAQSLNALLKQAWMVASLQGEEHIRSVHLLGALTENPHLVRCDGLWPLLTLSQSQLQRLSPLLDAQSDECPETLQ
DAEPVLPQGDSVTFIGRPVGADTAGIPSGDLPPVLQGALDKFTRDITASAREGKIDPVSGRDTEIRQMVDILSRRRKNNP
ILVGDPGVGKTALVEGLALRIVEGNVPESLRPVTLRTLDLGLLQAGAGVKGEFEQRLKNVIDAVQLSPAPVLLFIDEAHT
LIGAGNQAGGADAANLLKPALARGELRTIAATTWSEYKQYLERDAALERRFQMVKVDEPDDETACLMLRSLKSRYAEHHN
VHITDEAVRAAVTLSRRYLTERQLPDKAVDLLDTAAARVRMSLDTVPEQLTRIRSQLASLGMEKQALLEDIAVGHQNHGE
RLSAIEQEENVLMTARDDLEQQYARECELTGELLESRSDISRQSETHHLQQALHDIQQNQPLLSVDVDVRTVAGVVADWT
GVPLSSLMKDEQTELLHLEKDIGRRVVGQDVALESIAQRLRAAKTGLTSGNGPQEVFLLVGPSGVGKTETALALADVMYG
GEKSLITINLSEYQEPHTVSQLKGSPPGYVGYGQGGILTEAVRKRPYSVVLLDEVEKAHRDVLNLFYQVFDRGFMRDGEG
REIDFRNTVILMTSNLGSDLLMQQLSEKPETTESELHELIRPLLRDHFQPALLARFQTVIYRPLTPSAMRTIVEMKLAQV
CERLHCHYGLSTSVDERVYDALTSACLLPDTGARNVESLLNQQLLPVLSRQLLSHMAAKQKPQALALAWSDEDGMVIELR
QECAL*
MENSAALLRRLNHYCARALEGAASLCQTRAHAEITPEHWLLKLLEQGEGDLTVLGRRYDWDMDAIWQSLLGWLDNQPRSV
RSRPQLAQSLNALLKQAWMVASLQGEEHIRSVHLLGALTENPHLVRCDGLWPLLTLSQSQLQRLSPLLDAQSDECPETLQ
DAEPVLPQGDSVTFIGRPVGADTAGIPSGDLPPVLQGALDKFTRDITASAREGKIDPVSGRDTEIRQMVDILSRRRKNNP
ILVGDPGVGKTALVEGLALRIVEGNVPESLRPVTLRTLDLGLLQAGAGVKGEFEQRLKNVIDAVQLSPAPVLLFIDEAHT
LIGAGNQAGGADAANLLKPALARGELRTIAATTWSEYKQYLERDAALERRFQMVKVDEPDDETACLMLRSLKSRYAEHHN
VHITDEAVRAAVTLSRRYLTERQLPDKAVDLLDTAAARVRMSLDTVPEQLTRIRSQLASLGMEKQALLEDIAVGHQNHGE
RLSAIEQEENVLMTARDDLEQQYARECELTGELLESRSDISRQSETHHLQQALHDIQQNQPLLSVDVDVRTVAGVVADWT
GVPLSSLMKDEQTELLHLEKDIGRRVVGQDVALESIAQRLRAAKTGLTSGNGPQEVFLLVGPSGVGKTETALALADVMYG
GEKSLITINLSEYQEPHTVSQLKGSPPGYVGYGQGGILTEAVRKRPYSVVLLDEVEKAHRDVLNLFYQVFDRGFMRDGEG
REIDFRNTVILMTSNLGSDLLMQQLSEKPETTESELHELIRPLLRDHFQPALLARFQTVIYRPLTPSAMRTIVEMKLAQV
CERLHCHYGLSTSVDERVYDALTSACLLPDTGARNVESLLNQQLLPVLSRQLLSHMAAKQKPQALALAWSDEDGMVIELR
QECAL*
Nucleotide sequence: 2658 bp
>T6CP005162 NC_017626:4859340-4861997 [Escherichia coli 042] [TssH]
ATGGAAAATTCGGCAGCCCTGTTACGTCGTCTTAATCATTACTGTGCCCGTGCACTGGAAGGCGCAGCCTCCCTTTGCCA
GACCCGGGCCCATGCGGAAATCACCCCGGAGCACTGGTTACTGAAACTGCTGGAGCAGGGGGAAGGAGACCTTACCGTGC
TGGGCAGGCGTTATGACTGGGATATGGACGCGATATGGCAGTCGCTGCTCGGCTGGCTGGATAACCAGCCCCGTAGCGTA
CGCAGTCGCCCGCAGCTTGCGCAGTCCCTGAATGCTCTGCTGAAACAGGCCTGGATGGTGGCCTCATTACAGGGAGAAGA
ACATATCCGCAGCGTGCATCTGCTGGGTGCCCTGACGGAAAATCCGCACCTGGTTCGCTGTGACGGGCTGTGGCCTCTGC
TGACACTGAGTCAGAGCCAGTTGCAGCGTCTGTCCCCGCTGCTGGATGCGCAGTCTGATGAGTGTCCGGAGACGTTACAG
GATGCAGAGCCTGTGCTGCCTCAGGGAGACAGTGTGACCTTTATCGGGCGCCCTGTCGGTGCGGATACGGCAGGTATACC
GTCAGGTGACCTGCCGCCGGTGTTACAGGGTGCGCTGGATAAATTCACCCGGGACATCACGGCCAGCGCGAGAGAGGGGA
AAATTGACCCGGTATCGGGACGCGATACGGAAATCCGTCAGATGGTGGATATTCTCTCCCGCCGTCGCAAGAATAACCCG
ATTCTGGTGGGGGACCCCGGTGTGGGGAAAACGGCTCTGGTGGAAGGGCTCGCCCTGCGTATTGTGGAAGGTAACGTACC
AGAATCTCTCAGACCCGTCACCCTGCGCACCCTTGACCTCGGCCTGCTGCAGGCCGGTGCCGGCGTGAAAGGTGAATTTG
AACAGCGTCTGAAAAACGTGATTGATGCCGTGCAGCTGTCACCGGCTCCTGTACTGCTGTTTATAGATGAAGCCCATACC
CTTATCGGTGCCGGTAATCAGGCCGGTGGCGCGGATGCGGCCAACCTGCTGAAGCCTGCGCTGGCGCGAGGCGAACTGCG
CACCATTGCTGCCACCACCTGGTCTGAGTACAAACAATACCTGGAACGTGACGCGGCTCTGGAGCGGCGTTTTCAGATGG
TCAAAGTGGACGAGCCGGATGATGAGACGGCCTGTCTGATGCTGCGCTCCCTGAAATCCCGTTATGCGGAACATCATAAC
GTGCATATCACCGATGAGGCAGTACGTGCTGCCGTCACACTCTCGCGCCGTTATCTGACAGAACGTCAGTTACCGGACAA
GGCCGTTGACCTGCTTGATACTGCTGCTGCCCGTGTGCGGATGAGCCTCGACACGGTGCCGGAACAGCTGACCCGAATCC
GTTCACAGCTCGCTTCCCTCGGTATGGAAAAACAGGCACTGCTGGAAGATATTGCCGTTGGTCATCAGAATCACGGCGAA
CGCCTGTCTGCCATTGAGCAGGAAGAGAACGTGCTGATGACAGCACGTGATGACCTGGAGCAGCAGTACGCCCGTGAGTG
CGAACTCACCGGTGAGTTACTTGAAAGCCGCAGCGATATTTCCCGCCAGAGTGAGACACATCACCTGCAGCAGGCACTGC
ACGACATTCAGCAGAATCAGCCCCTGCTCAGTGTGGATGTTGACGTACGCACCGTTGCCGGTGTGGTCGCGGACTGGACC
GGAGTGCCGTTATCCTCACTGATGAAGGATGAACAGACGGAACTGCTGCATCTGGAAAAGGATATCGGCAGACGGGTGGT
CGGACAGGACGTGGCACTGGAATCCATTGCACAGCGTCTGCGTGCGGCGAAAACGGGTCTGACATCCGGTAACGGCCCCC
AGGAGGTGTTCCTGCTGGTCGGCCCCAGTGGTGTGGGAAAAACCGAAACGGCACTGGCGCTGGCAGATGTGATGTACGGC
GGCGAAAAATCACTGATTACCATCAATCTGTCGGAATATCAGGAGCCCCATACGGTTTCCCAGCTTAAGGGTTCACCGCC
CGGGTACGTGGGATACGGTCAGGGGGGTATTCTGACGGAAGCTGTACGTAAGCGCCCTTACAGCGTGGTGCTGCTGGATG
AAGTGGAAAAGGCCCACCGGGATGTGCTGAATCTGTTCTACCAGGTGTTTGACCGGGGCTTTATGCGCGACGGTGAGGGG
CGTGAAATTGACTTCCGTAATACGGTCATTCTGATGACCTCCAATCTGGGCAGTGACCTTCTGATGCAGCAACTGAGCGA
AAAGCCGGAGACAACGGAATCGGAGCTGCATGAGCTTATCCGTCCACTGCTGCGCGACCACTTCCAGCCAGCACTGCTTG
CCCGTTTCCAGACCGTGATTTACCGTCCGCTGACACCGTCTGCTATGCGCACCATTGTGGAGATGAAGCTTGCACAGGTC
TGCGAACGCCTGCACTGCCATTATGGGCTGAGCACATCGGTTGATGAACGTGTGTACGATGCCCTGACATCCGCCTGCCT
GCTGCCGGATACGGGCGCCCGGAATGTGGAGAGTCTGCTGAACCAGCAACTGCTGCCGGTACTGAGCCGGCAGTTGCTGA
GCCATATGGCCGCGAAGCAGAAACCGCAGGCGCTGGCTCTGGCATGGAGTGACGAAGACGGTATGGTGATTGAGCTGCGG
CAGGAATGCGCGTTATGA
ATGGAAAATTCGGCAGCCCTGTTACGTCGTCTTAATCATTACTGTGCCCGTGCACTGGAAGGCGCAGCCTCCCTTTGCCA
GACCCGGGCCCATGCGGAAATCACCCCGGAGCACTGGTTACTGAAACTGCTGGAGCAGGGGGAAGGAGACCTTACCGTGC
TGGGCAGGCGTTATGACTGGGATATGGACGCGATATGGCAGTCGCTGCTCGGCTGGCTGGATAACCAGCCCCGTAGCGTA
CGCAGTCGCCCGCAGCTTGCGCAGTCCCTGAATGCTCTGCTGAAACAGGCCTGGATGGTGGCCTCATTACAGGGAGAAGA
ACATATCCGCAGCGTGCATCTGCTGGGTGCCCTGACGGAAAATCCGCACCTGGTTCGCTGTGACGGGCTGTGGCCTCTGC
TGACACTGAGTCAGAGCCAGTTGCAGCGTCTGTCCCCGCTGCTGGATGCGCAGTCTGATGAGTGTCCGGAGACGTTACAG
GATGCAGAGCCTGTGCTGCCTCAGGGAGACAGTGTGACCTTTATCGGGCGCCCTGTCGGTGCGGATACGGCAGGTATACC
GTCAGGTGACCTGCCGCCGGTGTTACAGGGTGCGCTGGATAAATTCACCCGGGACATCACGGCCAGCGCGAGAGAGGGGA
AAATTGACCCGGTATCGGGACGCGATACGGAAATCCGTCAGATGGTGGATATTCTCTCCCGCCGTCGCAAGAATAACCCG
ATTCTGGTGGGGGACCCCGGTGTGGGGAAAACGGCTCTGGTGGAAGGGCTCGCCCTGCGTATTGTGGAAGGTAACGTACC
AGAATCTCTCAGACCCGTCACCCTGCGCACCCTTGACCTCGGCCTGCTGCAGGCCGGTGCCGGCGTGAAAGGTGAATTTG
AACAGCGTCTGAAAAACGTGATTGATGCCGTGCAGCTGTCACCGGCTCCTGTACTGCTGTTTATAGATGAAGCCCATACC
CTTATCGGTGCCGGTAATCAGGCCGGTGGCGCGGATGCGGCCAACCTGCTGAAGCCTGCGCTGGCGCGAGGCGAACTGCG
CACCATTGCTGCCACCACCTGGTCTGAGTACAAACAATACCTGGAACGTGACGCGGCTCTGGAGCGGCGTTTTCAGATGG
TCAAAGTGGACGAGCCGGATGATGAGACGGCCTGTCTGATGCTGCGCTCCCTGAAATCCCGTTATGCGGAACATCATAAC
GTGCATATCACCGATGAGGCAGTACGTGCTGCCGTCACACTCTCGCGCCGTTATCTGACAGAACGTCAGTTACCGGACAA
GGCCGTTGACCTGCTTGATACTGCTGCTGCCCGTGTGCGGATGAGCCTCGACACGGTGCCGGAACAGCTGACCCGAATCC
GTTCACAGCTCGCTTCCCTCGGTATGGAAAAACAGGCACTGCTGGAAGATATTGCCGTTGGTCATCAGAATCACGGCGAA
CGCCTGTCTGCCATTGAGCAGGAAGAGAACGTGCTGATGACAGCACGTGATGACCTGGAGCAGCAGTACGCCCGTGAGTG
CGAACTCACCGGTGAGTTACTTGAAAGCCGCAGCGATATTTCCCGCCAGAGTGAGACACATCACCTGCAGCAGGCACTGC
ACGACATTCAGCAGAATCAGCCCCTGCTCAGTGTGGATGTTGACGTACGCACCGTTGCCGGTGTGGTCGCGGACTGGACC
GGAGTGCCGTTATCCTCACTGATGAAGGATGAACAGACGGAACTGCTGCATCTGGAAAAGGATATCGGCAGACGGGTGGT
CGGACAGGACGTGGCACTGGAATCCATTGCACAGCGTCTGCGTGCGGCGAAAACGGGTCTGACATCCGGTAACGGCCCCC
AGGAGGTGTTCCTGCTGGTCGGCCCCAGTGGTGTGGGAAAAACCGAAACGGCACTGGCGCTGGCAGATGTGATGTACGGC
GGCGAAAAATCACTGATTACCATCAATCTGTCGGAATATCAGGAGCCCCATACGGTTTCCCAGCTTAAGGGTTCACCGCC
CGGGTACGTGGGATACGGTCAGGGGGGTATTCTGACGGAAGCTGTACGTAAGCGCCCTTACAGCGTGGTGCTGCTGGATG
AAGTGGAAAAGGCCCACCGGGATGTGCTGAATCTGTTCTACCAGGTGTTTGACCGGGGCTTTATGCGCGACGGTGAGGGG
CGTGAAATTGACTTCCGTAATACGGTCATTCTGATGACCTCCAATCTGGGCAGTGACCTTCTGATGCAGCAACTGAGCGA
AAAGCCGGAGACAACGGAATCGGAGCTGCATGAGCTTATCCGTCCACTGCTGCGCGACCACTTCCAGCCAGCACTGCTTG
CCCGTTTCCAGACCGTGATTTACCGTCCGCTGACACCGTCTGCTATGCGCACCATTGTGGAGATGAAGCTTGCACAGGTC
TGCGAACGCCTGCACTGCCATTATGGGCTGAGCACATCGGTTGATGAACGTGTGTACGATGCCCTGACATCCGCCTGCCT
GCTGCCGGATACGGGCGCCCGGAATGTGGAGAGTCTGCTGAACCAGCAACTGCTGCCGGTACTGAGCCGGCAGTTGCTGA
GCCATATGGCCGCGAAGCAGAAACCGCAGGCGCTGGCTCTGGCATGGAGTGACGAAGACGGTATGGTGATTGAGCTGCGG
CAGGAATGCGCGTTATGA