Detailed information of component protein
Summary
External database links
Locus tag (Gene) | A364_RS23120 |
Coordinate (Strand) | 18766..21525 (-) |
NCBI ID | 446537043 |
RefSeq | NZ_AOGL01000004 |
Uniprot ID | U9Z0Q2_ECOLX, A0A0L6XW37_ECOLX, F4TA57_ECOLX |
KEGG ID | - |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
AAA_2 | PF07724.16 | 1.3e-37 | 642..803 |
AAA_lid_9 | PF17871.3 | 2.1e-27 | 362..453 |
ClpB_D2-small | PF10431.11 | 3e-09 | 809..886 |
AAA | PF00004.31 | 1.7e-06 | 223..333 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Outside | 1-919 | MIQIDLPTLVKRLNLFSRQALEMAASECMSQQAAEITVSHVLIQMLAMPRSDLRVITRQGDIGMEELRQALTVENYTTARSADSYPAFSPMLVEWLKEGWLLASAEMQHSELRGGVLLLALLHSPLRYIPPAAARLLTGINRDRLQQDFVQWTQESAESVVPDADGKGAGTLTDASDTLLARYAKNMTEDARNGRLDPVLCRDHEIDLMIDILCRRRKNNPVVVGEAGVGKSALIEGLALRIVAGQVPDKLKNTDIMTLDLGALQAGASVKGEFEKRFKGLMAEVISSPVPVILFIDEAHTLIGAGNQQGGLDISNLLKPALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPSAAEATIILRGLSAVYEQSHGVLIDDDALQAAATLSERYLSGRQLPDKAIDVLDTACARVAINLSSPPKQISALTTLSHQQEAEIRQLERELRIGLRTDTSRMTEVLVQYDETLTALDELEAAWHQQQTLVREIIALRQQLLGVAEDDAAPLPDADTVEDTQPESEQDNTGAVPADEAGSVQPEETAETVSPVQRLAQLTAELDALHNDRLLVSPHVDKKQIAAVIAEWTGVPLNRLSQNEMSVITDLPKWLGDTIKGQDLAIASLHKHLLTARADLRRPGRPLGAFLLAGPSGVGKTETVLQLAELLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYSVVLLDEVEKAHPDVLNLFYQAFDKGEMADGEGRLIDCKNIVFFLTSNLGYQVIVEHADDPETMQEVLYPVLADFFKPALLARMEVVPYLPLSKETLATIIAGKLARLDNVLRSRFGAEVVIEPEVTDEIMSRVTRAENGARMLESVIDGDMLPPLSLLLLQKMAANTAIARIRLSAVDGAFTADVEDAQNDESVTKDETVL |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9850 | - | - |
Protein sequence: 920 a.a.
>T6CP016046 NZ_AOGL01000004:c21525-18766 [Escherichia coli SEPT362] [TssH]
MIQIDLPTLVKRLNLFSRQALEMAASECMSQQAAEITVSHVLIQMLAMPRSDLRVITRQGDIGMEELRQALTVENYTTAR
SADSYPAFSPMLVEWLKEGWLLASAEMQHSELRGGVLLLALLHSPLRYIPPAAARLLTGINRDRLQQDFVQWTQESAESV
VPDADGKGAGTLTDASDTLLARYAKNMTEDARNGRLDPVLCRDHEIDLMIDILCRRRKNNPVVVGEAGVGKSALIEGLAL
RIVAGQVPDKLKNTDIMTLDLGALQAGASVKGEFEKRFKGLMAEVISSPVPVILFIDEAHTLIGAGNQQGGLDISNLLKP
ALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPSAAEATIILRGLSAVYEQSHGVLIDDDALQAAATLSERYL
SGRQLPDKAIDVLDTACARVAINLSSPPKQISALTTLSHQQEAEIRQLERELRIGLRTDTSRMTEVLVQYDETLTALDEL
EAAWHQQQTLVREIIALRQQLLGVAEDDAAPLPDADTVEDTQPESEQDNTGAVPADEAGSVQPEETAETVSPVQRLAQLT
AELDALHNDRLLVSPHVDKKQIAAVIAEWTGVPLNRLSQNEMSVITDLPKWLGDTIKGQDLAIASLHKHLLTARADLRRP
GRPLGAFLLAGPSGVGKTETVLQLAELLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYSVV
LLDEVEKAHPDVLNLFYQAFDKGEMADGEGRLIDCKNIVFFLTSNLGYQVIVEHADDPETMQEVLYPVLADFFKPALLAR
MEVVPYLPLSKETLATIIAGKLARLDNVLRSRFGAEVVIEPEVTDEIMSRVTRAENGARMLESVIDGDMLPPLSLLLLQK
MAANTAIARIRLSAVDGAFTADVEDAQNDESVTKDETVL*
MIQIDLPTLVKRLNLFSRQALEMAASECMSQQAAEITVSHVLIQMLAMPRSDLRVITRQGDIGMEELRQALTVENYTTAR
SADSYPAFSPMLVEWLKEGWLLASAEMQHSELRGGVLLLALLHSPLRYIPPAAARLLTGINRDRLQQDFVQWTQESAESV
VPDADGKGAGTLTDASDTLLARYAKNMTEDARNGRLDPVLCRDHEIDLMIDILCRRRKNNPVVVGEAGVGKSALIEGLAL
RIVAGQVPDKLKNTDIMTLDLGALQAGASVKGEFEKRFKGLMAEVISSPVPVILFIDEAHTLIGAGNQQGGLDISNLLKP
ALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPSAAEATIILRGLSAVYEQSHGVLIDDDALQAAATLSERYL
SGRQLPDKAIDVLDTACARVAINLSSPPKQISALTTLSHQQEAEIRQLERELRIGLRTDTSRMTEVLVQYDETLTALDEL
EAAWHQQQTLVREIIALRQQLLGVAEDDAAPLPDADTVEDTQPESEQDNTGAVPADEAGSVQPEETAETVSPVQRLAQLT
AELDALHNDRLLVSPHVDKKQIAAVIAEWTGVPLNRLSQNEMSVITDLPKWLGDTIKGQDLAIASLHKHLLTARADLRRP
GRPLGAFLLAGPSGVGKTETVLQLAELLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYSVV
LLDEVEKAHPDVLNLFYQAFDKGEMADGEGRLIDCKNIVFFLTSNLGYQVIVEHADDPETMQEVLYPVLADFFKPALLAR
MEVVPYLPLSKETLATIIAGKLARLDNVLRSRFGAEVVIEPEVTDEIMSRVTRAENGARMLESVIDGDMLPPLSLLLLQK
MAANTAIARIRLSAVDGAFTADVEDAQNDESVTKDETVL*
Nucleotide sequence: 2760 bp
>T6CP016046 NZ_AOGL01000004:c21525-18766 [Escherichia coli SEPT362] [TssH]
ATGATCCAGATTGATCTTCCCACGCTGGTAAAACGGCTGAACCTGTTCTCCCGCCAGGCGCTGGAGATGGCCGCCTCAGA
ATGTATGAGCCAGCAGGCAGCGGAAATTACCGTCAGCCATGTGCTTATTCAGATGCTCGCCATGCCACGCAGTGACCTGC
GGGTTATCACCCGGCAGGGCGATATTGGCATGGAAGAGTTGCGCCAGGCGCTGACGGTGGAGAACTACACAACCGCCCGT
TCTGCAGACAGCTACCCGGCGTTTTCCCCGATGCTGGTTGAGTGGCTGAAAGAGGGCTGGCTGCTGGCGTCGGCTGAGAT
GCAGCACAGCGAACTGCGCGGCGGCGTGTTGCTGCTGGCCCTGCTGCATTCGCCGCTGCGTTATATACCGCCTGCTGCCG
CCCGGCTGTTGACCGGCATTAACCGTGACCGTCTGCAACAGGACTTTGTGCAGTGGACACAGGAGTCGGCGGAATCAGTC
GTGCCGGATGCAGACGGTAAAGGCGCAGGCACACTGACGGACGCCTCTGACACCCTGCTTGCCCGCTACGCCAAAAACAT
GACAGAAGATGCCCGTAACGGCAGGCTTGACCCGGTACTGTGCCGCGACCACGAAATCGACCTGATGATCGACATTCTCT
GCCGCCGCCGTAAAAACAACCCGGTGGTGGTGGGCGAAGCAGGCGTGGGTAAAAGCGCGCTGATTGAAGGGCTGGCGCTG
CGCATCGTGGCAGGCCAGGTGCCGGACAAACTGAAAAACACCGATATCATGACCCTTGACCTGGGCGCATTGCAGGCCGG
GGCGTCGGTGAAGGGTGAATTCGAAAAACGCTTCAAGGGGCTGATGGCGGAGGTCATTTCCTCTCCGGTGCCGGTCATTC
TGTTTATCGACGAAGCACATACCCTGATTGGCGCGGGCAACCAGCAGGGCGGGCTGGATATCTCCAACCTGCTCAAACCG
GCGCTGGCGCGCGGCGAGCTGAAAACCATCGCCGCCACCACCTGGAGCGAGTACAAAAAATACTTCGAGAAAGACGCCGC
CCTGTCGCGCCGCTTCCAGTTGGTGAAGGTCAGCGAACCCAGTGCCGCCGAAGCCACCATTATTCTGCGCGGTCTGTCGG
CGGTCTATGAACAGTCTCACGGCGTCCTGATTGATGATGACGCCTTGCAGGCCGCTGCGACATTAAGTGAACGTTATCTC
TCCGGGCGTCAGTTGCCGGACAAAGCGATTGATGTGCTGGATACCGCCTGCGCCCGTGTGGCAATCAACCTGTCATCGCC
ACCGAAACAAATCTCGGCGCTGACCACCCTGAGCCACCAGCAGGAGGCGGAAATTCGCCAGCTTGAGCGCGAGCTTCGCA
TCGGACTGCGTACCGACACATCACGGATGACCGAGGTGCTGGTGCAGTATGATGAAACGCTGACGGCGCTGGATGAACTG
GAAGCGGCCTGGCACCAGCAGCAGACGCTGGTCCGGGAGATTATCGCGCTGCGCCAGCAGTTACTGGGCGTGGCAGAGGA
CGATGCGGCGCCGTTGCCGGACGCAGATACCGTGGAGGATACGCAGCCAGAGTCAGAACAGGATAATACCGGTGCTGTAC
CGGCTGATGAAGCTGGCAGCGTACAGCCGGAAGAGACCGCTGAAACAGTTTCCCCGGTACAGCGGCTGGCCCAGCTCACT
GCCGAGCTGGACGCCCTGCATAACGACCGGTTGCTGGTCTCCCCGCACGTCGATAAAAAACAGATTGCGGCGGTGATTGC
CGAATGGACCGGCGTGCCGCTCAACCGCCTGTCGCAGAATGAAATGTCGGTCATCACCGACCTGCCGAAATGGCTGGGCG
ACACCATCAAAGGCCAGGACCTGGCGATTGCCAGCCTGCATAAGCATCTACTGACCGCACGCGCCGACCTGCGTCGTCCG
GGACGCCCGCTCGGTGCGTTCCTGCTGGCTGGCCCCAGCGGTGTGGGTAAAACCGAAACCGTCCTGCAACTGGCAGAGCT
GCTCTACGGCGGTCGCCAGTACCTGACCACCATCAATATGTCCGAATTCCAGGAGAAACACACCGTCTCGCGGCTGATTG
GTTCGCCTCCGGGCTACGTTGGCTACGGCGAAGGCGGCGTACTGACCGAAGCGATTCGCCAGAAACCCTACTCGGTAGTA
CTGCTCGATGAAGTGGAAAAAGCGCACCCGGATGTGCTCAACCTGTTCTACCAGGCGTTCGATAAGGGCGAAATGGCAGA
CGGTGAAGGCCGCCTGATTGACTGCAAAAATATCGTCTTCTTCCTGACGTCCAACCTCGGCTACCAGGTAATAGTCGAGC
ATGCCGATGACCCGGAAACCATGCAGGAAGTACTGTATCCGGTGCTGGCCGACTTCTTCAAACCTGCCCTGCTGGCGCGT
ATGGAAGTGGTGCCGTATCTGCCGCTGTCGAAAGAGACGCTCGCCACCATTATCGCCGGGAAACTGGCCCGTCTGGATAA
CGTGCTGCGCAGTCGCTTTGGTGCAGAAGTGGTCATTGAACCGGAAGTGACGGACGAAATCATGAGCCGCGTCACCCGCG
CGGAAAACGGCGCGAGGATGCTGGAATCGGTCATCGATGGCGACATGCTACCGCCGCTCTCGCTGCTGCTGTTGCAGAAA
ATGGCGGCTAACACGGCGATTGCCCGGATTCGGTTGTCGGCAGTGGACGGCGCATTTACGGCAGACGTGGAAGATGCTCA
GAACGACGAGTCCGTCACAAAGGATGAAACGGTTTTATGA
ATGATCCAGATTGATCTTCCCACGCTGGTAAAACGGCTGAACCTGTTCTCCCGCCAGGCGCTGGAGATGGCCGCCTCAGA
ATGTATGAGCCAGCAGGCAGCGGAAATTACCGTCAGCCATGTGCTTATTCAGATGCTCGCCATGCCACGCAGTGACCTGC
GGGTTATCACCCGGCAGGGCGATATTGGCATGGAAGAGTTGCGCCAGGCGCTGACGGTGGAGAACTACACAACCGCCCGT
TCTGCAGACAGCTACCCGGCGTTTTCCCCGATGCTGGTTGAGTGGCTGAAAGAGGGCTGGCTGCTGGCGTCGGCTGAGAT
GCAGCACAGCGAACTGCGCGGCGGCGTGTTGCTGCTGGCCCTGCTGCATTCGCCGCTGCGTTATATACCGCCTGCTGCCG
CCCGGCTGTTGACCGGCATTAACCGTGACCGTCTGCAACAGGACTTTGTGCAGTGGACACAGGAGTCGGCGGAATCAGTC
GTGCCGGATGCAGACGGTAAAGGCGCAGGCACACTGACGGACGCCTCTGACACCCTGCTTGCCCGCTACGCCAAAAACAT
GACAGAAGATGCCCGTAACGGCAGGCTTGACCCGGTACTGTGCCGCGACCACGAAATCGACCTGATGATCGACATTCTCT
GCCGCCGCCGTAAAAACAACCCGGTGGTGGTGGGCGAAGCAGGCGTGGGTAAAAGCGCGCTGATTGAAGGGCTGGCGCTG
CGCATCGTGGCAGGCCAGGTGCCGGACAAACTGAAAAACACCGATATCATGACCCTTGACCTGGGCGCATTGCAGGCCGG
GGCGTCGGTGAAGGGTGAATTCGAAAAACGCTTCAAGGGGCTGATGGCGGAGGTCATTTCCTCTCCGGTGCCGGTCATTC
TGTTTATCGACGAAGCACATACCCTGATTGGCGCGGGCAACCAGCAGGGCGGGCTGGATATCTCCAACCTGCTCAAACCG
GCGCTGGCGCGCGGCGAGCTGAAAACCATCGCCGCCACCACCTGGAGCGAGTACAAAAAATACTTCGAGAAAGACGCCGC
CCTGTCGCGCCGCTTCCAGTTGGTGAAGGTCAGCGAACCCAGTGCCGCCGAAGCCACCATTATTCTGCGCGGTCTGTCGG
CGGTCTATGAACAGTCTCACGGCGTCCTGATTGATGATGACGCCTTGCAGGCCGCTGCGACATTAAGTGAACGTTATCTC
TCCGGGCGTCAGTTGCCGGACAAAGCGATTGATGTGCTGGATACCGCCTGCGCCCGTGTGGCAATCAACCTGTCATCGCC
ACCGAAACAAATCTCGGCGCTGACCACCCTGAGCCACCAGCAGGAGGCGGAAATTCGCCAGCTTGAGCGCGAGCTTCGCA
TCGGACTGCGTACCGACACATCACGGATGACCGAGGTGCTGGTGCAGTATGATGAAACGCTGACGGCGCTGGATGAACTG
GAAGCGGCCTGGCACCAGCAGCAGACGCTGGTCCGGGAGATTATCGCGCTGCGCCAGCAGTTACTGGGCGTGGCAGAGGA
CGATGCGGCGCCGTTGCCGGACGCAGATACCGTGGAGGATACGCAGCCAGAGTCAGAACAGGATAATACCGGTGCTGTAC
CGGCTGATGAAGCTGGCAGCGTACAGCCGGAAGAGACCGCTGAAACAGTTTCCCCGGTACAGCGGCTGGCCCAGCTCACT
GCCGAGCTGGACGCCCTGCATAACGACCGGTTGCTGGTCTCCCCGCACGTCGATAAAAAACAGATTGCGGCGGTGATTGC
CGAATGGACCGGCGTGCCGCTCAACCGCCTGTCGCAGAATGAAATGTCGGTCATCACCGACCTGCCGAAATGGCTGGGCG
ACACCATCAAAGGCCAGGACCTGGCGATTGCCAGCCTGCATAAGCATCTACTGACCGCACGCGCCGACCTGCGTCGTCCG
GGACGCCCGCTCGGTGCGTTCCTGCTGGCTGGCCCCAGCGGTGTGGGTAAAACCGAAACCGTCCTGCAACTGGCAGAGCT
GCTCTACGGCGGTCGCCAGTACCTGACCACCATCAATATGTCCGAATTCCAGGAGAAACACACCGTCTCGCGGCTGATTG
GTTCGCCTCCGGGCTACGTTGGCTACGGCGAAGGCGGCGTACTGACCGAAGCGATTCGCCAGAAACCCTACTCGGTAGTA
CTGCTCGATGAAGTGGAAAAAGCGCACCCGGATGTGCTCAACCTGTTCTACCAGGCGTTCGATAAGGGCGAAATGGCAGA
CGGTGAAGGCCGCCTGATTGACTGCAAAAATATCGTCTTCTTCCTGACGTCCAACCTCGGCTACCAGGTAATAGTCGAGC
ATGCCGATGACCCGGAAACCATGCAGGAAGTACTGTATCCGGTGCTGGCCGACTTCTTCAAACCTGCCCTGCTGGCGCGT
ATGGAAGTGGTGCCGTATCTGCCGCTGTCGAAAGAGACGCTCGCCACCATTATCGCCGGGAAACTGGCCCGTCTGGATAA
CGTGCTGCGCAGTCGCTTTGGTGCAGAAGTGGTCATTGAACCGGAAGTGACGGACGAAATCATGAGCCGCGTCACCCGCG
CGGAAAACGGCGCGAGGATGCTGGAATCGGTCATCGATGGCGACATGCTACCGCCGCTCTCGCTGCTGCTGTTGCAGAAA
ATGGCGGCTAACACGGCGATTGCCCGGATTCGGTTGTCGGCAGTGGACGGCGCATTTACGGCAGACGTGGAAGATGCTCA
GAACGACGAGTCCGTCACAAAGGATGAAACGGTTTTATGA