Detailed information of component protein
Summary
External database links
Locus tag (Gene) | EC042_RS24390 |
Coordinate (Strand) | 4898712..4900694 (+) |
NCBI ID | WP_000198270.1 |
RefSeq | NC_017626 |
Uniprot ID | A0A0E0XVY5_ECO1C, A0A222QR63_ECOLX, D3GUZ5_ECO44, A0A2H3M840_ECOLX, B7LFQ9_ECO55 |
KEGG ID | eck:EC55989_3292, esl:O3K_04390, elo:EC042_4569 |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
Phage_GPD | PF05954.13 | 5.5e-38 | 37..319 |
Phage_base_V | PF04717.14 | 8.8e-14 | 382..451 |
DUF2345 | PF10106.11 | 3.3e-05 | 494..631 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Outside | 1-660 | MTRQRFISDDHYKFQSEHFIRLHGCDISLFPLEIKGKESISDVYSYEIKCFSRTDHNSLDKLHGTRLSCEIGEQYNSLPARFIHGVVTKVIYNYDNSMQHTCIIILQPEIAELASGKRTRVWSNIKPSDIVSTILEDSLFSPPQVILYKEQNLLEYKIQYQESDLDFINRILSEAGIYYFFVHNRDKHIMTLADDPASHPKASYDKLEHLPCENLKESRHGYIREWSFTESLKCPSVTLSGYNESNVSEITIKSESKLTDVKPSKGVYEDIIPERKRELIKKRSEALIASCDSELKIWSGKTDSWWLSCGECFSLDKQDYRITSLYLNAVNNDNEHAGLCYCDLCASDNKSSLNFSREFKTPIIPGVLLARVVGPDSEEYYTDDNGRVKISFLWGEKSAAGTDKTSCWVRVSQVWSGEGFGSQFIPRIGSEVLVSFIQGNPDYPVIVGTVYNGQNTSPFSLPENNCKSGFITRSVKNGKKGEGHQLVFDDKEGEEKVIVTSSGDSLLTVKKDMISTINRSMSLTIAEGRNAEIKRGNDRLVLKEGDLHNDVHGNINIKVSNGDYNLKVSGGSGSLITDKNLTLESTQSIKIKVGANEITISTSGIDIKAAKITIEGQVSAEVKAATLKFESQAISEVKGTMLTLQGSAMTQIKGGIVNIG |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9809 | - | - |
Protein sequence: 661 a.a.
>T6CP005145 NC_017626:4898712-4900694 [Escherichia coli 042] [TssI]
MTRQRFISDDHYKFQSEHFIRLHGCDISLFPLEIKGKESISDVYSYEIKCFSRTDHNSLDKLHGTRLSCEIGEQYNSLPA
RFIHGVVTKVIYNYDNSMQHTCIIILQPEIAELASGKRTRVWSNIKPSDIVSTILEDSLFSPPQVILYKEQNLLEYKIQY
QESDLDFINRILSEAGIYYFFVHNRDKHIMTLADDPASHPKASYDKLEHLPCENLKESRHGYIREWSFTESLKCPSVTLS
GYNESNVSEITIKSESKLTDVKPSKGVYEDIIPERKRELIKKRSEALIASCDSELKIWSGKTDSWWLSCGECFSLDKQDY
RITSLYLNAVNNDNEHAGLCYCDLCASDNKSSLNFSREFKTPIIPGVLLARVVGPDSEEYYTDDNGRVKISFLWGEKSAA
GTDKTSCWVRVSQVWSGEGFGSQFIPRIGSEVLVSFIQGNPDYPVIVGTVYNGQNTSPFSLPENNCKSGFITRSVKNGKK
GEGHQLVFDDKEGEEKVIVTSSGDSLLTVKKDMISTINRSMSLTIAEGRNAEIKRGNDRLVLKEGDLHNDVHGNINIKVS
NGDYNLKVSGGSGSLITDKNLTLESTQSIKIKVGANEITISTSGIDIKAAKITIEGQVSAEVKAATLKFESQAISEVKGT
MLTLQGSAMTQIKGGIVNIG*
MTRQRFISDDHYKFQSEHFIRLHGCDISLFPLEIKGKESISDVYSYEIKCFSRTDHNSLDKLHGTRLSCEIGEQYNSLPA
RFIHGVVTKVIYNYDNSMQHTCIIILQPEIAELASGKRTRVWSNIKPSDIVSTILEDSLFSPPQVILYKEQNLLEYKIQY
QESDLDFINRILSEAGIYYFFVHNRDKHIMTLADDPASHPKASYDKLEHLPCENLKESRHGYIREWSFTESLKCPSVTLS
GYNESNVSEITIKSESKLTDVKPSKGVYEDIIPERKRELIKKRSEALIASCDSELKIWSGKTDSWWLSCGECFSLDKQDY
RITSLYLNAVNNDNEHAGLCYCDLCASDNKSSLNFSREFKTPIIPGVLLARVVGPDSEEYYTDDNGRVKISFLWGEKSAA
GTDKTSCWVRVSQVWSGEGFGSQFIPRIGSEVLVSFIQGNPDYPVIVGTVYNGQNTSPFSLPENNCKSGFITRSVKNGKK
GEGHQLVFDDKEGEEKVIVTSSGDSLLTVKKDMISTINRSMSLTIAEGRNAEIKRGNDRLVLKEGDLHNDVHGNINIKVS
NGDYNLKVSGGSGSLITDKNLTLESTQSIKIKVGANEITISTSGIDIKAAKITIEGQVSAEVKAATLKFESQAISEVKGT
MLTLQGSAMTQIKGGIVNIG*
Nucleotide sequence: 1983 bp
>T6CP005145 NC_017626:4898712-4900694 [Escherichia coli 042] [TssI]
ATGACTAGGCAAAGATTTATATCTGACGATCATTATAAATTCCAGAGTGAACATTTTATCCGGTTGCACGGCTGTGATAT
ATCATTATTTCCACTTGAGATTAAAGGTAAAGAGTCAATATCTGATGTATATAGTTATGAAATAAAATGCTTTAGTAGGA
CCGACCACAACTCTCTGGATAAGTTACACGGCACGCGTTTGAGTTGTGAGATCGGGGAACAATACAATTCGTTACCTGCG
CGCTTCATTCATGGTGTGGTCACAAAAGTGATATATAATTATGATAACAGCATGCAACATACCTGCATTATTATCTTACA
ACCAGAGATAGCCGAATTGGCATCTGGTAAAAGAACAAGAGTATGGTCAAATATAAAGCCATCAGATATTGTAAGCACAA
TTCTGGAAGACAGTTTATTTAGTCCGCCACAAGTAATTTTATATAAAGAACAGAATTTGCTGGAGTATAAGATCCAGTAC
CAGGAGTCAGATCTTGACTTTATTAATAGAATTTTGTCTGAGGCGGGCATTTACTATTTCTTTGTTCATAATAGAGATAA
ACATATTATGACATTGGCAGATGATCCTGCATCTCATCCTAAAGCATCTTATGATAAACTGGAACACTTACCCTGTGAAA
ATCTGAAGGAATCGCGTCATGGCTATATCAGGGAATGGTCTTTCACAGAAAGTCTCAAATGTCCTTCTGTGACACTTTCA
GGTTATAATGAGAGTAATGTCAGTGAAATTACTATAAAGTCAGAAAGTAAACTGACGGATGTAAAACCCAGTAAGGGAGT
TTATGAAGATATTATCCCAGAGAGAAAGCGTGAGTTGATAAAGAAACGTTCTGAAGCACTGATAGCATCATGTGACAGTG
AACTCAAAATCTGGTCAGGAAAAACGGATTCATGGTGGTTGTCATGCGGGGAGTGTTTCAGTCTGGACAAACAGGATTAC
AGAATTACTTCGTTATATCTTAATGCTGTCAACAATGACAATGAACATGCTGGTTTGTGTTATTGTGATTTGTGTGCTTC
AGATAATAAAAGCTCACTTAATTTCAGCCGTGAATTTAAAACACCTATTATTCCCGGTGTTTTACTTGCCAGAGTGGTCG
GTCCTGACTCCGAAGAATATTATACAGATGATAATGGGCGTGTAAAAATCAGCTTTCTCTGGGGAGAAAAGTCAGCGGCA
GGTACAGATAAAACCTCCTGTTGGGTCCGTGTATCTCAGGTATGGTCCGGGGAGGGATTCGGTAGTCAGTTTATTCCTCG
CATTGGCAGTGAGGTTCTGGTTAGTTTTATACAGGGTAATCCTGACTATCCAGTTATCGTTGGTACAGTGTATAATGGTC
AGAATACATCGCCTTTTTCTCTTCCTGAGAATAATTGTAAGTCTGGATTTATCACCCGTAGTGTTAAGAATGGTAAAAAA
GGAGAAGGTCATCAACTGGTTTTTGATGATAAAGAAGGTGAGGAAAAAGTTATAGTGACTTCATCTGGTGATTCTCTCCT
GACTGTAAAGAAGGATATGATCAGCACTATTAACCGCTCTATGTCGTTAACTATTGCTGAAGGGCGTAATGCTGAAATAA
AAAGAGGGAATGACAGGCTTGTTCTTAAAGAAGGTGACCTACATAATGATGTACATGGTAATATTAATATAAAAGTTTCG
AACGGAGATTATAATCTTAAAGTGAGCGGGGGAAGTGGTAGTCTTATCACCGATAAGAATCTTACTCTTGAATCAACACA
GTCAATAAAAATAAAAGTGGGTGCAAATGAGATAACGATATCCACATCAGGTATAGATATAAAAGCCGCTAAAATTACTA
TAGAAGGGCAGGTATCGGCAGAGGTCAAAGCGGCCACACTTAAGTTTGAAAGCCAGGCTATTTCTGAGGTAAAAGGCACC
ATGTTGACGCTTCAGGGCTCTGCAATGACACAGATAAAAGGTGGAATCGTGAATATAGGATAA
ATGACTAGGCAAAGATTTATATCTGACGATCATTATAAATTCCAGAGTGAACATTTTATCCGGTTGCACGGCTGTGATAT
ATCATTATTTCCACTTGAGATTAAAGGTAAAGAGTCAATATCTGATGTATATAGTTATGAAATAAAATGCTTTAGTAGGA
CCGACCACAACTCTCTGGATAAGTTACACGGCACGCGTTTGAGTTGTGAGATCGGGGAACAATACAATTCGTTACCTGCG
CGCTTCATTCATGGTGTGGTCACAAAAGTGATATATAATTATGATAACAGCATGCAACATACCTGCATTATTATCTTACA
ACCAGAGATAGCCGAATTGGCATCTGGTAAAAGAACAAGAGTATGGTCAAATATAAAGCCATCAGATATTGTAAGCACAA
TTCTGGAAGACAGTTTATTTAGTCCGCCACAAGTAATTTTATATAAAGAACAGAATTTGCTGGAGTATAAGATCCAGTAC
CAGGAGTCAGATCTTGACTTTATTAATAGAATTTTGTCTGAGGCGGGCATTTACTATTTCTTTGTTCATAATAGAGATAA
ACATATTATGACATTGGCAGATGATCCTGCATCTCATCCTAAAGCATCTTATGATAAACTGGAACACTTACCCTGTGAAA
ATCTGAAGGAATCGCGTCATGGCTATATCAGGGAATGGTCTTTCACAGAAAGTCTCAAATGTCCTTCTGTGACACTTTCA
GGTTATAATGAGAGTAATGTCAGTGAAATTACTATAAAGTCAGAAAGTAAACTGACGGATGTAAAACCCAGTAAGGGAGT
TTATGAAGATATTATCCCAGAGAGAAAGCGTGAGTTGATAAAGAAACGTTCTGAAGCACTGATAGCATCATGTGACAGTG
AACTCAAAATCTGGTCAGGAAAAACGGATTCATGGTGGTTGTCATGCGGGGAGTGTTTCAGTCTGGACAAACAGGATTAC
AGAATTACTTCGTTATATCTTAATGCTGTCAACAATGACAATGAACATGCTGGTTTGTGTTATTGTGATTTGTGTGCTTC
AGATAATAAAAGCTCACTTAATTTCAGCCGTGAATTTAAAACACCTATTATTCCCGGTGTTTTACTTGCCAGAGTGGTCG
GTCCTGACTCCGAAGAATATTATACAGATGATAATGGGCGTGTAAAAATCAGCTTTCTCTGGGGAGAAAAGTCAGCGGCA
GGTACAGATAAAACCTCCTGTTGGGTCCGTGTATCTCAGGTATGGTCCGGGGAGGGATTCGGTAGTCAGTTTATTCCTCG
CATTGGCAGTGAGGTTCTGGTTAGTTTTATACAGGGTAATCCTGACTATCCAGTTATCGTTGGTACAGTGTATAATGGTC
AGAATACATCGCCTTTTTCTCTTCCTGAGAATAATTGTAAGTCTGGATTTATCACCCGTAGTGTTAAGAATGGTAAAAAA
GGAGAAGGTCATCAACTGGTTTTTGATGATAAAGAAGGTGAGGAAAAAGTTATAGTGACTTCATCTGGTGATTCTCTCCT
GACTGTAAAGAAGGATATGATCAGCACTATTAACCGCTCTATGTCGTTAACTATTGCTGAAGGGCGTAATGCTGAAATAA
AAAGAGGGAATGACAGGCTTGTTCTTAAAGAAGGTGACCTACATAATGATGTACATGGTAATATTAATATAAAAGTTTCG
AACGGAGATTATAATCTTAAAGTGAGCGGGGGAAGTGGTAGTCTTATCACCGATAAGAATCTTACTCTTGAATCAACACA
GTCAATAAAAATAAAAGTGGGTGCAAATGAGATAACGATATCCACATCAGGTATAGATATAAAAGCCGCTAAAATTACTA
TAGAAGGGCAGGTATCGGCAGAGGTCAAAGCGGCCACACTTAAGTTTGAAAGCCAGGCTATTTCTGAGGTAAAAGGCACC
ATGTTGACGCTTCAGGGCTCTGCAATGACACAGATAAAAGGTGGAATCGTGAATATAGGATAA