Detailed information of component protein
Summary
External database links
Locus tag (Gene) | A364_RS23055 |
Coordinate (Strand) | 32490..34631 (+) |
NCBI ID | 487471238 |
RefSeq | NZ_AOGL01000004 |
Uniprot ID | A0A2X6ZP85_ECOLX |
KEGG ID | - |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
Phage_GPD | PF05954.13 | 1e-69 | 28..327 |
Phage_base_V | PF04717.14 | 1.2e-14 | 384..450 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Outside | 1-713 | MSTGLRFTLEVDGLPPDAFAVVSFHLNQSLSSLFSLDLSLVSQQFLSLEFQQILDKMAYLTIWQGDDVQRRVKGVVTWFELGENDKNQKLYSMKVCPPLWRTGLRQNFRIFQNEDIESILGTILQENGVTEWSPLFSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHAQKSTDQSLVLCDTVRYLPESFEIPWNPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQYQDYQRTQYEVYDYPGRFKGAHGQNFARWQMDGWRNNAEVARGTSRSPEIWPGRQIVLTGHPQANLNREWQVVASDLHGEQPQAVPGRRGSGTTLDNHFAVIPADRTWRPQPLLKPLVDGPQSAVVTGPAGEEIFCDEHGRVRVKFNWDRYNPSNQDSSCWIRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRTYHQENRTPGSLPGTKTQMTIRSKTYKGSGFNELKFDDATGKEQVYIHAQKNMNTEVLNNRTTDVINNHAEKIGNNQAITVTNNQIQNIGVNQIQTVGVNQVETVGSNQIIKVGSNQVEKVGIIRALTVGVAYQTTVGGIMNTSVALLQSSQVGLHKSLMVGMGYSVNVGNNVTFSVGKTMKENTGQTAVYSAGEHLELCCGKARLVLTKDGSIFLNGTHIHLEGESDVNGDAPVINWNCGATQPVPDAPVPKDLPPGMPDMRQF |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9976 | - | - |
Protein sequence: 714 a.a.
>T6CP016056 NZ_AOGL01000004:32490-34631 [Escherichia coli SEPT362] [TssI]
MSTGLRFTLEVDGLPPDAFAVVSFHLNQSLSSLFSLDLSLVSQQFLSLEFQQILDKMAYLTIWQGDDVQRRVKGVVTWFE
LGENDKNQKLYSMKVCPPLWRTGLRQNFRIFQNEDIESILGTILQENGVTEWSPLFSEPHPSREFCVQYGETDYDFLCRM
AAEEGIFFYEEHAQKSTDQSLVLCDTVRYLPESFEIPWNPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGR
FDQEGQYQDYQRTQYEVYDYPGRFKGAHGQNFARWQMDGWRNNAEVARGTSRSPEIWPGRQIVLTGHPQANLNREWQVVA
SDLHGEQPQAVPGRRGSGTTLDNHFAVIPADRTWRPQPLLKPLVDGPQSAVVTGPAGEEIFCDEHGRVRVKFNWDRYNPS
NQDSSCWIRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRTYHQENRTPGSLPGTKTQMTIRSKTYKGSGFN
ELKFDDATGKEQVYIHAQKNMNTEVLNNRTTDVINNHAEKIGNNQAITVTNNQIQNIGVNQIQTVGVNQVETVGSNQIIK
VGSNQVEKVGIIRALTVGVAYQTTVGGIMNTSVALLQSSQVGLHKSLMVGMGYSVNVGNNVTFSVGKTMKENTGQTAVYS
AGEHLELCCGKARLVLTKDGSIFLNGTHIHLEGESDVNGDAPVINWNCGATQPVPDAPVPKDLPPGMPDMRQF*
MSTGLRFTLEVDGLPPDAFAVVSFHLNQSLSSLFSLDLSLVSQQFLSLEFQQILDKMAYLTIWQGDDVQRRVKGVVTWFE
LGENDKNQKLYSMKVCPPLWRTGLRQNFRIFQNEDIESILGTILQENGVTEWSPLFSEPHPSREFCVQYGETDYDFLCRM
AAEEGIFFYEEHAQKSTDQSLVLCDTVRYLPESFEIPWNPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGR
FDQEGQYQDYQRTQYEVYDYPGRFKGAHGQNFARWQMDGWRNNAEVARGTSRSPEIWPGRQIVLTGHPQANLNREWQVVA
SDLHGEQPQAVPGRRGSGTTLDNHFAVIPADRTWRPQPLLKPLVDGPQSAVVTGPAGEEIFCDEHGRVRVKFNWDRYNPS
NQDSSCWIRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRTYHQENRTPGSLPGTKTQMTIRSKTYKGSGFN
ELKFDDATGKEQVYIHAQKNMNTEVLNNRTTDVINNHAEKIGNNQAITVTNNQIQNIGVNQIQTVGVNQVETVGSNQIIK
VGSNQVEKVGIIRALTVGVAYQTTVGGIMNTSVALLQSSQVGLHKSLMVGMGYSVNVGNNVTFSVGKTMKENTGQTAVYS
AGEHLELCCGKARLVLTKDGSIFLNGTHIHLEGESDVNGDAPVINWNCGATQPVPDAPVPKDLPPGMPDMRQF*
Nucleotide sequence: 2142 bp
>T6CP016056 NZ_AOGL01000004:32490-34631 [Escherichia coli SEPT362] [TssI]
ATGTCAACCGGATTACGTTTCACGCTGGAAGTGGACGGCCTGCCACCGGATGCTTTTGCGGTGGTCTCCTTTCATCTGAA
CCAGTCACTCTCTTCGCTTTTTTCCCTCGATCTCTCTCTGGTCAGCCAGCAGTTTCTCTCCCTTGAATTCCAGCAGATCC
TCGACAAAATGGCCTACCTGACGATATGGCAGGGCGATGACGTACAGCGCCGGGTGAAAGGTGTGGTGACCTGGTTTGAA
CTGGGGGAGAACGACAAAAACCAGAAGCTGTACAGCATGAAGGTATGCCCGCCGCTGTGGCGCACAGGGCTGCGCCAGAA
CTTCCGTATCTTCCAGAATGAGGACATCGAAAGCATCCTCGGCACGATATTGCAGGAAAACGGGGTGACCGAGTGGAGCC
CGCTGTTCAGCGAGCCACATCCTTCCCGTGAGTTTTGTGTCCAGTACGGTGAGACTGATTACGATTTCCTGTGCCGGATG
GCGGCGGAGGAAGGCATCTTCTTTTATGAGGAGCACGCGCAAAAAAGTACCGACCAAAGCCTGGTCCTGTGCGATACCGT
GCGTTATCTGCCGGAGTCCTTTGAGATCCCCTGGAATCCGAACACCCGTACCGAGGTAAGCACCCTCTGCATCAGCCAGT
TTCGCTACAGCGCACAAATCCGCCCTTCTTCCGTGGTGACCAAAGACTACACCTTTAAACGACCCGGCTGGGCAGGGCGT
TTTGATCAGGAAGGCCAGTACCAGGATTACCAGCGCACACAGTATGAAGTGTATGACTACCCCGGACGTTTCAAGGGCGC
CCACGGGCAGAACTTTGCCCGCTGGCAGATGGATGGCTGGCGCAACAACGCAGAAGTGGCGCGCGGAACAAGCCGTTCGC
CGGAGATATGGCCGGGACGGCAAATTGTGCTGACGGGGCATCCGCAGGCGAACCTGAACCGGGAATGGCAGGTGGTGGCG
AGCGATCTTCACGGCGAACAGCCGCAGGCGGTACCGGGACGCAGGGGTTCAGGCACCACGCTGGATAACCATTTTGCGGT
AATACCGGCAGACAGAACATGGCGACCACAGCCGTTGCTGAAACCGCTGGTGGACGGCCCGCAGAGCGCCGTCGTGACGG
GACCGGCAGGCGAGGAAATCTTCTGTGATGAACATGGCCGCGTGCGGGTGAAATTTAACTGGGACCGTTATAACCCGTCA
AACCAGGACAGTTCATGCTGGATCCGTGTGGCACAGGCGTGGGCAGGCACCGGATTCGGTAACCTTGCGATACCGCGCGT
GGGTCAGGAGGTGATTGTGGACTTCCTCAACGGCGATCCGGACCAGCCGATCATTATGGGGCGCACCTACCACCAGGAAA
ACCGCACACCCGGCAGCCTGCCGGGAACAAAGACGCAGATGACCATTCGTTCGAAAACCTATAAGGGCAGCGGGTTTAAT
GAACTGAAGTTTGACGATGCGACAGGGAAAGAACAGGTCTACATCCACGCGCAGAAGAACATGAACACCGAGGTGCTGAA
TAACCGCACCACGGATGTGATAAACAACCATGCCGAAAAAATAGGTAACAACCAGGCGATCACCGTTACCAATAACCAGA
TCCAGAACATTGGCGTTAATCAGATACAGACGGTTGGTGTCAACCAGGTGGAAACGGTAGGCAGTAACCAGATTATCAAA
GTGGGATCAAACCAGGTTGAAAAGGTGGGGATCATTCGTGCGCTGACGGTGGGTGTGGCGTATCAGACGACGGTAGGCGG
CATTATGAATACCTCGGTGGCGTTGTTGCAGTCCTCACAGGTAGGGCTGCATAAATCACTGATGGTGGGAATGGGCTACA
GCGTCAATGTAGGGAATAACGTCACCTTCTCGGTGGGCAAGACGATGAAGGAAAACACCGGACAAACAGCAGTTTATTCT
GCCGGTGAGCATCTTGAACTCTGCTGTGGTAAGGCAAGGCTGGTGCTGACGAAGGACGGAAGCATATTTCTTAATGGTAC
GCACATTCATCTGGAAGGGGAGTCGGATGTGAACGGTGATGCGCCGGTGATTAACTGGAACTGTGGTGCCACACAACCTG
TACCGGATGCGCCTGTGCCGAAAGATTTACCCCCCGGAATGCCGGATATGCGGCAATTTTGA
ATGTCAACCGGATTACGTTTCACGCTGGAAGTGGACGGCCTGCCACCGGATGCTTTTGCGGTGGTCTCCTTTCATCTGAA
CCAGTCACTCTCTTCGCTTTTTTCCCTCGATCTCTCTCTGGTCAGCCAGCAGTTTCTCTCCCTTGAATTCCAGCAGATCC
TCGACAAAATGGCCTACCTGACGATATGGCAGGGCGATGACGTACAGCGCCGGGTGAAAGGTGTGGTGACCTGGTTTGAA
CTGGGGGAGAACGACAAAAACCAGAAGCTGTACAGCATGAAGGTATGCCCGCCGCTGTGGCGCACAGGGCTGCGCCAGAA
CTTCCGTATCTTCCAGAATGAGGACATCGAAAGCATCCTCGGCACGATATTGCAGGAAAACGGGGTGACCGAGTGGAGCC
CGCTGTTCAGCGAGCCACATCCTTCCCGTGAGTTTTGTGTCCAGTACGGTGAGACTGATTACGATTTCCTGTGCCGGATG
GCGGCGGAGGAAGGCATCTTCTTTTATGAGGAGCACGCGCAAAAAAGTACCGACCAAAGCCTGGTCCTGTGCGATACCGT
GCGTTATCTGCCGGAGTCCTTTGAGATCCCCTGGAATCCGAACACCCGTACCGAGGTAAGCACCCTCTGCATCAGCCAGT
TTCGCTACAGCGCACAAATCCGCCCTTCTTCCGTGGTGACCAAAGACTACACCTTTAAACGACCCGGCTGGGCAGGGCGT
TTTGATCAGGAAGGCCAGTACCAGGATTACCAGCGCACACAGTATGAAGTGTATGACTACCCCGGACGTTTCAAGGGCGC
CCACGGGCAGAACTTTGCCCGCTGGCAGATGGATGGCTGGCGCAACAACGCAGAAGTGGCGCGCGGAACAAGCCGTTCGC
CGGAGATATGGCCGGGACGGCAAATTGTGCTGACGGGGCATCCGCAGGCGAACCTGAACCGGGAATGGCAGGTGGTGGCG
AGCGATCTTCACGGCGAACAGCCGCAGGCGGTACCGGGACGCAGGGGTTCAGGCACCACGCTGGATAACCATTTTGCGGT
AATACCGGCAGACAGAACATGGCGACCACAGCCGTTGCTGAAACCGCTGGTGGACGGCCCGCAGAGCGCCGTCGTGACGG
GACCGGCAGGCGAGGAAATCTTCTGTGATGAACATGGCCGCGTGCGGGTGAAATTTAACTGGGACCGTTATAACCCGTCA
AACCAGGACAGTTCATGCTGGATCCGTGTGGCACAGGCGTGGGCAGGCACCGGATTCGGTAACCTTGCGATACCGCGCGT
GGGTCAGGAGGTGATTGTGGACTTCCTCAACGGCGATCCGGACCAGCCGATCATTATGGGGCGCACCTACCACCAGGAAA
ACCGCACACCCGGCAGCCTGCCGGGAACAAAGACGCAGATGACCATTCGTTCGAAAACCTATAAGGGCAGCGGGTTTAAT
GAACTGAAGTTTGACGATGCGACAGGGAAAGAACAGGTCTACATCCACGCGCAGAAGAACATGAACACCGAGGTGCTGAA
TAACCGCACCACGGATGTGATAAACAACCATGCCGAAAAAATAGGTAACAACCAGGCGATCACCGTTACCAATAACCAGA
TCCAGAACATTGGCGTTAATCAGATACAGACGGTTGGTGTCAACCAGGTGGAAACGGTAGGCAGTAACCAGATTATCAAA
GTGGGATCAAACCAGGTTGAAAAGGTGGGGATCATTCGTGCGCTGACGGTGGGTGTGGCGTATCAGACGACGGTAGGCGG
CATTATGAATACCTCGGTGGCGTTGTTGCAGTCCTCACAGGTAGGGCTGCATAAATCACTGATGGTGGGAATGGGCTACA
GCGTCAATGTAGGGAATAACGTCACCTTCTCGGTGGGCAAGACGATGAAGGAAAACACCGGACAAACAGCAGTTTATTCT
GCCGGTGAGCATCTTGAACTCTGCTGTGGTAAGGCAAGGCTGGTGCTGACGAAGGACGGAAGCATATTTCTTAATGGTAC
GCACATTCATCTGGAAGGGGAGTCGGATGTGAACGGTGATGCGCCGGTGATTAACTGGAACTGTGGTGCCACACAACCTG
TACCGGATGCGCCTGTGCCGAAAGATTTACCCCCCGGAATGCCGGATATGCGGCAATTTTGA