Detailed information of component protein
Summary
External database links
Locus tag (Gene) | H1D38_RS04435 |
Coordinate (Strand) | 929314..931839 (-) |
NCBI ID | 446977247 |
RefSeq | NZ_JACEFV010000001 |
Uniprot ID | D3GUW4_ECO44, B7LG62_ECO55 |
KEGG ID | eck:EC55989_3331, elo:EC042_4533 |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
Phage_GPD | PF05954.13 | 2.9e-59 | 39..360 |
DUF2345 | PF10106.11 | 2.3e-46 | 616..763 |
T6SS_Vgr | PF13296.8 | 1.1e-34 | 496..598 |
Phage_base_V | PF04717.14 | 1.8e-07 | 406..477 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Outside | 1-841 | MNLTDSLQNVLSGVGTLNRYRLDIPSCPSSLDVEDFSGTEGISKIYRYDIIFTSTDRYPDAAWFLRKSATLIMGTGLLESLTEQKKVHGVITDFRRISGSEDQAQYRITLKPFISLLDKQFRTHRFFVNKSVPEVVEQILTEHGLKGWEYEFSLKRTYPKREQINQYQESDLRFIQRLLAEVGIFYFFTLHPDAQTEVIHFGDVQAALIFDKTLPVNSPSGMSDSGTDSVWALNVEHRVVESRVITNDYNHREAQNILMSVPADMTRGEGYDTSYGEVYHYRPRHTERGDKIDPAPETANFWARLDHERFLAEQTRITGKSTDASLLPAQVLTITDSTPPVLPAPLQEPVLLTQLLFTGSRKSALQVMLSAVPYSEIVCWRPPLLTRPKITGTMTARVTSAKEGDIYAWQDASGMYRVKFDADRDDKNPGQESMPVRLAKPYSGDAYGFHFPLIQGTEVAIAFEEGDPDRPYIAHALHDSRHVDHVTDKNGTRNVIRTPANNKLRMEDKRGEEHIKLSTEYGGKTQLNLGHNVDASRELRGEGAELRTDDWISIRGGKGIFISADMQPQAQGKMLDMDEAIRQLEQALSLARSMAKAATAANATQGDISCQQRLNASLTDLTAPGMLLHAPDGIGMVSARALRIASGSESVGIMSGDNTDITAGQSFTVVAEGAVSLLSRNQGMQLLAAKGRVNIQAQSDDLSMSSQQNLDIQSSEGKVTVSANQELILACGGAYIKLSGGNIELGCPGQILLKSTNMQKMGPTSLDIASVEMPRGFGGGFILTDEAGVPQPSTPYRLTTAEGDILQGITDENGKTAPVNTSIPSVVKVEFGKVKIHGETE |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9745 | - | - |
Protein sequence: 842 a.a.
>T6CP016075 NZ_JACEFV010000001:c931839-929314 [Escherichia coli] [TssI]
MNLTDSLQNVLSGVGTLNRYRLDIPSCPSSLDVEDFSGTEGISKIYRYDIIFTSTDRYPDAAWFLRKSATLIMGTGLLES
LTEQKKVHGVITDFRRISGSEDQAQYRITLKPFISLLDKQFRTHRFFVNKSVPEVVEQILTEHGLKGWEYEFSLKRTYPK
REQINQYQESDLRFIQRLLAEVGIFYFFTLHPDAQTEVIHFGDVQAALIFDKTLPVNSPSGMSDSGTDSVWALNVEHRVV
ESRVITNDYNHREAQNILMSVPADMTRGEGYDTSYGEVYHYRPRHTERGDKIDPAPETANFWARLDHERFLAEQTRITGK
STDASLLPAQVLTITDSTPPVLPAPLQEPVLLTQLLFTGSRKSALQVMLSAVPYSEIVCWRPPLLTRPKITGTMTARVTS
AKEGDIYAWQDASGMYRVKFDADRDDKNPGQESMPVRLAKPYSGDAYGFHFPLIQGTEVAIAFEEGDPDRPYIAHALHDS
RHVDHVTDKNGTRNVIRTPANNKLRMEDKRGEEHIKLSTEYGGKTQLNLGHNVDASRELRGEGAELRTDDWISIRGGKGI
FISADMQPQAQGKMLDMDEAIRQLEQALSLARSMAKAATAANATQGDISCQQRLNASLTDLTAPGMLLHAPDGIGMVSAR
ALRIASGSESVGIMSGDNTDITAGQSFTVVAEGAVSLLSRNQGMQLLAAKGRVNIQAQSDDLSMSSQQNLDIQSSEGKVT
VSANQELILACGGAYIKLSGGNIELGCPGQILLKSTNMQKMGPTSLDIASVEMPRGFGGGFILTDEAGVPQPSTPYRLTT
AEGDILQGITDENGKTAPVNTSIPSVVKVEFGKVKIHGETE*
MNLTDSLQNVLSGVGTLNRYRLDIPSCPSSLDVEDFSGTEGISKIYRYDIIFTSTDRYPDAAWFLRKSATLIMGTGLLES
LTEQKKVHGVITDFRRISGSEDQAQYRITLKPFISLLDKQFRTHRFFVNKSVPEVVEQILTEHGLKGWEYEFSLKRTYPK
REQINQYQESDLRFIQRLLAEVGIFYFFTLHPDAQTEVIHFGDVQAALIFDKTLPVNSPSGMSDSGTDSVWALNVEHRVV
ESRVITNDYNHREAQNILMSVPADMTRGEGYDTSYGEVYHYRPRHTERGDKIDPAPETANFWARLDHERFLAEQTRITGK
STDASLLPAQVLTITDSTPPVLPAPLQEPVLLTQLLFTGSRKSALQVMLSAVPYSEIVCWRPPLLTRPKITGTMTARVTS
AKEGDIYAWQDASGMYRVKFDADRDDKNPGQESMPVRLAKPYSGDAYGFHFPLIQGTEVAIAFEEGDPDRPYIAHALHDS
RHVDHVTDKNGTRNVIRTPANNKLRMEDKRGEEHIKLSTEYGGKTQLNLGHNVDASRELRGEGAELRTDDWISIRGGKGI
FISADMQPQAQGKMLDMDEAIRQLEQALSLARSMAKAATAANATQGDISCQQRLNASLTDLTAPGMLLHAPDGIGMVSAR
ALRIASGSESVGIMSGDNTDITAGQSFTVVAEGAVSLLSRNQGMQLLAAKGRVNIQAQSDDLSMSSQQNLDIQSSEGKVT
VSANQELILACGGAYIKLSGGNIELGCPGQILLKSTNMQKMGPTSLDIASVEMPRGFGGGFILTDEAGVPQPSTPYRLTT
AEGDILQGITDENGKTAPVNTSIPSVVKVEFGKVKIHGETE*
Nucleotide sequence: 2526 bp
>T6CP016075 NZ_JACEFV010000001:c931839-929314 [Escherichia coli] [TssI]
ATGAATCTCACTGACTCCCTGCAAAATGTTTTATCCGGGGTGGGAACCCTGAACCGCTACAGACTGGATATACCTTCCTG
CCCGTCATCGCTGGATGTTGAAGATTTCAGCGGAACTGAAGGCATAAGTAAGATTTACCGGTATGACATCATTTTTACCA
GTACGGACAGGTATCCTGATGCTGCCTGGTTCCTGCGTAAATCCGCAACCCTGATAATGGGAACCGGATTACTGGAAAGC
CTGACTGAACAGAAAAAAGTGCACGGTGTTATTACTGATTTCAGACGTATTTCTGGCTCTGAGGACCAGGCACAATACCG
GATCACACTTAAACCTTTCATTTCCCTGCTGGATAAACAGTTCCGTACACACCGTTTTTTCGTTAATAAATCAGTGCCTG
AGGTGGTGGAGCAAATCCTGACGGAACACGGGCTGAAAGGCTGGGAGTATGAGTTCAGCCTCAAAAGGACATATCCGAAG
CGCGAACAGATTAACCAGTATCAGGAAAGCGATCTGAGATTTATCCAGCGCCTGCTGGCAGAAGTGGGCATTTTTTACTT
CTTTACGTTGCACCCGGATGCGCAGACAGAGGTGATCCACTTCGGTGACGTCCAGGCGGCGTTGATATTTGATAAAACAC
TGCCCGTTAACAGCCCGTCCGGTATGAGTGACAGTGGCACAGACTCAGTATGGGCATTGAATGTGGAGCACCGGGTGGTG
GAAAGCCGTGTTATCACGAACGACTATAACCACCGCGAGGCCCAGAACATACTGATGTCCGTACCGGCGGACATGACCCG
TGGAGAAGGTTACGACACCAGTTATGGCGAGGTTTATCATTACCGGCCACGCCATACAGAAAGGGGCGATAAAATTGACC
CTGCGCCGGAGACGGCTAACTTCTGGGCCCGGCTCGATCACGAGCGTTTTCTGGCTGAGCAAACACGTATCACAGGTAAA
AGCACCGATGCCAGTCTGTTACCGGCACAGGTGCTGACTATTACTGACAGTACTCCACCAGTGTTGCCTGCACCGTTACA
GGAGCCGGTTCTGCTCACGCAGCTTCTGTTCACCGGTAGCCGGAAATCTGCGCTACAGGTGATGCTGAGTGCTGTTCCTT
ACAGCGAGATTGTCTGCTGGCGACCACCGTTACTTACTCGTCCGAAAATCACCGGGACCATGACGGCCCGTGTGACCAGT
GCAAAAGAAGGGGATATCTACGCCTGGCAGGATGCTTCAGGCATGTACCGGGTGAAATTTGATGCTGACCGTGATGATAA
AAATCCGGGACAGGAAAGTATGCCGGTACGTCTGGCCAAACCCTACAGTGGTGATGCTTATGGTTTTCATTTTCCGCTGA
TTCAGGGAACGGAAGTGGCCATTGCTTTTGAGGAAGGTGATCCGGATCGTCCCTATATAGCGCATGCCCTGCATGATTCA
CGTCATGTTGATCACGTTACCGATAAGAACGGTACGCGCAATGTTATCCGTACGCCGGCAAATAACAAGTTGCGGATGGA
GGACAAACGCGGTGAGGAGCATATCAAGCTCAGTACCGAGTATGGCGGTAAGACACAACTTAATCTTGGCCATAACGTTG
ATGCCTCCCGGGAACTCAGGGGAGAAGGCGCTGAGCTGAGGACTGATGACTGGATAAGTATCCGGGGAGGAAAAGGAATA
TTTATCAGTGCAGATATGCAGCCTCAGGCACAAGGAAAGATGCTGGATATGGATGAAGCTATCCGCCAATTGGAGCAGGC
ATTATCTCTGGCCCGAAGTATGGCGAAGGCCGCAACTGCTGCAAATGCTACTCAGGGAGATATAAGTTGTCAGCAACGGT
TGAATGCATCACTAACAGATCTCACTGCGCCGGGAATGCTGTTGCATGCTCCTGATGGGATTGGCATGGTTAGTGCGCGG
GCGTTACGAATTGCTTCAGGTAGTGAAAGCGTGGGCATAATGTCGGGAGATAACACTGATATTACTGCCGGGCAGTCATT
TACAGTTGTAGCAGAAGGTGCTGTTAGCCTGTTGAGCAGAAACCAGGGGATGCAACTGTTAGCAGCAAAAGGGCGGGTAA
ATATTCAGGCTCAAAGTGACGATTTGTCAATGAGTTCCCAACAGAATCTTGATATTCAAAGTTCTGAGGGCAAAGTGACT
GTTTCTGCTAATCAGGAACTGATACTGGCATGTGGTGGAGCTTATATCAAACTAAGCGGTGGAAATATTGAGCTGGGATG
CCCGGGACAGATTCTGTTAAAAAGCACGAATATGCAGAAGATGGGACCAACAAGTCTTGATATTGCGTCTGTTGAAATGC
CGCGAGGGTTTGGGGGCGGTTTTATTCTTACCGATGAAGCGGGAGTGCCGCAACCTTCGACGCCTTATCGATTAACAACG
GCTGAAGGTGATATTCTACAGGGGATCACCGATGAAAATGGTAAAACAGCGCCAGTCAACACCTCCATTCCGTCAGTAGT
AAAAGTTGAGTTTGGGAAGGTAAAAATTCATGGAGAAACAGAATGA
ATGAATCTCACTGACTCCCTGCAAAATGTTTTATCCGGGGTGGGAACCCTGAACCGCTACAGACTGGATATACCTTCCTG
CCCGTCATCGCTGGATGTTGAAGATTTCAGCGGAACTGAAGGCATAAGTAAGATTTACCGGTATGACATCATTTTTACCA
GTACGGACAGGTATCCTGATGCTGCCTGGTTCCTGCGTAAATCCGCAACCCTGATAATGGGAACCGGATTACTGGAAAGC
CTGACTGAACAGAAAAAAGTGCACGGTGTTATTACTGATTTCAGACGTATTTCTGGCTCTGAGGACCAGGCACAATACCG
GATCACACTTAAACCTTTCATTTCCCTGCTGGATAAACAGTTCCGTACACACCGTTTTTTCGTTAATAAATCAGTGCCTG
AGGTGGTGGAGCAAATCCTGACGGAACACGGGCTGAAAGGCTGGGAGTATGAGTTCAGCCTCAAAAGGACATATCCGAAG
CGCGAACAGATTAACCAGTATCAGGAAAGCGATCTGAGATTTATCCAGCGCCTGCTGGCAGAAGTGGGCATTTTTTACTT
CTTTACGTTGCACCCGGATGCGCAGACAGAGGTGATCCACTTCGGTGACGTCCAGGCGGCGTTGATATTTGATAAAACAC
TGCCCGTTAACAGCCCGTCCGGTATGAGTGACAGTGGCACAGACTCAGTATGGGCATTGAATGTGGAGCACCGGGTGGTG
GAAAGCCGTGTTATCACGAACGACTATAACCACCGCGAGGCCCAGAACATACTGATGTCCGTACCGGCGGACATGACCCG
TGGAGAAGGTTACGACACCAGTTATGGCGAGGTTTATCATTACCGGCCACGCCATACAGAAAGGGGCGATAAAATTGACC
CTGCGCCGGAGACGGCTAACTTCTGGGCCCGGCTCGATCACGAGCGTTTTCTGGCTGAGCAAACACGTATCACAGGTAAA
AGCACCGATGCCAGTCTGTTACCGGCACAGGTGCTGACTATTACTGACAGTACTCCACCAGTGTTGCCTGCACCGTTACA
GGAGCCGGTTCTGCTCACGCAGCTTCTGTTCACCGGTAGCCGGAAATCTGCGCTACAGGTGATGCTGAGTGCTGTTCCTT
ACAGCGAGATTGTCTGCTGGCGACCACCGTTACTTACTCGTCCGAAAATCACCGGGACCATGACGGCCCGTGTGACCAGT
GCAAAAGAAGGGGATATCTACGCCTGGCAGGATGCTTCAGGCATGTACCGGGTGAAATTTGATGCTGACCGTGATGATAA
AAATCCGGGACAGGAAAGTATGCCGGTACGTCTGGCCAAACCCTACAGTGGTGATGCTTATGGTTTTCATTTTCCGCTGA
TTCAGGGAACGGAAGTGGCCATTGCTTTTGAGGAAGGTGATCCGGATCGTCCCTATATAGCGCATGCCCTGCATGATTCA
CGTCATGTTGATCACGTTACCGATAAGAACGGTACGCGCAATGTTATCCGTACGCCGGCAAATAACAAGTTGCGGATGGA
GGACAAACGCGGTGAGGAGCATATCAAGCTCAGTACCGAGTATGGCGGTAAGACACAACTTAATCTTGGCCATAACGTTG
ATGCCTCCCGGGAACTCAGGGGAGAAGGCGCTGAGCTGAGGACTGATGACTGGATAAGTATCCGGGGAGGAAAAGGAATA
TTTATCAGTGCAGATATGCAGCCTCAGGCACAAGGAAAGATGCTGGATATGGATGAAGCTATCCGCCAATTGGAGCAGGC
ATTATCTCTGGCCCGAAGTATGGCGAAGGCCGCAACTGCTGCAAATGCTACTCAGGGAGATATAAGTTGTCAGCAACGGT
TGAATGCATCACTAACAGATCTCACTGCGCCGGGAATGCTGTTGCATGCTCCTGATGGGATTGGCATGGTTAGTGCGCGG
GCGTTACGAATTGCTTCAGGTAGTGAAAGCGTGGGCATAATGTCGGGAGATAACACTGATATTACTGCCGGGCAGTCATT
TACAGTTGTAGCAGAAGGTGCTGTTAGCCTGTTGAGCAGAAACCAGGGGATGCAACTGTTAGCAGCAAAAGGGCGGGTAA
ATATTCAGGCTCAAAGTGACGATTTGTCAATGAGTTCCCAACAGAATCTTGATATTCAAAGTTCTGAGGGCAAAGTGACT
GTTTCTGCTAATCAGGAACTGATACTGGCATGTGGTGGAGCTTATATCAAACTAAGCGGTGGAAATATTGAGCTGGGATG
CCCGGGACAGATTCTGTTAAAAAGCACGAATATGCAGAAGATGGGACCAACAAGTCTTGATATTGCGTCTGTTGAAATGC
CGCGAGGGTTTGGGGGCGGTTTTATTCTTACCGATGAAGCGGGAGTGCCGCAACCTTCGACGCCTTATCGATTAACAACG
GCTGAAGGTGATATTCTACAGGGGATCACCGATGAAAATGGTAAAACAGCGCCAGTCAACACCTCCATTCCGTCAGTAGT
AAAAGTTGAGTTTGGGAAGGTAAAAATTCATGGAGAAACAGAATGA