Detailed information of component protein
Summary
External database links
Locus tag (Gene) | BPS_RS26920 |
Coordinate (Strand) | 2049856..2052879 (+) |
NCBI ID | WP_009940775.1 |
RefSeq | NC_006351 |
Uniprot ID | Q63K62_BURPS |
KEGG ID | bps:BPSS1503 |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
Phage_GPD | PF05954.13 | 2.4e-73 | 46..349 |
Phage_base_V | PF04717.14 | 1.7e-06 | 418..480 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Outside | 1-1007 | MPSSPRHDAPASRANAAPSANARRFTFASDAYDPATFDVVDINGRDAISQPYRFEITLVSRQLRIDFAKMLSRGATLAILPPFGEAGTTRYAGVLAEFEQKKRFRDFTVYRATLVPRLWRLSLYKASDVYLNEQTIPDIVKRVLRAASFGSRDFRMRHGGGYRKRSFVCQYDESHLDFVSRWMEKEGLYYYFEHDGRHETLVIVDDRRHQPGPADDLALRYLPATSLDAGIEADRVQAFTCRATPLPREVVLRDFNHRKAELSLEVRERVARDGVGERVSSDEHFHTKDEGQRYAKLRAEALVCEGRRFAGESTAAGLRAGRFFALSGHYRQDFDGRYLVTALTHRGSQAHLLFPDLDAPFGATPGEPIYRAEFEAIAADLQYRPPRTTPKPRAAGVVSAIVDGEGGGKLAELDEYGQYKVRFPFAHTAHPANKASARIRMATPYAGDDRGMHLPLLKRTEVKIAFDGGDPDRPVIVGAVPNSSHRSVVTRRNPAEHRILTEHNQLYMKDGSGAATWLHAPNNHIGIGAVGPGDGLALLTSGNKFDFSLGNAYSFSGGLKCSVSMGGNTDVYVGVRNSLDVSANFLTTLQGNLRWMLPGSRSFEINDSASTLLQTLHKQSATGAIRLSAGQDASALLQKQLDKLKGTVRKFMIVSGLANAGSAAAAAGLIKGGGKLADLPWAGFGISAAQFAGATGVSTALMATSRTLLANVAKLQEALPLVADLSLGKQGIALAAKNLTHATRMSLTVDGVSWSTHAKGPGAAGAAMSVGKGRWGVEAAKHAHVHASDTLLFAVPADPTTQFDLKDLIGLRRDLDECMKDIADLEADISENEVLSTDQNTFGVSALIPTPPSPAGALAAVAIKVKQAKLVELKAKQKLVALKVDNLQQKFAKHVQHLSAVRMSASDAQLGFKGNRLVATADGVTLAHAQGKAKLDVREAKIGVTAGKSSVELDEGKIAAGCGSASLKLGSDGAIDVSATDVKLNGTNVKLNGSASLKFDGQLIQLG |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.8969 | - | - |
Protein sequence: 1008 a.a.
>T6CP000669 NC_006351:2049856-2052879 [Burkholderia pseudomallei K96243] [TssI]
MPSSPRHDAPASRANAAPSANARRFTFASDAYDPATFDVVDINGRDAISQPYRFEITLVSRQLRIDFAKMLSRGATLAIL
PPFGEAGTTRYAGVLAEFEQKKRFRDFTVYRATLVPRLWRLSLYKASDVYLNEQTIPDIVKRVLRAASFGSRDFRMRHGG
GYRKRSFVCQYDESHLDFVSRWMEKEGLYYYFEHDGRHETLVIVDDRRHQPGPADDLALRYLPATSLDAGIEADRVQAFT
CRATPLPREVVLRDFNHRKAELSLEVRERVARDGVGERVSSDEHFHTKDEGQRYAKLRAEALVCEGRRFAGESTAAGLRA
GRFFALSGHYRQDFDGRYLVTALTHRGSQAHLLFPDLDAPFGATPGEPIYRAEFEAIAADLQYRPPRTTPKPRAAGVVSA
IVDGEGGGKLAELDEYGQYKVRFPFAHTAHPANKASARIRMATPYAGDDRGMHLPLLKRTEVKIAFDGGDPDRPVIVGAV
PNSSHRSVVTRRNPAEHRILTEHNQLYMKDGSGAATWLHAPNNHIGIGAVGPGDGLALLTSGNKFDFSLGNAYSFSGGLK
CSVSMGGNTDVYVGVRNSLDVSANFLTTLQGNLRWMLPGSRSFEINDSASTLLQTLHKQSATGAIRLSAGQDASALLQKQ
LDKLKGTVRKFMIVSGLANAGSAAAAAGLIKGGGKLADLPWAGFGISAAQFAGATGVSTALMATSRTLLANVAKLQEALP
LVADLSLGKQGIALAAKNLTHATRMSLTVDGVSWSTHAKGPGAAGAAMSVGKGRWGVEAAKHAHVHASDTLLFAVPADPT
TQFDLKDLIGLRRDLDECMKDIADLEADISENEVLSTDQNTFGVSALIPTPPSPAGALAAVAIKVKQAKLVELKAKQKLV
ALKVDNLQQKFAKHVQHLSAVRMSASDAQLGFKGNRLVATADGVTLAHAQGKAKLDVREAKIGVTAGKSSVELDEGKIAA
GCGSASLKLGSDGAIDVSATDVKLNGTNVKLNGSASLKFDGQLIQLG*
MPSSPRHDAPASRANAAPSANARRFTFASDAYDPATFDVVDINGRDAISQPYRFEITLVSRQLRIDFAKMLSRGATLAIL
PPFGEAGTTRYAGVLAEFEQKKRFRDFTVYRATLVPRLWRLSLYKASDVYLNEQTIPDIVKRVLRAASFGSRDFRMRHGG
GYRKRSFVCQYDESHLDFVSRWMEKEGLYYYFEHDGRHETLVIVDDRRHQPGPADDLALRYLPATSLDAGIEADRVQAFT
CRATPLPREVVLRDFNHRKAELSLEVRERVARDGVGERVSSDEHFHTKDEGQRYAKLRAEALVCEGRRFAGESTAAGLRA
GRFFALSGHYRQDFDGRYLVTALTHRGSQAHLLFPDLDAPFGATPGEPIYRAEFEAIAADLQYRPPRTTPKPRAAGVVSA
IVDGEGGGKLAELDEYGQYKVRFPFAHTAHPANKASARIRMATPYAGDDRGMHLPLLKRTEVKIAFDGGDPDRPVIVGAV
PNSSHRSVVTRRNPAEHRILTEHNQLYMKDGSGAATWLHAPNNHIGIGAVGPGDGLALLTSGNKFDFSLGNAYSFSGGLK
CSVSMGGNTDVYVGVRNSLDVSANFLTTLQGNLRWMLPGSRSFEINDSASTLLQTLHKQSATGAIRLSAGQDASALLQKQ
LDKLKGTVRKFMIVSGLANAGSAAAAAGLIKGGGKLADLPWAGFGISAAQFAGATGVSTALMATSRTLLANVAKLQEALP
LVADLSLGKQGIALAAKNLTHATRMSLTVDGVSWSTHAKGPGAAGAAMSVGKGRWGVEAAKHAHVHASDTLLFAVPADPT
TQFDLKDLIGLRRDLDECMKDIADLEADISENEVLSTDQNTFGVSALIPTPPSPAGALAAVAIKVKQAKLVELKAKQKLV
ALKVDNLQQKFAKHVQHLSAVRMSASDAQLGFKGNRLVATADGVTLAHAQGKAKLDVREAKIGVTAGKSSVELDEGKIAA
GCGSASLKLGSDGAIDVSATDVKLNGTNVKLNGSASLKFDGQLIQLG*
Nucleotide sequence: 3024 bp
>T6CP000669 NC_006351:2049856-2052879 [Burkholderia pseudomallei K96243] [TssI]
ATGCCTTCGTCCCCCCGCCACGATGCGCCCGCGTCGCGCGCGAACGCCGCGCCGTCCGCGAACGCGCGACGCTTCACGTT
CGCGAGCGACGCGTACGACCCCGCGACGTTCGACGTCGTCGACATCAACGGCCGCGACGCGATCTCGCAGCCGTACCGGT
TCGAGATCACGCTCGTGAGCAGGCAGTTGCGGATCGACTTCGCGAAGATGCTGAGCCGCGGGGCGACGCTCGCGATCCTG
CCGCCGTTCGGCGAGGCCGGCACCACCCGCTATGCCGGCGTGCTCGCCGAATTCGAGCAGAAGAAGCGCTTTCGCGACTT
CACCGTCTATCGCGCGACGCTCGTGCCGCGCCTCTGGCGACTATCGCTGTACAAGGCGTCGGACGTCTATCTGAACGAGC
AGACGATTCCCGACATCGTCAAGCGCGTGCTGCGCGCCGCCTCGTTCGGCAGCCGCGATTTCCGCATGCGGCACGGCGGC
GGCTACCGCAAGCGCAGCTTCGTCTGCCAGTACGACGAGAGCCATCTCGATTTCGTGTCGCGCTGGATGGAGAAGGAAGG
CCTCTACTACTACTTCGAGCATGACGGCCGGCACGAAACGCTCGTGATCGTCGACGACCGCCGCCATCAGCCCGGCCCCG
CCGACGATCTCGCGCTGCGCTACCTACCCGCGACCAGCCTCGACGCGGGCATCGAAGCGGACCGCGTGCAGGCGTTCACA
TGCCGGGCGACGCCGCTGCCGCGCGAAGTCGTGCTGCGCGACTTCAACCACCGCAAGGCGGAGCTCTCGCTCGAAGTCCG
CGAGCGCGTGGCGCGCGACGGCGTCGGCGAGCGGGTGTCGAGCGACGAGCACTTCCACACGAAGGACGAAGGGCAGCGCT
ACGCGAAGCTGCGCGCCGAGGCGCTCGTCTGCGAAGGGCGCCGATTCGCCGGCGAATCGACCGCGGCCGGGCTGCGCGCC
GGCCGCTTCTTCGCGCTGTCGGGCCACTACCGCCAGGATTTCGACGGCCGCTATCTGGTGACGGCGCTCACGCATCGCGG
CTCGCAGGCACACCTGCTGTTTCCCGATCTCGACGCGCCGTTCGGCGCGACGCCGGGCGAGCCCATCTACCGCGCCGAGT
TCGAGGCGATTGCCGCCGACCTCCAGTACCGGCCGCCGCGCACGACGCCCAAGCCGCGCGCGGCGGGCGTCGTCAGCGCG
ATCGTCGACGGCGAGGGCGGCGGCAAGCTCGCCGAGCTCGACGAATACGGCCAGTACAAGGTGCGCTTTCCGTTCGCGCA
CACCGCGCATCCGGCGAACAAGGCCTCCGCGCGCATCCGGATGGCGACGCCCTACGCGGGCGACGACCGCGGCATGCACC
TGCCGCTGCTGAAGCGCACCGAAGTGAAGATCGCATTCGACGGCGGCGATCCGGACCGCCCCGTGATCGTCGGCGCGGTG
CCCAACTCGTCGCACCGCAGCGTCGTCACGCGCCGCAACCCCGCCGAGCATCGCATCCTCACCGAGCACAACCAGCTCTA
CATGAAGGACGGCAGCGGCGCGGCGACGTGGCTGCACGCGCCGAACAACCACATCGGCATCGGCGCGGTCGGGCCGGGCG
ACGGCCTCGCGCTCCTCACGTCCGGCAACAAGTTCGACTTCTCGCTCGGCAACGCGTACAGCTTCTCGGGCGGGCTCAAG
TGCTCGGTGTCGATGGGCGGCAACACCGACGTCTACGTCGGCGTGCGCAACAGCCTCGACGTCAGCGCGAACTTCCTGAC
GACGCTGCAGGGCAACCTGCGCTGGATGCTGCCCGGCAGCCGGAGCTTCGAGATCAACGACAGCGCATCGACACTGCTGC
AGACGCTGCACAAGCAGTCCGCGACGGGCGCGATCCGGCTGTCCGCCGGGCAGGACGCGTCCGCGCTGCTGCAAAAGCAG
CTCGACAAGCTCAAGGGCACGGTGCGCAAGTTCATGATCGTGTCGGGCCTCGCGAACGCCGGGTCCGCGGCCGCCGCCGC
GGGGCTCATCAAGGGCGGCGGCAAGCTCGCCGATCTGCCGTGGGCGGGCTTCGGCATATCCGCCGCGCAGTTCGCCGGCG
CGACCGGCGTCAGCACGGCGCTGATGGCGACCTCGCGCACGCTGCTCGCGAACGTCGCGAAGCTCCAGGAGGCGTTGCCG
CTCGTCGCCGATCTGTCGCTCGGCAAGCAGGGCATCGCGCTCGCCGCGAAGAACCTCACGCACGCGACGCGGATGTCGCT
CACCGTCGACGGCGTCTCGTGGTCGACGCACGCGAAGGGGCCCGGCGCGGCGGGCGCCGCGATGAGCGTCGGCAAGGGCC
GCTGGGGCGTCGAAGCGGCGAAGCACGCGCACGTCCACGCGAGCGACACGCTGCTGTTCGCGGTGCCGGCCGACCCGACG
ACCCAGTTCGACCTGAAGGACCTGATCGGGCTGCGCCGCGATCTCGACGAATGCATGAAGGACATCGCCGATCTCGAAGC
CGACATTTCGGAAAACGAAGTGCTGTCGACCGATCAGAACACGTTCGGCGTCAGCGCGCTCATCCCCACGCCGCCGTCGC
CCGCCGGCGCGCTCGCGGCGGTCGCGATCAAGGTGAAGCAAGCGAAGCTCGTCGAGCTGAAGGCCAAGCAGAAGCTCGTC
GCGCTGAAGGTCGACAACCTGCAGCAGAAGTTCGCGAAGCACGTGCAGCACCTGAGCGCCGTGCGGATGAGCGCTTCCGA
CGCGCAGCTCGGCTTCAAGGGCAACCGGCTCGTCGCGACGGCCGACGGCGTCACGCTCGCGCATGCGCAGGGCAAGGCGA
AGCTCGACGTGCGCGAAGCGAAGATCGGCGTCACGGCGGGCAAATCGAGCGTCGAGCTCGACGAAGGCAAGATCGCGGCC
GGCTGCGGCAGCGCATCGCTGAAGCTCGGCAGCGACGGCGCGATCGACGTGAGCGCGACCGACGTCAAGCTGAACGGCAC
CAACGTCAAGCTGAACGGCAGCGCGTCGCTGAAGTTCGACGGCCAACTGATCCAGCTCGGCTGA
ATGCCTTCGTCCCCCCGCCACGATGCGCCCGCGTCGCGCGCGAACGCCGCGCCGTCCGCGAACGCGCGACGCTTCACGTT
CGCGAGCGACGCGTACGACCCCGCGACGTTCGACGTCGTCGACATCAACGGCCGCGACGCGATCTCGCAGCCGTACCGGT
TCGAGATCACGCTCGTGAGCAGGCAGTTGCGGATCGACTTCGCGAAGATGCTGAGCCGCGGGGCGACGCTCGCGATCCTG
CCGCCGTTCGGCGAGGCCGGCACCACCCGCTATGCCGGCGTGCTCGCCGAATTCGAGCAGAAGAAGCGCTTTCGCGACTT
CACCGTCTATCGCGCGACGCTCGTGCCGCGCCTCTGGCGACTATCGCTGTACAAGGCGTCGGACGTCTATCTGAACGAGC
AGACGATTCCCGACATCGTCAAGCGCGTGCTGCGCGCCGCCTCGTTCGGCAGCCGCGATTTCCGCATGCGGCACGGCGGC
GGCTACCGCAAGCGCAGCTTCGTCTGCCAGTACGACGAGAGCCATCTCGATTTCGTGTCGCGCTGGATGGAGAAGGAAGG
CCTCTACTACTACTTCGAGCATGACGGCCGGCACGAAACGCTCGTGATCGTCGACGACCGCCGCCATCAGCCCGGCCCCG
CCGACGATCTCGCGCTGCGCTACCTACCCGCGACCAGCCTCGACGCGGGCATCGAAGCGGACCGCGTGCAGGCGTTCACA
TGCCGGGCGACGCCGCTGCCGCGCGAAGTCGTGCTGCGCGACTTCAACCACCGCAAGGCGGAGCTCTCGCTCGAAGTCCG
CGAGCGCGTGGCGCGCGACGGCGTCGGCGAGCGGGTGTCGAGCGACGAGCACTTCCACACGAAGGACGAAGGGCAGCGCT
ACGCGAAGCTGCGCGCCGAGGCGCTCGTCTGCGAAGGGCGCCGATTCGCCGGCGAATCGACCGCGGCCGGGCTGCGCGCC
GGCCGCTTCTTCGCGCTGTCGGGCCACTACCGCCAGGATTTCGACGGCCGCTATCTGGTGACGGCGCTCACGCATCGCGG
CTCGCAGGCACACCTGCTGTTTCCCGATCTCGACGCGCCGTTCGGCGCGACGCCGGGCGAGCCCATCTACCGCGCCGAGT
TCGAGGCGATTGCCGCCGACCTCCAGTACCGGCCGCCGCGCACGACGCCCAAGCCGCGCGCGGCGGGCGTCGTCAGCGCG
ATCGTCGACGGCGAGGGCGGCGGCAAGCTCGCCGAGCTCGACGAATACGGCCAGTACAAGGTGCGCTTTCCGTTCGCGCA
CACCGCGCATCCGGCGAACAAGGCCTCCGCGCGCATCCGGATGGCGACGCCCTACGCGGGCGACGACCGCGGCATGCACC
TGCCGCTGCTGAAGCGCACCGAAGTGAAGATCGCATTCGACGGCGGCGATCCGGACCGCCCCGTGATCGTCGGCGCGGTG
CCCAACTCGTCGCACCGCAGCGTCGTCACGCGCCGCAACCCCGCCGAGCATCGCATCCTCACCGAGCACAACCAGCTCTA
CATGAAGGACGGCAGCGGCGCGGCGACGTGGCTGCACGCGCCGAACAACCACATCGGCATCGGCGCGGTCGGGCCGGGCG
ACGGCCTCGCGCTCCTCACGTCCGGCAACAAGTTCGACTTCTCGCTCGGCAACGCGTACAGCTTCTCGGGCGGGCTCAAG
TGCTCGGTGTCGATGGGCGGCAACACCGACGTCTACGTCGGCGTGCGCAACAGCCTCGACGTCAGCGCGAACTTCCTGAC
GACGCTGCAGGGCAACCTGCGCTGGATGCTGCCCGGCAGCCGGAGCTTCGAGATCAACGACAGCGCATCGACACTGCTGC
AGACGCTGCACAAGCAGTCCGCGACGGGCGCGATCCGGCTGTCCGCCGGGCAGGACGCGTCCGCGCTGCTGCAAAAGCAG
CTCGACAAGCTCAAGGGCACGGTGCGCAAGTTCATGATCGTGTCGGGCCTCGCGAACGCCGGGTCCGCGGCCGCCGCCGC
GGGGCTCATCAAGGGCGGCGGCAAGCTCGCCGATCTGCCGTGGGCGGGCTTCGGCATATCCGCCGCGCAGTTCGCCGGCG
CGACCGGCGTCAGCACGGCGCTGATGGCGACCTCGCGCACGCTGCTCGCGAACGTCGCGAAGCTCCAGGAGGCGTTGCCG
CTCGTCGCCGATCTGTCGCTCGGCAAGCAGGGCATCGCGCTCGCCGCGAAGAACCTCACGCACGCGACGCGGATGTCGCT
CACCGTCGACGGCGTCTCGTGGTCGACGCACGCGAAGGGGCCCGGCGCGGCGGGCGCCGCGATGAGCGTCGGCAAGGGCC
GCTGGGGCGTCGAAGCGGCGAAGCACGCGCACGTCCACGCGAGCGACACGCTGCTGTTCGCGGTGCCGGCCGACCCGACG
ACCCAGTTCGACCTGAAGGACCTGATCGGGCTGCGCCGCGATCTCGACGAATGCATGAAGGACATCGCCGATCTCGAAGC
CGACATTTCGGAAAACGAAGTGCTGTCGACCGATCAGAACACGTTCGGCGTCAGCGCGCTCATCCCCACGCCGCCGTCGC
CCGCCGGCGCGCTCGCGGCGGTCGCGATCAAGGTGAAGCAAGCGAAGCTCGTCGAGCTGAAGGCCAAGCAGAAGCTCGTC
GCGCTGAAGGTCGACAACCTGCAGCAGAAGTTCGCGAAGCACGTGCAGCACCTGAGCGCCGTGCGGATGAGCGCTTCCGA
CGCGCAGCTCGGCTTCAAGGGCAACCGGCTCGTCGCGACGGCCGACGGCGTCACGCTCGCGCATGCGCAGGGCAAGGCGA
AGCTCGACGTGCGCGAAGCGAAGATCGGCGTCACGGCGGGCAAATCGAGCGTCGAGCTCGACGAAGGCAAGATCGCGGCC
GGCTGCGGCAGCGCATCGCTGAAGCTCGGCAGCGACGGCGCGATCGACGTGAGCGCGACCGACGTCAAGCTGAACGGCAC
CAACGTCAAGCTGAACGGCAGCGCGTCGCTGAAGTTCGACGGCCAACTGATCCAGCTCGGCTGA