Detailed information of component protein
Summary
External database links
Locus tag (Gene) | AZO_RS19450 |
Coordinate (Strand) | 4249321..4251393 (+) |
NCBI ID | WP_011767598.1 |
RefSeq | NC_008702 |
Uniprot ID | A1KCD6_AZOSB |
KEGG ID | azo:azo3876 |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
Phage_GPD | PF05954.13 | 4.8e-105 | 27..326 |
Phage_base_V | PF04717.14 | 2e-15 | 382..450 |
Gp5_C | PF06715.14 | 2.8e-05 | 558..581 |
Gp5_C | PF06715.14 | 9.8e-05 | 535..557 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Outside | 1-690 | VADQSQFELITPLGSGVLVFQALAADEALSQVSEFEIDCLSDKTDINLDDILGKQLSLRVMLPTGGKRYFSGYVTRFAHTGTQGRYQTYRASVRPWLWFLGRTADCRIFQSRTVPDIVKQVFSEHGVADFRNALSGSYRTREYCVQYRETDLAFVSRLLEEEGIYYFFEHDENKNTLVLADAYAAHSSTAGYEEIEFVENRQGGGDELGFVSQWQFERQVMPGKVMLRDYDFTKPSVGLEASAVVPREHDEGKHEVFDYPGEFDQVADGRHYAKVRVDELHAQFELSRARSSARGLAVGHLFKLAGHARRDQNREYLITAAQTRVRAEGRESEAGAGSQFECHFSALNSRQDYRPPRLTPRPQVKGPQTAVVVGPSGNEIYTDEYGRVKVQFHWDRYGTKDEHSSCWIRVSHPWAGKNWGFIAIPRIGQEVIVEFLEGDPDQPIITGRVYNAEQMPPYALPDNMTQTGIKTRSTLGGGTDNFNEIRFEDKKGEEQLFIHAEKNQDIEVENDETHWVGHDRTKNIDHNETVTVGNDRTESVGNNETISIGNNRTESVGANETISVGGNRAEDVGQNETVSIGKDRSESIGNNDAHDVAKNQTIAIGENQTLEVGKARSTTVGDDDKLQVGKKLVIEAGDEITIKTGSASITMKKDGTIQIKGKDITIKGDGQIGIKAASDVVIKGAKISEN |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.6980 | - | - |
Protein sequence: 691 a.a.
>T6CP001572 NC_008702:4249321-4251393 [Azoarcus olearius] [TssI]
VADQSQFELITPLGSGVLVFQALAADEALSQVSEFEIDCLSDKTDINLDDILGKQLSLRVMLPTGGKRYFSGYVTRFAHT
GTQGRYQTYRASVRPWLWFLGRTADCRIFQSRTVPDIVKQVFSEHGVADFRNALSGSYRTREYCVQYRETDLAFVSRLLE
EEGIYYFFEHDENKNTLVLADAYAAHSSTAGYEEIEFVENRQGGGDELGFVSQWQFERQVMPGKVMLRDYDFTKPSVGLE
ASAVVPREHDEGKHEVFDYPGEFDQVADGRHYAKVRVDELHAQFELSRARSSARGLAVGHLFKLAGHARRDQNREYLITA
AQTRVRAEGRESEAGAGSQFECHFSALNSRQDYRPPRLTPRPQVKGPQTAVVVGPSGNEIYTDEYGRVKVQFHWDRYGTK
DEHSSCWIRVSHPWAGKNWGFIAIPRIGQEVIVEFLEGDPDQPIITGRVYNAEQMPPYALPDNMTQTGIKTRSTLGGGTD
NFNEIRFEDKKGEEQLFIHAEKNQDIEVENDETHWVGHDRTKNIDHNETVTVGNDRTESVGNNETISIGNNRTESVGANE
TISVGGNRAEDVGQNETVSIGKDRSESIGNNDAHDVAKNQTIAIGENQTLEVGKARSTTVGDDDKLQVGKKLVIEAGDEI
TIKTGSASITMKKDGTIQIKGKDITIKGDGQIGIKAASDVVIKGAKISEN*
VADQSQFELITPLGSGVLVFQALAADEALSQVSEFEIDCLSDKTDINLDDILGKQLSLRVMLPTGGKRYFSGYVTRFAHT
GTQGRYQTYRASVRPWLWFLGRTADCRIFQSRTVPDIVKQVFSEHGVADFRNALSGSYRTREYCVQYRETDLAFVSRLLE
EEGIYYFFEHDENKNTLVLADAYAAHSSTAGYEEIEFVENRQGGGDELGFVSQWQFERQVMPGKVMLRDYDFTKPSVGLE
ASAVVPREHDEGKHEVFDYPGEFDQVADGRHYAKVRVDELHAQFELSRARSSARGLAVGHLFKLAGHARRDQNREYLITA
AQTRVRAEGRESEAGAGSQFECHFSALNSRQDYRPPRLTPRPQVKGPQTAVVVGPSGNEIYTDEYGRVKVQFHWDRYGTK
DEHSSCWIRVSHPWAGKNWGFIAIPRIGQEVIVEFLEGDPDQPIITGRVYNAEQMPPYALPDNMTQTGIKTRSTLGGGTD
NFNEIRFEDKKGEEQLFIHAEKNQDIEVENDETHWVGHDRTKNIDHNETVTVGNDRTESVGNNETISIGNNRTESVGANE
TISVGGNRAEDVGQNETVSIGKDRSESIGNNDAHDVAKNQTIAIGENQTLEVGKARSTTVGDDDKLQVGKKLVIEAGDEI
TIKTGSASITMKKDGTIQIKGKDITIKGDGQIGIKAASDVVIKGAKISEN*
Nucleotide sequence: 2073 bp
>T6CP001572 NC_008702:4249321-4251393 [Azoarcus olearius] [TssI]
GTGGCAGATCAATCGCAATTCGAACTCATCACCCCGCTGGGCAGCGGCGTGCTCGTATTCCAGGCGCTCGCCGCCGACGA
GGCGCTGTCGCAGGTCAGCGAATTCGAGATCGACTGCCTCTCCGACAAGACCGACATCAACCTCGACGACATCCTCGGCA
AGCAGCTCAGCCTGCGGGTGATGCTGCCCACCGGCGGCAAGCGCTACTTCAGCGGCTACGTCACCCGCTTCGCCCACACC
GGCACCCAGGGCCGCTACCAGACCTACCGCGCGAGCGTGCGGCCGTGGCTGTGGTTCCTCGGCCGCACCGCCGACTGCCG
CATCTTCCAGTCGCGCACGGTGCCCGACATCGTCAAGCAGGTCTTCTCCGAACACGGCGTCGCCGACTTCCGCAACGCGC
TCAGCGGCAGCTACCGCACGCGCGAATACTGCGTGCAGTACCGCGAGACCGACCTCGCCTTCGTCAGCCGGCTGCTGGAA
GAAGAGGGCATCTATTACTTCTTCGAGCACGACGAGAACAAGAACACGCTGGTGCTGGCCGACGCCTACGCCGCGCACAG
CAGCACCGCCGGCTACGAGGAGATCGAGTTCGTCGAGAACCGCCAGGGCGGCGGCGACGAGCTGGGCTTCGTCAGCCAGT
GGCAGTTCGAGCGCCAGGTGATGCCGGGCAAGGTCATGCTGCGCGACTACGACTTCACCAAGCCCTCGGTCGGGCTGGAA
GCCTCGGCGGTGGTGCCGCGCGAACACGACGAGGGCAAGCACGAGGTGTTCGACTACCCCGGCGAGTTCGACCAGGTGGC
CGACGGCCGCCACTACGCCAAGGTGCGCGTCGATGAGCTGCACGCGCAGTTCGAACTGTCCCGCGCGCGCAGCAGCGCCC
GCGGCCTGGCGGTGGGCCACCTGTTCAAGCTCGCCGGTCACGCCCGCCGCGACCAGAACCGCGAGTACCTGATCACCGCG
GCGCAGACCCGCGTGCGTGCCGAAGGGCGGGAATCCGAAGCGGGTGCGGGCAGCCAGTTCGAGTGCCACTTCAGCGCCCT
CAACAGCCGCCAGGATTACCGCCCGCCGCGGCTGACCCCGCGCCCGCAGGTGAAGGGGCCGCAGACCGCGGTAGTGGTCG
GCCCGTCCGGCAACGAGATCTACACCGACGAATACGGCCGCGTGAAGGTGCAGTTCCACTGGGACCGCTACGGCACCAAG
GACGAGCACAGCTCGTGCTGGATCCGCGTCTCCCATCCCTGGGCGGGCAAGAACTGGGGCTTCATCGCCATCCCCCGCAT
CGGCCAGGAGGTGATCGTCGAATTCCTCGAGGGCGATCCCGACCAGCCGATCATCACCGGCCGGGTCTACAACGCCGAGC
AGATGCCGCCATACGCGCTGCCCGACAACATGACCCAGACCGGCATCAAGACGCGCTCCACGCTCGGCGGCGGCACCGAC
AACTTCAACGAGATCCGCTTCGAGGACAAGAAGGGCGAGGAGCAGCTCTTCATCCACGCCGAGAAGAACCAGGACATCGA
GGTCGAGAACGACGAGACCCACTGGGTCGGCCACGACCGCACCAAGAACATCGACCACAACGAGACGGTCACCGTGGGCA
ACGACCGCACCGAGTCGGTCGGCAACAACGAAACCATCAGCATCGGCAACAACCGCACCGAATCGGTGGGCGCCAACGAA
ACCATCTCGGTGGGCGGCAACCGCGCCGAGGACGTCGGCCAGAACGAAACCGTCAGCATCGGCAAGGACCGCAGCGAGAG
CATCGGCAACAACGACGCCCACGACGTCGCCAAGAACCAGACCATCGCGATCGGCGAAAACCAGACGCTGGAGGTGGGCA
AGGCGCGCTCCACCACGGTCGGCGATGACGACAAGCTGCAGGTCGGCAAGAAACTGGTGATCGAGGCCGGCGACGAGATC
ACCATCAAGACCGGCAGCGCCAGCATCACCATGAAGAAGGACGGCACGATCCAGATCAAGGGCAAGGACATCACCATCAA
GGGCGACGGCCAGATCGGCATCAAGGCCGCTTCCGACGTGGTGATCAAGGGCGCCAAGATCTCCGAGAACTGA
GTGGCAGATCAATCGCAATTCGAACTCATCACCCCGCTGGGCAGCGGCGTGCTCGTATTCCAGGCGCTCGCCGCCGACGA
GGCGCTGTCGCAGGTCAGCGAATTCGAGATCGACTGCCTCTCCGACAAGACCGACATCAACCTCGACGACATCCTCGGCA
AGCAGCTCAGCCTGCGGGTGATGCTGCCCACCGGCGGCAAGCGCTACTTCAGCGGCTACGTCACCCGCTTCGCCCACACC
GGCACCCAGGGCCGCTACCAGACCTACCGCGCGAGCGTGCGGCCGTGGCTGTGGTTCCTCGGCCGCACCGCCGACTGCCG
CATCTTCCAGTCGCGCACGGTGCCCGACATCGTCAAGCAGGTCTTCTCCGAACACGGCGTCGCCGACTTCCGCAACGCGC
TCAGCGGCAGCTACCGCACGCGCGAATACTGCGTGCAGTACCGCGAGACCGACCTCGCCTTCGTCAGCCGGCTGCTGGAA
GAAGAGGGCATCTATTACTTCTTCGAGCACGACGAGAACAAGAACACGCTGGTGCTGGCCGACGCCTACGCCGCGCACAG
CAGCACCGCCGGCTACGAGGAGATCGAGTTCGTCGAGAACCGCCAGGGCGGCGGCGACGAGCTGGGCTTCGTCAGCCAGT
GGCAGTTCGAGCGCCAGGTGATGCCGGGCAAGGTCATGCTGCGCGACTACGACTTCACCAAGCCCTCGGTCGGGCTGGAA
GCCTCGGCGGTGGTGCCGCGCGAACACGACGAGGGCAAGCACGAGGTGTTCGACTACCCCGGCGAGTTCGACCAGGTGGC
CGACGGCCGCCACTACGCCAAGGTGCGCGTCGATGAGCTGCACGCGCAGTTCGAACTGTCCCGCGCGCGCAGCAGCGCCC
GCGGCCTGGCGGTGGGCCACCTGTTCAAGCTCGCCGGTCACGCCCGCCGCGACCAGAACCGCGAGTACCTGATCACCGCG
GCGCAGACCCGCGTGCGTGCCGAAGGGCGGGAATCCGAAGCGGGTGCGGGCAGCCAGTTCGAGTGCCACTTCAGCGCCCT
CAACAGCCGCCAGGATTACCGCCCGCCGCGGCTGACCCCGCGCCCGCAGGTGAAGGGGCCGCAGACCGCGGTAGTGGTCG
GCCCGTCCGGCAACGAGATCTACACCGACGAATACGGCCGCGTGAAGGTGCAGTTCCACTGGGACCGCTACGGCACCAAG
GACGAGCACAGCTCGTGCTGGATCCGCGTCTCCCATCCCTGGGCGGGCAAGAACTGGGGCTTCATCGCCATCCCCCGCAT
CGGCCAGGAGGTGATCGTCGAATTCCTCGAGGGCGATCCCGACCAGCCGATCATCACCGGCCGGGTCTACAACGCCGAGC
AGATGCCGCCATACGCGCTGCCCGACAACATGACCCAGACCGGCATCAAGACGCGCTCCACGCTCGGCGGCGGCACCGAC
AACTTCAACGAGATCCGCTTCGAGGACAAGAAGGGCGAGGAGCAGCTCTTCATCCACGCCGAGAAGAACCAGGACATCGA
GGTCGAGAACGACGAGACCCACTGGGTCGGCCACGACCGCACCAAGAACATCGACCACAACGAGACGGTCACCGTGGGCA
ACGACCGCACCGAGTCGGTCGGCAACAACGAAACCATCAGCATCGGCAACAACCGCACCGAATCGGTGGGCGCCAACGAA
ACCATCTCGGTGGGCGGCAACCGCGCCGAGGACGTCGGCCAGAACGAAACCGTCAGCATCGGCAAGGACCGCAGCGAGAG
CATCGGCAACAACGACGCCCACGACGTCGCCAAGAACCAGACCATCGCGATCGGCGAAAACCAGACGCTGGAGGTGGGCA
AGGCGCGCTCCACCACGGTCGGCGATGACGACAAGCTGCAGGTCGGCAAGAAACTGGTGATCGAGGCCGGCGACGAGATC
ACCATCAAGACCGGCAGCGCCAGCATCACCATGAAGAAGGACGGCACGATCCAGATCAAGGGCAAGGACATCACCATCAA
GGGCGACGGCCAGATCGGCATCAAGGCCGCTTCCGACGTGGTGATCAAGGGCGCCAAGATCTCCGAGAACTGA