Detailed information of component protein
Summary
External database links
Locus tag (Gene) | AZO_RS19585 |
Coordinate (Strand) | 4280714..4283434 (+) |
NCBI ID | WP_011767625.1 |
RefSeq | NC_008702 |
Uniprot ID | A1KCG3_AZOSB |
KEGG ID | aoa:dqs_4047, azo:azo3903 |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
AAA_2 | PF07724.16 | 8e-37 | 638..804 |
AAA_lid_9 | PF17871.3 | 1e-28 | 372..463 |
ClpB_D2-small | PF10431.11 | 3.7e-11 | 811..887 |
AAA | PF00004.31 | 3.5e-08 | 232..343 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Outside | 1-906 | MSEISRVASFGKLNSLAYKAMEGATVFCKLRGNPYVELEHWLQQLLQNEDTDLHRIVRHFDVDAARLAADMTTALDRLPRGSTSISGLSENLDTLVERGWVYASLMFGDRQIRSGYLLVGMLKTPSLRNALMAISRQFERVRPDVLTDEFTKIVAGSAEENLTARDGSGGAPGEDSGAIAPAQMGKQEALKKFTVDLTEQARSGKMDPIVGRDEEIRQVVDILMRRRQNNPILVGEAGVGKTAVVEGFAQRIARGDVPPALKDVSLLALDVGLLQAGASMKGEFEQRLRSVIDEVQASPKPIILFVDETHTLVGAGGAAGTGDAANLLKPALARGTLRTVGATTWAEYKKYIEKDPALTRRFQNVQVDEPDEKKAVLMMRGVASTMEKHHQVQILDEALEAAVKLSHRYIPARQLPDKSVSLLDTACARVAVSLHATPAEVDDSRKRIDALNTELEIIGRESNIGIEVGERRAHAEALLAEEQQRLAELEARWAAEKTLVDELLALRAKLRSGSRPVEGTGSALEAAAEAAAPEAAAESAPEPEREALFARLREVQAELATLQGEDPLILPTVDYQAVASVVADWTGIPVGRMARNEIENVLRLPQLLGQRVIGQDHAMEMIAKRIQTSRAGLDNPNKPIGVFMLAGTSGVGKTETALALAEALYGGEQNVVTINMSEFQEAHTVSTLKGAPPGYVGYGEGGVLTEAVRRKPYSVVLLDEVEKAHPDVHEMFFQVFDKGFMEDGEGRFIDFKNTLILLTTNAGTDLIASMCKDPDLLPDPEGLAKALRDPLLKIFPPALLGRLVTIPYYPLTDAMLGAIVRLQLGRIKKRVEARYKIPFEYGDDVVELVVSRCTESESGGRMIDAILTNTMLPDISREFLNRTMEGKPIVRVAMGVANADFTYNFD |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9943 | - | - |
Protein sequence: 907 a.a.
>T6CP001585 NC_008702:4280714-4283434 [Azoarcus olearius] [TssH]
MSEISRVASFGKLNSLAYKAMEGATVFCKLRGNPYVELEHWLQQLLQNEDTDLHRIVRHFDVDAARLAADMTTALDRLPR
GSTSISGLSENLDTLVERGWVYASLMFGDRQIRSGYLLVGMLKTPSLRNALMAISRQFERVRPDVLTDEFTKIVAGSAEE
NLTARDGSGGAPGEDSGAIAPAQMGKQEALKKFTVDLTEQARSGKMDPIVGRDEEIRQVVDILMRRRQNNPILVGEAGVG
KTAVVEGFAQRIARGDVPPALKDVSLLALDVGLLQAGASMKGEFEQRLRSVIDEVQASPKPIILFVDETHTLVGAGGAAG
TGDAANLLKPALARGTLRTVGATTWAEYKKYIEKDPALTRRFQNVQVDEPDEKKAVLMMRGVASTMEKHHQVQILDEALE
AAVKLSHRYIPARQLPDKSVSLLDTACARVAVSLHATPAEVDDSRKRIDALNTELEIIGRESNIGIEVGERRAHAEALLA
EEQQRLAELEARWAAEKTLVDELLALRAKLRSGSRPVEGTGSALEAAAEAAAPEAAAESAPEPEREALFARLREVQAELA
TLQGEDPLILPTVDYQAVASVVADWTGIPVGRMARNEIENVLRLPQLLGQRVIGQDHAMEMIAKRIQTSRAGLDNPNKPI
GVFMLAGTSGVGKTETALALAEALYGGEQNVVTINMSEFQEAHTVSTLKGAPPGYVGYGEGGVLTEAVRRKPYSVVLLDE
VEKAHPDVHEMFFQVFDKGFMEDGEGRFIDFKNTLILLTTNAGTDLIASMCKDPDLLPDPEGLAKALRDPLLKIFPPALL
GRLVTIPYYPLTDAMLGAIVRLQLGRIKKRVEARYKIPFEYGDDVVELVVSRCTESESGGRMIDAILTNTMLPDISREFL
NRTMEGKPIVRVAMGVANADFTYNFD*
MSEISRVASFGKLNSLAYKAMEGATVFCKLRGNPYVELEHWLQQLLQNEDTDLHRIVRHFDVDAARLAADMTTALDRLPR
GSTSISGLSENLDTLVERGWVYASLMFGDRQIRSGYLLVGMLKTPSLRNALMAISRQFERVRPDVLTDEFTKIVAGSAEE
NLTARDGSGGAPGEDSGAIAPAQMGKQEALKKFTVDLTEQARSGKMDPIVGRDEEIRQVVDILMRRRQNNPILVGEAGVG
KTAVVEGFAQRIARGDVPPALKDVSLLALDVGLLQAGASMKGEFEQRLRSVIDEVQASPKPIILFVDETHTLVGAGGAAG
TGDAANLLKPALARGTLRTVGATTWAEYKKYIEKDPALTRRFQNVQVDEPDEKKAVLMMRGVASTMEKHHQVQILDEALE
AAVKLSHRYIPARQLPDKSVSLLDTACARVAVSLHATPAEVDDSRKRIDALNTELEIIGRESNIGIEVGERRAHAEALLA
EEQQRLAELEARWAAEKTLVDELLALRAKLRSGSRPVEGTGSALEAAAEAAAPEAAAESAPEPEREALFARLREVQAELA
TLQGEDPLILPTVDYQAVASVVADWTGIPVGRMARNEIENVLRLPQLLGQRVIGQDHAMEMIAKRIQTSRAGLDNPNKPI
GVFMLAGTSGVGKTETALALAEALYGGEQNVVTINMSEFQEAHTVSTLKGAPPGYVGYGEGGVLTEAVRRKPYSVVLLDE
VEKAHPDVHEMFFQVFDKGFMEDGEGRFIDFKNTLILLTTNAGTDLIASMCKDPDLLPDPEGLAKALRDPLLKIFPPALL
GRLVTIPYYPLTDAMLGAIVRLQLGRIKKRVEARYKIPFEYGDDVVELVVSRCTESESGGRMIDAILTNTMLPDISREFL
NRTMEGKPIVRVAMGVANADFTYNFD*
Nucleotide sequence: 2721 bp
>T6CP001585 NC_008702:4280714-4283434 [Azoarcus olearius] [TssH]
ATGAGCGAAATCAGTCGGGTAGCCTCGTTCGGCAAACTCAACAGCCTCGCCTACAAGGCGATGGAAGGCGCCACCGTCTT
CTGCAAGCTGCGCGGCAATCCCTATGTGGAGCTGGAACACTGGCTGCAGCAGCTGCTGCAGAACGAGGACACCGACCTGC
ACCGCATCGTGCGCCACTTCGACGTGGACGCGGCGCGCCTGGCCGCCGACATGACCACCGCGCTCGACCGCCTGCCGCGC
GGCTCGACCTCGATCTCGGGCCTCTCCGAAAACCTCGACACGCTCGTCGAACGCGGCTGGGTGTATGCCTCTCTGATGTT
CGGCGACCGCCAGATCCGCTCCGGCTACCTGCTGGTCGGCATGCTCAAGACGCCCTCGCTGCGCAACGCGCTGATGGCGA
TCTCGCGCCAGTTCGAGCGCGTGCGCCCCGATGTGCTGACCGACGAATTCACCAAGATCGTCGCCGGCTCGGCGGAAGAA
AACCTCACCGCGCGCGATGGCTCGGGCGGCGCGCCGGGCGAGGACAGCGGCGCGATTGCGCCCGCGCAGATGGGCAAGCA
GGAGGCGCTGAAGAAATTCACCGTCGACCTCACCGAGCAGGCCCGCTCCGGCAAGATGGACCCGATCGTCGGCCGCGACG
AAGAAATCCGCCAGGTGGTCGACATCCTGATGCGCCGCCGCCAGAACAACCCCATCCTGGTCGGCGAGGCCGGCGTCGGC
AAGACCGCGGTGGTCGAAGGCTTCGCCCAGCGCATCGCGCGCGGCGACGTGCCGCCGGCGCTCAAGGACGTCTCGCTGCT
GGCGCTGGATGTCGGCCTGCTGCAGGCCGGCGCCAGCATGAAGGGCGAATTCGAGCAGCGACTGCGCTCGGTCATCGACG
AAGTCCAGGCCAGCCCCAAGCCCATCATCCTGTTCGTCGATGAGACCCACACCCTGGTCGGCGCCGGCGGCGCGGCCGGC
ACCGGCGATGCGGCCAACCTGCTCAAGCCGGCGCTCGCGCGCGGCACGCTGCGCACCGTCGGCGCCACCACCTGGGCCGA
GTACAAGAAGTACATCGAGAAGGACCCGGCGCTGACCCGGCGCTTCCAGAACGTGCAGGTCGACGAGCCGGACGAGAAGA
AGGCGGTGCTCATGATGCGCGGCGTTGCCAGCACCATGGAGAAGCACCACCAGGTGCAGATCCTCGACGAGGCGCTGGAG
GCCGCGGTCAAGCTGTCGCACCGCTACATCCCGGCGCGCCAGCTGCCGGACAAGTCCGTCTCGCTGCTCGACACCGCCTG
CGCCCGCGTCGCGGTGAGCCTGCACGCCACCCCGGCCGAGGTGGACGACAGCCGCAAGCGCATCGACGCGCTGAACACCG
AGCTGGAAATCATCGGCCGCGAGAGCAACATCGGCATCGAGGTCGGCGAGCGCCGTGCCCATGCCGAGGCCCTGCTGGCC
GAAGAACAGCAGCGGCTGGCCGAACTCGAGGCGCGCTGGGCGGCCGAGAAGACGCTGGTCGATGAACTCCTCGCGCTGCG
CGCCAAGCTGCGCAGCGGCAGCCGCCCGGTGGAAGGCACCGGCAGCGCGCTGGAAGCCGCGGCCGAAGCGGCTGCGCCGG
AGGCCGCCGCCGAGAGCGCGCCCGAACCCGAGCGCGAGGCGCTGTTCGCGCGCCTGCGCGAAGTGCAGGCCGAACTCGCC
ACGCTGCAGGGCGAAGACCCGCTGATCCTGCCGACAGTGGACTACCAGGCGGTGGCCTCGGTGGTCGCCGACTGGACCGG
CATCCCGGTCGGCCGCATGGCGCGCAACGAAATCGAGAACGTGCTGCGCCTGCCGCAGCTGCTCGGCCAGCGCGTCATCG
GCCAGGACCACGCGATGGAGATGATCGCCAAGCGCATCCAGACTTCGCGCGCCGGGCTCGACAACCCCAACAAGCCGATC
GGCGTGTTCATGCTCGCCGGCACCTCGGGCGTCGGCAAGACCGAAACCGCGCTGGCACTGGCCGAAGCGCTGTACGGCGG
CGAGCAGAACGTCGTCACGATCAACATGAGCGAATTCCAGGAAGCGCACACCGTGTCCACGCTCAAGGGCGCGCCTCCGG
GCTACGTCGGCTACGGCGAAGGCGGCGTGCTGACCGAGGCGGTGCGGCGCAAGCCCTACAGCGTGGTGCTGCTCGACGAG
GTCGAGAAGGCCCACCCCGATGTGCACGAGATGTTCTTCCAGGTCTTCGACAAGGGCTTCATGGAAGACGGCGAAGGCCG
CTTCATCGACTTCAAGAACACCCTGATCCTGCTCACCACCAATGCCGGCACCGACCTCATCGCCAGCATGTGCAAGGACC
CCGACCTGCTGCCCGACCCGGAAGGCCTCGCCAAGGCGCTGCGCGACCCGCTGCTGAAGATCTTCCCGCCGGCGCTGCTC
GGCCGCCTCGTCACCATTCCCTACTACCCGCTGACCGACGCCATGCTGGGCGCCATCGTCCGGCTGCAGCTCGGGCGCAT
CAAGAAGCGCGTGGAAGCGCGCTACAAGATCCCGTTCGAGTATGGCGACGACGTGGTCGAGCTGGTGGTCAGCCGCTGCA
CCGAGAGCGAATCCGGCGGCCGCATGATCGACGCCATCCTGACCAACACCATGCTGCCCGACATCAGCCGCGAGTTCCTG
AACCGGACGATGGAAGGCAAGCCGATCGTGCGCGTGGCGATGGGAGTCGCGAATGCCGACTTCACCTACAACTTCGATTG
A
ATGAGCGAAATCAGTCGGGTAGCCTCGTTCGGCAAACTCAACAGCCTCGCCTACAAGGCGATGGAAGGCGCCACCGTCTT
CTGCAAGCTGCGCGGCAATCCCTATGTGGAGCTGGAACACTGGCTGCAGCAGCTGCTGCAGAACGAGGACACCGACCTGC
ACCGCATCGTGCGCCACTTCGACGTGGACGCGGCGCGCCTGGCCGCCGACATGACCACCGCGCTCGACCGCCTGCCGCGC
GGCTCGACCTCGATCTCGGGCCTCTCCGAAAACCTCGACACGCTCGTCGAACGCGGCTGGGTGTATGCCTCTCTGATGTT
CGGCGACCGCCAGATCCGCTCCGGCTACCTGCTGGTCGGCATGCTCAAGACGCCCTCGCTGCGCAACGCGCTGATGGCGA
TCTCGCGCCAGTTCGAGCGCGTGCGCCCCGATGTGCTGACCGACGAATTCACCAAGATCGTCGCCGGCTCGGCGGAAGAA
AACCTCACCGCGCGCGATGGCTCGGGCGGCGCGCCGGGCGAGGACAGCGGCGCGATTGCGCCCGCGCAGATGGGCAAGCA
GGAGGCGCTGAAGAAATTCACCGTCGACCTCACCGAGCAGGCCCGCTCCGGCAAGATGGACCCGATCGTCGGCCGCGACG
AAGAAATCCGCCAGGTGGTCGACATCCTGATGCGCCGCCGCCAGAACAACCCCATCCTGGTCGGCGAGGCCGGCGTCGGC
AAGACCGCGGTGGTCGAAGGCTTCGCCCAGCGCATCGCGCGCGGCGACGTGCCGCCGGCGCTCAAGGACGTCTCGCTGCT
GGCGCTGGATGTCGGCCTGCTGCAGGCCGGCGCCAGCATGAAGGGCGAATTCGAGCAGCGACTGCGCTCGGTCATCGACG
AAGTCCAGGCCAGCCCCAAGCCCATCATCCTGTTCGTCGATGAGACCCACACCCTGGTCGGCGCCGGCGGCGCGGCCGGC
ACCGGCGATGCGGCCAACCTGCTCAAGCCGGCGCTCGCGCGCGGCACGCTGCGCACCGTCGGCGCCACCACCTGGGCCGA
GTACAAGAAGTACATCGAGAAGGACCCGGCGCTGACCCGGCGCTTCCAGAACGTGCAGGTCGACGAGCCGGACGAGAAGA
AGGCGGTGCTCATGATGCGCGGCGTTGCCAGCACCATGGAGAAGCACCACCAGGTGCAGATCCTCGACGAGGCGCTGGAG
GCCGCGGTCAAGCTGTCGCACCGCTACATCCCGGCGCGCCAGCTGCCGGACAAGTCCGTCTCGCTGCTCGACACCGCCTG
CGCCCGCGTCGCGGTGAGCCTGCACGCCACCCCGGCCGAGGTGGACGACAGCCGCAAGCGCATCGACGCGCTGAACACCG
AGCTGGAAATCATCGGCCGCGAGAGCAACATCGGCATCGAGGTCGGCGAGCGCCGTGCCCATGCCGAGGCCCTGCTGGCC
GAAGAACAGCAGCGGCTGGCCGAACTCGAGGCGCGCTGGGCGGCCGAGAAGACGCTGGTCGATGAACTCCTCGCGCTGCG
CGCCAAGCTGCGCAGCGGCAGCCGCCCGGTGGAAGGCACCGGCAGCGCGCTGGAAGCCGCGGCCGAAGCGGCTGCGCCGG
AGGCCGCCGCCGAGAGCGCGCCCGAACCCGAGCGCGAGGCGCTGTTCGCGCGCCTGCGCGAAGTGCAGGCCGAACTCGCC
ACGCTGCAGGGCGAAGACCCGCTGATCCTGCCGACAGTGGACTACCAGGCGGTGGCCTCGGTGGTCGCCGACTGGACCGG
CATCCCGGTCGGCCGCATGGCGCGCAACGAAATCGAGAACGTGCTGCGCCTGCCGCAGCTGCTCGGCCAGCGCGTCATCG
GCCAGGACCACGCGATGGAGATGATCGCCAAGCGCATCCAGACTTCGCGCGCCGGGCTCGACAACCCCAACAAGCCGATC
GGCGTGTTCATGCTCGCCGGCACCTCGGGCGTCGGCAAGACCGAAACCGCGCTGGCACTGGCCGAAGCGCTGTACGGCGG
CGAGCAGAACGTCGTCACGATCAACATGAGCGAATTCCAGGAAGCGCACACCGTGTCCACGCTCAAGGGCGCGCCTCCGG
GCTACGTCGGCTACGGCGAAGGCGGCGTGCTGACCGAGGCGGTGCGGCGCAAGCCCTACAGCGTGGTGCTGCTCGACGAG
GTCGAGAAGGCCCACCCCGATGTGCACGAGATGTTCTTCCAGGTCTTCGACAAGGGCTTCATGGAAGACGGCGAAGGCCG
CTTCATCGACTTCAAGAACACCCTGATCCTGCTCACCACCAATGCCGGCACCGACCTCATCGCCAGCATGTGCAAGGACC
CCGACCTGCTGCCCGACCCGGAAGGCCTCGCCAAGGCGCTGCGCGACCCGCTGCTGAAGATCTTCCCGCCGGCGCTGCTC
GGCCGCCTCGTCACCATTCCCTACTACCCGCTGACCGACGCCATGCTGGGCGCCATCGTCCGGCTGCAGCTCGGGCGCAT
CAAGAAGCGCGTGGAAGCGCGCTACAAGATCCCGTTCGAGTATGGCGACGACGTGGTCGAGCTGGTGGTCAGCCGCTGCA
CCGAGAGCGAATCCGGCGGCCGCATGATCGACGCCATCCTGACCAACACCATGCTGCCCGACATCAGCCGCGAGTTCCTG
AACCGGACGATGGAAGGCAAGCCGATCGTGCGCGTGGCGATGGGAGTCGCGAATGCCGACTTCACCTACAACTTCGATTG
A