Detailed information of component protein
Summary
External database links
| Locus tag (Gene) | AZO_RS19445 |
| Coordinate (Strand) | 4247215..4249182 (+) |
| NCBI ID | WP_011767597.1 |
| RefSeq | NC_008702 |
| Uniprot ID | A1KCD5_AZOSB |
| KEGG ID | azo:azo3875 |
| PDB ID | - |
Pfam domain hit(s)
| Domain | Pfam ID | E-value | Aligned region |
|---|---|---|---|
| Sigma54_activat | PF00158.28 | 1.3e-63 | 333..495 |
| HTH_8 | PF02954.21 | 5.9e-09 | 614..646 |
| GAF | PF01590.28 | 2.9e-06 | 71..203 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
| Prediction | Region | Sequence |
|---|---|---|
| Outside | 1-655 | MLAQSQREHIDHVLAVSQAGEPATDPIHRSWVRCVNDYGLDPSRPTRAHIVEQARLREHQDQIEDFLGVARTGMEQLYKRMAGIGYVLLLTDAHGIAVDFIGNDAWARELKHSGLYLGADWSETRAGTCAVGTCIVDKAPITCHHTDHFDASHITLTCNAAPLFDPTGGFLGVLDVSALTSPSPRDSQHLALHMVTMYAQMVEDASFLRYFRDRWVLRLGSAWSMVDVSGEIMLAFDADGLIVGANSGARRALRPLDGVTDTLIGRPLGEVFREGMDAIWRIARAGFSADRTALSTSGGEVYYGAVLQPRVAPLRPAPAVAATPAAPALERLAGNDPQMQRLIAQASRLVNRPVNILVHGETGTGKEVLAKALHESSSRATKPFVAVNCAAIPESLIESELFGYTAGTFTGGRSKGMRGLIQQADGGTLFLDEIGDMPLHLQTRLLRVLAEREVLPLGAEKPVPVNLTVVAASHRDLRKLIAEGSFREDLYYRLCGAILYLPPLRERRDRDYLIDRLLAAEGAELGTSAALDPAARGLLLGYDWPGNVRQLRNVLRFALAVCDGEHILPQDLPAELMASAPQAPSRTPAIATATAAVTAPAAVTTPGADPVRGELEAALRRHHWNITAAAAELGVCRATIYRQMKRHGIVPPHLL |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
| Prediction | Probability | Cleavage site | Signal peptide sequence |
|---|---|---|---|
| Other | 0.9753 | - | - |
Protein sequence: 656 a.a.
>T6CP012464 NC_008702:4247215-4249182 [Azoarcus olearius] [Sfa3]
MLAQSQREHIDHVLAVSQAGEPATDPIHRSWVRCVNDYGLDPSRPTRAHIVEQARLREHQDQIEDFLGVARTGMEQLYKR
MAGIGYVLLLTDAHGIAVDFIGNDAWARELKHSGLYLGADWSETRAGTCAVGTCIVDKAPITCHHTDHFDASHITLTCNA
APLFDPTGGFLGVLDVSALTSPSPRDSQHLALHMVTMYAQMVEDASFLRYFRDRWVLRLGSAWSMVDVSGEIMLAFDADG
LIVGANSGARRALRPLDGVTDTLIGRPLGEVFREGMDAIWRIARAGFSADRTALSTSGGEVYYGAVLQPRVAPLRPAPAV
AATPAAPALERLAGNDPQMQRLIAQASRLVNRPVNILVHGETGTGKEVLAKALHESSSRATKPFVAVNCAAIPESLIESE
LFGYTAGTFTGGRSKGMRGLIQQADGGTLFLDEIGDMPLHLQTRLLRVLAEREVLPLGAEKPVPVNLTVVAASHRDLRKL
IAEGSFREDLYYRLCGAILYLPPLRERRDRDYLIDRLLAAEGAELGTSAALDPAARGLLLGYDWPGNVRQLRNVLRFALA
VCDGEHILPQDLPAELMASAPQAPSRTPAIATATAAVTAPAAVTTPGADPVRGELEAALRRHHWNITAAAAELGVCRATI
YRQMKRHGIVPPHLL*
MLAQSQREHIDHVLAVSQAGEPATDPIHRSWVRCVNDYGLDPSRPTRAHIVEQARLREHQDQIEDFLGVARTGMEQLYKR
MAGIGYVLLLTDAHGIAVDFIGNDAWARELKHSGLYLGADWSETRAGTCAVGTCIVDKAPITCHHTDHFDASHITLTCNA
APLFDPTGGFLGVLDVSALTSPSPRDSQHLALHMVTMYAQMVEDASFLRYFRDRWVLRLGSAWSMVDVSGEIMLAFDADG
LIVGANSGARRALRPLDGVTDTLIGRPLGEVFREGMDAIWRIARAGFSADRTALSTSGGEVYYGAVLQPRVAPLRPAPAV
AATPAAPALERLAGNDPQMQRLIAQASRLVNRPVNILVHGETGTGKEVLAKALHESSSRATKPFVAVNCAAIPESLIESE
LFGYTAGTFTGGRSKGMRGLIQQADGGTLFLDEIGDMPLHLQTRLLRVLAEREVLPLGAEKPVPVNLTVVAASHRDLRKL
IAEGSFREDLYYRLCGAILYLPPLRERRDRDYLIDRLLAAEGAELGTSAALDPAARGLLLGYDWPGNVRQLRNVLRFALA
VCDGEHILPQDLPAELMASAPQAPSRTPAIATATAAVTAPAAVTTPGADPVRGELEAALRRHHWNITAAAAELGVCRATI
YRQMKRHGIVPPHLL*
Nucleotide sequence: 1968 bp
>T6CP012464 NC_008702:4247215-4249182 [Azoarcus olearius] [Sfa3]
ATGCTGGCGCAATCCCAGCGCGAACACATCGACCATGTGCTCGCCGTATCACAGGCGGGCGAACCCGCCACCGACCCGAT
CCACCGCTCGTGGGTCCGCTGTGTGAACGACTACGGCCTCGACCCGAGCCGGCCGACCCGCGCGCATATCGTCGAACAGG
CCCGCCTGCGCGAGCACCAGGACCAGATCGAGGACTTCCTCGGCGTCGCCCGCACCGGCATGGAGCAGCTCTACAAGCGC
ATGGCCGGCATCGGCTACGTGCTGCTGCTGACCGACGCCCACGGCATCGCGGTGGACTTCATCGGCAACGACGCCTGGGC
GCGCGAGCTGAAGCACTCCGGCCTCTATCTCGGCGCCGACTGGAGCGAAACCCGTGCCGGCACCTGTGCGGTGGGCACCT
GCATCGTCGACAAGGCGCCGATCACCTGTCACCACACCGATCACTTCGACGCGTCGCACATCACGCTGACCTGTAATGCC
GCGCCGCTGTTCGACCCCACCGGCGGCTTCCTCGGCGTGCTCGACGTCTCGGCGCTGACCTCGCCGTCGCCGCGCGACAG
CCAGCATCTGGCGCTGCACATGGTGACGATGTATGCGCAGATGGTGGAGGACGCCAGCTTCCTGCGCTATTTCCGCGACC
GCTGGGTGCTGCGGCTGGGCTCGGCGTGGTCGATGGTCGATGTGTCGGGCGAGATCATGCTCGCCTTCGATGCCGATGGC
CTCATCGTCGGCGCCAACAGCGGCGCCCGGCGCGCACTGCGCCCGCTCGACGGCGTCACCGACACCCTGATCGGGCGCCC
GCTCGGCGAGGTTTTCCGCGAGGGCATGGACGCGATCTGGCGCATCGCCCGCGCCGGTTTTTCCGCCGACCGCACCGCGC
TGTCCACCAGCGGCGGCGAGGTCTATTACGGTGCCGTGCTGCAACCGCGCGTGGCCCCGCTGCGCCCGGCGCCCGCCGTC
GCCGCCACCCCGGCGGCGCCCGCGCTGGAGCGCCTCGCCGGCAACGACCCGCAGATGCAGCGCCTGATCGCACAGGCGTC
GCGGCTGGTGAACCGGCCGGTGAACATCCTGGTGCACGGCGAAACCGGCACCGGCAAGGAAGTGCTGGCGAAAGCGCTGC
ACGAATCCAGCAGCCGCGCCACCAAGCCCTTCGTCGCGGTCAATTGCGCGGCGATCCCGGAATCGCTGATCGAGAGCGAA
CTGTTCGGCTACACCGCCGGCACCTTCACCGGCGGGCGCAGCAAGGGCATGCGCGGGCTGATCCAGCAGGCCGACGGCGG
CACGCTGTTCCTCGACGAGATCGGCGACATGCCGCTGCACCTGCAGACCCGGCTGCTGCGCGTGCTGGCCGAGCGCGAGG
TGCTGCCGCTGGGCGCCGAGAAGCCGGTGCCGGTGAATCTCACCGTGGTCGCCGCCTCGCACCGCGACCTGCGCAAGCTG
ATCGCCGAAGGCAGCTTCCGCGAGGATCTCTACTACCGCCTGTGCGGCGCCATCCTGTACCTGCCGCCGCTGCGCGAACG
GCGCGACCGCGACTACCTGATCGACCGCCTGCTCGCGGCCGAAGGCGCGGAACTGGGCACCAGCGCCGCGCTGGACCCGG
CCGCGCGCGGCCTGCTGCTGGGCTACGACTGGCCCGGCAACGTGCGCCAGCTGCGCAATGTGCTGCGCTTCGCGCTGGCG
GTGTGCGATGGCGAGCACATCCTGCCGCAGGACCTGCCCGCCGAACTGATGGCGAGCGCACCGCAAGCGCCGTCACGCAC
GCCCGCCATCGCGACCGCGACCGCGGCCGTCACTGCGCCCGCGGCGGTCACCACGCCGGGCGCCGACCCGGTGCGCGGCG
AACTGGAGGCCGCGCTGCGCCGTCATCACTGGAACATCACCGCCGCCGCGGCCGAACTCGGCGTCTGCCGCGCGACCATC
TACCGGCAGATGAAACGCCACGGCATCGTGCCGCCCCATCTGCTGTAG
ATGCTGGCGCAATCCCAGCGCGAACACATCGACCATGTGCTCGCCGTATCACAGGCGGGCGAACCCGCCACCGACCCGAT
CCACCGCTCGTGGGTCCGCTGTGTGAACGACTACGGCCTCGACCCGAGCCGGCCGACCCGCGCGCATATCGTCGAACAGG
CCCGCCTGCGCGAGCACCAGGACCAGATCGAGGACTTCCTCGGCGTCGCCCGCACCGGCATGGAGCAGCTCTACAAGCGC
ATGGCCGGCATCGGCTACGTGCTGCTGCTGACCGACGCCCACGGCATCGCGGTGGACTTCATCGGCAACGACGCCTGGGC
GCGCGAGCTGAAGCACTCCGGCCTCTATCTCGGCGCCGACTGGAGCGAAACCCGTGCCGGCACCTGTGCGGTGGGCACCT
GCATCGTCGACAAGGCGCCGATCACCTGTCACCACACCGATCACTTCGACGCGTCGCACATCACGCTGACCTGTAATGCC
GCGCCGCTGTTCGACCCCACCGGCGGCTTCCTCGGCGTGCTCGACGTCTCGGCGCTGACCTCGCCGTCGCCGCGCGACAG
CCAGCATCTGGCGCTGCACATGGTGACGATGTATGCGCAGATGGTGGAGGACGCCAGCTTCCTGCGCTATTTCCGCGACC
GCTGGGTGCTGCGGCTGGGCTCGGCGTGGTCGATGGTCGATGTGTCGGGCGAGATCATGCTCGCCTTCGATGCCGATGGC
CTCATCGTCGGCGCCAACAGCGGCGCCCGGCGCGCACTGCGCCCGCTCGACGGCGTCACCGACACCCTGATCGGGCGCCC
GCTCGGCGAGGTTTTCCGCGAGGGCATGGACGCGATCTGGCGCATCGCCCGCGCCGGTTTTTCCGCCGACCGCACCGCGC
TGTCCACCAGCGGCGGCGAGGTCTATTACGGTGCCGTGCTGCAACCGCGCGTGGCCCCGCTGCGCCCGGCGCCCGCCGTC
GCCGCCACCCCGGCGGCGCCCGCGCTGGAGCGCCTCGCCGGCAACGACCCGCAGATGCAGCGCCTGATCGCACAGGCGTC
GCGGCTGGTGAACCGGCCGGTGAACATCCTGGTGCACGGCGAAACCGGCACCGGCAAGGAAGTGCTGGCGAAAGCGCTGC
ACGAATCCAGCAGCCGCGCCACCAAGCCCTTCGTCGCGGTCAATTGCGCGGCGATCCCGGAATCGCTGATCGAGAGCGAA
CTGTTCGGCTACACCGCCGGCACCTTCACCGGCGGGCGCAGCAAGGGCATGCGCGGGCTGATCCAGCAGGCCGACGGCGG
CACGCTGTTCCTCGACGAGATCGGCGACATGCCGCTGCACCTGCAGACCCGGCTGCTGCGCGTGCTGGCCGAGCGCGAGG
TGCTGCCGCTGGGCGCCGAGAAGCCGGTGCCGGTGAATCTCACCGTGGTCGCCGCCTCGCACCGCGACCTGCGCAAGCTG
ATCGCCGAAGGCAGCTTCCGCGAGGATCTCTACTACCGCCTGTGCGGCGCCATCCTGTACCTGCCGCCGCTGCGCGAACG
GCGCGACCGCGACTACCTGATCGACCGCCTGCTCGCGGCCGAAGGCGCGGAACTGGGCACCAGCGCCGCGCTGGACCCGG
CCGCGCGCGGCCTGCTGCTGGGCTACGACTGGCCCGGCAACGTGCGCCAGCTGCGCAATGTGCTGCGCTTCGCGCTGGCG
GTGTGCGATGGCGAGCACATCCTGCCGCAGGACCTGCCCGCCGAACTGATGGCGAGCGCACCGCAAGCGCCGTCACGCAC
GCCCGCCATCGCGACCGCGACCGCGGCCGTCACTGCGCCCGCGGCGGTCACCACGCCGGGCGCCGACCCGGTGCGCGGCG
AACTGGAGGCCGCGCTGCGCCGTCATCACTGGAACATCACCGCCGCCGCGGCCGAACTCGGCGTCTGCCGCGCGACCATC
TACCGGCAGATGAAACGCCACGGCATCGTGCCGCCCCATCTGCTGTAG