Detailed information of component protein
Summary
External database links
Locus tag (Gene) | ATU_RS20345 |
Coordinate (Strand) | 1477260..1479938 (+) |
NCBI ID | WP_010973763.1 |
RefSeq | NC_003063 |
Uniprot ID | Q7CUP5_AGRFC, A0A083ZID1_RHIRD |
KEGG ID | atu:Atu4344 |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
AAA_2 | PF07724.16 | 3.7e-36 | 594..758 |
AAA_lid_9 | PF17871.3 | 9.7e-27 | 363..462 |
ClpB_D2-small | PF10431.11 | 5.2e-11 | 766..840 |
AAA | PF00004.31 | 6e-11 | 223..332 |
Clp_N | PF02861.22 | 5.6e-06 | 24..73 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Outside | 1-892 | MSHIDLNRLVGALEPDLRVTLEAAASVAVRMGHRYVDIPHWLLAVVDAGIYAETFDELKIPLPVLKAEIGRSLEEAIIGDGEALSLSQNILTAAREAWILASLEAGRDRVTLCDLLLAMDEETSLRSFVRSAFPSLKAMDRGALERLRSSAENGAGVDVPSALTSSGEAGSAQAAGQNDFLRLYTHDMTTDARNGKVDPVIGRDDELRQLVDILTRRRQNNPILVGEAGVGKTAVAEALALEIASGNVPEKLRNVCLLNLDISLLQAGAGVKGEFERRLHGVIDAVKRSAEPVILFIDEAHGLVGAGGAAGQGDAANILKPALARGEVRTVAATTWSEYKKYFEKDAALTRRFQPVHVREPDEATAIRMLRGVADTFVSHHNVTVRDEAIVAAVQLSARYMPARQLPDKAVSLLDTAAAAVSLARQTLPERLRAMESERHLLSDELNWLLREPQDEDMQNRIQSIRDELERLEAGIDDLRGRYDAEMAELAALSEEQPAETGASNVSHLRPATEMRPANAERLVPTVVDREAIAAVVSRWTGIPLGKLLADQIESARTLDVRMRQRVVGQDAAITRIADAMRTARAGLSDPRRPPAVFFLVGMSGTGKTETALSLADLLYGGNSHLTTINMSEFKEEHKVSLLLGSPPGYVGFGEGGVLTEAVRRRPFGVLLLDEIDKAHPGVQDIFYQVFDKGVLRDGEGRDVDFKNTTIFMTANTGSELLSALSADPDTMPEGEALEALLMPELTKQFKPAFLGRTIILPFMPLGAEELASVVDMQIGKIRDRVLATYGTDLRLSDAARDALVARAGASEIGARAIEIMIGKDLLPPLSSFFLEKVIAGERVGKIVVDFGENGFGIRAEAAGEADEFAVTEEVGVDKVAASDGATRRMRH |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9980 | - | - |
Protein sequence: 893 a.a.
>T6CP000122 NC_003063:1477260-1479938 [Agrobacterium fabrum str. C58] [TssH]
MSHIDLNRLVGALEPDLRVTLEAAASVAVRMGHRYVDIPHWLLAVVDAGIYAETFDELKIPLPVLKAEIGRSLEEAIIGD
GEALSLSQNILTAAREAWILASLEAGRDRVTLCDLLLAMDEETSLRSFVRSAFPSLKAMDRGALERLRSSAENGAGVDVP
SALTSSGEAGSAQAAGQNDFLRLYTHDMTTDARNGKVDPVIGRDDELRQLVDILTRRRQNNPILVGEAGVGKTAVAEALA
LEIASGNVPEKLRNVCLLNLDISLLQAGAGVKGEFERRLHGVIDAVKRSAEPVILFIDEAHGLVGAGGAAGQGDAANILK
PALARGEVRTVAATTWSEYKKYFEKDAALTRRFQPVHVREPDEATAIRMLRGVADTFVSHHNVTVRDEAIVAAVQLSARY
MPARQLPDKAVSLLDTAAAAVSLARQTLPERLRAMESERHLLSDELNWLLREPQDEDMQNRIQSIRDELERLEAGIDDLR
GRYDAEMAELAALSEEQPAETGASNVSHLRPATEMRPANAERLVPTVVDREAIAAVVSRWTGIPLGKLLADQIESARTLD
VRMRQRVVGQDAAITRIADAMRTARAGLSDPRRPPAVFFLVGMSGTGKTETALSLADLLYGGNSHLTTINMSEFKEEHKV
SLLLGSPPGYVGFGEGGVLTEAVRRRPFGVLLLDEIDKAHPGVQDIFYQVFDKGVLRDGEGRDVDFKNTTIFMTANTGSE
LLSALSADPDTMPEGEALEALLMPELTKQFKPAFLGRTIILPFMPLGAEELASVVDMQIGKIRDRVLATYGTDLRLSDAA
RDALVARAGASEIGARAIEIMIGKDLLPPLSSFFLEKVIAGERVGKIVVDFGENGFGIRAEAAGEADEFAVTEEVGVDKV
AASDGATRRMRH*
MSHIDLNRLVGALEPDLRVTLEAAASVAVRMGHRYVDIPHWLLAVVDAGIYAETFDELKIPLPVLKAEIGRSLEEAIIGD
GEALSLSQNILTAAREAWILASLEAGRDRVTLCDLLLAMDEETSLRSFVRSAFPSLKAMDRGALERLRSSAENGAGVDVP
SALTSSGEAGSAQAAGQNDFLRLYTHDMTTDARNGKVDPVIGRDDELRQLVDILTRRRQNNPILVGEAGVGKTAVAEALA
LEIASGNVPEKLRNVCLLNLDISLLQAGAGVKGEFERRLHGVIDAVKRSAEPVILFIDEAHGLVGAGGAAGQGDAANILK
PALARGEVRTVAATTWSEYKKYFEKDAALTRRFQPVHVREPDEATAIRMLRGVADTFVSHHNVTVRDEAIVAAVQLSARY
MPARQLPDKAVSLLDTAAAAVSLARQTLPERLRAMESERHLLSDELNWLLREPQDEDMQNRIQSIRDELERLEAGIDDLR
GRYDAEMAELAALSEEQPAETGASNVSHLRPATEMRPANAERLVPTVVDREAIAAVVSRWTGIPLGKLLADQIESARTLD
VRMRQRVVGQDAAITRIADAMRTARAGLSDPRRPPAVFFLVGMSGTGKTETALSLADLLYGGNSHLTTINMSEFKEEHKV
SLLLGSPPGYVGFGEGGVLTEAVRRRPFGVLLLDEIDKAHPGVQDIFYQVFDKGVLRDGEGRDVDFKNTTIFMTANTGSE
LLSALSADPDTMPEGEALEALLMPELTKQFKPAFLGRTIILPFMPLGAEELASVVDMQIGKIRDRVLATYGTDLRLSDAA
RDALVARAGASEIGARAIEIMIGKDLLPPLSSFFLEKVIAGERVGKIVVDFGENGFGIRAEAAGEADEFAVTEEVGVDKV
AASDGATRRMRH*
Nucleotide sequence: 2679 bp
>T6CP000122 NC_003063:1477260-1479938 [Agrobacterium fabrum str. C58] [TssH]
ATGTCGCATATCGATCTCAATCGCCTCGTTGGAGCGCTTGAGCCAGACCTGCGCGTTACCCTCGAAGCTGCGGCTTCCGT
TGCGGTGCGTATGGGGCACAGGTACGTGGATATACCGCATTGGCTGCTGGCGGTTGTAGATGCCGGTATCTATGCGGAAA
CATTCGACGAATTAAAAATTCCACTTCCGGTATTGAAGGCTGAAATCGGCCGCAGTCTGGAAGAGGCCATTATCGGCGAC
GGGGAGGCCTTGTCGCTCTCCCAGAACATCCTCACGGCAGCCCGGGAGGCATGGATTCTTGCGTCGCTGGAGGCTGGCCG
TGACCGCGTGACGCTTTGCGATCTCCTGTTGGCGATGGATGAGGAAACCTCACTTCGCTCCTTTGTCCGCTCGGCATTTC
CGTCTCTCAAGGCGATGGATCGCGGTGCGCTTGAGCGTCTTCGCAGCTCGGCCGAGAACGGCGCAGGCGTGGATGTTCCT
TCCGCGCTGACAAGTTCAGGAGAGGCAGGCTCGGCACAGGCCGCCGGCCAGAATGATTTCTTGCGCCTTTACACCCACGA
CATGACCACCGATGCCCGCAACGGCAAGGTCGATCCTGTTATCGGCCGCGATGATGAGTTGCGCCAGCTCGTTGATATTC
TGACCCGCCGCCGACAGAACAATCCTATTCTGGTGGGCGAGGCCGGCGTGGGCAAGACCGCGGTCGCCGAAGCGCTGGCG
TTGGAAATCGCCTCCGGCAACGTTCCGGAGAAACTGCGCAATGTCTGCCTGCTGAACCTCGATATTTCGCTGCTGCAGGC
CGGTGCCGGCGTGAAAGGTGAGTTCGAGCGGCGCCTGCACGGCGTGATCGATGCGGTCAAGCGTTCGGCCGAACCGGTCA
TCCTGTTCATCGATGAGGCCCATGGCCTTGTTGGCGCGGGTGGTGCGGCGGGCCAGGGTGATGCGGCGAATATTCTTAAG
CCCGCACTCGCACGAGGCGAGGTGCGGACCGTCGCGGCAACGACGTGGAGTGAATACAAGAAATATTTCGAGAAGGATGC
GGCGCTGACGCGTCGTTTCCAGCCAGTGCATGTGCGCGAGCCGGACGAGGCGACTGCGATACGGATGTTGCGCGGTGTGG
CGGATACTTTCGTCAGCCATCACAACGTTACCGTTCGTGATGAGGCGATCGTGGCTGCCGTGCAATTATCCGCGCGCTAC
ATGCCCGCTCGGCAATTGCCGGACAAGGCGGTCAGCCTGCTCGATACCGCCGCCGCGGCCGTTTCGCTGGCGCGGCAGAC
GTTGCCAGAGCGGTTACGTGCCATGGAAAGCGAGCGACACCTTCTGTCTGACGAACTGAACTGGCTTCTGCGTGAACCGC
AGGATGAGGACATGCAAAACCGCATCCAGTCGATACGCGACGAGCTTGAACGGCTTGAGGCCGGGATCGATGATCTGCGC
GGCCGTTATGACGCGGAAATGGCTGAACTGGCGGCGCTCAGTGAAGAGCAGCCGGCCGAAACCGGCGCGTCGAATGTTTC
GCACCTGCGGCCTGCCACGGAGATGCGGCCCGCAAATGCCGAAAGACTTGTTCCGACAGTGGTTGATCGTGAAGCGATCG
CCGCCGTCGTCTCGCGATGGACCGGGATTCCGCTCGGCAAACTTCTGGCGGACCAGATCGAAAGCGCCCGCACTCTGGAT
GTGCGCATGCGGCAGCGCGTGGTTGGGCAGGATGCCGCAATTACCCGGATTGCCGATGCCATGCGCACCGCCCGTGCCGG
ATTGTCCGATCCGCGCCGTCCGCCGGCGGTGTTTTTCCTTGTCGGCATGTCCGGAACGGGCAAGACCGAAACGGCGCTGT
CGCTGGCCGATCTTCTCTATGGCGGCAACAGCCATCTGACGACGATCAACATGTCGGAGTTCAAGGAGGAGCACAAGGTT
TCCCTGCTTCTCGGCTCACCGCCTGGCTATGTCGGTTTCGGCGAAGGTGGGGTGCTGACGGAAGCGGTGCGCCGCCGGCC
GTTCGGCGTTCTGCTTCTCGATGAAATCGACAAGGCTCACCCCGGCGTCCAGGATATTTTTTATCAGGTGTTCGACAAGG
GCGTGCTTCGCGATGGCGAAGGCCGCGACGTCGATTTCAAGAACACCACGATCTTCATGACGGCCAATACCGGTTCGGAA
TTGCTGTCGGCGCTTTCCGCCGATCCCGACACTATGCCGGAAGGTGAGGCGCTGGAAGCGCTTCTGATGCCGGAGCTTAC
CAAACAGTTCAAGCCCGCATTCCTCGGGCGCACGATCATCCTGCCCTTCATGCCGCTTGGCGCGGAGGAGCTTGCAAGTG
TCGTGGACATGCAGATTGGCAAGATCAGGGACCGTGTGCTCGCAACATATGGCACCGACCTGAGATTGTCCGATGCGGCG
CGCGATGCCTTGGTGGCGCGCGCCGGTGCAAGTGAAATCGGCGCACGCGCCATCGAAATCATGATCGGCAAGGATCTCCT
GCCGCCTCTTTCCAGCTTTTTCCTCGAGAAGGTCATCGCGGGAGAACGCGTCGGGAAGATCGTTGTCGATTTTGGTGAAA
ACGGGTTCGGCATTCGTGCGGAAGCGGCTGGGGAAGCAGATGAATTCGCCGTAACCGAAGAGGTTGGGGTGGATAAAGTT
GCCGCGTCGGATGGGGCTACCCGACGCATGCGGCATTAA
ATGTCGCATATCGATCTCAATCGCCTCGTTGGAGCGCTTGAGCCAGACCTGCGCGTTACCCTCGAAGCTGCGGCTTCCGT
TGCGGTGCGTATGGGGCACAGGTACGTGGATATACCGCATTGGCTGCTGGCGGTTGTAGATGCCGGTATCTATGCGGAAA
CATTCGACGAATTAAAAATTCCACTTCCGGTATTGAAGGCTGAAATCGGCCGCAGTCTGGAAGAGGCCATTATCGGCGAC
GGGGAGGCCTTGTCGCTCTCCCAGAACATCCTCACGGCAGCCCGGGAGGCATGGATTCTTGCGTCGCTGGAGGCTGGCCG
TGACCGCGTGACGCTTTGCGATCTCCTGTTGGCGATGGATGAGGAAACCTCACTTCGCTCCTTTGTCCGCTCGGCATTTC
CGTCTCTCAAGGCGATGGATCGCGGTGCGCTTGAGCGTCTTCGCAGCTCGGCCGAGAACGGCGCAGGCGTGGATGTTCCT
TCCGCGCTGACAAGTTCAGGAGAGGCAGGCTCGGCACAGGCCGCCGGCCAGAATGATTTCTTGCGCCTTTACACCCACGA
CATGACCACCGATGCCCGCAACGGCAAGGTCGATCCTGTTATCGGCCGCGATGATGAGTTGCGCCAGCTCGTTGATATTC
TGACCCGCCGCCGACAGAACAATCCTATTCTGGTGGGCGAGGCCGGCGTGGGCAAGACCGCGGTCGCCGAAGCGCTGGCG
TTGGAAATCGCCTCCGGCAACGTTCCGGAGAAACTGCGCAATGTCTGCCTGCTGAACCTCGATATTTCGCTGCTGCAGGC
CGGTGCCGGCGTGAAAGGTGAGTTCGAGCGGCGCCTGCACGGCGTGATCGATGCGGTCAAGCGTTCGGCCGAACCGGTCA
TCCTGTTCATCGATGAGGCCCATGGCCTTGTTGGCGCGGGTGGTGCGGCGGGCCAGGGTGATGCGGCGAATATTCTTAAG
CCCGCACTCGCACGAGGCGAGGTGCGGACCGTCGCGGCAACGACGTGGAGTGAATACAAGAAATATTTCGAGAAGGATGC
GGCGCTGACGCGTCGTTTCCAGCCAGTGCATGTGCGCGAGCCGGACGAGGCGACTGCGATACGGATGTTGCGCGGTGTGG
CGGATACTTTCGTCAGCCATCACAACGTTACCGTTCGTGATGAGGCGATCGTGGCTGCCGTGCAATTATCCGCGCGCTAC
ATGCCCGCTCGGCAATTGCCGGACAAGGCGGTCAGCCTGCTCGATACCGCCGCCGCGGCCGTTTCGCTGGCGCGGCAGAC
GTTGCCAGAGCGGTTACGTGCCATGGAAAGCGAGCGACACCTTCTGTCTGACGAACTGAACTGGCTTCTGCGTGAACCGC
AGGATGAGGACATGCAAAACCGCATCCAGTCGATACGCGACGAGCTTGAACGGCTTGAGGCCGGGATCGATGATCTGCGC
GGCCGTTATGACGCGGAAATGGCTGAACTGGCGGCGCTCAGTGAAGAGCAGCCGGCCGAAACCGGCGCGTCGAATGTTTC
GCACCTGCGGCCTGCCACGGAGATGCGGCCCGCAAATGCCGAAAGACTTGTTCCGACAGTGGTTGATCGTGAAGCGATCG
CCGCCGTCGTCTCGCGATGGACCGGGATTCCGCTCGGCAAACTTCTGGCGGACCAGATCGAAAGCGCCCGCACTCTGGAT
GTGCGCATGCGGCAGCGCGTGGTTGGGCAGGATGCCGCAATTACCCGGATTGCCGATGCCATGCGCACCGCCCGTGCCGG
ATTGTCCGATCCGCGCCGTCCGCCGGCGGTGTTTTTCCTTGTCGGCATGTCCGGAACGGGCAAGACCGAAACGGCGCTGT
CGCTGGCCGATCTTCTCTATGGCGGCAACAGCCATCTGACGACGATCAACATGTCGGAGTTCAAGGAGGAGCACAAGGTT
TCCCTGCTTCTCGGCTCACCGCCTGGCTATGTCGGTTTCGGCGAAGGTGGGGTGCTGACGGAAGCGGTGCGCCGCCGGCC
GTTCGGCGTTCTGCTTCTCGATGAAATCGACAAGGCTCACCCCGGCGTCCAGGATATTTTTTATCAGGTGTTCGACAAGG
GCGTGCTTCGCGATGGCGAAGGCCGCGACGTCGATTTCAAGAACACCACGATCTTCATGACGGCCAATACCGGTTCGGAA
TTGCTGTCGGCGCTTTCCGCCGATCCCGACACTATGCCGGAAGGTGAGGCGCTGGAAGCGCTTCTGATGCCGGAGCTTAC
CAAACAGTTCAAGCCCGCATTCCTCGGGCGCACGATCATCCTGCCCTTCATGCCGCTTGGCGCGGAGGAGCTTGCAAGTG
TCGTGGACATGCAGATTGGCAAGATCAGGGACCGTGTGCTCGCAACATATGGCACCGACCTGAGATTGTCCGATGCGGCG
CGCGATGCCTTGGTGGCGCGCGCCGGTGCAAGTGAAATCGGCGCACGCGCCATCGAAATCATGATCGGCAAGGATCTCCT
GCCGCCTCTTTCCAGCTTTTTCCTCGAGAAGGTCATCGCGGGAGAACGCGTCGGGAAGATCGTTGTCGATTTTGGTGAAA
ACGGGTTCGGCATTCGTGCGGAAGCGGCTGGGGAAGCAGATGAATTCGCCGTAACCGAAGAGGTTGGGGTGGATAAAGTT
GCCGCGTCGGATGGGGCTACCCGACGCATGCGGCATTAA