Detailed information of component protein
Summary
External database links
Locus tag (Gene) | ATU_RS20285 |
Coordinate (Strand) | 1460756..1464235 (-) |
NCBI ID | WP_010973755.1 |
RefSeq | NC_003063 |
Uniprot ID | A9CGF9_AGRFC, A0A6V7AAH0_RHIRD |
KEGG ID | atu:Atu4332 |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
IcmF-related_N | PF14331.8 | 2.3e-78 | 203..442 |
IcmF-related | PF06761.14 | 9.1e-61 | 496..793 |
IcmF_C | PF06744.14 | 6.6e-18 | 1037..1140 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Inside | 1-24 | MNPLSYFYTIRSYVESYAALVGRR |
Transmembrane helix | 25-47 | FLSLLWVIALCVVVWFYGYLIAL |
Outside | 48-56 | GDFKPLGTV |
Transmembrane helix | 57-79 | QARLIAIGIIVAVWLVYIIVTIY |
Inside | 80-432 | RGRKQDKELVDSIEREALANRQAEIGEIQTRLKEALALLRRVTKKRFGYIYDLPWYVIFGAPGSGKTTALTNSGLQFPLGDALGENAVKGIGGTRNCNWWFADEAILIDTAGRYTTQDDLDGSSKAGWEGFLGLLRRYRRSQPINGALVTLSIPDLLNRDPEEQRQELRSIRQRLSELDEYLHARVPVYIVLTKADLLHGFVEFFDGFNKTDRQQVWGTTFKLDESYSAENLPQRLTEEFELLQQRVDAMLIERLQQEQNAEIRGRIFRFPAELARLKDRLHEALAELCASSPLIEAPLLRGVYFASGTQPETEKSPAASRTRRSYFLSRLFKDVIFPEAALVTRDKRLSRRQ |
Transmembrane helix | 433-455 | LLVRRIAYAVSATAVAIVFTGWI |
Outside | 456-1159 | FTYFANTQALAEADRKLGAYEQLVQGIPVREVADADFLRILPALDNLRDVNSGFARERVWNVSFGLDQEDKIAGRQRDAYQRALNALLLPRMIVQLQKQLKDEKDVTRTFNSLKLYGMLGGMGGLDRDFLTTQTHQMFASLYPGDGRAAAREALDQHAKALADGVLAPIELDARLIATARETIRDQAIGTRAYDILAGLPQVRELMEWTPATAFGPLGERAFERRSKAPMAEGIEGLFTADGYRRVVIPQVAHAARVALSEGWVRGSDDAIKGATVEQVAQAALQIYFDRFEKIWADTLSDLRVKPSQSLGDAVETTRALANERNIVVEAARSIAEATDLRPGANPAALASAAEGDATAAVLAATVNAADPYARLRDMLATKGAATTGEQPNGDKTGGSPSEQLLAHFKLLNEQLARSATTSDEVAKVFDVDSQLTKANQDLLQQARELPAPLDVWVAGVAADVGSLAVKSARSRIAELWTADSASLCSSIVTGRYPFDRASSRDVAIADFTRLFAPTGVFQSFFKQRMEPFVDKTTTPWSWKGTFGAAGIPSSAVAQFENADKISRAFFPNGSETPTVSINVKPVSLTNASSAVMLEIEGERVVYYHGPIQAKSITWPSRENTASLSRIAFQPGGWQQAKTENGDWSPFRLFDGANIENQSGELLRVRFENGVQAAEFDIQFGSVLNPFKLDAIASFACPAQF |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9912 | - | - |
Protein sequence: 1160 a.a.
>T6CP000114 NC_003063:c1464235-1460756 [Agrobacterium fabrum str. C58] [TssM]
MNPLSYFYTIRSYVESYAALVGRRFLSLLWVIALCVVVWFYGYLIALGDFKPLGTVQARLIAIGIIVAVWLVYIIVTIYR
GRKQDKELVDSIEREALANRQAEIGEIQTRLKEALALLRRVTKKRFGYIYDLPWYVIFGAPGSGKTTALTNSGLQFPLGD
ALGENAVKGIGGTRNCNWWFADEAILIDTAGRYTTQDDLDGSSKAGWEGFLGLLRRYRRSQPINGALVTLSIPDLLNRDP
EEQRQELRSIRQRLSELDEYLHARVPVYIVLTKADLLHGFVEFFDGFNKTDRQQVWGTTFKLDESYSAENLPQRLTEEFE
LLQQRVDAMLIERLQQEQNAEIRGRIFRFPAELARLKDRLHEALAELCASSPLIEAPLLRGVYFASGTQPETEKSPAASR
TRRSYFLSRLFKDVIFPEAALVTRDKRLSRRQLLVRRIAYAVSATAVAIVFTGWIFTYFANTQALAEADRKLGAYEQLVQ
GIPVREVADADFLRILPALDNLRDVNSGFARERVWNVSFGLDQEDKIAGRQRDAYQRALNALLLPRMIVQLQKQLKDEKD
VTRTFNSLKLYGMLGGMGGLDRDFLTTQTHQMFASLYPGDGRAAAREALDQHAKALADGVLAPIELDARLIATARETIRD
QAIGTRAYDILAGLPQVRELMEWTPATAFGPLGERAFERRSKAPMAEGIEGLFTADGYRRVVIPQVAHAARVALSEGWVR
GSDDAIKGATVEQVAQAALQIYFDRFEKIWADTLSDLRVKPSQSLGDAVETTRALANERNIVVEAARSIAEATDLRPGAN
PAALASAAEGDATAAVLAATVNAADPYARLRDMLATKGAATTGEQPNGDKTGGSPSEQLLAHFKLLNEQLARSATTSDEV
AKVFDVDSQLTKANQDLLQQARELPAPLDVWVAGVAADVGSLAVKSARSRIAELWTADSASLCSSIVTGRYPFDRASSRD
VAIADFTRLFAPTGVFQSFFKQRMEPFVDKTTTPWSWKGTFGAAGIPSSAVAQFENADKISRAFFPNGSETPTVSINVKP
VSLTNASSAVMLEIEGERVVYYHGPIQAKSITWPSRENTASLSRIAFQPGGWQQAKTENGDWSPFRLFDGANIENQSGEL
LRVRFENGVQAAEFDIQFGSVLNPFKLDAIASFACPAQF*
MNPLSYFYTIRSYVESYAALVGRRFLSLLWVIALCVVVWFYGYLIALGDFKPLGTVQARLIAIGIIVAVWLVYIIVTIYR
GRKQDKELVDSIEREALANRQAEIGEIQTRLKEALALLRRVTKKRFGYIYDLPWYVIFGAPGSGKTTALTNSGLQFPLGD
ALGENAVKGIGGTRNCNWWFADEAILIDTAGRYTTQDDLDGSSKAGWEGFLGLLRRYRRSQPINGALVTLSIPDLLNRDP
EEQRQELRSIRQRLSELDEYLHARVPVYIVLTKADLLHGFVEFFDGFNKTDRQQVWGTTFKLDESYSAENLPQRLTEEFE
LLQQRVDAMLIERLQQEQNAEIRGRIFRFPAELARLKDRLHEALAELCASSPLIEAPLLRGVYFASGTQPETEKSPAASR
TRRSYFLSRLFKDVIFPEAALVTRDKRLSRRQLLVRRIAYAVSATAVAIVFTGWIFTYFANTQALAEADRKLGAYEQLVQ
GIPVREVADADFLRILPALDNLRDVNSGFARERVWNVSFGLDQEDKIAGRQRDAYQRALNALLLPRMIVQLQKQLKDEKD
VTRTFNSLKLYGMLGGMGGLDRDFLTTQTHQMFASLYPGDGRAAAREALDQHAKALADGVLAPIELDARLIATARETIRD
QAIGTRAYDILAGLPQVRELMEWTPATAFGPLGERAFERRSKAPMAEGIEGLFTADGYRRVVIPQVAHAARVALSEGWVR
GSDDAIKGATVEQVAQAALQIYFDRFEKIWADTLSDLRVKPSQSLGDAVETTRALANERNIVVEAARSIAEATDLRPGAN
PAALASAAEGDATAAVLAATVNAADPYARLRDMLATKGAATTGEQPNGDKTGGSPSEQLLAHFKLLNEQLARSATTSDEV
AKVFDVDSQLTKANQDLLQQARELPAPLDVWVAGVAADVGSLAVKSARSRIAELWTADSASLCSSIVTGRYPFDRASSRD
VAIADFTRLFAPTGVFQSFFKQRMEPFVDKTTTPWSWKGTFGAAGIPSSAVAQFENADKISRAFFPNGSETPTVSINVKP
VSLTNASSAVMLEIEGERVVYYHGPIQAKSITWPSRENTASLSRIAFQPGGWQQAKTENGDWSPFRLFDGANIENQSGEL
LRVRFENGVQAAEFDIQFGSVLNPFKLDAIASFACPAQF*
Nucleotide sequence: 3480 bp
>T6CP000114 NC_003063:c1464235-1460756 [Agrobacterium fabrum str. C58] [TssM]
ATGAATCCATTGAGCTATTTTTATACGATCCGTTCCTATGTCGAGAGCTATGCGGCCCTCGTCGGCCGGCGTTTCCTCTC
GCTTTTATGGGTGATCGCGCTCTGCGTCGTCGTCTGGTTCTACGGCTATCTGATCGCGCTCGGCGATTTCAAACCGCTTG
GAACGGTGCAGGCGCGGCTCATCGCCATCGGCATCATCGTCGCCGTCTGGCTGGTCTACATCATCGTCACGATCTACCGC
GGCCGCAAACAGGACAAGGAACTCGTCGACAGTATCGAGCGCGAGGCGCTGGCCAATCGGCAGGCGGAAATCGGCGAAAT
CCAGACCCGCCTGAAAGAAGCGCTCGCGCTCTTGCGGCGCGTCACGAAAAAGCGCTTCGGTTACATTTACGACTTGCCCT
GGTACGTGATCTTCGGCGCACCCGGCTCCGGCAAGACGACGGCGCTGACCAATTCCGGCCTGCAATTTCCGCTTGGCGAT
GCGCTCGGTGAAAATGCGGTGAAGGGTATCGGTGGCACGCGCAACTGTAACTGGTGGTTCGCGGACGAGGCGATCCTTAT
CGACACTGCCGGTCGCTATACCACCCAGGATGACCTCGACGGTTCCTCGAAGGCCGGCTGGGAAGGCTTTCTTGGCCTTT
TGAGGCGATATCGTCGTTCGCAGCCGATCAATGGCGCATTGGTCACGCTGTCCATTCCCGATCTGCTCAACCGCGATCCC
GAAGAACAGCGGCAGGAGCTGCGCTCCATCCGCCAGCGTCTTTCGGAACTTGACGAATATCTGCATGCCCGCGTTCCGGT
CTATATCGTCCTGACGAAAGCCGACCTGCTGCATGGTTTCGTGGAATTTTTCGATGGATTCAACAAGACCGACCGCCAGC
AGGTGTGGGGCACCACCTTCAAGCTGGATGAGAGCTACTCCGCCGAAAACCTGCCGCAGCGTCTGACGGAAGAATTCGAA
CTGCTACAGCAGCGGGTCGATGCCATGCTGATCGAACGCCTGCAGCAGGAGCAGAATGCGGAAATACGCGGCCGTATCTT
CCGTTTTCCAGCCGAACTCGCGCGGCTGAAAGATCGCCTGCACGAGGCCCTTGCCGAACTTTGCGCAAGTTCGCCTCTCA
TCGAGGCGCCGCTTCTGCGCGGCGTTTATTTCGCGTCGGGAACGCAGCCGGAAACGGAAAAATCTCCCGCGGCATCCAGA
ACCCGCCGCAGCTATTTCCTGTCGCGGTTGTTCAAGGACGTCATCTTCCCTGAAGCCGCCCTGGTGACGCGTGACAAGCG
CCTTTCCCGGCGGCAATTGCTGGTGCGGCGCATTGCTTATGCCGTATCCGCCACGGCGGTCGCCATCGTCTTCACCGGCT
GGATTTTCACCTATTTCGCCAATACGCAGGCGCTGGCCGAAGCGGATCGCAAACTTGGCGCTTATGAGCAGCTCGTTCAG
GGAATTCCCGTGCGCGAGGTCGCGGACGCCGATTTCCTGCGGATCCTCCCCGCGCTCGACAATCTGCGCGATGTCAATTC
CGGCTTTGCCCGCGAACGCGTGTGGAATGTCAGCTTCGGTCTTGATCAGGAAGACAAGATTGCCGGCCGGCAGCGTGACG
CCTATCAGCGCGCACTCAACGCCCTTCTGCTGCCGCGCATGATCGTGCAGCTGCAAAAGCAGCTGAAGGACGAAAAGGAT
GTCACCCGCACGTTCAATTCGCTGAAACTCTACGGAATGCTGGGCGGGATGGGTGGTCTGGACCGCGATTTCCTGACCAC
GCAGACGCATCAGATGTTCGCATCCCTTTATCCCGGTGACGGAAGGGCGGCGGCGAGAGAAGCGCTTGACCAGCATGCAA
AGGCGCTCGCTGACGGCGTGCTCGCCCCGATCGAGCTCGATGCGCGGCTCATCGCCACCGCCCGCGAAACCATCCGCGAT
CAGGCGATTGGCACACGCGCCTATGATATTCTCGCCGGCCTTCCGCAAGTGCGGGAATTGATGGAATGGACACCCGCTAC
GGCTTTCGGCCCGCTGGGGGAGCGTGCTTTCGAGCGTCGCAGCAAAGCGCCGATGGCCGAGGGGATAGAGGGCCTTTTCA
CCGCCGATGGATATCGCCGCGTTGTCATCCCGCAGGTCGCGCATGCGGCCCGCGTCGCGCTTTCCGAAGGATGGGTGCGC
GGTTCTGACGATGCCATCAAGGGTGCGACTGTCGAGCAAGTGGCGCAGGCCGCGCTGCAAATCTATTTCGACAGGTTCGA
GAAAATATGGGCCGACACGCTGTCCGATCTGCGCGTCAAACCATCCCAGAGCCTCGGCGACGCGGTGGAGACAACACGGG
CTCTTGCCAATGAGCGCAATATCGTGGTGGAGGCTGCCAGATCGATCGCAGAGGCGACCGATCTCCGCCCCGGAGCAAAC
CCCGCCGCCCTCGCCTCCGCTGCGGAAGGAGACGCAACCGCAGCCGTTCTGGCCGCTACGGTGAATGCGGCCGATCCCTA
TGCGAGACTGCGCGACATGCTGGCGACGAAAGGCGCAGCTACAACGGGCGAGCAGCCCAATGGCGACAAGACCGGTGGTT
CGCCATCGGAACAGTTACTCGCGCATTTCAAGCTGCTGAACGAACAGCTTGCCCGTTCGGCAACCACCTCCGATGAGGTC
GCCAAGGTGTTCGACGTGGACAGCCAGCTCACCAAGGCCAATCAGGATCTGTTGCAGCAGGCACGCGAATTGCCGGCACC
GCTCGACGTCTGGGTGGCAGGCGTGGCGGCGGATGTCGGCTCGCTCGCGGTCAAGTCCGCACGCAGCCGCATCGCCGAAC
TCTGGACGGCCGATTCAGCCTCCCTGTGCTCCTCGATCGTGACCGGTCGTTATCCCTTCGACCGAGCCTCCTCCCGCGAT
GTCGCGATTGCGGATTTTACCCGTCTCTTCGCGCCGACGGGGGTTTTCCAGAGCTTCTTCAAACAGCGCATGGAACCGTT
TGTCGACAAGACGACGACCCCATGGAGCTGGAAAGGCACCTTCGGCGCCGCCGGCATTCCCAGCAGCGCCGTCGCCCAGT
TCGAAAATGCCGACAAGATTTCGCGCGCCTTTTTCCCGAATGGCAGCGAGACGCCGACGGTTTCGATCAATGTCAAACCC
GTCTCGCTCACCAATGCCTCCAGCGCGGTGATGCTGGAAATCGAAGGCGAGCGGGTCGTTTATTATCATGGACCGATCCA
GGCGAAATCGATTACCTGGCCCTCGCGCGAAAACACGGCGAGCCTGTCGCGCATCGCCTTCCAGCCGGGCGGCTGGCAGC
AGGCGAAGACCGAAAACGGCGACTGGTCGCCCTTCCGGCTGTTTGACGGTGCCAATATCGAAAATCAGTCGGGTGAGCTG
TTGCGGGTGCGGTTTGAGAATGGCGTCCAGGCTGCCGAGTTCGATATTCAGTTCGGCTCGGTTCTCAATCCATTCAAGCT
GGATGCGATTGCCAGCTTCGCCTGCCCTGCGCAGTTCTAG
ATGAATCCATTGAGCTATTTTTATACGATCCGTTCCTATGTCGAGAGCTATGCGGCCCTCGTCGGCCGGCGTTTCCTCTC
GCTTTTATGGGTGATCGCGCTCTGCGTCGTCGTCTGGTTCTACGGCTATCTGATCGCGCTCGGCGATTTCAAACCGCTTG
GAACGGTGCAGGCGCGGCTCATCGCCATCGGCATCATCGTCGCCGTCTGGCTGGTCTACATCATCGTCACGATCTACCGC
GGCCGCAAACAGGACAAGGAACTCGTCGACAGTATCGAGCGCGAGGCGCTGGCCAATCGGCAGGCGGAAATCGGCGAAAT
CCAGACCCGCCTGAAAGAAGCGCTCGCGCTCTTGCGGCGCGTCACGAAAAAGCGCTTCGGTTACATTTACGACTTGCCCT
GGTACGTGATCTTCGGCGCACCCGGCTCCGGCAAGACGACGGCGCTGACCAATTCCGGCCTGCAATTTCCGCTTGGCGAT
GCGCTCGGTGAAAATGCGGTGAAGGGTATCGGTGGCACGCGCAACTGTAACTGGTGGTTCGCGGACGAGGCGATCCTTAT
CGACACTGCCGGTCGCTATACCACCCAGGATGACCTCGACGGTTCCTCGAAGGCCGGCTGGGAAGGCTTTCTTGGCCTTT
TGAGGCGATATCGTCGTTCGCAGCCGATCAATGGCGCATTGGTCACGCTGTCCATTCCCGATCTGCTCAACCGCGATCCC
GAAGAACAGCGGCAGGAGCTGCGCTCCATCCGCCAGCGTCTTTCGGAACTTGACGAATATCTGCATGCCCGCGTTCCGGT
CTATATCGTCCTGACGAAAGCCGACCTGCTGCATGGTTTCGTGGAATTTTTCGATGGATTCAACAAGACCGACCGCCAGC
AGGTGTGGGGCACCACCTTCAAGCTGGATGAGAGCTACTCCGCCGAAAACCTGCCGCAGCGTCTGACGGAAGAATTCGAA
CTGCTACAGCAGCGGGTCGATGCCATGCTGATCGAACGCCTGCAGCAGGAGCAGAATGCGGAAATACGCGGCCGTATCTT
CCGTTTTCCAGCCGAACTCGCGCGGCTGAAAGATCGCCTGCACGAGGCCCTTGCCGAACTTTGCGCAAGTTCGCCTCTCA
TCGAGGCGCCGCTTCTGCGCGGCGTTTATTTCGCGTCGGGAACGCAGCCGGAAACGGAAAAATCTCCCGCGGCATCCAGA
ACCCGCCGCAGCTATTTCCTGTCGCGGTTGTTCAAGGACGTCATCTTCCCTGAAGCCGCCCTGGTGACGCGTGACAAGCG
CCTTTCCCGGCGGCAATTGCTGGTGCGGCGCATTGCTTATGCCGTATCCGCCACGGCGGTCGCCATCGTCTTCACCGGCT
GGATTTTCACCTATTTCGCCAATACGCAGGCGCTGGCCGAAGCGGATCGCAAACTTGGCGCTTATGAGCAGCTCGTTCAG
GGAATTCCCGTGCGCGAGGTCGCGGACGCCGATTTCCTGCGGATCCTCCCCGCGCTCGACAATCTGCGCGATGTCAATTC
CGGCTTTGCCCGCGAACGCGTGTGGAATGTCAGCTTCGGTCTTGATCAGGAAGACAAGATTGCCGGCCGGCAGCGTGACG
CCTATCAGCGCGCACTCAACGCCCTTCTGCTGCCGCGCATGATCGTGCAGCTGCAAAAGCAGCTGAAGGACGAAAAGGAT
GTCACCCGCACGTTCAATTCGCTGAAACTCTACGGAATGCTGGGCGGGATGGGTGGTCTGGACCGCGATTTCCTGACCAC
GCAGACGCATCAGATGTTCGCATCCCTTTATCCCGGTGACGGAAGGGCGGCGGCGAGAGAAGCGCTTGACCAGCATGCAA
AGGCGCTCGCTGACGGCGTGCTCGCCCCGATCGAGCTCGATGCGCGGCTCATCGCCACCGCCCGCGAAACCATCCGCGAT
CAGGCGATTGGCACACGCGCCTATGATATTCTCGCCGGCCTTCCGCAAGTGCGGGAATTGATGGAATGGACACCCGCTAC
GGCTTTCGGCCCGCTGGGGGAGCGTGCTTTCGAGCGTCGCAGCAAAGCGCCGATGGCCGAGGGGATAGAGGGCCTTTTCA
CCGCCGATGGATATCGCCGCGTTGTCATCCCGCAGGTCGCGCATGCGGCCCGCGTCGCGCTTTCCGAAGGATGGGTGCGC
GGTTCTGACGATGCCATCAAGGGTGCGACTGTCGAGCAAGTGGCGCAGGCCGCGCTGCAAATCTATTTCGACAGGTTCGA
GAAAATATGGGCCGACACGCTGTCCGATCTGCGCGTCAAACCATCCCAGAGCCTCGGCGACGCGGTGGAGACAACACGGG
CTCTTGCCAATGAGCGCAATATCGTGGTGGAGGCTGCCAGATCGATCGCAGAGGCGACCGATCTCCGCCCCGGAGCAAAC
CCCGCCGCCCTCGCCTCCGCTGCGGAAGGAGACGCAACCGCAGCCGTTCTGGCCGCTACGGTGAATGCGGCCGATCCCTA
TGCGAGACTGCGCGACATGCTGGCGACGAAAGGCGCAGCTACAACGGGCGAGCAGCCCAATGGCGACAAGACCGGTGGTT
CGCCATCGGAACAGTTACTCGCGCATTTCAAGCTGCTGAACGAACAGCTTGCCCGTTCGGCAACCACCTCCGATGAGGTC
GCCAAGGTGTTCGACGTGGACAGCCAGCTCACCAAGGCCAATCAGGATCTGTTGCAGCAGGCACGCGAATTGCCGGCACC
GCTCGACGTCTGGGTGGCAGGCGTGGCGGCGGATGTCGGCTCGCTCGCGGTCAAGTCCGCACGCAGCCGCATCGCCGAAC
TCTGGACGGCCGATTCAGCCTCCCTGTGCTCCTCGATCGTGACCGGTCGTTATCCCTTCGACCGAGCCTCCTCCCGCGAT
GTCGCGATTGCGGATTTTACCCGTCTCTTCGCGCCGACGGGGGTTTTCCAGAGCTTCTTCAAACAGCGCATGGAACCGTT
TGTCGACAAGACGACGACCCCATGGAGCTGGAAAGGCACCTTCGGCGCCGCCGGCATTCCCAGCAGCGCCGTCGCCCAGT
TCGAAAATGCCGACAAGATTTCGCGCGCCTTTTTCCCGAATGGCAGCGAGACGCCGACGGTTTCGATCAATGTCAAACCC
GTCTCGCTCACCAATGCCTCCAGCGCGGTGATGCTGGAAATCGAAGGCGAGCGGGTCGTTTATTATCATGGACCGATCCA
GGCGAAATCGATTACCTGGCCCTCGCGCGAAAACACGGCGAGCCTGTCGCGCATCGCCTTCCAGCCGGGCGGCTGGCAGC
AGGCGAAGACCGAAAACGGCGACTGGTCGCCCTTCCGGCTGTTTGACGGTGCCAATATCGAAAATCAGTCGGGTGAGCTG
TTGCGGGTGCGGTTTGAGAATGGCGTCCAGGCTGCCGAGTTCGATATTCAGTTCGGCTCGGTTCTCAATCCATTCAAGCT
GGATGCGATTGCCAGCTTCGCCTGCCCTGCGCAGTTCTAG