Detailed information of component protein
Summary
Component ID | T6CP005150 |
Component protein type | TssM |
T6SS ID (Type) | T6SS00485 (Type i1) |
Strain | Escherichia coli 042 |
Replicon | chromosome |
Sequence | Protein sequence (1146 a.a.); Nucleotide sequence (3438 bp) |
Reference |
External database links
Locus tag (Gene) | EC042_RS01150 |
Coordinate (Strand) | 245395..248832 (-) |
NCBI ID | WP_122986661.1 |
RefSeq | NC_017626 |
Uniprot ID | - |
KEGG ID | - |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
IcmF-related | PF06761.14 | 2e-66 | 478..789 |
IcmF-related_N | PF14331.8 | 2.1e-59 | 181..426 |
IcmF_C | PF06744.14 | 8.5e-20 | 1019..1123 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Inside | 1-1 | M |
Transmembrane helix | 2-16 | LALAWIFLLVWIWWQ |
Outside | 17-30 | GPKWTLYEQHWLAP |
Transmembrane helix | 31-53 | LTNRWLATAVWGLIALIWLTWRV |
Inside | 54-421 | MKRLQKLEKQQKQQREEEKDPLTVELHRQQQYLDHWLLRLRRHLDNRRYLWQLPWYMVIGPAGSGKSALLREGFPSDIIYTPESIRGTEYHPLITPRVGNQAVIFDVDGVLTSPGGDDLLHRRLREHWLGWLMQTRARQPLNGLILTLDLPDLLTADKSRRETLVQNLRQQLQEIRQSLHCRLPVYVVLTRLDLLTGFAALFHSLDKKDRDAILGVTFTRRAHESDDWRSELGAFWQTWVQQVNLALSDLMLAQTGAAPRSAVFSFSRQMQGTGEIVTALLAALLDGENMDVMLRGVWLTSSLQRGQVDDIFTQSAARQYGLGNSSLATWPLVETTPYFTRRLFPEVLLAEPNLAGENSVWLNSSRRR |
Transmembrane helix | 422-444 | LTAFSACGAALAALLVGSWHHYY |
Outside | 445-1145 | NQNWQSGVNVLAQAKAFMDVPPPQGTDEFGNLQLSLLNPVRDATLAYGDYRDRGFLADMGLYQGVRVGPYVEQTYIQLLEQRYLPSLMNGLIRDLNNAPPESEEKLAVLRVLRMMEDKSGRNNEAVKQYMARRWSNEFHGQRDIQAQLMAHLDYALEHTDWHAQRQSGDSDAVSRWTPYDKPVINAQQELSKLPIYQRVYQTLRTKALSVLPADLNLRDQVGPTFDNVFVAGNDEKLVIPQFLTRYGLQSYFVKQREGLVELTALDSWVLNLTQSVAYSEADREEIQRHITEQYISDYTATWRAGMDNLNVRDYEAMSALTDALEQIISGDQPFQRALTALRDNTHALTLSGKLDDKAREAAINEMDYRLLSRLGHEFAPENSALEEQKDKASTLQAVYQQLTELHRYLLAIQNSPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGKLADQAWHVVMVEAVRYMEVDWRDNVVKPFNEQLADNYPFNPRATQDASLDSFERFFKPDGILDNFYKNNLRLFLENDLTFGDDGRVLIREDIRQQLDTAQKIRDIFFSQQNGLGAQFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGRAPHSIAFSGPWAQFRLFGAGQLTNVTSDTFNVRFNVDGGAMVYRVHVDTEDNPFTGGLFSLFRLPDTLY |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9962 | - | - |
Protein sequence: 1146 a.a.
>T6CP005150 NC_017626:c248832-245395 [Escherichia coli 042] [TssM]
MLALAWIFLLVWIWWQGPKWTLYEQHWLAPLTNRWLATAVWGLIALIWLTWRVMKRLQKLEKQQKQQREEEKDPLTVELH
RQQQYLDHWLLRLRRHLDNRRYLWQLPWYMVIGPAGSGKSALLREGFPSDIIYTPESIRGTEYHPLITPRVGNQAVIFDV
DGVLTSPGGDDLLHRRLREHWLGWLMQTRARQPLNGLILTLDLPDLLTADKSRRETLVQNLRQQLQEIRQSLHCRLPVYV
VLTRLDLLTGFAALFHSLDKKDRDAILGVTFTRRAHESDDWRSELGAFWQTWVQQVNLALSDLMLAQTGAAPRSAVFSFS
RQMQGTGEIVTALLAALLDGENMDVMLRGVWLTSSLQRGQVDDIFTQSAARQYGLGNSSLATWPLVETTPYFTRRLFPEV
LLAEPNLAGENSVWLNSSRRRLTAFSACGAALAALLVGSWHHYYNQNWQSGVNVLAQAKAFMDVPPPQGTDEFGNLQLSL
LNPVRDATLAYGDYRDRGFLADMGLYQGVRVGPYVEQTYIQLLEQRYLPSLMNGLIRDLNNAPPESEEKLAVLRVLRMME
DKSGRNNEAVKQYMARRWSNEFHGQRDIQAQLMAHLDYALEHTDWHAQRQSGDSDAVSRWTPYDKPVINAQQELSKLPIY
QRVYQTLRTKALSVLPADLNLRDQVGPTFDNVFVAGNDEKLVIPQFLTRYGLQSYFVKQREGLVELTALDSWVLNLTQSV
AYSEADREEIQRHITEQYISDYTATWRAGMDNLNVRDYEAMSALTDALEQIISGDQPFQRALTALRDNTHALTLSGKLDD
KAREAAINEMDYRLLSRLGHEFAPENSALEEQKDKASTLQAVYQQLTELHRYLLAIQNSPVPGKSALKAVQLRLDQNSSD
PIFATRQMAKTLPAPLNRWVGKLADQAWHVVMVEAVRYMEVDWRDNVVKPFNEQLADNYPFNPRATQDASLDSFERFFKP
DGILDNFYKNNLRLFLENDLTFGDDGRVLIREDIRQQLDTAQKIRDIFFSQQNGLGAQFAVETVSLSGNKRRSVLNLDGQ
LVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGRAPHSIAFSGPWAQFRLFGAGQLTNVTSDTFNVRFNVDGGAMVYR
VHVDTEDNPFTGGLFSLFRLPDTLY*
MLALAWIFLLVWIWWQGPKWTLYEQHWLAPLTNRWLATAVWGLIALIWLTWRVMKRLQKLEKQQKQQREEEKDPLTVELH
RQQQYLDHWLLRLRRHLDNRRYLWQLPWYMVIGPAGSGKSALLREGFPSDIIYTPESIRGTEYHPLITPRVGNQAVIFDV
DGVLTSPGGDDLLHRRLREHWLGWLMQTRARQPLNGLILTLDLPDLLTADKSRRETLVQNLRQQLQEIRQSLHCRLPVYV
VLTRLDLLTGFAALFHSLDKKDRDAILGVTFTRRAHESDDWRSELGAFWQTWVQQVNLALSDLMLAQTGAAPRSAVFSFS
RQMQGTGEIVTALLAALLDGENMDVMLRGVWLTSSLQRGQVDDIFTQSAARQYGLGNSSLATWPLVETTPYFTRRLFPEV
LLAEPNLAGENSVWLNSSRRRLTAFSACGAALAALLVGSWHHYYNQNWQSGVNVLAQAKAFMDVPPPQGTDEFGNLQLSL
LNPVRDATLAYGDYRDRGFLADMGLYQGVRVGPYVEQTYIQLLEQRYLPSLMNGLIRDLNNAPPESEEKLAVLRVLRMME
DKSGRNNEAVKQYMARRWSNEFHGQRDIQAQLMAHLDYALEHTDWHAQRQSGDSDAVSRWTPYDKPVINAQQELSKLPIY
QRVYQTLRTKALSVLPADLNLRDQVGPTFDNVFVAGNDEKLVIPQFLTRYGLQSYFVKQREGLVELTALDSWVLNLTQSV
AYSEADREEIQRHITEQYISDYTATWRAGMDNLNVRDYEAMSALTDALEQIISGDQPFQRALTALRDNTHALTLSGKLDD
KAREAAINEMDYRLLSRLGHEFAPENSALEEQKDKASTLQAVYQQLTELHRYLLAIQNSPVPGKSALKAVQLRLDQNSSD
PIFATRQMAKTLPAPLNRWVGKLADQAWHVVMVEAVRYMEVDWRDNVVKPFNEQLADNYPFNPRATQDASLDSFERFFKP
DGILDNFYKNNLRLFLENDLTFGDDGRVLIREDIRQQLDTAQKIRDIFFSQQNGLGAQFAVETVSLSGNKRRSVLNLDGQ
LVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGRAPHSIAFSGPWAQFRLFGAGQLTNVTSDTFNVRFNVDGGAMVYR
VHVDTEDNPFTGGLFSLFRLPDTLY*
Nucleotide sequence: 3438 bp
>T6CP005150 NC_017626:c248832-245395 [Escherichia coli 042] [TssM]
CTGCTGGCGCTGGCATGGATTTTTCTGCTGGTGTGGATCTGGTGGCAAGGTCCGAAATGGACGCTCTATGAGCAGCACTG
GCTGGCTCCTCTAACAAACCGCTGGCTGGCGACCGCCGTCTGGGGGCTTATCGCTCTGATCTGGCTCACCTGGCGGGTAA
TGAAGCGTCTGCAAAAGCTGGAAAAACAGCAGAAACAGCAGCGGGAGGAAGAAAAAGATCCGTTGACCGTGGAACTCCAC
CGCCAGCAGCAATATCTGGATCACTGGCTGCTGCGCCTGCGCCGCCATCTGGATAACCGCCGTTATCTGTGGCAGTTGCC
GTGGTATATGGTCATTGGTCCTGCGGGTAGCGGTAAAAGCGCTCTGCTGCGCGAGGGCTTTCCATCTGACATTATTTACA
CGCCGGAAAGCATCCGGGGTACGGAATACCATCCGCTGATCACACCGCGAGTGGGCAACCAAGCGGTGATTTTCGATGTT
GACGGCGTACTGACCTCGCCCGGCGGGGATGATCTGCTCCACCGCCGCCTGCGCGAACACTGGCTGGGCTGGCTGATGCA
AACGCGCGCGCGCCAGCCGCTCAACGGCCTGATCCTGACGCTCGATCTTCCCGATCTGCTGACGGCGGATAAATCCCGCC
GTGAGACACTGGTACAAAATTTGCGCCAGCAACTTCAGGAGATCCGCCAGAGTCTGCACTGCCGTCTGCCCGTTTACGTG
GTGCTGACACGGCTGGATCTGCTGACCGGCTTTGCCGCGCTGTTCCATTCACTGGATAAAAAAGACCGCGATGCGATCCT
CGGCGTCACGTTTACCCGCCGCGCCCATGAAAGTGACGACTGGCGCAGCGAACTGGGGGCTTTCTGGCAGACGTGGGTAC
AACAGGTGAACCTGGCGCTGTCGGATCTGATGCTCGCACAAACCGGTGCTGCTCCCCGCAGCGCCGTGTTCAGCTTCTCC
CGTCAGATGCAGGGAACAGGAGAAATCGTCACCGCACTGCTCGCCGCATTGCTGGACGGTGAGAACATGGATGTAATGCT
GCGTGGCGTCTGGCTCACATCATCGCTACAGCGTGGCCAGGTGGATGATATTTTCACGCAGTCCGCCGCCCGCCAGTACG
GGCTGGGTAACAGCTCGCTGGCAACCTGGCCTCTGGTGGAGACGACGCCGTATTTTACTCGCCGCCTCTTCCCTGAAGTC
CTGCTGGCTGAGCCGAACCTGGCGGGTGAAAACAGCGTCTGGCTGAACAGCTCCCGGCGCAGGCTGACCGCCTTTTCCGC
CTGTGGCGCGGCGCTGGCGGCATTGCTGGTCGGAAGCTGGCACCATTATTACAATCAGAACTGGCAGTCCGGCGTTAACG
TACTGGCACAGGCTAAAGCCTTTATGGACGTACCACCACCGCAAGGAACGGATGAATTCGGCAATCTGCAACTGTCGTTG
CTTAATCCGGTACGCGATGCCACCCTGGCCTATGGCGATTACCGCGATCGCGGTTTTCTGGCGGATATGGGATTGTACCA
GGGCGTCCGCGTAGGGCCGTATGTGGAGCAAACCTACATTCAGCTTCTTGAGCAGCGTTATCTCCCCTCGTTAATGAACG
GCCTGATCCGGGATCTAAACAATGCCCCGCCAGAGAGCGAAGAAAAGCTCGCCGTGCTGCGCGTACTGCGCATGATGGAA
GACAAAAGTGGGCGCAACAACGAGGCGGTAAAACAGTACATGGCGCGGCGCTGGAGCAATGAATTTCACGGCCAGCGCGA
TATTCAGGCGCAACTGATGGCGCATCTGGACTATGCGCTGGAGCACACCGACTGGCACGCGCAGCGCCAGAGCGGTGACA
GCGATGCTGTCAGCCGCTGGACCCCCTATGATAAACCGGTCATTAATGCGCAGCAGGAACTGAGCAAGCTGCCCATATAC
CAGCGTGTCTACCAGACCCTGCGCACCAAAGCATTAAGCGTGTTGCCCGCCGATTTGAATTTGCGCGACCAGGTTGGTCC
CACCTTCGACAACGTGTTCGTCGCCGGTAATGATGAAAAACTGGTGATCCCGCAGTTCCTCACCCGCTATGGACTGCAAA
GCTATTTTGTCAAACAGCGTGAGGGCCTCGTTGAGCTGACCGCGCTGGATTCGTGGGTACTGAACCTGACGCAAAGCGTC
GCCTACAGCGAGGCCGACCGTGAAGAGATCCAGCGCCATATCACCGAACAGTACATCAGTGACTATACCGCCACCTGGCG
TGCCGGAATGGATAACCTCAACGTCCGTGACTATGAGGCCATGTCGGCGCTGACCGACGCGCTGGAGCAGATTATCAGCG
GCGATCAGCCATTCCAGCGTGCGCTGACGGCGCTGCGCGATAATACCCACGCGCTGACGCTCTCCGGCAAACTGGATGAT
AAGGCGAGGGAAGCGGCAATAAATGAGATGGATTACCGCCTGTTATCCCGGCTGGGGCATGAGTTCGCACCGGAAAACAG
CGCACTGGAGGAGCAAAAGGACAAGGCGAGTACGCTACAGGCCGTGTACCAGCAACTGACCGAGCTGCACCGTTACCTGC
TGGCGATCCAGAACTCGCCAGTGCCGGGGAAATCGGCGCTGAAAGCAGTACAGCTACGTCTGGATCAAAACAGCAGCGAT
CCAATCTTCGCTACCCGCCAGATGGCAAAAACTCTCCCTGCACCGCTTAACCGCTGGGTAGGTAAGCTCGCGGATCAGGC
CTGGCATGTGGTGATGGTGGAAGCCGTTCGTTACATGGAAGTGGACTGGCGCGACAATGTAGTGAAACCCTTCAACGAGC
AGCTTGCCGATAACTATCCGTTTAATCCGCGCGCCACACAGGATGCCTCACTGGATTCGTTTGAACGTTTCTTTAAACCG
GATGGCATTCTGGACAATTTCTACAAGAACAACCTGCGCCTGTTCCTTGAAAACGATCTGACCTTTGGCGACGACGGCAG
AGTGTTAATCCGTGAAGATATCCGGCAGCAACTGGATACCGCGCAGAAAATCCGCGACATCTTCTTCAGCCAGCAGAACG
GGCTGGGCGCACAGTTTGCCGTGGAAACCGTATCGCTTTCCGGCAATAAGCGGCGCAGCGTACTTAACCTGGACGGCCAG
TTAGTGGACTACAGCCAGGGACGCAACTACACCGCCCATCTGGTCTGGCCGAACAACATGCGTGAAGGCAATGAAAGCAA
GCTGACGCTGATTGGCACCAGCGGCAGAGCACCGCACAGTATCGCGTTCAGTGGACCGTGGGCGCAGTTCCGCCTGTTCG
GCGCGGGCCAGTTGACCAATGTGACCAGTGACACCTTTAACGTGCGCTTTAACGTGGACGGCGGCGCAATGGTTTACCGG
GTGCATGTGGATACCGAAGATAACCCGTTCACCGGCGGTCTGTTCAGCCTGTTCCGTTTACCGGATACGTTGTATTAA
CTGCTGGCGCTGGCATGGATTTTTCTGCTGGTGTGGATCTGGTGGCAAGGTCCGAAATGGACGCTCTATGAGCAGCACTG
GCTGGCTCCTCTAACAAACCGCTGGCTGGCGACCGCCGTCTGGGGGCTTATCGCTCTGATCTGGCTCACCTGGCGGGTAA
TGAAGCGTCTGCAAAAGCTGGAAAAACAGCAGAAACAGCAGCGGGAGGAAGAAAAAGATCCGTTGACCGTGGAACTCCAC
CGCCAGCAGCAATATCTGGATCACTGGCTGCTGCGCCTGCGCCGCCATCTGGATAACCGCCGTTATCTGTGGCAGTTGCC
GTGGTATATGGTCATTGGTCCTGCGGGTAGCGGTAAAAGCGCTCTGCTGCGCGAGGGCTTTCCATCTGACATTATTTACA
CGCCGGAAAGCATCCGGGGTACGGAATACCATCCGCTGATCACACCGCGAGTGGGCAACCAAGCGGTGATTTTCGATGTT
GACGGCGTACTGACCTCGCCCGGCGGGGATGATCTGCTCCACCGCCGCCTGCGCGAACACTGGCTGGGCTGGCTGATGCA
AACGCGCGCGCGCCAGCCGCTCAACGGCCTGATCCTGACGCTCGATCTTCCCGATCTGCTGACGGCGGATAAATCCCGCC
GTGAGACACTGGTACAAAATTTGCGCCAGCAACTTCAGGAGATCCGCCAGAGTCTGCACTGCCGTCTGCCCGTTTACGTG
GTGCTGACACGGCTGGATCTGCTGACCGGCTTTGCCGCGCTGTTCCATTCACTGGATAAAAAAGACCGCGATGCGATCCT
CGGCGTCACGTTTACCCGCCGCGCCCATGAAAGTGACGACTGGCGCAGCGAACTGGGGGCTTTCTGGCAGACGTGGGTAC
AACAGGTGAACCTGGCGCTGTCGGATCTGATGCTCGCACAAACCGGTGCTGCTCCCCGCAGCGCCGTGTTCAGCTTCTCC
CGTCAGATGCAGGGAACAGGAGAAATCGTCACCGCACTGCTCGCCGCATTGCTGGACGGTGAGAACATGGATGTAATGCT
GCGTGGCGTCTGGCTCACATCATCGCTACAGCGTGGCCAGGTGGATGATATTTTCACGCAGTCCGCCGCCCGCCAGTACG
GGCTGGGTAACAGCTCGCTGGCAACCTGGCCTCTGGTGGAGACGACGCCGTATTTTACTCGCCGCCTCTTCCCTGAAGTC
CTGCTGGCTGAGCCGAACCTGGCGGGTGAAAACAGCGTCTGGCTGAACAGCTCCCGGCGCAGGCTGACCGCCTTTTCCGC
CTGTGGCGCGGCGCTGGCGGCATTGCTGGTCGGAAGCTGGCACCATTATTACAATCAGAACTGGCAGTCCGGCGTTAACG
TACTGGCACAGGCTAAAGCCTTTATGGACGTACCACCACCGCAAGGAACGGATGAATTCGGCAATCTGCAACTGTCGTTG
CTTAATCCGGTACGCGATGCCACCCTGGCCTATGGCGATTACCGCGATCGCGGTTTTCTGGCGGATATGGGATTGTACCA
GGGCGTCCGCGTAGGGCCGTATGTGGAGCAAACCTACATTCAGCTTCTTGAGCAGCGTTATCTCCCCTCGTTAATGAACG
GCCTGATCCGGGATCTAAACAATGCCCCGCCAGAGAGCGAAGAAAAGCTCGCCGTGCTGCGCGTACTGCGCATGATGGAA
GACAAAAGTGGGCGCAACAACGAGGCGGTAAAACAGTACATGGCGCGGCGCTGGAGCAATGAATTTCACGGCCAGCGCGA
TATTCAGGCGCAACTGATGGCGCATCTGGACTATGCGCTGGAGCACACCGACTGGCACGCGCAGCGCCAGAGCGGTGACA
GCGATGCTGTCAGCCGCTGGACCCCCTATGATAAACCGGTCATTAATGCGCAGCAGGAACTGAGCAAGCTGCCCATATAC
CAGCGTGTCTACCAGACCCTGCGCACCAAAGCATTAAGCGTGTTGCCCGCCGATTTGAATTTGCGCGACCAGGTTGGTCC
CACCTTCGACAACGTGTTCGTCGCCGGTAATGATGAAAAACTGGTGATCCCGCAGTTCCTCACCCGCTATGGACTGCAAA
GCTATTTTGTCAAACAGCGTGAGGGCCTCGTTGAGCTGACCGCGCTGGATTCGTGGGTACTGAACCTGACGCAAAGCGTC
GCCTACAGCGAGGCCGACCGTGAAGAGATCCAGCGCCATATCACCGAACAGTACATCAGTGACTATACCGCCACCTGGCG
TGCCGGAATGGATAACCTCAACGTCCGTGACTATGAGGCCATGTCGGCGCTGACCGACGCGCTGGAGCAGATTATCAGCG
GCGATCAGCCATTCCAGCGTGCGCTGACGGCGCTGCGCGATAATACCCACGCGCTGACGCTCTCCGGCAAACTGGATGAT
AAGGCGAGGGAAGCGGCAATAAATGAGATGGATTACCGCCTGTTATCCCGGCTGGGGCATGAGTTCGCACCGGAAAACAG
CGCACTGGAGGAGCAAAAGGACAAGGCGAGTACGCTACAGGCCGTGTACCAGCAACTGACCGAGCTGCACCGTTACCTGC
TGGCGATCCAGAACTCGCCAGTGCCGGGGAAATCGGCGCTGAAAGCAGTACAGCTACGTCTGGATCAAAACAGCAGCGAT
CCAATCTTCGCTACCCGCCAGATGGCAAAAACTCTCCCTGCACCGCTTAACCGCTGGGTAGGTAAGCTCGCGGATCAGGC
CTGGCATGTGGTGATGGTGGAAGCCGTTCGTTACATGGAAGTGGACTGGCGCGACAATGTAGTGAAACCCTTCAACGAGC
AGCTTGCCGATAACTATCCGTTTAATCCGCGCGCCACACAGGATGCCTCACTGGATTCGTTTGAACGTTTCTTTAAACCG
GATGGCATTCTGGACAATTTCTACAAGAACAACCTGCGCCTGTTCCTTGAAAACGATCTGACCTTTGGCGACGACGGCAG
AGTGTTAATCCGTGAAGATATCCGGCAGCAACTGGATACCGCGCAGAAAATCCGCGACATCTTCTTCAGCCAGCAGAACG
GGCTGGGCGCACAGTTTGCCGTGGAAACCGTATCGCTTTCCGGCAATAAGCGGCGCAGCGTACTTAACCTGGACGGCCAG
TTAGTGGACTACAGCCAGGGACGCAACTACACCGCCCATCTGGTCTGGCCGAACAACATGCGTGAAGGCAATGAAAGCAA
GCTGACGCTGATTGGCACCAGCGGCAGAGCACCGCACAGTATCGCGTTCAGTGGACCGTGGGCGCAGTTCCGCCTGTTCG
GCGCGGGCCAGTTGACCAATGTGACCAGTGACACCTTTAACGTGCGCTTTAACGTGGACGGCGGCGCAATGGTTTACCGG
GTGCATGTGGATACCGAAGATAACCCGTTCACCGGCGGTCTGTTCAGCCTGTTCCGTTTACCGGATACGTTGTATTAA