Detailed information of component protein
Summary
Component ID | T6CP010896 |
Component protein type | TssM |
T6SS ID (Type) | T6SS00897 (Type i5) |
Strain | Rhizobium leguminosarum bv. trifolii |
Replicon | chromosome |
Sequence | Protein sequence (1159 a.a.); Nucleotide sequence (3477 bp) |
Reference |
External database links
Locus tag (Gene) | - |
Coordinate (Strand) | 13661..17137 (+) |
NCBI ID | 16326462 |
GenBank | AF361470 |
Uniprot ID | Q93EC2_RHILT |
KEGG ID | - |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
IcmF-related_N | PF14331.8 | 1.1e-71 | 207..444 |
IcmF-related | PF06761.14 | 4.9e-57 | 498..794 |
IcmF_C | PF06744.14 | 1.9e-20 | 1037..1139 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Inside | 1-19 | MNPLSYFYTLRSYVEAYAG |
Transmembrane helix | 20-42 | LIGRRFISIIWVAAICVVIWFYG |
Outside | 43-56 | YLAVYGDFKPLASA |
Transmembrane helix | 57-79 | SARLTLIGIILAAWLAYLVFTTI |
Inside | 80-434 | RDRRRDKQLVDGIERDAEAEAAASQQAEVGEIHGRLKEALQLLRRITKKRFGYIYELPWYVIFGAPGSGKTTALTNSGLKFPLGDALGSNSVQGIGGTRNCNWWFTDEAILIDTAGRYTTQDDLNGTAKAGWEGFLGLLRKYRRSQPINGALVTLSIGDLLTRDPEAQREEIRAIRQRLSELDELLHARVPVYLLLTKADFLTGFVEFFDGFNKSDREQVWGTTFGLDESYKAANLPERFMEEFTLLQQRVDTMLIERLQQEPNPELRGRIFRFPAELTSLKERLHEVVTELCSGSKLVEAPLLRGIYFASSTQREETVTVPRMRRSYFLSRLFKEVIFGEASLVARDKRLSGRQ |
Transmembrane helix | 435-457 | LLFRRAAYASAVLLLAVVLTSWT |
Outside | 458-1158 | ATYIQNTTALAGAERRIDAYEQLVRGVPVRDVSDADFLRILPALDNLAAVTTDFSKTRVWPISLGLDQEGKVASRQREAYQRALNTLLLPRMLVQLQKDMTETTDITRTFDALKLYGMLGGLGPVNADFASLEAEEMFTRLYPDEGRAAARQALIAHADVMARGALPPIELDKALIAKAREVIRSQNIANRAYDILAEYRESRALPAWSPAGALGPLGEQAFERTSKAPLNEGIPGLFSATGYRTVVLPGITDAAREALDEQWVRGDPNPAGVTVDTIAQATLQLYFDAFEQRWSMILTDIRVKPSQTIGDAAETTRILAGKPGPVETITKSIVAATDLRAPGAEIASAAGDSLPASTTALASAANAPDPFGRLRDALKAPAGQSSPDDQPKDAPRSEIGQLEPILQRVQEQLSRATTSTAEVAKVFDVDSQLTDANQDLLQQARKLPAPVDTWMAGLAADVGSLAVKSARSRISDAWAAEGASFCSDVVAGRYPFDRKSPRDVAMSDFIRLFGPEGLFKTFFKEKLEAFVDTSASPWGWKGTFGAVSIPSAAIAQFENADKINRAFFPAGSENPSISINVKPVSLTEAASAVMLEIEGERVVYFHGPIQSKSITWPSTDASNVSRLAFQPGGWQQALTENGDWSPFRLFDDADLAIQGDDLFRAKFRQNGQAAEFDVQFGSVLNPFRLEALGAFSCPAQF |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9895 | - | - |
Protein sequence: 1159 a.a.
>T6CP010896 AF361470:13661-17137 [Rhizobium leguminosarum bv. trifolii] [TssM]
MNPLSYFYTLRSYVEAYAGLIGRRFISIIWVAAICVVIWFYGYLAVYGDFKPLASASARLTLIGIILAAWLAYLVFTTIR
DRRRDKQLVDGIERDAEAEAAASQQAEVGEIHGRLKEALQLLRRITKKRFGYIYELPWYVIFGAPGSGKTTALTNSGLKF
PLGDALGSNSVQGIGGTRNCNWWFTDEAILIDTAGRYTTQDDLNGTAKAGWEGFLGLLRKYRRSQPINGALVTLSIGDLL
TRDPEAQREEIRAIRQRLSELDELLHARVPVYLLLTKADFLTGFVEFFDGFNKSDREQVWGTTFGLDESYKAANLPERFM
EEFTLLQQRVDTMLIERLQQEPNPELRGRIFRFPAELTSLKERLHEVVTELCSGSKLVEAPLLRGIYFASSTQREETVTV
PRMRRSYFLSRLFKEVIFGEASLVARDKRLSGRQLLFRRAAYASAVLLLAVVLTSWTATYIQNTTALAGAERRIDAYEQL
VRGVPVRDVSDADFLRILPALDNLAAVTTDFSKTRVWPISLGLDQEGKVASRQREAYQRALNTLLLPRMLVQLQKDMTET
TDITRTFDALKLYGMLGGLGPVNADFASLEAEEMFTRLYPDEGRAAARQALIAHADVMARGALPPIELDKALIAKAREVI
RSQNIANRAYDILAEYRESRALPAWSPAGALGPLGEQAFERTSKAPLNEGIPGLFSATGYRTVVLPGITDAAREALDEQW
VRGDPNPAGVTVDTIAQATLQLYFDAFEQRWSMILTDIRVKPSQTIGDAAETTRILAGKPGPVETITKSIVAATDLRAPG
AEIASAAGDSLPASTTALASAANAPDPFGRLRDALKAPAGQSSPDDQPKDAPRSEIGQLEPILQRVQEQLSRATTSTAEV
AKVFDVDSQLTDANQDLLQQARKLPAPVDTWMAGLAADVGSLAVKSARSRISDAWAAEGASFCSDVVAGRYPFDRKSPRD
VAMSDFIRLFGPEGLFKTFFKEKLEAFVDTSASPWGWKGTFGAVSIPSAAIAQFENADKINRAFFPAGSENPSISINVKP
VSLTEAASAVMLEIEGERVVYFHGPIQSKSITWPSTDASNVSRLAFQPGGWQQALTENGDWSPFRLFDDADLAIQGDDLF
RAKFRQNGQAAEFDVQFGSVLNPFRLEALGAFSCPAQF*
MNPLSYFYTLRSYVEAYAGLIGRRFISIIWVAAICVVIWFYGYLAVYGDFKPLASASARLTLIGIILAAWLAYLVFTTIR
DRRRDKQLVDGIERDAEAEAAASQQAEVGEIHGRLKEALQLLRRITKKRFGYIYELPWYVIFGAPGSGKTTALTNSGLKF
PLGDALGSNSVQGIGGTRNCNWWFTDEAILIDTAGRYTTQDDLNGTAKAGWEGFLGLLRKYRRSQPINGALVTLSIGDLL
TRDPEAQREEIRAIRQRLSELDELLHARVPVYLLLTKADFLTGFVEFFDGFNKSDREQVWGTTFGLDESYKAANLPERFM
EEFTLLQQRVDTMLIERLQQEPNPELRGRIFRFPAELTSLKERLHEVVTELCSGSKLVEAPLLRGIYFASSTQREETVTV
PRMRRSYFLSRLFKEVIFGEASLVARDKRLSGRQLLFRRAAYASAVLLLAVVLTSWTATYIQNTTALAGAERRIDAYEQL
VRGVPVRDVSDADFLRILPALDNLAAVTTDFSKTRVWPISLGLDQEGKVASRQREAYQRALNTLLLPRMLVQLQKDMTET
TDITRTFDALKLYGMLGGLGPVNADFASLEAEEMFTRLYPDEGRAAARQALIAHADVMARGALPPIELDKALIAKAREVI
RSQNIANRAYDILAEYRESRALPAWSPAGALGPLGEQAFERTSKAPLNEGIPGLFSATGYRTVVLPGITDAAREALDEQW
VRGDPNPAGVTVDTIAQATLQLYFDAFEQRWSMILTDIRVKPSQTIGDAAETTRILAGKPGPVETITKSIVAATDLRAPG
AEIASAAGDSLPASTTALASAANAPDPFGRLRDALKAPAGQSSPDDQPKDAPRSEIGQLEPILQRVQEQLSRATTSTAEV
AKVFDVDSQLTDANQDLLQQARKLPAPVDTWMAGLAADVGSLAVKSARSRISDAWAAEGASFCSDVVAGRYPFDRKSPRD
VAMSDFIRLFGPEGLFKTFFKEKLEAFVDTSASPWGWKGTFGAVSIPSAAIAQFENADKINRAFFPAGSENPSISINVKP
VSLTEAASAVMLEIEGERVVYFHGPIQSKSITWPSTDASNVSRLAFQPGGWQQALTENGDWSPFRLFDDADLAIQGDDLF
RAKFRQNGQAAEFDVQFGSVLNPFRLEALGAFSCPAQF*
Nucleotide sequence: 3477 bp
>T6CP010896 AF361470:13661-17137 [Rhizobium leguminosarum bv. trifolii] [TssM]
ATGAACCCGCTCAGTTATTTCTATACGCTCCGCTCCTATGTGGAGGCCTATGCCGGCCTCATCGGCCGTCGCTTCATCTC
GATCATCTGGGTGGCGGCAATCTGTGTCGTCATCTGGTTCTACGGCTATCTCGCCGTCTATGGCGACTTCAAGCCGCTGG
CGAGCGCCAGCGCGCGGCTGACGCTGATCGGCATCATCCTCGCTGCCTGGCTCGCTTATCTCGTCTTCACGACCATCCGC
GACCGACGCCGCGACAAGCAGCTCGTCGACGGTATCGAGCGAGACGCGGAAGCAGAAGCCGCGGCTAGCCAACAGGCTGA
GGTCGGCGAGATCCACGGTCGGCTCAAGGAAGCGCTGCAGCTGCTCCGCCGCATCACCAAGAAGCGCTTCGGCTATATCT
ATGAGCTGCCATGGTACGTCATCTTCGGCGCGCCGGGGTCCGGCAAGACGACGGCGCTCACCAATTCCGGCCTCAAATTC
CCGCTCGGCGATGCGCTCGGCAGCAATTCGGTGCAGGGGATCGGCGGCACGCGCAACTGCAATTGGTGGTTCACCGACGA
GGCCATCCTCATCGACACGGCCGGCCGCTATACGACGCAGGACGATCTGAACGGCACCGCCAAGGCCGGCTGGGAAGGAT
TTCTCGGCCTGCTGCGCAAGTACCGTCGTTCTCAGCCGATCAACGGCGCCCTGGTGACGCTGTCGATCGGCGATCTCCTG
ACGCGCGATCCGGAAGCGCAGCGCGAGGAGATCAGGGCGATCCGCCAGCGGCTTTCCGAACTCGATGAACTCCTGCATGC
GCGCGTGCCGGTCTATCTGCTGCTGACCAAAGCCGATTTCTTGACCGGCTTCGTGGAGTTTTTCGACGGTTTCAACAAAA
GCGACCGCGAACAGGTCTGGGGCACGACGTTCGGGCTCGACGAGAGCTACAAGGCGGCGAACCTGCCGGAACGCTTCATG
GAGGAATTCACCCTGCTGCAGCAGCGCGTCGATACCATGCTGATCGAACGCCTGCAGCAGGAGCCTAATCCGGAATTGCG
CGGCCGCATCTTCCGTTTCCCGGCCGAACTCACCTCTTTGAAGGAGCGTCTGCACGAGGTCGTCACCGAGCTTTGCTCGG
GTTCCAAGCTGGTGGAGGCGCCGCTGCTGCGCGGCATCTATTTCGCTTCCAGCACCCAGAGGGAGGAGACAGTCACCGTG
CCCCGCATGCGGCGCAGCTATTTCCTCTCGCGCCTCTTCAAGGAGGTGATCTTCGGCGAGGCCTCGCTGGTCGCCCGCGA
CAAGCGGCTTTCGGGGCGGCAACTGCTCTTCCGCCGCGCGGCCTATGCGAGCGCGGTCCTGCTGCTTGCCGTGGTGTTGA
CCAGCTGGACGGCCACCTACATTCAGAACACCACGGCGCTGGCCGGCGCGGAAAGACGCATAGACGCCTACGAACAACTG
GTGCGTGGCGTGCCGGTTCGCGACGTTTCGGATGCGGATTTCCTGCGCATCCTGCCGGCGCTCGACAATCTCGCCGCCGT
CACCACCGACTTCTCGAAGACCCGCGTCTGGCCGATAAGCCTCGGCCTTGACCAGGAAGGCAAGGTCGCCAGCCGGCAGC
GCGAGGCCTATCAGCGGGCGCTGAACACGCTGCTCCTGCCGCGCATGCTGGTGCAGCTCCAGAAGGACATGACCGAGACC
ACCGACATCACCCGCACCTTCGACGCGTTGAAGCTCTACGGCATGCTCGGCGGGCTCGGCCCGGTGAATGCGGACTTTGC
CTCGCTGGAAGCAGAAGAGATGTTCACGAGGCTCTATCCCGACGAAGGCAGGGCGGCGGCCCGGCAGGCGCTGATTGCCC
ATGCCGATGTCATGGCGCGCGGCGCGCTGCCGCCGATCGAGCTCGACAAGGCGCTGATCGCCAAGGCGCGCGAGGTCATC
CGCTCGCAGAACATCGCCAACCGCGCCTATGACATCCTGGCGGAATATCGCGAGTCCCGCGCGCTTCCCGCCTGGAGCCC
GGCAGGCGCGCTGGGGCCGCTTGGCGAACAGGCTTTCGAGCGCACCTCGAAGGCGCCGCTCAACGAAGGCATTCCCGGCC
TCTTCTCCGCGACCGGCTATCGCACCGTCGTTCTGCCTGGGATCACCGACGCCGCGCGCGAGGCGCTCGACGAGCAATGG
GTCCGCGGCGACCCCAATCCCGCCGGCGTCACGGTCGACACGATCGCCCAGGCGACGCTTCAGCTTTACTTCGACGCATT
CGAGCAGCGCTGGTCGATGATCCTCACCGATATCAGGGTGAAGCCTTCCCAGACGATTGGCGATGCGGCGGAAACCACCC
GCATCCTGGCGGGCAAGCCCGGCCCGGTCGAGACGATCACCAAATCAATCGTCGCTGCCACCGATCTACGCGCGCCCGGG
GCGGAAATCGCCTCGGCCGCCGGCGACAGCCTGCCAGCCTCGACTACGGCGCTTGCCAGCGCAGCGAATGCGCCCGATCC
CTTCGGCCGTCTTCGAGATGCGCTGAAGGCGCCGGCCGGCCAATCGTCGCCGGACGACCAGCCGAAGGATGCGCCGCGAT
CGGAGATCGGGCAGCTCGAGCCGATCCTGCAGAGGGTCCAGGAGCAGCTTTCGCGCGCCACGACGTCGACGGCCGAAGTC
GCGAAAGTCTTCGACGTCGACAGCCAGCTCACCGACGCCAATCAGGACCTGCTGCAGCAGGCGCGCAAGCTCCCGGCGCC
GGTCGATACCTGGATGGCGGGGCTGGCTGCGGATGTCGGATCGCTGGCGGTCAAATCCGCCCGATCACGCATCAGCGACG
CCTGGGCTGCCGAAGGCGCGAGCTTCTGCTCGGATGTCGTCGCCGGTCGTTATCCTTTCGACCGAAAATCTCCGCGCGAC
GTGGCGATGAGCGATTTCATCAGGCTTTTCGGACCGGAGGGGCTGTTCAAGACCTTCTTCAAGGAAAAGCTCGAAGCCTT
CGTCGACACCAGCGCCTCGCCATGGGGCTGGAAGGGCACGTTCGGCGCGGTGAGCATTCCGAGTGCGGCGATTGCCCAGT
TCGAGAATGCCGACAAAATCAACCGCGCCTTCTTCCCGGCCGGCAGCGAGAACCCGTCGATTTCAATCAACGTCAAGCCG
GTATCGCTGACCGAGGCGGCCAGCGCCGTGATGCTGGAGATCGAGGGCGAACGGGTGGTCTATTTCCACGGGCCCATCCA
ATCGAAATCGATCACCTGGCCGTCGACGGACGCCAGCAACGTTTCGCGCCTCGCCTTCCAGCCGGGCGGCTGGCAGCAGG
CGCTGACGGAGAACGGCGACTGGTCGCCCTTCCGGCTCTTCGACGATGCCGATCTCGCCATACAGGGCGACGACCTCTTC
CGGGCAAAGTTCCGGCAGAACGGGCAGGCAGCAGAGTTCGACGTGCAGTTCGGTTCGGTGCTCAATCCATTCCGGCTGGA
GGCGCTCGGCGCCTTCTCATGCCCGGCGCAATTCTGA
ATGAACCCGCTCAGTTATTTCTATACGCTCCGCTCCTATGTGGAGGCCTATGCCGGCCTCATCGGCCGTCGCTTCATCTC
GATCATCTGGGTGGCGGCAATCTGTGTCGTCATCTGGTTCTACGGCTATCTCGCCGTCTATGGCGACTTCAAGCCGCTGG
CGAGCGCCAGCGCGCGGCTGACGCTGATCGGCATCATCCTCGCTGCCTGGCTCGCTTATCTCGTCTTCACGACCATCCGC
GACCGACGCCGCGACAAGCAGCTCGTCGACGGTATCGAGCGAGACGCGGAAGCAGAAGCCGCGGCTAGCCAACAGGCTGA
GGTCGGCGAGATCCACGGTCGGCTCAAGGAAGCGCTGCAGCTGCTCCGCCGCATCACCAAGAAGCGCTTCGGCTATATCT
ATGAGCTGCCATGGTACGTCATCTTCGGCGCGCCGGGGTCCGGCAAGACGACGGCGCTCACCAATTCCGGCCTCAAATTC
CCGCTCGGCGATGCGCTCGGCAGCAATTCGGTGCAGGGGATCGGCGGCACGCGCAACTGCAATTGGTGGTTCACCGACGA
GGCCATCCTCATCGACACGGCCGGCCGCTATACGACGCAGGACGATCTGAACGGCACCGCCAAGGCCGGCTGGGAAGGAT
TTCTCGGCCTGCTGCGCAAGTACCGTCGTTCTCAGCCGATCAACGGCGCCCTGGTGACGCTGTCGATCGGCGATCTCCTG
ACGCGCGATCCGGAAGCGCAGCGCGAGGAGATCAGGGCGATCCGCCAGCGGCTTTCCGAACTCGATGAACTCCTGCATGC
GCGCGTGCCGGTCTATCTGCTGCTGACCAAAGCCGATTTCTTGACCGGCTTCGTGGAGTTTTTCGACGGTTTCAACAAAA
GCGACCGCGAACAGGTCTGGGGCACGACGTTCGGGCTCGACGAGAGCTACAAGGCGGCGAACCTGCCGGAACGCTTCATG
GAGGAATTCACCCTGCTGCAGCAGCGCGTCGATACCATGCTGATCGAACGCCTGCAGCAGGAGCCTAATCCGGAATTGCG
CGGCCGCATCTTCCGTTTCCCGGCCGAACTCACCTCTTTGAAGGAGCGTCTGCACGAGGTCGTCACCGAGCTTTGCTCGG
GTTCCAAGCTGGTGGAGGCGCCGCTGCTGCGCGGCATCTATTTCGCTTCCAGCACCCAGAGGGAGGAGACAGTCACCGTG
CCCCGCATGCGGCGCAGCTATTTCCTCTCGCGCCTCTTCAAGGAGGTGATCTTCGGCGAGGCCTCGCTGGTCGCCCGCGA
CAAGCGGCTTTCGGGGCGGCAACTGCTCTTCCGCCGCGCGGCCTATGCGAGCGCGGTCCTGCTGCTTGCCGTGGTGTTGA
CCAGCTGGACGGCCACCTACATTCAGAACACCACGGCGCTGGCCGGCGCGGAAAGACGCATAGACGCCTACGAACAACTG
GTGCGTGGCGTGCCGGTTCGCGACGTTTCGGATGCGGATTTCCTGCGCATCCTGCCGGCGCTCGACAATCTCGCCGCCGT
CACCACCGACTTCTCGAAGACCCGCGTCTGGCCGATAAGCCTCGGCCTTGACCAGGAAGGCAAGGTCGCCAGCCGGCAGC
GCGAGGCCTATCAGCGGGCGCTGAACACGCTGCTCCTGCCGCGCATGCTGGTGCAGCTCCAGAAGGACATGACCGAGACC
ACCGACATCACCCGCACCTTCGACGCGTTGAAGCTCTACGGCATGCTCGGCGGGCTCGGCCCGGTGAATGCGGACTTTGC
CTCGCTGGAAGCAGAAGAGATGTTCACGAGGCTCTATCCCGACGAAGGCAGGGCGGCGGCCCGGCAGGCGCTGATTGCCC
ATGCCGATGTCATGGCGCGCGGCGCGCTGCCGCCGATCGAGCTCGACAAGGCGCTGATCGCCAAGGCGCGCGAGGTCATC
CGCTCGCAGAACATCGCCAACCGCGCCTATGACATCCTGGCGGAATATCGCGAGTCCCGCGCGCTTCCCGCCTGGAGCCC
GGCAGGCGCGCTGGGGCCGCTTGGCGAACAGGCTTTCGAGCGCACCTCGAAGGCGCCGCTCAACGAAGGCATTCCCGGCC
TCTTCTCCGCGACCGGCTATCGCACCGTCGTTCTGCCTGGGATCACCGACGCCGCGCGCGAGGCGCTCGACGAGCAATGG
GTCCGCGGCGACCCCAATCCCGCCGGCGTCACGGTCGACACGATCGCCCAGGCGACGCTTCAGCTTTACTTCGACGCATT
CGAGCAGCGCTGGTCGATGATCCTCACCGATATCAGGGTGAAGCCTTCCCAGACGATTGGCGATGCGGCGGAAACCACCC
GCATCCTGGCGGGCAAGCCCGGCCCGGTCGAGACGATCACCAAATCAATCGTCGCTGCCACCGATCTACGCGCGCCCGGG
GCGGAAATCGCCTCGGCCGCCGGCGACAGCCTGCCAGCCTCGACTACGGCGCTTGCCAGCGCAGCGAATGCGCCCGATCC
CTTCGGCCGTCTTCGAGATGCGCTGAAGGCGCCGGCCGGCCAATCGTCGCCGGACGACCAGCCGAAGGATGCGCCGCGAT
CGGAGATCGGGCAGCTCGAGCCGATCCTGCAGAGGGTCCAGGAGCAGCTTTCGCGCGCCACGACGTCGACGGCCGAAGTC
GCGAAAGTCTTCGACGTCGACAGCCAGCTCACCGACGCCAATCAGGACCTGCTGCAGCAGGCGCGCAAGCTCCCGGCGCC
GGTCGATACCTGGATGGCGGGGCTGGCTGCGGATGTCGGATCGCTGGCGGTCAAATCCGCCCGATCACGCATCAGCGACG
CCTGGGCTGCCGAAGGCGCGAGCTTCTGCTCGGATGTCGTCGCCGGTCGTTATCCTTTCGACCGAAAATCTCCGCGCGAC
GTGGCGATGAGCGATTTCATCAGGCTTTTCGGACCGGAGGGGCTGTTCAAGACCTTCTTCAAGGAAAAGCTCGAAGCCTT
CGTCGACACCAGCGCCTCGCCATGGGGCTGGAAGGGCACGTTCGGCGCGGTGAGCATTCCGAGTGCGGCGATTGCCCAGT
TCGAGAATGCCGACAAAATCAACCGCGCCTTCTTCCCGGCCGGCAGCGAGAACCCGTCGATTTCAATCAACGTCAAGCCG
GTATCGCTGACCGAGGCGGCCAGCGCCGTGATGCTGGAGATCGAGGGCGAACGGGTGGTCTATTTCCACGGGCCCATCCA
ATCGAAATCGATCACCTGGCCGTCGACGGACGCCAGCAACGTTTCGCGCCTCGCCTTCCAGCCGGGCGGCTGGCAGCAGG
CGCTGACGGAGAACGGCGACTGGTCGCCCTTCCGGCTCTTCGACGATGCCGATCTCGCCATACAGGGCGACGACCTCTTC
CGGGCAAAGTTCCGGCAGAACGGGCAGGCAGCAGAGTTCGACGTGCAGTTCGGTTCGGTGCTCAATCCATTCCGGCTGGA
GGCGCTCGGCGCCTTCTCATGCCCGGCGCAATTCTGA