Detailed information of component protein
Summary
External database links
Locus tag (Gene) | C_RS16085 |
Coordinate (Strand) | 3227152..3228804 (+) |
NCBI ID | WP_001246526.1 |
RefSeq | NC_004431 |
Uniprot ID | A0A0H2VA37_ECOL6, A0A1L1K0F2_ECOLX |
KEGG ID | ecc:c3389 |
PDB ID | - |
Pfam domain hit(s)
Domain | Pfam ID | E-value | Aligned region |
---|---|---|---|
OmpA | PF00691.22 | 7.5e-17 | 437..533 |
Transmembrane helices
- Transmembrane helices are predicted using TMHMM 2.0 software.
Prediction | Region | Sequence |
---|---|---|
Inside | 1-6 | VRNTLK |
Transmembrane helix | 7-25 | QAIVLWGMVLLLVLWSVFI |
Outside | 26-29 | SPSG |
Transmembrane helix | 30-49 | VLRWAGAAAIVLAVAALLIY |
Inside | 50-316 | RRRQAWTEMTGDAGLSSLPPETYRQPVVLVCGGLSAHLSTDSPVRQVSEGLYLHVPDEEQLVAQVERLLTLRPAWASQLAVAYTIMPGIHRDVAVLAGRLRRFAHSMATVRRRAGVNVPWLLWSGLSGSPLPERASSPWFICTGGEVQVATSTETTMPAQWIAQSGVQERSQRLCYLLKAESLMQWLNLNVLTALNGPEAKCPPLAMTVGLVPSLPAVDNNLWQLWITARTGLTPDIADTGTDDALPFPDALLRQLPRQSGFTPLRR |
Transmembrane helix | 317-339 | ACVTMLGVTTVAGIAALCLSATA |
Outside | 340-550 | NRQLLRQVGDDLHRFYAVPVEEFITKARHLSVLKDDATMLDGYYREGEPLRLGLGLYPGERIRQPVLRAIRDWRPPEQKMEVTASLQVQTVRLDSMSLFDVGQARLKDGSTKVLVDALVNIRAKPGWLILVAGYTDATGDEKSNQQLSLRRAEAVRNWMLQTSDIPATCFAVQGLGESQPAATNDTPQGRAVNRRVEISLVPRSDACQDVK |
Signal peptides
- Sec/SPI: "standard" secretory signal peptides transported by the Sec translocon and cleaved by Signal Peptidase I (Lep).
- Sec/SPII: lipoprotein signal peptides transported by the Sec translocon and cleaved by Signal Peptidase II (Lsp).
- Tat/SPI: Tat signal peptides transported by the Tat translocon and cleaved by Signal Peptidase I (Lep).
Prediction | Probability | Cleavage site | Signal peptide sequence |
---|---|---|---|
Other | 0.9923 | - | - |
Protein sequence: 551 a.a.
>T6CP011356 NC_004431:3227152-3228804 [Escherichia coli CFT073] [TagL]
VRNTLKQAIVLWGMVLLLVLWSVFISPSGVLRWAGAAAIVLAVAALLIYRRRQAWTEMTGDAGLSSLPPETYRQPVVLVC
GGLSAHLSTDSPVRQVSEGLYLHVPDEEQLVAQVERLLTLRPAWASQLAVAYTIMPGIHRDVAVLAGRLRRFAHSMATVR
RRAGVNVPWLLWSGLSGSPLPERASSPWFICTGGEVQVATSTETTMPAQWIAQSGVQERSQRLCYLLKAESLMQWLNLNV
LTALNGPEAKCPPLAMTVGLVPSLPAVDNNLWQLWITARTGLTPDIADTGTDDALPFPDALLRQLPRQSGFTPLRRACVT
MLGVTTVAGIAALCLSATANRQLLRQVGDDLHRFYAVPVEEFITKARHLSVLKDDATMLDGYYREGEPLRLGLGLYPGER
IRQPVLRAIRDWRPPEQKMEVTASLQVQTVRLDSMSLFDVGQARLKDGSTKVLVDALVNIRAKPGWLILVAGYTDATGDE
KSNQQLSLRRAEAVRNWMLQTSDIPATCFAVQGLGESQPAATNDTPQGRAVNRRVEISLVPRSDACQDVK*
VRNTLKQAIVLWGMVLLLVLWSVFISPSGVLRWAGAAAIVLAVAALLIYRRRQAWTEMTGDAGLSSLPPETYRQPVVLVC
GGLSAHLSTDSPVRQVSEGLYLHVPDEEQLVAQVERLLTLRPAWASQLAVAYTIMPGIHRDVAVLAGRLRRFAHSMATVR
RRAGVNVPWLLWSGLSGSPLPERASSPWFICTGGEVQVATSTETTMPAQWIAQSGVQERSQRLCYLLKAESLMQWLNLNV
LTALNGPEAKCPPLAMTVGLVPSLPAVDNNLWQLWITARTGLTPDIADTGTDDALPFPDALLRQLPRQSGFTPLRRACVT
MLGVTTVAGIAALCLSATANRQLLRQVGDDLHRFYAVPVEEFITKARHLSVLKDDATMLDGYYREGEPLRLGLGLYPGER
IRQPVLRAIRDWRPPEQKMEVTASLQVQTVRLDSMSLFDVGQARLKDGSTKVLVDALVNIRAKPGWLILVAGYTDATGDE
KSNQQLSLRRAEAVRNWMLQTSDIPATCFAVQGLGESQPAATNDTPQGRAVNRRVEISLVPRSDACQDVK*
Nucleotide sequence: 1653 bp
>T6CP011356 NC_004431:3227152-3228804 [Escherichia coli CFT073] [TagL]
GTGAGGAACACGCTGAAACAGGCCATCGTGCTGTGGGGAATGGTGTTACTGCTGGTGCTGTGGTCAGTGTTTATCAGTCC
GTCTGGCGTGCTGAGATGGGCCGGTGCGGCGGCTATCGTTCTGGCGGTTGCCGCGTTGTTGATTTATCGGCGCAGGCAGG
CGTGGACGGAGATGACCGGCGATGCCGGGTTGTCATCGCTGCCGCCGGAAACCTACCGACAGCCGGTAGTGCTGGTCTGT
GGCGGTCTGTCGGCGCACCTGTCCACTGACAGCCCGGTCCGCCAGGTTTCAGAAGGGCTGTATCTGCATGTTCCTGATGA
AGAACAGCTTGTGGCGCAGGTGGAGCGATTGCTGACCCTTCGCCCGGCGTGGGCATCGCAGCTTGCCGTGGCGTATACCA
TCATGCCCGGCATACACCGGGATGTGGCGGTTCTGGCCGGACGGCTGCGACGGTTCGCCCACAGTATGGCGACGGTGCGT
CGTCGGGCAGGCGTAAACGTCCCCTGGCTTCTCTGGAGCGGGCTGTCCGGCTCGCCGTTGCCGGAAAGAGCGAGTTCACC
GTGGTTTATCTGTACCGGCGGCGAAGTTCAGGTAGCAACATCCACAGAGACCACCATGCCCGCGCAGTGGATTGCACAAT
CCGGCGTACAGGAGCGCAGTCAGCGACTCTGTTACCTGCTGAAAGCTGAAAGCCTGATGCAGTGGCTGAATCTTAATGTG
CTGACGGCACTGAACGGCCCGGAGGCGAAATGTCCACCACTGGCGATGACCGTGGGGCTGGTCCCCTCGTTGCCTGCGGT
GGATAACAACCTGTGGCAGTTGTGGATCACCGCCAGAACCGGCCTGACGCCGGATATCGCGGACACCGGCACAGACGATG
CGCTGCCATTCCCGGATGCCCTGTTACGGCAGTTGCCGCGTCAGTCGGGCTTTACCCCGCTGCGACGAGCCTGCGTGACC
ATGCTGGGCGTCACCACCGTGGCGGGTATCGCCGCGCTGTGCCTGTCAGCCACGGCAAATCGCCAGTTATTACGGCAGGT
CGGTGACGATCTGCACCGGTTTTATGCCGTCCCGGTGGAGGAATTTATCACCAAAGCCCGTCACCTGTCGGTGCTGAAAG
ACGATGCGACCATGCTCGATGGGTATTACCGGGAAGGAGAACCCCTGCGCCTCGGTCTGGGGTTATACCCCGGCGAACGC
ATCCGCCAGCCGGTATTACGCGCCATTCGCGACTGGCGTCCGCCTGAACAAAAAATGGAGGTGACGGCTTCGCTTCAGGT
TCAGACCGTGCGTCTTGACAGTATGTCGCTGTTTGACGTCGGACAGGCCCGCCTGAAAGACGGCTCGACAAAAGTGCTGG
TGGACGCACTGGTGAACATCCGGGCAAAACCGGGCTGGCTGATCCTCGTGGCCGGATATACCGATGCCACCGGCGATGAA
AAAAGCAATCAGCAGTTATCGCTGCGGCGTGCCGAAGCGGTGCGCAACTGGATGCTGCAGACCAGCGACATCCCGGCCAC
CTGTTTTGCCGTACAGGGACTGGGCGAGAGCCAGCCTGCGGCGACCAACGACACGCCACAGGGCCGGGCAGTCAACCGGC
GTGTCGAAATCAGTCTTGTTCCGCGTTCTGACGCCTGTCAGGACGTGAAATAA
GTGAGGAACACGCTGAAACAGGCCATCGTGCTGTGGGGAATGGTGTTACTGCTGGTGCTGTGGTCAGTGTTTATCAGTCC
GTCTGGCGTGCTGAGATGGGCCGGTGCGGCGGCTATCGTTCTGGCGGTTGCCGCGTTGTTGATTTATCGGCGCAGGCAGG
CGTGGACGGAGATGACCGGCGATGCCGGGTTGTCATCGCTGCCGCCGGAAACCTACCGACAGCCGGTAGTGCTGGTCTGT
GGCGGTCTGTCGGCGCACCTGTCCACTGACAGCCCGGTCCGCCAGGTTTCAGAAGGGCTGTATCTGCATGTTCCTGATGA
AGAACAGCTTGTGGCGCAGGTGGAGCGATTGCTGACCCTTCGCCCGGCGTGGGCATCGCAGCTTGCCGTGGCGTATACCA
TCATGCCCGGCATACACCGGGATGTGGCGGTTCTGGCCGGACGGCTGCGACGGTTCGCCCACAGTATGGCGACGGTGCGT
CGTCGGGCAGGCGTAAACGTCCCCTGGCTTCTCTGGAGCGGGCTGTCCGGCTCGCCGTTGCCGGAAAGAGCGAGTTCACC
GTGGTTTATCTGTACCGGCGGCGAAGTTCAGGTAGCAACATCCACAGAGACCACCATGCCCGCGCAGTGGATTGCACAAT
CCGGCGTACAGGAGCGCAGTCAGCGACTCTGTTACCTGCTGAAAGCTGAAAGCCTGATGCAGTGGCTGAATCTTAATGTG
CTGACGGCACTGAACGGCCCGGAGGCGAAATGTCCACCACTGGCGATGACCGTGGGGCTGGTCCCCTCGTTGCCTGCGGT
GGATAACAACCTGTGGCAGTTGTGGATCACCGCCAGAACCGGCCTGACGCCGGATATCGCGGACACCGGCACAGACGATG
CGCTGCCATTCCCGGATGCCCTGTTACGGCAGTTGCCGCGTCAGTCGGGCTTTACCCCGCTGCGACGAGCCTGCGTGACC
ATGCTGGGCGTCACCACCGTGGCGGGTATCGCCGCGCTGTGCCTGTCAGCCACGGCAAATCGCCAGTTATTACGGCAGGT
CGGTGACGATCTGCACCGGTTTTATGCCGTCCCGGTGGAGGAATTTATCACCAAAGCCCGTCACCTGTCGGTGCTGAAAG
ACGATGCGACCATGCTCGATGGGTATTACCGGGAAGGAGAACCCCTGCGCCTCGGTCTGGGGTTATACCCCGGCGAACGC
ATCCGCCAGCCGGTATTACGCGCCATTCGCGACTGGCGTCCGCCTGAACAAAAAATGGAGGTGACGGCTTCGCTTCAGGT
TCAGACCGTGCGTCTTGACAGTATGTCGCTGTTTGACGTCGGACAGGCCCGCCTGAAAGACGGCTCGACAAAAGTGCTGG
TGGACGCACTGGTGAACATCCGGGCAAAACCGGGCTGGCTGATCCTCGTGGCCGGATATACCGATGCCACCGGCGATGAA
AAAAGCAATCAGCAGTTATCGCTGCGGCGTGCCGAAGCGGTGCGCAACTGGATGCTGCAGACCAGCGACATCCCGGCCAC
CTGTTTTGCCGTACAGGGACTGGGCGAGAGCCAGCCTGCGGCGACCAACGACACGCCACAGGGCCGGGCAGTCAACCGGC
GTGTCGAAATCAGTCTTGTTCCGCGTTCTGACGCCTGTCAGGACGTGAAATAA