Detailed information of oriT
oriT
The information of the oriT region
| oriTDB ID | 100710 |
| Name | oriT_pC227-11-1 |
| Organism | Escherichia coli O104:H4 str. C227-11 isolate 368 shch |
| Sequence Completeness | intact |
| NCBI accession of oriT (coordinates [strand]) | NZ_CP011332 (17000..17449 [+], 450 nt) |
| oriT length | 450 nt |
| IRs (inverted repeats) | IR1: 201..215, 222..236 (ATTCATTGGTGAATC..GATTCACCAATGAAT) IR2: 308..315, 318..325 (GCAAAAAC..GTTTTTGC) |
| Location of nic site | 334..335 |
| Conserved sequence flanking the nic site |
GTGGGGTGT|GG |
| Note | predicted by the oriTfinder |
oriT sequence
Download Length: 450 nt
>oriT_pC227-11-1
ATATGTATACCTCTTTATTTTCTTATCGCATCAAATTTATTTCACATATAAAAAAATGACAATTCATCGATGAATCGAATCGTGACGCAAGTTACGAAAATTACGATTCATGCAGCAAATCACATCACGAATTGAATCTAGAGTCAATTCGCTTTAACTCGTTGTTTTATTATTACTGATTCATCTATGAGTTGCGTTAAATTCATTGGTGAATCATATGCGATTCACCAATGAATCCTTTTTATAATGTAAAATAAATTAAAATACATTATTTAAAACATAAGTTAATGATTCAAATAGCAAATCAGCAAAAACTTGTTTTTGCGTGGGGTGTGGTGCTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCTTTTTGTGGAGTGGGTTAAATTATTTACGGATAAAGTCACCAGAGGTGGAAAAATGAAAAAATGGATGTTAGCAATCT
ATATGTATACCTCTTTATTTTCTTATCGCATCAAATTTATTTCACATATAAAAAAATGACAATTCATCGATGAATCGAATCGTGACGCAAGTTACGAAAATTACGATTCATGCAGCAAATCACATCACGAATTGAATCTAGAGTCAATTCGCTTTAACTCGTTGTTTTATTATTACTGATTCATCTATGAGTTGCGTTAAATTCATTGGTGAATCATATGCGATTCACCAATGAATCCTTTTTATAATGTAAAATAAATTAAAATACATTATTTAAAACATAAGTTAATGATTCAAATAGCAAATCAGCAAAAACTTGTTTTTGCGTGGGGTGTGGTGCTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCTTTTTGTGGAGTGGGTTAAATTATTTACGGATAAAGTCACCAGAGGTGGAAAAATGAAAAAATGGATGTTAGCAATCT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure file
Relaxase
| ID | 782 | GenBank | WP_001390493 |
| Name | TraI_pC227-11-1 |
UniProt ID | _ |
| Length | 1231 a.a. | PDB ID | |
| Note | putative relaxase | ||
Relaxase protein sequence
Download Length: 1231 a.a. Molecular weight: 133814.33 Da Isoelectric Point: 6.0682
>WP_001390493.1 conjugative transfer relaxase/helicase TraI [Escherichia coli]
MAREQGREVQIIAADRRSQMNLKQDERLSGELITGRRQLQDGMTFTPGSTVIVDQGEKLSLKETLTLLDG
AARHNVQVLITDSGQRTGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFAASVKAG
EESVAQVSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRYLRDMYRPGMVMEQWNPETR
SHDRYVIDRVTAQSHSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRVTGKTPGLRVSGGDR
LQVASVSEDAMTVVVPGRAEPATLPVADSPFTALKLENGWVETPGHSVSDSATVFASVTQMAMDNATLNG
LARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKARAGETSLETAISLQKAGLHTPAQQAIHLALPV
LESKNLAFSMVDLLTEAKSFAAEGTGFTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSIL
RHILEGKEAVTPLMERVPGELMETLTSGQRAATRMILEASDRFTVVQGYAGVGKTTQFRAVMSAVNMLPA
SERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDMARAY
ALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAVYSLINRDVERALSG
LESVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEAFPDVPMTLYEAIVRDYTGRTPEAR
EQTLIVTHLNEDRRVLNSMIHDAREKAGELGKEQVMVPVLNTANIRDGELRRLSTWEKNPDALALVDSVY
HRIAGISKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDKIRVGTGDRMRFTKSDRERGYVANSVWTV
TAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAITAHGAQGASETFAIALEGTEGNRKQMAGFESAYV
ALSRMKQHVQVYTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAQRLFSTARELRDVAAGRAVLRQA
GLSGGDSPARFIAPGRKYPQPYVALPAFDRNGRSAGIWLNPLTTDDGNGLRGFSGEGRVKGSEDAQFVAL
QGSRNGESLLADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPGAITGGRVWGDIPDSSVQPGAGNGEPVT
AEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTE
MAREQGREVQIIAADRRSQMNLKQDERLSGELITGRRQLQDGMTFTPGSTVIVDQGEKLSLKETLTLLDG
AARHNVQVLITDSGQRTGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFAASVKAG
EESVAQVSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRYLRDMYRPGMVMEQWNPETR
SHDRYVIDRVTAQSHSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRVTGKTPGLRVSGGDR
LQVASVSEDAMTVVVPGRAEPATLPVADSPFTALKLENGWVETPGHSVSDSATVFASVTQMAMDNATLNG
LARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKARAGETSLETAISLQKAGLHTPAQQAIHLALPV
LESKNLAFSMVDLLTEAKSFAAEGTGFTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSIL
RHILEGKEAVTPLMERVPGELMETLTSGQRAATRMILEASDRFTVVQGYAGVGKTTQFRAVMSAVNMLPA
SERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDMARAY
ALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAVYSLINRDVERALSG
LESVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEAFPDVPMTLYEAIVRDYTGRTPEAR
EQTLIVTHLNEDRRVLNSMIHDAREKAGELGKEQVMVPVLNTANIRDGELRRLSTWEKNPDALALVDSVY
HRIAGISKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDKIRVGTGDRMRFTKSDRERGYVANSVWTV
TAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAITAHGAQGASETFAIALEGTEGNRKQMAGFESAYV
ALSRMKQHVQVYTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAQRLFSTARELRDVAAGRAVLRQA
GLSGGDSPARFIAPGRKYPQPYVALPAFDRNGRSAGIWLNPLTTDDGNGLRGFSGEGRVKGSEDAQFVAL
QGSRNGESLLADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPGAITGGRVWGDIPDSSVQPGAGNGEPVT
AEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTE
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Host bacterium
| ID | 1170 | GenBank | NZ_CP011332 |
| Plasmid name | pC227-11-1 | Incompatibility group | IncFII |
| Plasmid size | 75079 bp | Coordinate of oriT [Strand] | 17000..17449 [+] |
| Host baterium | Escherichia coli O104:H4 str. C227-11 isolate 368 shch |
Cargo genes
| Drug resistance gene | - |
| Virulence gene | aap/aspU, aggA, aggC, aggD |
| Metal resistance gene | - |
| Degradation gene | - |
| Symbiosis gene | - |
| Anti-CRISPR | - |