Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 100428 |
Name | oriT_pAA-EA11 |
Organism | Escherichia coli O104:H4 str. 2011C-3493 |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | NC_018666 (52043..52492 [+], 450 nt) |
oriT length | 450 nt |
IRs (inverted repeats) | IR1: 201..215, 222..236 (ATTCATTGGTGAATC..GATTCACCAATGAAT) IR2: 308..315, 318..325 (GCAAAAAC..GTTTTTGC) |
Location of nic site | 334..335 |
Conserved sequence flanking the nic site |
GTGGGGTGT|GG |
Note | predicted by the oriTfinder |
oriT sequence
Download Length: 450 nt
>oriT_pAA-EA11
ATATGTATACCTCTTTATTTTCTTATCGCATCAAATTTATTTCACATATAAAAAAATGACAATTCATCGATGAATCGAATCGTGACGCAAGTTACGAAAATTACGATTCATGCAGCAAATCACATCACGAATTGAATCTAGAGTCAATTCGCTTTAACTCGTTGTTTTATTATTACTGATTCATCTATGAGTTGCGTTAAATTCATTGGTGAATCATATGCGATTCACCAATGAATCCTTTTTATAATGTAAAATAAATTAAAATACATTATTTAAAACATAAGTTAATGATTCAAATAGCAAATCAGCAAAAACTTGTTTTTGCGTGGGGTGTGGTGCTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCTTTTTGTGGAGTGGGTTAAATTATTTACGGATAAAGTCACCAGAGGTGGAAAAATGAAAAAATGGATGTTAGCAATCT
ATATGTATACCTCTTTATTTTCTTATCGCATCAAATTTATTTCACATATAAAAAAATGACAATTCATCGATGAATCGAATCGTGACGCAAGTTACGAAAATTACGATTCATGCAGCAAATCACATCACGAATTGAATCTAGAGTCAATTCGCTTTAACTCGTTGTTTTATTATTACTGATTCATCTATGAGTTGCGTTAAATTCATTGGTGAATCATATGCGATTCACCAATGAATCCTTTTTATAATGTAAAATAAATTAAAATACATTATTTAAAACATAAGTTAATGATTCAAATAGCAAATCAGCAAAAACTTGTTTTTGCGTGGGGTGTGGTGCTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCTTTTTGTGGAGTGGGTTAAATTATTTACGGATAAAGTCACCAGAGGTGGAAAAATGAAAAAATGGATGTTAGCAATCT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 499 | GenBank | YP_006792604 |
Name | TraI_pAA-EA11 | UniProt ID | A0A0E0Y6S8 |
Length | 1237 a.a. | PDB ID | |
Note | putative relaxase |
Relaxase protein sequence
Download Length: 1237 a.a. Molecular weight: 134489.20 Da Isoelectric Point: 5.9884
>YP_006792604.1 conjugal transfer nickase/helicase TraI (plasmid) [Escherichia coli O104:H4 str. 2011C-3493]
MAELVMMAREQGREVQIIAADRRSQMNLKQDERLSGELITGRRQLQDGMTFTPGSTVIVDQGEKLSLKET
LTLLDGAARHNVQVLITDSGQRTGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFA
ASVKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRYLRDMYRPGMVMEQ
WNPETRSHDRYVIDRVTAQSHSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRVTGKTPGLR
VSGGDRLQVASVSEDAMTVVVPGRAEPATLPVADSPFTALKLENGWVETPGHSVSDSATVFASVTQMAMD
NATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKARAGETSLETAISLQKAGLHTPAQQAI
HLALPVLESKNLAFSMVDLLTEAKSFAAEGTGFTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYE
AEKSILRHILEGKEAVTPLMERVPGELMETLTSGQRAATRMILEASDRFTVVQGYAGVGKTTQFRAVMSA
VNMLPASERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNT
DMARAYALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAVYSLINRDV
ERALSGLESVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEAFPDVPMTLYEAIVRDYTG
RTPEAREQTLIVTHLNEDRRVLNSMIHDAREKAGELGKEQVMVPVLNTANIRDGELRRLSTWEKNPDALA
LVDSVYHRIAGISKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDKIRVGTGDRMRFTKSDRERGYVA
NSVWTVTAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAITAHGAQGASETFAIALEGTEGNRKQMAG
FESAYVALSRMKQHVQVYTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAQRLFSTARELRDVAAGR
AVLRQAGLSGGDSPARFIAPGRKYPQPYVALPAFDRNGRSAGIWLNPLTTDDGNGLRGFSGEGRVKGSED
AQFVALQGSRNGESLLADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPGAITGGRVWGDIPDSSVQPGAG
NGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTE
MAELVMMAREQGREVQIIAADRRSQMNLKQDERLSGELITGRRQLQDGMTFTPGSTVIVDQGEKLSLKET
LTLLDGAARHNVQVLITDSGQRTGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFA
ASVKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRYLRDMYRPGMVMEQ
WNPETRSHDRYVIDRVTAQSHSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRVTGKTPGLR
VSGGDRLQVASVSEDAMTVVVPGRAEPATLPVADSPFTALKLENGWVETPGHSVSDSATVFASVTQMAMD
NATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKARAGETSLETAISLQKAGLHTPAQQAI
HLALPVLESKNLAFSMVDLLTEAKSFAAEGTGFTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYE
AEKSILRHILEGKEAVTPLMERVPGELMETLTSGQRAATRMILEASDRFTVVQGYAGVGKTTQFRAVMSA
VNMLPASERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNT
DMARAYALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAVYSLINRDV
ERALSGLESVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEAFPDVPMTLYEAIVRDYTG
RTPEAREQTLIVTHLNEDRRVLNSMIHDAREKAGELGKEQVMVPVLNTANIRDGELRRLSTWEKNPDALA
LVDSVYHRIAGISKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDKIRVGTGDRMRFTKSDRERGYVA
NSVWTVTAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAITAHGAQGASETFAIALEGTEGNRKQMAG
FESAYVALSRMKQHVQVYTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAQRLFSTARELRDVAAGR
AVLRQAGLSGGDSPARFIAPGRKYPQPYVALPAFDRNGRSAGIWLNPLTTDDGNGLRGFSGEGRVKGSED
AQFVALQGSRNGESLLADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPGAITGGRVWGDIPDSSVQPGAG
NGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTE
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A0E0Y6S8 |
Host bacterium
ID | 889 | GenBank | NC_018666 |
Plasmid name | pAA-EA11 | Incompatibility group | IncFIB |
Plasmid size | 74217 bp | Coordinate of oriT [Strand] | 52043..52492 [+] |
Host baterium | Escherichia coli O104:H4 str. 2011C-3493 |
Cargo genes
Drug resistance gene | - |
Virulence gene | aap/aspU, aggA, aggB, aggC, aggD |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |