Detailed information of oriT
oriT
The information of the oriT region
| oriTDB ID | 100342 |
| Name | oriT_pO103 |
| Organism | Escherichia coli O103:H2 str. 12009 |
| Sequence Completeness | intact |
| NCBI accession of oriT (coordinates [strand]) | NC_013354 (72592..72881 [-], 290 nt) |
| oriT length | 290 nt |
| IRs (inverted repeats) | 225..232, 235..242 n (GCAAAAAC..GTTTTTGC) |
| Location of nic site | 251..252 |
| Conserved sequence flanking the nic site |
GTGGGGTGT|GG |
| Note | predicted by the oriTfinder |
oriT sequence
Download Length: 290 nt
>oriT_pO103
CGCACCGCTAGCAGTGCCCCTGGCGGTATCCTATAAAGAAACACACCGCGCCGCTAGCAGCACCCCTAATATAAAATAATGTTTTTTATAAAAATAGTCAGTACTACCCCTACAAAGCGGTGTCGGTGCGTTGTTGTAGCCGCGCCGACACCGCTTTTTTAAATATCATAAAGAGAGTAAGAGAAACTAATTTTTCATAAAACTCTATTTATAAATAAAAATCAGCAAAAACTTGTTTTTGCGTGGGGTGTGGTGCTTTTGGTGGTGAGAACCACCAATCTGTTGAGCCT
CGCACCGCTAGCAGTGCCCCTGGCGGTATCCTATAAAGAAACACACCGCGCCGCTAGCAGCACCCCTAATATAAAATAATGTTTTTTATAAAAATAGTCAGTACTACCCCTACAAAGCGGTGTCGGTGCGTTGTTGTAGCCGCGCCGACACCGCTTTTTTAAATATCATAAAGAGAGTAAGAGAAACTAATTTTTCATAAAACTCTATTTATAAATAAAAATCAGCAAAAACTTGTTTTTGCGTGGGGTGTGGTGCTTTTGGTGGTGAGAACCACCAATCTGTTGAGCCT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure file
Relaxase
| ID | 413 | GenBank | WP_000987003 |
| Name | TraI_pO103 |
UniProt ID | A0A140RHP8 |
| Length | 1756 a.a. | PDB ID | |
| Note | putative relaxase | ||
Relaxase protein sequence
Download Length: 1756 a.a. Molecular weight: 191902.49 Da Isoelectric Point: 5.8525
>WP_000987003.1 conjugative transfer relaxase/helicase TraI [Escherichia coli]
MMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGRGAEQLGLQGSVDKDVFTRLLEGRLPDGADLSRMQ
DGSNRHRPGYDLTFSAPKSVSMMAMLGGDKRLIDAHNQAVDFAVRQVEALASTRVMTDGQSETVLTGNLV
MALFNHDTSRDQEPQLHTHAVVANVTQHNGEWKTLSSDKVGKTGFIENVYANQIAFGRLYREKLKEQVEA
LGYETEVVGKHGMWEMPGVPVEAFSGRSQAIREAVGEDASLKSRDVAALDTRKSKQHVDPEVRMTEWMQT
LKETGFDIRAYRDAADQRAETRTQAPGAVSQEGPDVQQAVTQAIAGLSERKVQFTYTDVLARTVGILPPE
NGVIERARAGIDEAISREQLIPLDREKGLFTSGIHVLNELSVRALSRDIMKQNRVTVHPEKSVPRTAGYS
DAVSVLAQDRPSLAIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDERLSGELITGR
RQLLEGMAFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQRTGTGSALMAMKDAGVNTYRW
QGGEQRPATIISEPDRNVRYARLAGDFAASVKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGHQEVT
MTALSPVWLDSRSRYLRDMYRPGMVMEQWNPETRSHDRYVIDRVTAQSHSLTLRDAQGETQVVRISSLDS
SWSLFRPEKMPVADGERLRVTGKISGLRVSGGDRLQVASVSEDAMTVVVPGRAEPASLPVSDSPFTALKL
ENGWVETPGHSVSDSAKVFASVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQI
KARAGETLLETAISLQKAGLHTPAQQAIHLALPVVESKNLAFSMVDLLTEAKSFAAEGTSFADLGREINT
QIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEGKEAVTPLMERVPGELMEKLTSGQRAATRMI
LETSDRFTVVQGYAGVGKTTQFRAVMSAVNLLPESERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDT
QLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSA
ADVAIMKEIVRQTPELREAVYSLINRDVERALSGLESVKPSQVPRQEGAWAPEHSVTEFSHSQEAKLAEA
QQKAMLKGEAFPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDAREKAGELGKEQVM
VPVLNTANIRDGELRRLSTWENNPDALALVDSVYHRIAGISKDDGLITLEDAEGNTRLISPREAVAEGVT
LYTPDTIRVGTGDRMRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAI
TAHGAQGASETFAIALEGTEGGRKQMAGFESAYVALSRMKQHVQVYTDNRQGWTDAINNAVQKGTAHDVL
EPGSDREVMNAERLFSTARELRDVVAGRAVLRQAGLAGGDSPARFIAPGRKYPQPYVALPAFDRNGKSAG
IWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESLLADNMQDGVRIARDNPDSGVVVRIAGE
GRPWNPGAITGGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLP
DGRTEQAVREIAGQERERAVTSEREAALPESVLREPQREREAVREVVRENLLQERLQQMERDMVRDLQKE
KTLGGD
MMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGRGAEQLGLQGSVDKDVFTRLLEGRLPDGADLSRMQ
DGSNRHRPGYDLTFSAPKSVSMMAMLGGDKRLIDAHNQAVDFAVRQVEALASTRVMTDGQSETVLTGNLV
MALFNHDTSRDQEPQLHTHAVVANVTQHNGEWKTLSSDKVGKTGFIENVYANQIAFGRLYREKLKEQVEA
LGYETEVVGKHGMWEMPGVPVEAFSGRSQAIREAVGEDASLKSRDVAALDTRKSKQHVDPEVRMTEWMQT
LKETGFDIRAYRDAADQRAETRTQAPGAVSQEGPDVQQAVTQAIAGLSERKVQFTYTDVLARTVGILPPE
NGVIERARAGIDEAISREQLIPLDREKGLFTSGIHVLNELSVRALSRDIMKQNRVTVHPEKSVPRTAGYS
DAVSVLAQDRPSLAIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDERLSGELITGR
RQLLEGMAFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQRTGTGSALMAMKDAGVNTYRW
QGGEQRPATIISEPDRNVRYARLAGDFAASVKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGHQEVT
MTALSPVWLDSRSRYLRDMYRPGMVMEQWNPETRSHDRYVIDRVTAQSHSLTLRDAQGETQVVRISSLDS
SWSLFRPEKMPVADGERLRVTGKISGLRVSGGDRLQVASVSEDAMTVVVPGRAEPASLPVSDSPFTALKL
ENGWVETPGHSVSDSAKVFASVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQI
KARAGETLLETAISLQKAGLHTPAQQAIHLALPVVESKNLAFSMVDLLTEAKSFAAEGTSFADLGREINT
QIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEGKEAVTPLMERVPGELMEKLTSGQRAATRMI
LETSDRFTVVQGYAGVGKTTQFRAVMSAVNLLPESERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDT
QLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSA
ADVAIMKEIVRQTPELREAVYSLINRDVERALSGLESVKPSQVPRQEGAWAPEHSVTEFSHSQEAKLAEA
QQKAMLKGEAFPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDAREKAGELGKEQVM
VPVLNTANIRDGELRRLSTWENNPDALALVDSVYHRIAGISKDDGLITLEDAEGNTRLISPREAVAEGVT
LYTPDTIRVGTGDRMRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAI
TAHGAQGASETFAIALEGTEGGRKQMAGFESAYVALSRMKQHVQVYTDNRQGWTDAINNAVQKGTAHDVL
EPGSDREVMNAERLFSTARELRDVVAGRAVLRQAGLAGGDSPARFIAPGRKYPQPYVALPAFDRNGKSAG
IWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESLLADNMQDGVRIARDNPDSGVVVRIAGE
GRPWNPGAITGGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLP
DGRTEQAVREIAGQERERAVTSEREAALPESVLREPQREREAVREVVRENLLQERLQQMERDMVRDLQKE
KTLGGD
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Host bacterium
| ID | 803 | GenBank | NC_013354 |
| Plasmid name | pO103 | Incompatibility group | IncFIB |
| Plasmid size | 75546 bp | Coordinate of oriT [Strand] | 72592..72881 [-] |
| Host baterium | Escherichia coli O103:H2 str. 12009 |
Cargo genes
| Drug resistance gene | - |
| Virulence gene | stcE, exeE, exeG, hlyD, hlyB, hlyA, hlyC |
| Metal resistance gene | - |
| Degradation gene | - |
| Symbiosis gene | - |
| Anti-CRISPR | - |