Detailed information of oriT
oriT
The information of the oriT region
| oriTDB ID | 100560 |
| Name | oriT_pACN001-B |
| Organism | Escherichia coli ACN001 |
| Sequence Completeness | intact |
| NCBI accession of oriT (coordinates [strand]) | NC_023327 (109227..109688 [-], 462 nt) |
| oriT length | 462 nt |
| IRs (inverted repeats) | 142..151, 154..163 (ATAGGGTCGT..ACGACCCTAT) 231..238, 241..248 (GCAAAAAC..GTTTTTGC) |
| Location of nic site | 257..258 |
| Conserved sequence flanking the nic site |
GTGGGGTGT|GG |
| Note | predicted by the oriTfinder |
oriT sequence
Download Length: 462 nt
>oriT_pACN001-B
AAGAAGAGCACCCCTAGCAGCGCCCCTAGGGGCATACTATAAAAAAACGCACGGCGCTGCTAGGGGCGGCCCTAATATATCCAAATGTTTTTCATGAAAATTGTCAGCACTGACCCTAACAAGGGTCGTCATAGGGTCGCTATAGGGTCGTCAACGACCCTATTTAATAATGTAAAATAAATTAAAATACATTATTTAGAACATAAACTAATGATTTAAAAAACAAATCAGCAAAAACTTGTTTTTGCGTGGGGTGTGGTACTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCTTTTTGTGGAATGGGTTAAATTATTTACGGATAAAGTCACCAGAGGTGGAAAAATGAAAAAATGGATGTTAGCCATCTGCCTGATGTTTATAAATGAGATCTGCCATGCCACTGATTGCTTTGATCTTGCAGGCCGGGATTACAAAATAGATCCTGATTTACTGAGA
AAGAAGAGCACCCCTAGCAGCGCCCCTAGGGGCATACTATAAAAAAACGCACGGCGCTGCTAGGGGCGGCCCTAATATATCCAAATGTTTTTCATGAAAATTGTCAGCACTGACCCTAACAAGGGTCGTCATAGGGTCGCTATAGGGTCGTCAACGACCCTATTTAATAATGTAAAATAAATTAAAATACATTATTTAGAACATAAACTAATGATTTAAAAAACAAATCAGCAAAAACTTGTTTTTGCGTGGGGTGTGGTACTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCTTTTTGTGGAATGGGTTAAATTATTTACGGATAAAGTCACCAGAGGTGGAAAAATGAAAAAATGGATGTTAGCCATCTGCCTGATGTTTATAAATGAGATCTGCCATGCCACTGATTGCTTTGATCTTGCAGGCCGGGATTACAAAATAGATCCTGATTTACTGAGA
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure file
Relaxase
| ID | 632 | GenBank | YP_008998619 |
| Name | TraI_pACN001-B |
UniProt ID | A0A140WYN1 |
| Length | 1074 a.a. | PDB ID | |
| Note | putative relaxase | ||
Relaxase protein sequence
Download Length: 1074 a.a. Molecular weight: 117013.58 Da Isoelectric Point: 5.8538
>YP_008998619.1 conjugal transfer nickase/helicase TraI (plasmid) [Escherichia coli ACN001]
MTAQSNSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASVSED
AMTVVVPGRAEPASLPVSDSPFMALKLENGWVETPGHSVSDSAKVFASVTQMAMDNATLNGLARSGRDVR
LYSSLDETRTAEKLARHPSFTVVSEQIKARAGETLLETAISLQKTGLHTPAQQAIHLALPVVESKNLAFS
MVDLLTEAKSFAAEGTSFTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEGKEA
VTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTTQFRAVMSAVNMLPESERPRVVGL
GPTHRAVGEMRSAGVDAQTLASFLHDTQLLQRSGETPNFSNTLFLLDESSMVGNTDMARAYALIAAGGGR
AVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAVYSLINRDVERALSGLESVKPSQV
PRQEGAWVPEHSVTEFSHSQEAKLAEAQQKAMLKGEAFPDIPMTLYEAIVRDYTGRTPEAREQTLIVTHL
NEDRRVLNSMIHDAREKAGELGKEQVMVPVLNTANIRDGELRRLSTWETHRDALALVDNVYHRIAGISKD
DGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVGTGDRMRFTKSDRERGYVANSVWTVTAVSGDSVT
LSDGQQTRVIRPGQERAEQHIGLAYAITAHGAQGASETFAIALEGTEGNRKQMAGFESAYVALSRMKQHV
QVYTDNRQGWTDAINNAVQKGTAHDVLEPKSDREVMNAERLFSTARELRDVVAGRAVLRQAGLAGGDSPA
RFIAPGRKYPQPYVALPAFDRNGKSAGIWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESL
LADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPRTITGGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQA
EEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVRDIAGLERDRSAISEREAALPESVLREPQRVREAV
REVARENLLQERLQQMERDMVRDL
MTAQSNSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASVSED
AMTVVVPGRAEPASLPVSDSPFMALKLENGWVETPGHSVSDSAKVFASVTQMAMDNATLNGLARSGRDVR
LYSSLDETRTAEKLARHPSFTVVSEQIKARAGETLLETAISLQKTGLHTPAQQAIHLALPVVESKNLAFS
MVDLLTEAKSFAAEGTSFTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEGKEA
VTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTTQFRAVMSAVNMLPESERPRVVGL
GPTHRAVGEMRSAGVDAQTLASFLHDTQLLQRSGETPNFSNTLFLLDESSMVGNTDMARAYALIAAGGGR
AVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAVYSLINRDVERALSGLESVKPSQV
PRQEGAWVPEHSVTEFSHSQEAKLAEAQQKAMLKGEAFPDIPMTLYEAIVRDYTGRTPEAREQTLIVTHL
NEDRRVLNSMIHDAREKAGELGKEQVMVPVLNTANIRDGELRRLSTWETHRDALALVDNVYHRIAGISKD
DGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVGTGDRMRFTKSDRERGYVANSVWTVTAVSGDSVT
LSDGQQTRVIRPGQERAEQHIGLAYAITAHGAQGASETFAIALEGTEGNRKQMAGFESAYVALSRMKQHV
QVYTDNRQGWTDAINNAVQKGTAHDVLEPKSDREVMNAERLFSTARELRDVVAGRAVLRQAGLAGGDSPA
RFIAPGRKYPQPYVALPAFDRNGKSAGIWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESL
LADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPRTITGGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQA
EEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVRDIAGLERDRSAISEREAALPESVLREPQRVREAV
REVARENLLQERLQQMERDMVRDL
Protein domains
Predicted by InterproScan.
Protein structure
| Source | ID | Structure |
|---|---|---|
| AlphaFold DB | A0A140WYN1 |
Host bacterium
| ID | 1020 | GenBank | NC_023327 |
| Plasmid name | pACN001-B | Incompatibility group | IncFIB |
| Plasmid size | 168543 bp | Coordinate of oriT [Strand] | 109227..109688 [-] |
| Host baterium | Escherichia coli ACN001 |
Cargo genes
| Drug resistance gene | sitABCD |
| Virulence gene | - |
| Metal resistance gene | - |
| Degradation gene | - |
| Symbiosis gene | - |
| Anti-CRISPR | - |