Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 100560 |
Name | oriT_pACN001-B |
Organism | Escherichia coli ACN001 |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | NC_023327 (109227..109688 [-], 462 nt) |
oriT length | 462 nt |
IRs (inverted repeats) | 142..151, 154..163 (ATAGGGTCGT..ACGACCCTAT) 231..238, 241..248 (GCAAAAAC..GTTTTTGC) |
Location of nic site | 257..258 |
Conserved sequence flanking the nic site |
GTGGGGTGT|GG |
Note | predicted by the oriTfinder |
oriT sequence
Download Length: 462 nt
>oriT_pACN001-B
AAGAAGAGCACCCCTAGCAGCGCCCCTAGGGGCATACTATAAAAAAACGCACGGCGCTGCTAGGGGCGGCCCTAATATATCCAAATGTTTTTCATGAAAATTGTCAGCACTGACCCTAACAAGGGTCGTCATAGGGTCGCTATAGGGTCGTCAACGACCCTATTTAATAATGTAAAATAAATTAAAATACATTATTTAGAACATAAACTAATGATTTAAAAAACAAATCAGCAAAAACTTGTTTTTGCGTGGGGTGTGGTACTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCTTTTTGTGGAATGGGTTAAATTATTTACGGATAAAGTCACCAGAGGTGGAAAAATGAAAAAATGGATGTTAGCCATCTGCCTGATGTTTATAAATGAGATCTGCCATGCCACTGATTGCTTTGATCTTGCAGGCCGGGATTACAAAATAGATCCTGATTTACTGAGA
AAGAAGAGCACCCCTAGCAGCGCCCCTAGGGGCATACTATAAAAAAACGCACGGCGCTGCTAGGGGCGGCCCTAATATATCCAAATGTTTTTCATGAAAATTGTCAGCACTGACCCTAACAAGGGTCGTCATAGGGTCGCTATAGGGTCGTCAACGACCCTATTTAATAATGTAAAATAAATTAAAATACATTATTTAGAACATAAACTAATGATTTAAAAAACAAATCAGCAAAAACTTGTTTTTGCGTGGGGTGTGGTACTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCTTTTTGTGGAATGGGTTAAATTATTTACGGATAAAGTCACCAGAGGTGGAAAAATGAAAAAATGGATGTTAGCCATCTGCCTGATGTTTATAAATGAGATCTGCCATGCCACTGATTGCTTTGATCTTGCAGGCCGGGATTACAAAATAGATCCTGATTTACTGAGA
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 632 | GenBank | YP_008998619 |
Name | TraI_pACN001-B | UniProt ID | A0A140WYN1 |
Length | 1074 a.a. | PDB ID | |
Note | putative relaxase |
Relaxase protein sequence
Download Length: 1074 a.a. Molecular weight: 117013.58 Da Isoelectric Point: 5.8538
>YP_008998619.1 conjugal transfer nickase/helicase TraI (plasmid) [Escherichia coli ACN001]
MTAQSNSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASVSED
AMTVVVPGRAEPASLPVSDSPFMALKLENGWVETPGHSVSDSAKVFASVTQMAMDNATLNGLARSGRDVR
LYSSLDETRTAEKLARHPSFTVVSEQIKARAGETLLETAISLQKTGLHTPAQQAIHLALPVVESKNLAFS
MVDLLTEAKSFAAEGTSFTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEGKEA
VTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTTQFRAVMSAVNMLPESERPRVVGL
GPTHRAVGEMRSAGVDAQTLASFLHDTQLLQRSGETPNFSNTLFLLDESSMVGNTDMARAYALIAAGGGR
AVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAVYSLINRDVERALSGLESVKPSQV
PRQEGAWVPEHSVTEFSHSQEAKLAEAQQKAMLKGEAFPDIPMTLYEAIVRDYTGRTPEAREQTLIVTHL
NEDRRVLNSMIHDAREKAGELGKEQVMVPVLNTANIRDGELRRLSTWETHRDALALVDNVYHRIAGISKD
DGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVGTGDRMRFTKSDRERGYVANSVWTVTAVSGDSVT
LSDGQQTRVIRPGQERAEQHIGLAYAITAHGAQGASETFAIALEGTEGNRKQMAGFESAYVALSRMKQHV
QVYTDNRQGWTDAINNAVQKGTAHDVLEPKSDREVMNAERLFSTARELRDVVAGRAVLRQAGLAGGDSPA
RFIAPGRKYPQPYVALPAFDRNGKSAGIWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESL
LADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPRTITGGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQA
EEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVRDIAGLERDRSAISEREAALPESVLREPQRVREAV
REVARENLLQERLQQMERDMVRDL
MTAQSNSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASVSED
AMTVVVPGRAEPASLPVSDSPFMALKLENGWVETPGHSVSDSAKVFASVTQMAMDNATLNGLARSGRDVR
LYSSLDETRTAEKLARHPSFTVVSEQIKARAGETLLETAISLQKTGLHTPAQQAIHLALPVVESKNLAFS
MVDLLTEAKSFAAEGTSFTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEGKEA
VTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTTQFRAVMSAVNMLPESERPRVVGL
GPTHRAVGEMRSAGVDAQTLASFLHDTQLLQRSGETPNFSNTLFLLDESSMVGNTDMARAYALIAAGGGR
AVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAVYSLINRDVERALSGLESVKPSQV
PRQEGAWVPEHSVTEFSHSQEAKLAEAQQKAMLKGEAFPDIPMTLYEAIVRDYTGRTPEAREQTLIVTHL
NEDRRVLNSMIHDAREKAGELGKEQVMVPVLNTANIRDGELRRLSTWETHRDALALVDNVYHRIAGISKD
DGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVGTGDRMRFTKSDRERGYVANSVWTVTAVSGDSVT
LSDGQQTRVIRPGQERAEQHIGLAYAITAHGAQGASETFAIALEGTEGNRKQMAGFESAYVALSRMKQHV
QVYTDNRQGWTDAINNAVQKGTAHDVLEPKSDREVMNAERLFSTARELRDVVAGRAVLRQAGLAGGDSPA
RFIAPGRKYPQPYVALPAFDRNGKSAGIWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESL
LADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPRTITGGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQA
EEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVRDIAGLERDRSAISEREAALPESVLREPQRVREAV
REVARENLLQERLQQMERDMVRDL
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A140WYN1 |
Host bacterium
ID | 1020 | GenBank | NC_023327 |
Plasmid name | pACN001-B | Incompatibility group | IncFIB |
Plasmid size | 168543 bp | Coordinate of oriT [Strand] | 109227..109688 [-] |
Host baterium | Escherichia coli ACN001 |
Cargo genes
Drug resistance gene | sitABCD |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |