Detailed information of oriT
oriT
The information of the oriT region
| oriTDB ID | 100976 |
| Name | oriT_unitig_2-pAR_0055 |
| Organism | Escherichia coli strain AR_0055 |
| Sequence Completeness | intact |
| NCBI accession of oriT (coordinates [strand]) | NZ_CP021936 (137512..138062 [+], 551 nt) |
| oriT length | 551 nt |
| IRs (inverted repeats) | IR1: 62..70, 71..79 (AACCCTTTC..GACAGGGTT) IR2: 143..150, 154..161 (TGGCCTGC..GGAGGCCA) |
| Location of nic site | _ |
| Conserved sequence flanking the nic site |
_ |
| Note | predicted by the oriTfinder |
oriT sequence
Download Length: 551 nt
>oriT_unitig_2-pAR_0055
GAGCAGAGCTATGTGTGACAAGAAGTATAGAAATTACGAGGTAGCCATCATGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAAGGCTAATCAGGAGCCTGTGATTTACCCGATCATTGAAGAAGCTCTGCATCGCTACAGCCAGCTCGTGTTTCATGAGCAGAGAGAGAAATATGAAGACCCGGCCAGAATTGGGGCATTTCTGGAAACCCTGATCACCGAAACCTGCCGGGCGTTGGAAGTGCAAATTGTCGATAGTGGCGGTGATTCATGGTCTGTCGATTCAGGAGAGTCGTTCTCACTGTGGCTTTCTTCCCATCCAGGAGAACTATCCATTAACCCGCAGCCCCATGAGGATGAGACCTCTTTGCGTGGCTTGCTGTATGAGCTCATCACCTGTGAGAGCGTGAAAACTGTTTTAAGGAGAACCGACT
GAGCAGAGCTATGTGTGACAAGAAGTATAGAAATTACGAGGTAGCCATCATGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAAGGCTAATCAGGAGCCTGTGATTTACCCGATCATTGAAGAAGCTCTGCATCGCTACAGCCAGCTCGTGTTTCATGAGCAGAGAGAGAAATATGAAGACCCGGCCAGAATTGGGGCATTTCTGGAAACCCTGATCACCGAAACCTGCCGGGCGTTGGAAGTGCAAATTGTCGATAGTGGCGGTGATTCATGGTCTGTCGATTCAGGAGAGTCGTTCTCACTGTGGCTTTCTTCCCATCCAGGAGAACTATCCATTAACCCGCAGCCCCATGAGGATGAGACCTCTTTGCGTGGCTTGCTGTATGAGCTCATCACCTGTGAGAGCGTGAAAACTGTTTTAAGGAGAACCGACT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure file
Relaxase
| ID | 1046 | GenBank | WP_001326173 |
| Name | TraI_unitig_2-pAR_0055 |
UniProt ID | _ |
| Length | 990 a.a. | PDB ID | |
| Note | putative relaxase | ||
Relaxase protein sequence
Download Length: 990 a.a. Molecular weight: 109356.90 Da Isoelectric Point: 4.9395
>WP_001326173.1 MULTISPECIES: MobH family relaxase [Gammaproteobacteria]
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPMFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPMFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Host bacterium
| ID | 1434 | GenBank | NZ_CP021936 |
| Plasmid name | unitig_2-pAR_0055 | Incompatibility group | IncA/C2 |
| Plasmid size | 138340 bp | Coordinate of oriT [Strand] | 137512..138062 [+] |
| Host baterium | Escherichia coli strain AR_0055 |
Cargo genes
| Drug resistance gene | blaCMY-6, aac(6')-Ib, qacE, sul1, rmtC, blaNDM-1 |
| Virulence gene | - |
| Metal resistance gene | - |
| Degradation gene | - |
| Symbiosis gene | - |
| Anti-CRISPR | - |