Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 100976 |
Name | oriT_unitig_2-pAR_0055 |
Organism | Escherichia coli strain AR_0055 |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | NZ_CP021936 (137512..138062 [+], 551 nt) |
oriT length | 551 nt |
IRs (inverted repeats) | IR1: 62..70, 71..79 (AACCCTTTC..GACAGGGTT) IR2: 143..150, 154..161 (TGGCCTGC..GGAGGCCA) |
Location of nic site | _ |
Conserved sequence flanking the nic site |
_ |
Note | predicted by the oriTfinder |
oriT sequence
Download Length: 551 nt
>oriT_unitig_2-pAR_0055
GAGCAGAGCTATGTGTGACAAGAAGTATAGAAATTACGAGGTAGCCATCATGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAAGGCTAATCAGGAGCCTGTGATTTACCCGATCATTGAAGAAGCTCTGCATCGCTACAGCCAGCTCGTGTTTCATGAGCAGAGAGAGAAATATGAAGACCCGGCCAGAATTGGGGCATTTCTGGAAACCCTGATCACCGAAACCTGCCGGGCGTTGGAAGTGCAAATTGTCGATAGTGGCGGTGATTCATGGTCTGTCGATTCAGGAGAGTCGTTCTCACTGTGGCTTTCTTCCCATCCAGGAGAACTATCCATTAACCCGCAGCCCCATGAGGATGAGACCTCTTTGCGTGGCTTGCTGTATGAGCTCATCACCTGTGAGAGCGTGAAAACTGTTTTAAGGAGAACCGACT
GAGCAGAGCTATGTGTGACAAGAAGTATAGAAATTACGAGGTAGCCATCATGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAAGGCTAATCAGGAGCCTGTGATTTACCCGATCATTGAAGAAGCTCTGCATCGCTACAGCCAGCTCGTGTTTCATGAGCAGAGAGAGAAATATGAAGACCCGGCCAGAATTGGGGCATTTCTGGAAACCCTGATCACCGAAACCTGCCGGGCGTTGGAAGTGCAAATTGTCGATAGTGGCGGTGATTCATGGTCTGTCGATTCAGGAGAGTCGTTCTCACTGTGGCTTTCTTCCCATCCAGGAGAACTATCCATTAACCCGCAGCCCCATGAGGATGAGACCTCTTTGCGTGGCTTGCTGTATGAGCTCATCACCTGTGAGAGCGTGAAAACTGTTTTAAGGAGAACCGACT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 1046 | GenBank | WP_001326173 |
Name | TraI_unitig_2-pAR_0055 | UniProt ID | _ |
Length | 990 a.a. | PDB ID | |
Note | putative relaxase |
Relaxase protein sequence
Download Length: 990 a.a. Molecular weight: 109356.90 Da Isoelectric Point: 4.9395
>WP_001326173.1 MULTISPECIES: MobH family relaxase [Gammaproteobacteria]
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPMFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPMFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Host bacterium
ID | 1434 | GenBank | NZ_CP021936 |
Plasmid name | unitig_2-pAR_0055 | Incompatibility group | IncA/C2 |
Plasmid size | 138340 bp | Coordinate of oriT [Strand] | 137512..138062 [+] |
Host baterium | Escherichia coli strain AR_0055 |
Cargo genes
Drug resistance gene | blaCMY-6, aac(6')-Ib, qacE, sul1, rmtC, blaNDM-1 |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |