Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 100944 |
Name | oriT_unitig_7-pAR_0104 |
Organism | Escherichia coli strain AR_0104 |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | NZ_CP020119 (26350..26987 [+], 638 nt) |
oriT length | 638 nt |
IRs (inverted repeats) | IR1: 198..207, 211..220 (ATTGGGTGTT..AATACCTAAT) IR2:319..324, 330..335 (AAAAAA..TTTTTT) IR3: 393..402, 405..414 (GTATTCATGC..GCATGAATAC) |
Location of nic site | 301..302 |
Conserved sequence flanking the nic site |
GGTGT|ATAGC |
Note | predicted by the oriTfinder |
oriT sequence
Download Length: 638 nt
>oriT_unitig_7-pAR_0104
AGCGCCGCAGATAATCTGACCGATTACCTCCTGAAACCAGGTCTATATAGGCCAAAAGTTCATCTGATACTTTTGCGGTTATTATTGGCATTCAGTCCTCACATTGTGCATTTTTTAAACAAAAAATTGGGATCTAACAAGCTGAAATCTTAGTATTACCAAAGTAATAAAGCAAACTCATTATAAAACAATGAGTTATTGGGTGTTTTTAATACCTAATTATTACCGAATATTGTTGCTATTTATTTTTTTATCTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAGGGGCGATCTACGTAGGTTAAGGACTAACTGGCTAAAAAGCGTTCAATATTCCGTATTCATGCTTGCATGAATACCAGTACAACAAAAGTACATCAAAATTACATCAAAATTACATCACTTGAAGGTTGACAGTACAACAAAATTACATCATTCTTTGGTCATGAGGTAGCCAGTACAACAAAAGTACATCAAAAGTACATCAAAAGTACATCAAAATTACATCAAAATTACATCATTCTAAATGAGGGTACTATGAAGCCCAAAAGTATCAGGGCGGCACTTCAGTTGATGTTGCCGG
AGCGCCGCAGATAATCTGACCGATTACCTCCTGAAACCAGGTCTATATAGGCCAAAAGTTCATCTGATACTTTTGCGGTTATTATTGGCATTCAGTCCTCACATTGTGCATTTTTTAAACAAAAAATTGGGATCTAACAAGCTGAAATCTTAGTATTACCAAAGTAATAAAGCAAACTCATTATAAAACAATGAGTTATTGGGTGTTTTTAATACCTAATTATTACCGAATATTGTTGCTATTTATTTTTTTATCTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAGGGGCGATCTACGTAGGTTAAGGACTAACTGGCTAAAAAGCGTTCAATATTCCGTATTCATGCTTGCATGAATACCAGTACAACAAAAGTACATCAAAATTACATCAAAATTACATCACTTGAAGGTTGACAGTACAACAAAATTACATCATTCTTTGGTCATGAGGTAGCCAGTACAACAAAAGTACATCAAAAGTACATCAAAAGTACATCAAAATTACATCAAAATTACATCATTCTAAATGAGGGTACTATGAAGCCCAAAAGTATCAGGGCGGCACTTCAGTTGATGTTGCCGG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 1014 | GenBank | WP_000884351 |
Name | TrwC_unitig_7-pAR_0104 | UniProt ID | _ |
Length | 1076 a.a. | PDB ID | |
Note | putative relaxase |
Relaxase protein sequence
Download Length: 1076 a.a. Molecular weight: 119897.16 Da Isoelectric Point: 6.6071
>WP_000884351.1 MULTISPECIES: MobF family relaxase [Enterobacterales]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
ISGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEEGGHEI
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
ISGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEEGGHEI
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Host bacterium
ID | 1402 | GenBank | NZ_CP020119 |
Plasmid name | unitig_7-pAR_0104 | Incompatibility group | IncN |
Plasmid size | 51515 bp | Coordinate of oriT [Strand] | 26350..26987 [+] |
Host baterium | Escherichia coli strain AR_0104 |
Cargo genes
Drug resistance gene | blaKPC-4, blaTEM-1A, aph(3')-Ia, dfrA14 |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |