oriTDB
The information of the oriT region
oriTDB accession number 100751
Name oriT_p2009C-3133-3  insolico
Sequence Completeness intact
oriT length 290 nt
oriT sequence CGCACCGCTAGCAGCGCCCCTGGCGGTATCCTATAAAAAAACACACCGCGCCGCTAGCAG CACCCCTAATATAAAATAATGTTTTTTATAAAAATAGTCAGTACCACCCCTACAAAGCGG TGTCGGCGCGTTGTTGTAGCCGCGCCGACACCGCTTTTTTAAATATCATAAAGAGAGTAA GAGAAACTAATTTTTCATAACTCTCTATTTATAAAGAAAAATCAGCAAAAACTTGTTTTT GCGTGGGGTGTGGTGCTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCT
IR (inverted repeat)[225-232] [235-242] n  (GCAAAAAC..GTTTTTGC)
Location of nic site [251-252] nt
Conserved sequence flanking the
  nic site
GTGGGGTGT|GG
Note predicted by the oriTfinder
L:225-232;R:235-242;D:251-252
Visualization of oriT structure
Reference
 N/A
I. Information of Relaxase
ID 823
Name TraI_p2009C-3133-3 insolico
GenBank accession number WP_000986969
Family MOBF
Length 1756 aa
UniProt ID A0A0P0T0N6
PDB ID _
Pfam TrwC [PF08751.10], Evalue: 3.10E-75, Aligned region: 10..284
TraI [PF07057.10], Evalue: 1.00E-55, Aligned region: 1434..1556
AAA_30 [PF13604.5], Evalue: 9.60E-35, Aligned region: 968..1160
AAA_19 [PF13245.5], Evalue: 6.20E-09, Aligned region: 973..1108
Note putative relaxase
Protein sequence [Download] MMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGRGAEQLGLQGSVDKDVFTRLLEGKL
PDGADLSRMQDGSNKHRPGYDLTFSAPKSVSVMAMLGGDKRLIDAHNQAVDFAVRQVEAL
ASTRVMTDGQSETVLTGNLVMALFNHDTSRDQEPQLHTHAVVANVTQHNGEWKTLSSDKV
GKTGFIENVYANQIAFGRLYREKLKEQVEALGYETEVVGKHGMWEMPGVPVEAFSGRSQA
IREAVGEDASLKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKETGFDIRAYRDAADQRAE
TRTQTPGPASQDGPDVQQAVTQAIAGLSERKVQFTYTDVLARTVGILPPENGVIERARAG
IDEAISREQLIPLDREKGLFTSGIHVLDELSVRALSRDIMKQNRVTVHPEKSVPRTAGYS
DAVSVLAQDRPSLAIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDE
RLSGELITGRRQLQEGMAFTPGNTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQR
TGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFAASVKAGEESVAQ
VSGVREQAILTQAIRSELKTQGVLGRPEVTMTALSPVWLDSRSRYLRDMYRPGMVMEQWN
PETRSHDRYVIDRVTAQSNSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRV
TGKIPGLRVSGGDRLQVASVSEDAMTVVVPGRAEPATLPVSDSPFMALKLENGWVETPGH
SVSDSAKVFASVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQI
KARAGETLLETAISLQKTGLHTPAQQAIHLALPVVESKNLAFSMVDLLTEAKSFAAEGTS
FTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEGKEAVTPLMER
VPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTTQFRAVMSAVNMLPESERPRV
VGLGPTHRAVGEMRSAGVDAQTLASFLHDTQLLQRSGETPNFSNTLFLLDESSMVGNTDM
ARAYALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAV
YSLINRDVERALSGLESVKPSQVPRQEGAWVPEHSVTEFSHSQEAKLAEAQQKAMLKGEA
FPDIPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDAREKAGELGKEQVM
APVLNTANIRDGELRRLSTWETHRDALALVDNVYHRIAGISKDDGLITLQDAEGNTRLIS
PREAVAEGVTLYTPDTIRVGTGDRMRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQT
RVIRPGQERAEQHIDLAYAITAHGAQGASETFAIALEGTEGNRKQMAGFESAYVALSRMK
QHVQVYTDNRQGWTDAINNAVQKGTAHDVLEPKSDREVMNAERLFSTARELRDVVAGRAV
LRQAGLAGGDSPARFIAPGRKYPQPYVALPAFDRNGKSAGIWLNPLTTDDGNGLRGFSGE
GRVKGSGDAQFVALQGSRNGESLLADNMQDGVQIARDNPDSGVVVRIAGEGRPWNPGTIT
GGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLP
DGKTEQAVRDIAGLERDRSAISEREAALPESVLREPQRVREAVREVARENLLQERLQQME
RDMVRDLQKEKTLGGD
Reference
 N/A

II. Auxiliary DNA-binding protein(s) of the relaxosome
 N/A
 N/A
Information of plasmid
ID 1211
Plasmid name p2009C-3133-3
GenBank accession number NZ_CP013027.1
Incompatibility group IncFIB(AP001918)
Genome size 174564 bp
Coordinate of oriT  [Strand] 79783..80072 [-]
Drug resistance insolico tetracyline resistance
Heavy-metal resistance _
Virulence factor _
Xenobiotic degradation _
Host bacterium [NCBI Taxonomy ID] Escherichia coli 2009C-3133 [562]
Reference
[1] Lindsey RL et al (2015) Complete Genome Sequences of Two Shiga Toxin-Producing Escherichia coli Strains from Serotypes O119:H4 and O165:H25. Genome Announc. 3(6). [PMID:26679598]