oriTDB
The information of the oriT region
oriTDB accession number 100750
Name oriT_p2009C-3133-2  insolico
Sequence Completeness intact
oriT length 462 nt
oriT sequence AAGAAGAGCACCCCTAGCAGCGCCCCTAGGGTCATACTATAAAAAAACGCACGGCGCTGC TAGGGGCGGCCCTAATATATCCAAATGTTTTTCATAAAATTTGTCAGTACTGACCCTAAC AAGGGTCGCCATAGGGTCGCCATAGGGTCGTCAACGACCCTATTTAATAATGTAAAATAA ATTAAAATACATTATTTAAAACATAAGCTAATGATTCAAAAAGCAAATCAGCAAAAACTT GTTTTTGCGTGGGGTGTGGTGCTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCTTTTT GTGGAGTGGGTTAAATTATTTACGGATAAAGTCACCAGAGGTGGAAAAATGAAAAAATGG ATGTTAGCCATCTGCCTGATGTTTATAAATGAGATCTGCCATGCCACTGATTGCTTTGAT CTTGCAGGCCGGGATTACAAAATAGATCCTGATTTACTGAGA
IR (inverted repeat)[142-151] [154-163] nt  (ATAGGGTCGT..ACGACCCTAT)
[231-238] [241-248] nt  (GCAAAAAC..GTTTTTGC)
Location of nic site [257-258] nt
Conserved sequence flanking the
  nic site
GTGGGGTGT|GG
Note predicted by the oriTfinder
L:142-151;R:154-163;L:231-238;R:241-248;D:257-258
Visualization of oriT structure
Reference
 N/A
I. Information of Relaxase
ID 822
Name TraI_p2009C-3133-2 insolico
GenBank accession number WP_060565186
Family MOBF
Length 1756 aa
UniProt ID A0A0P0T0S7
PDB ID _
Pfam TrwC [PF08751.10], Evalue: 5.20E-75, Aligned region: 10..284
TraI [PF07057.10], Evalue: 1.70E-55, Aligned region: 1434..1556
AAA_30 [PF13604.5], Evalue: 3.20E-34, Aligned region: 968..1159
AAA_19 [PF13245.5], Evalue: 9.70E-09, Aligned region: 973..1108
Note putative relaxase
Protein sequence [Download] MMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGRGAEQLGLQGSVDKDVFTRLLEGRL
PDGADLSRMQDGSNKHRPGYDLTFSAPKSVSMMAMLGGDKRLIDAHNQAVDFAVRQVEAL
ASTRVMTDGQSETVLTGNLVMALFNHDTSRDQEPQLHTHAVVANVTQHNGEWKTLSSDKV
GKTGFIENVYANQIAFGRLYREKLKEQVEALGYETEVVGKHGMWEMPGVPVEAFSGRSQT
IREAVGEDASLKSRDVAALDTRKSKQHVDPEIKMAEWMQTLKETGFDIRAYRDAADQRAD
LRTQTPRPASQDGPDVQQAVTQAIAGLSERKVQFTYTDVLARTVGILPPENGVIERARAG
IDEAISREQLIPLDREKGLFTSGIHVLDELSVRALSRDIMKQNRVTVHPEKSVPRTAGYS
DAVSVLAQDRPSLAIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDE
RLSGELITGRRQLLEGMAFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQR
TGTGSALMAMKDTGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFAASVKTGEESVAQ
VSGVREQAILTQAIRSELKTQDVLGHPEVTMTALSPVWLDSRSRYLRDMYRPGMVMEQWN
PETRSHDRYVTERVTAQSNSLTLRNAQGETRVVRISSLDSSWSLFRPEKMPVADGERLRV
TGKIPGLRVSGGDRLQVASVSEDAMTVVVPGRAELASLPVADSPFTALKLESGWVETPGH
SVSDSAKVFASVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQI
KARAGETSLETAISHQKSALHTPAQQAIHLALPVVESKNLAFSQVDLLTEAKSFAAEGTS
FTELGGEIDAQIKRGDLLHVDVAKGYGTDLLVSRASYEAEKSILRHILEGKEAVTPLMER
VPGELMEMLTSGQRAATRMILETSDRFTVVQGYAGVGKTTQFRAVMSAVNMLPESERPRV
VGLGPTHRAVGEMRSAGVDAQTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDM
ARAYALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAV
YSLINRDVERALSGLESVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEA
FPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDAREKAGELGKEQVM
VPVLNTANIRDGELRRLSTWETHRDALALVDNVYHRIAGISRDDGLITLQDAEGNTRLIS
PREAVAEGVTLYTPDTIRVGTGDRMRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQT
RVIRPGQERAEQHIDLAYAITAHGAQGASETFAIALEGTEGNRKLMAGFESAYVALSRMK
QHVQVYTDDRQGWTDAINNAVQKGTAHDVLEPKADREVMNAERLFSTARQLQDVAAGRAV
LRQAGLARGDSPARFIAPGRKYPQPYVALPAFDRNGRSAGIWLNPLTTDDGNGLRGFSGE
GRVKGSGDAQFVALQGSRNGESLLADNMQEGVRIARDNPDSGVVVRIAGEGRPWNPGAIT
GGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLP
DGRTEQAVREIAGLERERAVTSEREAALPESVLREPQREREAVREVVRENLLQERLQQME
RDMVRDLQKEKTLGGD
Reference
 N/A

II. Auxiliary DNA-binding protein(s) of the relaxosome
 N/A
 N/A
Information of plasmid
ID 1210
Plasmid name p2009C-3133-2
GenBank accession number NZ_CP013026.1
Incompatibility group IncFII(pCoo)
Genome size 63800 bp
Coordinate of oriT  [Strand] 38763..39224 [+]
Drug resistance _
Heavy-metal resistance _
Virulence factor _
Xenobiotic degradation _
Host bacterium [NCBI Taxonomy ID] Escherichia coli 2009C-3133 [562]
Reference
[1] Lindsey RL et al (2015) Complete Genome Sequences of Two Shiga Toxin-Producing Escherichia coli Strains from Serotypes O119:H4 and O165:H25. Genome Announc. 3(6). [PMID:26679598]