oriTDB
The information of the oriT region
oriTDB accession number 100422
Name oriT_p666  insolico
Sequence Completeness intact
oriT length 291 nt
oriT sequence CGCACCGCTAGCAGCGCCCCTAGCGGTATCCTATAAAAAAACACACCGCGCCGCTAGCAG CGCCCCTAATATAAAATGATGTTTTTTTATAAAAATAGTCAGTACCACCCCTACAAAGCG GTGTCGGCGCGTTGTTGTAGCCGCGCCGACACCGCTTTTTTGAATATCATAAAGAGAGTA AGAGAAACTAATTTTTCATAACACTATATTTATAAAGAAAAATCAGCAAAAACTTGTTTT TGCGTAGTGTGTGGTGCTTTTGGTGGTGAGAACCACCAACCTGTTGAGCCT
IR (inverted repeat)[226-233] [236-243] n  (GCAAAAAC..GTTTTTGC)
Location of nic site [252-253] nt
Conserved sequence flanking the
  nic site
GTAGTGTGT|GG
Note predicted by the oriTfinder
L:226-233;R:236-243;D:252-253
Visualization of oriT structure
Reference
 N/A
I. Information of Relaxase
ID 493
Name TraI_p666 insolico
GenBank accession number WP_000948336
Family MOBF
Length 1756 aa
UniProt ID E3PP28
PDB ID _
Pfam TrwC [PF08751.10], Evalue: 6.70E-76, Aligned region: 10..284
TraI [PF07057.10], Evalue: 5.20E-55, Aligned region: 1434..1556
AAA_30 [PF13604.5], Evalue: 2.40E-34, Aligned region: 968..1159
AAA_19 [PF13245.5], Evalue: 9.70E-09, Aligned region: 973..1108
Note putative relaxase
Protein sequence [Download] MLSFSVVKSAGSAGNYYTDKDNYYVLGSMGERWAGQGAEQLGLQGSVDKDIFTRLLEGRL
PDGADLSRMQDGSNKHRPGYDLTFSAPKSVSMMAMLGGDKRLIDAHNQAVDFAVRQVEAL
ASTRVMTDGQSETVLTGNLVMALFNHDTSRDQDPQLHTHVVVANVTQHNGEWKTLSSDKV
GKTGFSENVLANRIAFGKIYQSELRQRVEALGYETEVVGKHGMWEMPGVPVEAFSGRSQA
IREAVGEDASLKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKETGFDIRAYRDAADQRAE
IRTQAPGSASQDGPDVQQAVTQAIAGLSERKVQFTYTDVLARTVGILPPENGVIERARAG
IDEAISREQLIPLDREKGLFTSGIHVLDELSVRALSRDIMKQNRVTVHPEKSVPRTAGYS
DAVSVLAQDRPSLAIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDE
RLSGELITGRRQLQEGMIFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQR
TGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFAASVKAGEESVVQ
VSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRYLRDMYRPGMVMEQWN
PETRSHDRYVIDRVTAQSHSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRV
TGKIPGLRVSGGDRLQVASVSEDAMTVVVPGRAEPATLPVADSPFTALKLENGWVETPGH
SVSDSAKVFASVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQI
KARAGETSLETAISLQKAGLHTPAQQAIHLALPVLESKNLAFSMVDLLTEAKSFAAEGTG
FTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEGKEAVTPLMER
VPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTTQFRAVMSAVNMLPESERPRV
VGLGPTHRAVGEMRSAGVDAQTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDM
ARAYALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQTPELREAV
YSLINRDVERALSGLESVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMQNGEA
FPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDAREKAGELGKEQVM
VPVLNTANIRDGELRRLSTWENNPDALALVDNVYHRIAGISRDDGLITLQDAEGNTRLIS
PREAVAEGVTLYTPDTIRVGTGDRMRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQT
RVIRPGQERAEQHIDLAYAITAHGAQGASETFAIALEGTEGNRKLMAGFESAYVALSRMK
QHVQVYTDNRQGWTDAIHNSVQKGTAHDVFEPKPDREVMNAEWLFSTARELRDVAAGRAV
LRQAGLAGGDSPARFIAPGRKYPQPYVALPAFDRNGKSAGIWLNPLTTDDGNGLRGFSGE
GRVKGSGDAQFVALQGSRNGESLLADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPGAIT
GGRVWGDIPDNSVQPGAGNGEPVTAEVLAQQQAEEAIRRETERRADEIVRKMAENKPDLP
DGKTEQAVREIAGQERDRTATSERETALPESVLRESQREQEAVREVARENLLQERLQQME
RDMVRDLQKEKTLGGD
Reference
 N/A

II. Auxiliary DNA-binding protein(s) of the relaxosome
 N/A
 N/A
Information of plasmid
ID 883
Plasmid name p666
GenBank accession number NC_017722.1
Incompatibility group IncFII(29)
Genome size 66681 bp
Coordinate of oriT  [Strand] 28663..28953 [+]
Drug resistance _
Heavy-metal resistance _
Virulence factor _
Xenobiotic degradation _
Host bacterium [NCBI Taxonomy ID] Escherichia coli ETEC H10407 [316401]
Reference
[1] Crossman LC et al (2010) A commensal gone bad: complete genome sequence of the prototypical enterotoxigenic Escherichia coli strain H10407. J Bacteriol. 192(21):5822-31. [PMID:20802035]