oriTDB
The information of the oriT region
oriTDB accession number 100536
Name oriT_pCFSAN000189  insolico
Sequence Completeness intact
oriT length 360 nt
oriT sequence GATTATCAAGAGTCACATGTGAATCGGCCATTTATTTTATTAGAGTCAACCTGGATTCAC CAACTTCATACAGGCGAATCTGTTATACCGAAATTAAGGATTCGAATCCAGATTCGAATC CTTTAATTTGTCTTAATTTAGTGGTCTGATTCGTAGATGAATCTGGATTCAACTCCTTTA CGATTCACTAAAGATTTGCATGTGAATCCAGTATCTATATCGGACATAGAATAACGATAA AAATATTTACATAACACACTGAAATAAAAGGATAATCATGCAAGATTAAATCTTGCGACG GGTGTGGGATTTTGATGTTAAGGAGCAGGCATACAGAAATATGTCAGCCAGGAAGAATTC
IR (inverted repeat)[280-287] [289-296] nt  (GCAAGATT..AATCTTGC)
Location of nic site [305-306] nt
Conserved sequence flanking the
  nic site
GACGGGTGT|GG
Note predicted by the oriTfinder
L:280-287;R:289-296;D:305-306
Visualization of oriT structure
Reference
 N/A
I. Information of Relaxase
ID 607
Name TraI_pCFSAN000189 insolico
GenBank accession number WP_020833546
Family MOBF
Length 1767 aa
UniProt ID _
PDB ID _
Pfam TrwC [PF08751.10], Evalue: 9.80E-72, Aligned region: 7..280
TraI [PF07057.10], Evalue: 4.60E-46, Aligned region: 1441..1562
AAA_30 [PF13604.5], Evalue: 1.90E-34, Aligned region: 970..1162
AAA_19 [PF13245.5], Evalue: 7.00E-10, Aligned region: 975..1111
Note putative relaxase
Protein sequence [Download] MLSISSIKGDAGYYSHEDNYYASGSLDSRWMGEGAEKLGLKGEVASADMDAVRQGRLPDG
SDLSRMVDGVNKHRSGYDLTFSAPKSVSVMALVGEDRRFIEAHNRAVAVVMKEVEQLVSA
RITQEGKTETVLTGSMVAALYNHDTSRDLDPQVHTHALVFNVTFADEKWRSLASDIRMKT
GFSENLYATKIALGNLYRSALREDIESMGFETVAAGKHGLWELKDVPVDIFSSRSQAIRE
AAGPDASAKSRDVAALDTRQAKAWADPDLLKADWRRRLTDEKFDIGHYISQAQARVEITA
PVVAGQGGMRAPGQPGIASSGEAADELVQKAVSDTISALSDKKVQFTWSEMLAGTVSRLP
SAPGLFEQARAGIEAAIEGQRLIPLDREKGIFTSDIHLLNELSVHQLARTAVAEQTVLVF
PERAQERDIPAGDAVSVLSQDKSPVAIFSGRGGAQTLRERTEDVAMMARSQGREVMVIAA
DGRSGQFLSESPHLAGHVMLRSQMNADTVLPVQGTVIVDRAERLSLKETVLLQEKALSAG
AQLIFMDTENRQGTGNALSVLKEADIPQYRFYGTQLPEVRLVSEADKRSRYGQLAQEYVR
LSAEGRDVVAQVTGTREQQQLTEVIRDTRREAGELGREQVTLRVLEPVWLDSKTRHQRDN
YRPGMVMEQWDAEKKTMTRHTIDRVAEATNSLVLQGEDGTRLTMKVTQLDGSWSLYRSRT
LEVSEGDRVRTLGRELKGAIKAKEQFTVAGLENGAVRLRSGDRELCLPTERAVKLAHDYV
EGTGAGTSASRTVLAAVGPRGLNKQALNALAQSGSDIRIYTPLEGRQAARKVESVSVVRL
ASDQVRQSTGEANLDTAIQASRERLMSDAEQAVNLAIPRAQQGQVYLSELTLLAEAVKSG
QPLADVRAEIARQVDSGALIKLDSVAGAGNRVLVPRVAYEMEKTIIRHIAEGKDAVQPLM
ALTPASVLSGLTAGQRDATRTVLENTDRFMAIQGYAGVGKTTQFRAVMGALNTLSESVRP
QVIGLGPTHRAVHEMREAGVEARTLASFLSETRLAIQAGETPDFRNVLFLTDESSMVGNR
DMAELYQLVAAGGGRMVSSGDTAQLQAISTGLPFRLVQQRSAIDTVVMQEIVRQTPALRP
AIESIIAGQVEISLRQVDDVTPQQVPRQPGAWVPDNSVMEIRAPKKEQDQVQDTPVAGEQ
TLTPEQLALVRTDIIEAIRDDWMGRTPEAQQQTLVVAELNADRHAINDAIHTARHEKGDT
GAEERTFTVLEPLRVPDNALRAAETFAEYTGAVAMMNERYWTVAEVDTQDAVVTLRNADG
ESVLISPQQNTAQDISLFTPRDLTISQGDRVRFTRSDTDRGYVANSLWEVAGFTDDGAIR
FRQGDQEKIVDPQAMTEDRHIDLAYALTVYGVQGASERFAIALTGTEGGRKRMASLESTY
VTLSRAKEHVQVYTDDLAGWSADARHSNAGQTAHDLLHQKSDHESDTGNRLLATASRLDK
TALGRRVLAENGLEGETMARFIAAGKKYPSPYVALPAWTRHGKAAGALLTEIRIEDDGMR
VVLSDESRLRGGEDAQFAGLQASRNGQTLIADDAQTALRLAQENPESGVVIRLHGEERLL
NAARLTGGRITEPDEVARSVRSVAEAESAAKAEDPITLPPDEQQKLAEAQEKAARELAEQ
ARQELLPALPGTEGDKPDPLLSASDERRLRDGTERETRELDEAVREAVAEGREPRQQVCD
QMQRTEREWVVNVPEKEIELEKTLGGD
Reference
 N/A

II. Auxiliary DNA-binding protein(s) of the relaxosome
 N/A
 N/A
Information of plasmid
ID 997
Plasmid name pCFSAN000189
GenBank accession number NC_021817.1
Incompatibility group IncFII(S)
Genome size 78193 bp
Coordinate of oriT  [Strand] 35811..36170 [+]
Drug resistance _
Heavy-metal resistance _
Virulence factor _
Xenobiotic degradation _
Host bacterium [NCBI Taxonomy ID] Salmonella enterica subsp. enterica serovar Bareilly str. CFSAN000189 [1173427]
Reference
[1] Hoffmann M et al (2016) Tracing Origins of the Salmonella Bareilly Strain Causing a Food-borne Outbreak in the United States. J Infect Dis. 213(4):502-8. [PMID:25995194]
[2] Timme RE et al (2013) Phylogenetic diversity of the enteric pathogen Salmonella enterica subsp enterica inferred from genome-wide reference-free SNP characters. Genome Biol Evol. 5(11):2109-23. [PMID:24158624]