Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 114376 |
Name | oriT_pSa1423-50K |
Organism | Salmonella sp. strain Sa1423 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_MK356560 (37675..37775 [+], 101 nt) |
oriT length | 101 nt |
IRs (inverted repeats) | 80..85, 91..96 (AAAAAA..TTTTTT) 20..26, 38..44 (TAAATCA..TGATTTA) |
Location of nic site | 62..63 |
Conserved sequence flanking the nic site |
GGTGTATAGC |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 101 nt
>oriT_pSa1423-50K
TATTTATTTTTTTATTTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
TATTTATTTTTTTATTTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 9217 | GenBank | WP_000884352 |
Name | mobF_HXJ03_RS00195_pSa1423-50K | UniProt ID | A8R732 |
Length | 1078 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1078 a.a. Molecular weight: 120163.41 Da Isoelectric Point: 6.5766
>WP_000884352.1 MULTISPECIES: MobF family relaxase [Enterobacterales]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
ISGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
ISGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A8R732 |
Auxiliary protein
ID | 5403 | GenBank | WP_172694008 |
Name | WP_172694008_pSa1423-50K | UniProt ID | _ |
Length | 138 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 138 a.a. Molecular weight: 15318.58 Da Isoelectric Point: 5.9670
>WP_172694008.1 MULTISPECIES: traK protein [Enterobacteriaceae]
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCLEAGPGDRESGVKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCLEAGPGDRESGVKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
Protein domains
No domain identified.
Protein structure
No available structure.
T4CP
ID | 10683 | GenBank | WP_000342688 |
Name | t4cp2_HXJ03_RS00200_pSa1423-50K | UniProt ID | _ |
Length | 509 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 509 a.a. Molecular weight: 57762.87 Da Isoelectric Point: 9.6551
>WP_000342688.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 20685..37108
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
HXJ03_RS00085 (NNIBIDOC_00261) | 15861..16559 | + | 699 | WP_001390335 | EamA family transporter | - |
HXJ03_RS00090 (NNIBIDOC_00262) | 16591..17790 | - | 1200 | WP_001493765 | tetracycline efflux MFS transporter Tet(A) | - |
HXJ03_RS00095 (NNIBIDOC_00263) | 17896..18546 | + | 651 | WP_001493764 | tetracycline resistance transcriptional repressor TetR(A) | - |
HXJ03_RS00100 | 18578..18820 | - | 243 | WP_000844627 | transposase | - |
HXJ03_RS00105 (NNIBIDOC_00264) | 18878..19378 | - | 501 | Protein_20 | Tn3 family transposase | - |
HXJ03_RS00110 (NNIBIDOC_00265) | 19444..20148 | + | 705 | WP_001067858 | IS6-like element IS26 family transposase | - |
HXJ03_RS00115 (NNIBIDOC_00267) | 20271..20642 | - | 372 | WP_011867773 | H-NS family nucleoid-associated regulatory protein | - |
HXJ03_RS00120 (NNIBIDOC_00268) | 20685..21419 | + | 735 | WP_000033783 | lytic transglycosylase domain-containing protein | virB1 |
HXJ03_RS00125 (NNIBIDOC_00269) | 21428..21709 | + | 282 | WP_000440698 | transcriptional repressor KorA | - |
HXJ03_RS00130 (NNIBIDOC_00270) | 21719..22012 | + | 294 | WP_000209137 | TrbC/VirB2 family protein | virB2 |
HXJ03_RS00135 (NNIBIDOC_00271) | 22062..22379 | + | 318 | WP_000496058 | VirB3 family type IV secretion system protein | virB3 |
HXJ03_RS00140 (NNIBIDOC_00272) | 22379..24979 | + | 2601 | WP_001200711 | VirB4 family type IV secretion/conjugal transfer ATPase | virb4 |
HXJ03_RS00145 (NNIBIDOC_00273) | 24997..25710 | + | 714 | WP_000749362 | type IV secretion system protein | virB5 |
HXJ03_RS00150 (NNIBIDOC_00274) | 25718..25945 | + | 228 | WP_000734973 | IncN-type entry exclusion lipoprotein EexN | - |
HXJ03_RS00155 (NNIBIDOC_00275) | 25961..27001 | + | 1041 | WP_000886022 | type IV secretion system protein | virB6 |
HXJ03_RS00160 | 27072..27230 | + | 159 | WP_012561180 | hypothetical protein | - |
HXJ03_RS00165 (NNIBIDOC_00276) | 27220..27918 | + | 699 | WP_000646594 | virB8 family protein | virB8 |
HXJ03_RS00170 (NNIBIDOC_00277) | 27929..28813 | + | 885 | WP_000735067 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
HXJ03_RS00175 (NNIBIDOC_00278) | 28813..29973 | + | 1161 | WP_000101711 | type IV secretion system protein VirB10 | virB10 |
HXJ03_RS00180 (NNIBIDOC_00279) | 30015..31010 | + | 996 | WP_000128596 | ATPase, T2SS/T4P/T4SS family | virB11 |
HXJ03_RS00185 (NNIBIDOC_00280) | 31010..31543 | + | 534 | WP_000731968 | phospholipase D family protein | - |
HXJ03_RS00190 (NNIBIDOC_00281) | 31717..32343 | - | 627 | WP_000434930 | DUF6710 family protein | - |
HXJ03_RS00195 (NNIBIDOC_00282) | 32343..35579 | - | 3237 | WP_000884352 | MobF family relaxase | - |
HXJ03_RS00200 (NNIBIDOC_00283) | 35579..37108 | - | 1530 | WP_000342688 | type IV secretion system DNA-binding domain-containing protein | virb4 |
HXJ03_RS00205 (NNIBIDOC_00284) | 37110..37526 | - | 417 | WP_172694008 | traK protein | - |
HXJ03_RS00290 (NNIBIDOC_00285) | 38039..38458 | + | 420 | WP_063815362 | plasmid stabilization protein StbA | - |
HXJ03_RS00215 (NNIBIDOC_00286) | 38467..39183 | + | 717 | WP_000861577 | StbB family protein | - |
HXJ03_RS00220 (NNIBIDOC_00287) | 39185..39553 | + | 369 | WP_000414913 | plasmid stabilization protein StbC | - |
HXJ03_RS00225 (NNIBIDOC_00288) | 39735..40079 | + | 345 | WP_000999874 | hypothetical protein | - |
HXJ03_RS00230 (NNIBIDOC_00289) | 40190..40510 | - | 321 | WP_000211832 | protein CcgAII | - |
HXJ03_RS00235 (NNIBIDOC_00290) | 40565..40744 | - | 180 | WP_000214483 | protein CcgAI | - |
HXJ03_RS00240 (NNIBIDOC_00291) | 41576..42085 | - | 510 | WP_000129958 | antirestriction protein ArdA | - |
Host bacterium
ID | 14811 | GenBank | NZ_MK356560 |
Plasmid name | pSa1423-50K | Incompatibility group | IncN |
Plasmid size | 49279 bp | Coordinate of oriT [Strand] | 37675..37775 [+] |
Host baterium | Salmonella sp. strain Sa1423 |
Cargo genes
Drug resistance gene | tet(A) |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |