Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 103899 |
Name | oriT_pHCM1 |
Organism | Salmonella enterica subsp. enterica serovar Typhi strain 311189_218186 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_CP029926 (141172..141456 [-], 285 nt) |
oriT length | 285 nt |
IRs (inverted repeats) | 184..189, 191..196 (AAAAGT..ACTTTT) |
Location of nic site | 109..110 |
Conserved sequence flanking the nic site |
TTTGGTTAAA |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 285 nt
>oriT_pHCM1
GGTTATTGCTACTTAATGCCGATAACGACTCAGGCTTTGAGGTTTTTTTATACGGTTCACATTTCGTTAGCAAGGTCAGGGTTTTTTGATAAAATTCTGGTTAGTTTGGTTAAAAAGTGTTACAAGTAAGGGTAATGGCTGAAAGGTTAGTTTTAAGGTTCAAAGCGGCAGTATTAAAATTCCAAAAGTTACTTTTCATCCTTCAGAATCCAGACCTTAATTTCATGTAGAAGATTCGTACAATTGTATTGGCGCAAGGACAATCCGCACATGTCAGAATCAGAT
GGTTATTGCTACTTAATGCCGATAACGACTCAGGCTTTGAGGTTTTTTTATACGGTTCACATTTCGTTAGCAAGGTCAGGGTTTTTTGATAAAATTCTGGTTAGTTTGGTTAAAAAGTGTTACAAGTAAGGGTAATGGCTGAAAGGTTAGTTTTAAGGTTCAAAGCGGCAGTATTAAAATTCCAAAAGTTACTTTTCATCCTTCAGAATCCAGACCTTAATTTCATGTAGAAGATTCGTACAATTGTATTGGCGCAAGGACAATCCGCACATGTCAGAATCAGAT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 2901 | GenBank | WP_001011149 |
Name | mobH_SM218186_RS25300_pHCM1 | UniProt ID | A0A0G3BA46 |
Length | 1011 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1011 a.a. Molecular weight: 113023.00 Da Isoelectric Point: 4.5108
>WP_001011149.1 MULTISPECIES: MobH family relaxase [Enterobacterales]
MNFRALFLSMQRVFGIFSRRENDVSELMMKDAANFSPFAQIIGEQKYTVPDHPNPEVLKFIEYPTRPAGI
QTFNEQSILSLYRDKLHSISMMLAISDGDIREDAYTFTNLVLKPLIEYIRWIHLLPASENHHHNGIGGLL
SHSLEVAMISLKNANHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLSETLTWAPS
SQSLLDWARENNVVEYEIHWRKRIHNQHNIWSSVFLERILDPVCMSFLDRVKKERVYAKMVTALNVYNDG
NDFLSKCVRTSDYYSTGTDLNVLRDPIMGLRSNDAAARAIGTIKHNFTSININNYKTKPMHIIIVNGEVY
LNENAFLDFVLSDFAAHKFNFPQGDAGKTVLVESLVQRGYVEPYDDERVVHYFIPGTYSENEIASIFRNG
IGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRESVTRVTDTVN
DAIENTPQYGLQLVNGPDADSNNIISSENTESITDSLEESGADISNEIFETQVVTAIDTAETVNADEPEQ
VEEHDDRSQIHLVEQLHEMLLSAPLPHHAVINIDSVPYLDLDAAIALIPGIDEAAFCNGPFFQLTYRDGS
LDGMWIVRDVNNLRLIQLGDNCAGMQVSTSEPRNTSSLKSLFDTSMYQPLDIPEAPSVNEAASPPQTPLE
LPQPRLNAPVAEEASSVAEQTNAHSEPDSVIATEYEQYGHLLEETLDSDGEAYSDLIASDSTEAEYPATD
PQSSDFAQLPRETALSVAPGDLDYSEGAIKPPAPDATGKETILTSPEPAEDVRETVAAVEKASHLSPALA
RLFAVSTHAEKKHEKTQEPSPVKEVKNPTSSTTVKAPISIEPPGAEEKEAVEEFTLLNDGEVTELEYVEI
ATMLHQILTKLSGSFKRKRKNRFMVLTQNTFYLTQSCIEKYGTQLNAPELFNQLPQYQVTSGAVVNTKCI
AFNIPTLVAASDRAKVDIELIINKLKEVGNL
MNFRALFLSMQRVFGIFSRRENDVSELMMKDAANFSPFAQIIGEQKYTVPDHPNPEVLKFIEYPTRPAGI
QTFNEQSILSLYRDKLHSISMMLAISDGDIREDAYTFTNLVLKPLIEYIRWIHLLPASENHHHNGIGGLL
SHSLEVAMISLKNANHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLSETLTWAPS
SQSLLDWARENNVVEYEIHWRKRIHNQHNIWSSVFLERILDPVCMSFLDRVKKERVYAKMVTALNVYNDG
NDFLSKCVRTSDYYSTGTDLNVLRDPIMGLRSNDAAARAIGTIKHNFTSININNYKTKPMHIIIVNGEVY
LNENAFLDFVLSDFAAHKFNFPQGDAGKTVLVESLVQRGYVEPYDDERVVHYFIPGTYSENEIASIFRNG
IGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRESVTRVTDTVN
DAIENTPQYGLQLVNGPDADSNNIISSENTESITDSLEESGADISNEIFETQVVTAIDTAETVNADEPEQ
VEEHDDRSQIHLVEQLHEMLLSAPLPHHAVINIDSVPYLDLDAAIALIPGIDEAAFCNGPFFQLTYRDGS
LDGMWIVRDVNNLRLIQLGDNCAGMQVSTSEPRNTSSLKSLFDTSMYQPLDIPEAPSVNEAASPPQTPLE
LPQPRLNAPVAEEASSVAEQTNAHSEPDSVIATEYEQYGHLLEETLDSDGEAYSDLIASDSTEAEYPATD
PQSSDFAQLPRETALSVAPGDLDYSEGAIKPPAPDATGKETILTSPEPAEDVRETVAAVEKASHLSPALA
RLFAVSTHAEKKHEKTQEPSPVKEVKNPTSSTTVKAPISIEPPGAEEKEAVEEFTLLNDGEVTELEYVEI
ATMLHQILTKLSGSFKRKRKNRFMVLTQNTFYLTQSCIEKYGTQLNAPELFNQLPQYQVTSGAVVNTKCI
AFNIPTLVAASDRAKVDIELIINKLKEVGNL
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A0G3BA46 |
T4CP
ID | 2747 | GenBank | WP_000167418 |
Name | traD_SM218186_RS25305_pHCM1 | UniProt ID | _ |
Length | 694 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 694 a.a. Molecular weight: 77969.74 Da Isoelectric Point: 8.7275
>WP_000167418.1 MULTISPECIES: conjugative transfer system coupling protein TraD [Enterobacterales]
MTKSKRTNLHAQENFYRPILEYRSASVLLICAVIMLVMGFRSDGVNIAPIILYTAVFLLLVCLYRCKTAH
PYLMAHWRVFQRQIMFISLKSLRTINKSNFFSNERKYRQLVQEYKKNNRPVPDRKTYFCNGFEWGPEHAD
RAYQIANLSSDKREIALPFVLSPIARHFETMARSMGGNNAIFAVDRRAPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVIVVIDPKNDAEWRQSLMDEASELGLPFYKFHPAQPSSSVCIDVCNAYTNVSD
LTSRLLSLVSVPGEVNPFVQYAEALVSTVITGLSYTDKKPSIYLIHKNMKSHMSVVNLTIKVMECCFARH
YGPDVWMEKVKYASNDTLQVRFKRLTEWFNAHFLNYEGAEPIEWIDTVGRLVDYSMSDPEHMSKMTAGIM
PLFSRLTEQPLNELLSPSPNTLTSREIVTSDGMFSTGGVLYISLDGLSNPESARAISQLIMSDLTSCAGS
RYNADDGDMSSHSRISIFVDEAHSAINNSMINLLAQGRAAQIALFICTQTISDFIAAANAETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSPISTNMVTYTTGSETTLPHNNFSGSISERKQTTLEESIPKELLGQV
PKFHIVARLQDGRKVVGQIPIAVSEKAMKPNTTLLEMFLKPAGKVTLRQNVGLSYLNKYLRKLH
MTKSKRTNLHAQENFYRPILEYRSASVLLICAVIMLVMGFRSDGVNIAPIILYTAVFLLLVCLYRCKTAH
PYLMAHWRVFQRQIMFISLKSLRTINKSNFFSNERKYRQLVQEYKKNNRPVPDRKTYFCNGFEWGPEHAD
RAYQIANLSSDKREIALPFVLSPIARHFETMARSMGGNNAIFAVDRRAPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVIVVIDPKNDAEWRQSLMDEASELGLPFYKFHPAQPSSSVCIDVCNAYTNVSD
LTSRLLSLVSVPGEVNPFVQYAEALVSTVITGLSYTDKKPSIYLIHKNMKSHMSVVNLTIKVMECCFARH
YGPDVWMEKVKYASNDTLQVRFKRLTEWFNAHFLNYEGAEPIEWIDTVGRLVDYSMSDPEHMSKMTAGIM
PLFSRLTEQPLNELLSPSPNTLTSREIVTSDGMFSTGGVLYISLDGLSNPESARAISQLIMSDLTSCAGS
RYNADDGDMSSHSRISIFVDEAHSAINNSMINLLAQGRAAQIALFICTQTISDFIAAANAETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSPISTNMVTYTTGSETTLPHNNFSGSISERKQTTLEESIPKELLGQV
PKFHIVARLQDGRKVVGQIPIAVSEKAMKPNTTLLEMFLKPAGKVTLRQNVGLSYLNKYLRKLH
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 202083..211251
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
SM218186_RS25635 | 197532..197732 | - | 201 | WP_000357614 | hypothetical protein | - |
SM218186_RS25640 (SM218186_07271) | 197750..198826 | - | 1077 | WP_001015069 | hypothetical protein | - |
SM218186_RS25645 (SM218186_07269) | 199361..200236 | + | 876 | WP_001282731 | RepB family plasmid replication initiator protein | - |
SM218186_RS25875 | 200683..201027 | - | 345 | WP_153253765 | hypothetical protein | - |
SM218186_RS25655 (SM218186_07267) | 201678..202031 | + | 354 | WP_000424121 | hypothetical protein | - |
SM218186_RS25660 (SM218186_07273) | 202083..202400 | + | 318 | WP_000203871 | type IV conjugative transfer system protein TraL | traL |
SM218186_RS25665 (SM218186_07320) | 202412..203197 | + | 786 | WP_000771920 | TraE/TraK family type IV conjugative transfer system protein | traE |
SM218186_RS25670 (SM218186_07266) | 203200..204432 | + | 1233 | WP_000202110 | type-F conjugative transfer system secretin TraK | traK |
SM218186_RS25675 (SM218186_07274) | 204434..204874 | + | 441 | WP_001224093 | plasmid transfer protein HtdO | - |
SM218186_RS25680 (SM218186_07310) | 204864..206222 | + | 1359 | WP_000351840 | IncHI-type conjugal transfer protein TrhB | traB |
SM218186_RS25685 (SM218186_07261) | 206231..206743 | + | 513 | WP_000429933 | hypothetical protein | - |
SM218186_RS25690 (SM218186_07277) | 206849..207601 | + | 753 | WP_010892304 | protein-disulfide isomerase HtdT | - |
SM218186_RS25695 (SM218186_07249) | 207611..208561 | + | 951 | WP_001022266 | TraV family lipoprotein | traV |
SM218186_RS25700 (SM218186_07275) | 208570..211251 | + | 2682 | WP_001255540 | IncHI-type conjugal transfer ATPase TrhC | virb4 |
SM218186_RS25710 (SM218186_07323) | 212295..214292 | + | 1998 | WP_001541890 | choline BCCT transporter BetT | - |
SM218186_RS25715 (SM218186_07279) | 214355..215650 | + | 1296 | WP_000625666 | DUF2254 domain-containing protein | - |
Host bacterium
ID | 4339 | GenBank | NZ_CP029926 |
Plasmid name | pHCM1 | Incompatibility group | IncFIA |
Plasmid size | 217120 bp | Coordinate of oriT [Strand] | 141172..141456 [-] |
Host baterium | Salmonella enterica subsp. enterica serovar Typhi strain 311189_218186 |
Cargo genes
Drug resistance gene | - |
Virulence gene | - |
Metal resistance gene | merR, merT, merP, merA, merD, merE, corA |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |