Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 103194 |
Name | oriT_pHYEC7-IncHI2 |
Organism | Escherichia coli strain HYEC7 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_KX518743 (53850..54103 [+], 254 nt) |
oriT length | 254 nt |
IRs (inverted repeats) | 152..157, 159..164 (AAAAGT..ACTTTT) 121..129, 134..142 (TTAAGGCTT..AAGCCTTAA) 50..55, 59..64 (AATTTT..AAAATT) |
Location of nic site | 77..78 |
Conserved sequence flanking the nic site |
TTTGGTTAAA |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 254 nt
>oriT_pHYEC7-IncHI2
GGATTTAGGTTTTTTTTTAATCGCTTCACATTTCGTTAGCATGCGAAGAAATTTTGATAAAATTCTGGTCAGTTTGGTTAAAAAGTGTTACAAGTAAGCGGTGTGGTTGAAGGGATAGATTTAAGGCTTATTCAAGCCTTAAGAAAATACTAAAAGTTACTTTTCACCCTACCGAACACCTAACAAAAAATCCATGTTGAAGATTTGAACAATTGTAATGGCGCAAGGACAATCAGCACATGTCAGAATCTGAT
GGATTTAGGTTTTTTTTTAATCGCTTCACATTTCGTTAGCATGCGAAGAAATTTTGATAAAATTCTGGTCAGTTTGGTTAAAAAGTGTTACAAGTAAGCGGTGTGGTTGAAGGGATAGATTTAAGGCTTATTCAAGCCTTAAGAAAATACTAAAAGTTACTTTTCACCCTACCGAACACCTAACAAAAAATCCATGTTGAAGATTTGAACAATTGTAATGGCGCAAGGACAATCAGCACATGTCAGAATCTGAT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 2392 | GenBank | WP_001011151 |
Name | TraI_2_HTJ30_RS00290_pHYEC7-IncHI2 | UniProt ID | A0A1P8VU63 |
Length | 1048 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1048 a.a. Molecular weight: 117773.04 Da Isoelectric Point: 4.6054
>WP_001011151.1 MULTISPECIES: MobH family relaxase [Enterobacterales]
MNFRALYLCIKRILGIFSSQENDATSVMIEDISSLSPFAQILGDQKYTVPDHPNPEVLKFIEYPTRPTGI
QTFNEQSILSLYREKLHSISMMLAISDSDIRDDAYTFTNLVLKPLVEYVRWIHLLPASENHHHNGIGGLL
SHSLEVAILSLKNAHHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLASPIIWTPS
SQSLLDWARENDVVEYEIHWRKRIHNQHNIWSSVFLERILNPVCLAFLDRVNKERVYSKMITALNVYTDG
NDFLSKCVRTADFYSTGTDLNVLRDPIMGLRSNDAAARAISTIKHNFTSININNYNAKPMHIIIVNGEVY
LNENAFLDFVLNDFELHKYNFPQGEAGKTVLVESLVQRGYVEPYDDERVVHYFIPGIYSENEISNIFRNG
IGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRDAITKITDTVE
TAVLKVNDLGRSTASIDVDIHSKKNEGSSDDFEKKAESDNEIDNDTQIVKSEGEEAADPVIPDIEESEDE
SAKDTESHVLVNQLHELLLSAPLSNDYIVCVDAVPYLNIDATMALLPGLDEKAFSEEPYFQLTFREGSLD
GMWIVRDIDDLRLVQLGDNCAGFQLTYHEPRRPTTLKSLFNTSMYQALVINDESSVENSAPRPKQTLELP
PPRVNAVEEHSGDVEYHGTDSASATGPLKTEAVEYEHYQHLFEEEDEEHEIIDYTDFTQLSVSRPEVGSC
ATSSSVHNEKLLPEPSELPELNREQNADPQGTIERSMDVSVGQENSEPDTEGNCPPPAEVVYSQTEAAAT
SVMASEEPALPPVLEESNGEHAPTDAKGHHLSPALARLFAPTAPVEKQNPKRNRNKSSDKAEVQKPASPV
SGHNLNSKVFATTESDQNGEFSLISEGDVTELEFVEIASVLHQILSKMEVAFKRKRKNRFMVSTPNTLYL
TQSCVEKFGSQLEAQDLFNKLPQYLVNSGAVINTKCHAFNMPTLLAASDRAKVDIERIINNLKEAGNL
MNFRALYLCIKRILGIFSSQENDATSVMIEDISSLSPFAQILGDQKYTVPDHPNPEVLKFIEYPTRPTGI
QTFNEQSILSLYREKLHSISMMLAISDSDIRDDAYTFTNLVLKPLVEYVRWIHLLPASENHHHNGIGGLL
SHSLEVAILSLKNAHHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLASPIIWTPS
SQSLLDWARENDVVEYEIHWRKRIHNQHNIWSSVFLERILNPVCLAFLDRVNKERVYSKMITALNVYTDG
NDFLSKCVRTADFYSTGTDLNVLRDPIMGLRSNDAAARAISTIKHNFTSININNYNAKPMHIIIVNGEVY
LNENAFLDFVLNDFELHKYNFPQGEAGKTVLVESLVQRGYVEPYDDERVVHYFIPGIYSENEISNIFRNG
IGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRDAITKITDTVE
TAVLKVNDLGRSTASIDVDIHSKKNEGSSDDFEKKAESDNEIDNDTQIVKSEGEEAADPVIPDIEESEDE
SAKDTESHVLVNQLHELLLSAPLSNDYIVCVDAVPYLNIDATMALLPGLDEKAFSEEPYFQLTFREGSLD
GMWIVRDIDDLRLVQLGDNCAGFQLTYHEPRRPTTLKSLFNTSMYQALVINDESSVENSAPRPKQTLELP
PPRVNAVEEHSGDVEYHGTDSASATGPLKTEAVEYEHYQHLFEEEDEEHEIIDYTDFTQLSVSRPEVGSC
ATSSSVHNEKLLPEPSELPELNREQNADPQGTIERSMDVSVGQENSEPDTEGNCPPPAEVVYSQTEAAAT
SVMASEEPALPPVLEESNGEHAPTDAKGHHLSPALARLFAPTAPVEKQNPKRNRNKSSDKAEVQKPASPV
SGHNLNSKVFATTESDQNGEFSLISEGDVTELEFVEIASVLHQILSKMEVAFKRKRKNRFMVSTPNTLYL
TQSCVEKFGSQLEAQDLFNKLPQYLVNSGAVINTKCHAFNMPTLLAASDRAKVDIERIINNLKEAGNL
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A1P8VU63 |
T4CP
ID | 2146 | GenBank | WP_046788497 |
Name | traD_HTJ30_RS00285_pHYEC7-IncHI2 | UniProt ID | _ |
Length | 694 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 694 a.a. Molecular weight: 78171.95 Da Isoelectric Point: 8.0177
>WP_046788497.1 MULTISPECIES: conjugative transfer system coupling protein TraD [Enterobacterales]
MSDSKRTNLHAQENFYRPILEYRSASILLICSVSMLYMGLSSDGLDIAPIVLFTSILLFLLCLYRCKTAA
PFLMAHWRVFKRHFMFVSLDSLRVINKSNFFSNERKYRQLVQDYQNKNKDIPERKSYFCDGFEWGPEHAD
RAYQIANLSSDKREIELPFVFNPIKRHFDAMARKMGGSNAIFAVERREPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVVVVIDPKNDAEWRESLMEEAKTLGLPFYKFHPGQPASSVCIDVCNTYTNVSD
LTSRLLSLVTVPGEVNPFVQYAKALVSNVISGLSYIEKKPSIYLIHKNMKSHMSIVNLTVKVMESCYARY
YGYDVWTEKVKYVANETLPVRFKRLAEWFTAHFMNYEGSEQIDWLDTVSQLIDYSMSDPEHMAKMTAGIM
PVFDMLIEKPLNELLSPNPNSVSSREIVTSEGMFSTGGVLYISLDGLSNPDTAAAISQLIMSDLTSCAGS
RYNAQDGDMSANSRISIFVDEAHSAINNPMINLLAQGRAAKIALFICTQTISDFIAAASVETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSAISTNMVTYTTGSETSLPHNNFSGSISERKQTTLEESIPKDLLGQV
PMFHIVARLQDGRKVVGQIPIAVAEKQMKPNTTLSEMLFKKAGKVTLRQNLDIKNLNKFLRKFH
MSDSKRTNLHAQENFYRPILEYRSASILLICSVSMLYMGLSSDGLDIAPIVLFTSILLFLLCLYRCKTAA
PFLMAHWRVFKRHFMFVSLDSLRVINKSNFFSNERKYRQLVQDYQNKNKDIPERKSYFCDGFEWGPEHAD
RAYQIANLSSDKREIELPFVFNPIKRHFDAMARKMGGSNAIFAVERREPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVVVVIDPKNDAEWRESLMEEAKTLGLPFYKFHPGQPASSVCIDVCNTYTNVSD
LTSRLLSLVTVPGEVNPFVQYAKALVSNVISGLSYIEKKPSIYLIHKNMKSHMSIVNLTVKVMESCYARY
YGYDVWTEKVKYVANETLPVRFKRLAEWFTAHFMNYEGSEQIDWLDTVSQLIDYSMSDPEHMAKMTAGIM
PVFDMLIEKPLNELLSPNPNSVSSREIVTSEGMFSTGGVLYISLDGLSNPDTAAAISQLIMSDLTSCAGS
RYNAQDGDMSANSRISIFVDEAHSAINNPMINLLAQGRAAKIALFICTQTISDFIAAASVETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSAISTNMVTYTTGSETSLPHNNFSGSISERKQTTLEESIPKDLLGQV
PMFHIVARLQDGRKVVGQIPIAVAEKQMKPNTTLSEMLFKKAGKVTLRQNLDIKNLNKFLRKFH
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 198380..207587
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
HTJ30_RS01065 | 198380..201061 | - | 2682 | WP_000387412 | TraC family protein | virb4 |
HTJ30_RS01070 | 201070..202020 | - | 951 | WP_001022587 | IncHI-type conjugal transfer lipoprotein TrhV | traV |
HTJ30_RS01075 | 202030..202782 | - | 753 | WP_183079335 | protein-disulfide isomerase HtdT | - |
HTJ30_RS01080 | 202901..203383 | - | 483 | WP_000377633 | hypothetical protein | - |
HTJ30_RS01085 | 203391..204746 | - | 1356 | WP_000351841 | IncHI-type conjugal transfer protein TrhB | traB |
HTJ30_RS01090 | 204736..205197 | - | 462 | WP_000521240 | plasmid transfer protein HtdO | - |
HTJ30_RS01095 | 205199..206470 | - | 1272 | WP_000592090 | type-F conjugative transfer system secretin TraK | traK |
HTJ30_RS01100 | 206470..207258 | - | 789 | WP_000783153 | TraE/TraK family type IV conjugative transfer system protein | traE |
HTJ30_RS01105 | 207270..207587 | - | 318 | WP_000043357 | type IV conjugative transfer system protein TraL | traL |
HTJ30_RS01110 | 207638..207991 | - | 354 | WP_000423602 | hypothetical protein | - |
HTJ30_RS01115 | 209140..210015 | - | 876 | WP_000594612 | RepB family plasmid replication initiator protein | - |
HTJ30_RS01120 | 210492..211547 | + | 1056 | WP_001065779 | hypothetical protein | - |
HTJ30_RS01125 | 211565..211765 | + | 201 | WP_000594931 | hypothetical protein | - |
HTJ30_RS01130 | 211762..212478 | + | 717 | WP_046788421 | zinc finger-like domain-containing protein | - |
Host bacterium
ID | 3637 | GenBank | NZ_KX518743 |
Plasmid name | pHYEC7-IncHI2 | Incompatibility group | - |
Plasmid size | 224736 bp | Coordinate of oriT [Strand] | 53850..54103 [+] |
Host baterium | Escherichia coli strain HYEC7 |
Cargo genes
Drug resistance gene | oqxB, oqxA, floR, sul2, aac(3)-IVa, fosA3, blaCTX-M-14 |
Virulence gene | - |
Metal resistance gene | terE, terD, terC, terB, terA, terZ, terW |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |