Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 103242 |
Name | oriT_pEC_G4W1 |
Organism | Escherichia coli strain G4W1 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_ON960343 (68478..68731 [+], 254 nt) |
oriT length | 254 nt |
IRs (inverted repeats) | 152..157, 159..164 (AAAAGT..ACTTTT) 121..129, 134..142 (TTAAGGCTT..AAGCCTTAA) 50..55, 59..64 (AATTTT..AAAATT) |
Location of nic site | 77..78 |
Conserved sequence flanking the nic site |
TTTGGTTAAA |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 254 nt
>oriT_pEC_G4W1
GGATTTAGGTTTTTTTTTAATCGCTTCACATTTCGTTAGCATGCGAAGAAATTTTGATAAAATTCTGGTCAGTTTGGTTAAAAAGTGTTACAAGTAAGCGGTGTGGTTGAAGGGATAGATTTAAGGCTTATTCAAGCCTTAAGAAAATACTAAAAGTTACTTTTCACCCTACCGAACACCTAACAAAAAATCCATGTTGAAGATTTGAACAATTGTAATGGCGCAAGGACAATCAGCACATGTCAGAATCTGAT
GGATTTAGGTTTTTTTTTAATCGCTTCACATTTCGTTAGCATGCGAAGAAATTTTGATAAAATTCTGGTCAGTTTGGTTAAAAAGTGTTACAAGTAAGCGGTGTGGTTGAAGGGATAGATTTAAGGCTTATTCAAGCCTTAAGAAAATACTAAAAGTTACTTTTCACCCTACCGAACACCTAACAAAAAATCCATGTTGAAGATTTGAACAATTGTAATGGCGCAAGGACAATCAGCACATGTCAGAATCTGAT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 2424 | GenBank | WP_000980119 |
Name | TraI_2_PRZ95_RS00390_pEC_G4W1 | UniProt ID | A0A0H3VVI8 |
Length | 1050 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1050 a.a. Molecular weight: 118035.42 Da Isoelectric Point: 4.6054
>WP_000980119.1 MULTISPECIES: MobH family relaxase [Enterobacteriaceae]
MMMNFRALYLCIKRILGIFSSQENDATSVMIEDISSLSPFAQILGDQKYTVPDHPNPEVLKFIEYPTRPT
GIQTFNEQSILSLYREKLHSISMMLAISDSDIRDDAYTFTNLVLKPLVEYVRWIHLLPASENHHHNGIGG
LLSHSLEVAILSLKNAHHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLASPIIWT
PSSQSLLDWARENDVVEYEIHWRKRIHNQHNIWSSVFLERILNPVCLAFLDRVNKERVYSKMITALNVYT
DGNDFLSKCVRTADFYSTGTDLNVLRDPIMGLRSNDAAARAISTIKHNFTSININNYNAKPMHIIIVNGE
VYLNENAFLDFVLNDFELHKYNFPQGEAGKTVLVESLVQRGYVEPYDDERVVHYFIPGIYSENEISNIFR
NGIGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRDAITKITDT
VETAVLKVNDLGRSTASIDVDIHSKKNEGSSDDFEKKAESDNEIDNDTQIVKSEGEEAADPVIPDIEESE
DESAKDTESHVLVNQLHELLLSAPLSNDYIVCVDAVPYLNIDATMALLPGLDEKAFSEEPYFQLTFREGS
LDGMWIVRDIDDLRLVQLGDNCAGFQLTYHEPRRPTTLKSLFNTSMYQALVINDESSVENSAPRPKQTLE
LPPPRVNAVEEHSGDVEYHGTDSASATGPLKTEAVEYEHYQHLFEEEDEEHEIIDYTDFTQLSVSRPEVG
SCATSSSVHNEKLLPEPSELPELNREQNADPQGTIERSMDVSVGQENSEPDTEGNCPPPAEVVYSQTEAA
ATSVMASEEPALPPVLEESNGEHAPTDAKGHHLSPALARLFAPTAPVEKQNPKRNRNKSSDKAEVQKPAS
PVSGHNLNSKVFATTESDQNGEFSLISEGDVTELEFVEIASVLHQILSKMEVAFKRKRKNRFMVSTPNTL
YLTQSCVEKFGSQLEAQDLFNKLPQYLVNSGAVINTKCHAFNMPTLLAASDRAKVDIERIINNLKEAGNL
MMMNFRALYLCIKRILGIFSSQENDATSVMIEDISSLSPFAQILGDQKYTVPDHPNPEVLKFIEYPTRPT
GIQTFNEQSILSLYREKLHSISMMLAISDSDIRDDAYTFTNLVLKPLVEYVRWIHLLPASENHHHNGIGG
LLSHSLEVAILSLKNAHHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLASPIIWT
PSSQSLLDWARENDVVEYEIHWRKRIHNQHNIWSSVFLERILNPVCLAFLDRVNKERVYSKMITALNVYT
DGNDFLSKCVRTADFYSTGTDLNVLRDPIMGLRSNDAAARAISTIKHNFTSININNYNAKPMHIIIVNGE
VYLNENAFLDFVLNDFELHKYNFPQGEAGKTVLVESLVQRGYVEPYDDERVVHYFIPGIYSENEISNIFR
NGIGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRDAITKITDT
VETAVLKVNDLGRSTASIDVDIHSKKNEGSSDDFEKKAESDNEIDNDTQIVKSEGEEAADPVIPDIEESE
DESAKDTESHVLVNQLHELLLSAPLSNDYIVCVDAVPYLNIDATMALLPGLDEKAFSEEPYFQLTFREGS
LDGMWIVRDIDDLRLVQLGDNCAGFQLTYHEPRRPTTLKSLFNTSMYQALVINDESSVENSAPRPKQTLE
LPPPRVNAVEEHSGDVEYHGTDSASATGPLKTEAVEYEHYQHLFEEEDEEHEIIDYTDFTQLSVSRPEVG
SCATSSSVHNEKLLPEPSELPELNREQNADPQGTIERSMDVSVGQENSEPDTEGNCPPPAEVVYSQTEAA
ATSVMASEEPALPPVLEESNGEHAPTDAKGHHLSPALARLFAPTAPVEKQNPKRNRNKSSDKAEVQKPAS
PVSGHNLNSKVFATTESDQNGEFSLISEGDVTELEFVEIASVLHQILSKMEVAFKRKRKNRFMVSTPNTL
YLTQSCVEKFGSQLEAQDLFNKLPQYLVNSGAVINTKCHAFNMPTLLAASDRAKVDIERIINNLKEAGNL
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A0H3VVI8 |
T4CP
ID | 2186 | GenBank | WP_001284076 |
Name | traD_PRZ95_RS00385_pEC_G4W1 | UniProt ID | _ |
Length | 694 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 694 a.a. Molecular weight: 78137.93 Da Isoelectric Point: 8.0177
>WP_001284076.1 MULTISPECIES: conjugative transfer system coupling protein TraD [Enterobacterales]
MSDSKRTNLHAQENFYRPILEYRSASILLICSVSMLYMGLSSDGLDIAPIVLFTSILLFLLCLYRCKTAA
PFLMAHWRVFKRHFMFVSLDSLRVINKSNFFSNERKYRQLVQDYQNKNKDIPERKSYFCDGFEWGPEHAD
RAYQIANLSSDKREIELPFVFNPIKRHFDAMARKMGGSNAIFAVERREPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVVVVIDPKNDAEWRESLMEEAKTLGLPFYKFHPGQPASSVCIDVCNTYTNVSD
LTSRLLSLVTVPGEVNPFVQYAKALVSNVISGLSYIEKKPSIYLIHKNMKSHMSIVNLTVKVMESCYARY
YGYDVWTEKVKYVANETLPVRFKRLAEWFTAHFMNYEGSEQIDWLDTVSQLIDYSMSDPEHMAKMTAGIM
PVFDMLIEKPLNELLSPNPNSVSSREIVTSEGMFSTGGVLYISLDGLSNPDTAAAISQLIMSDLTSCAGS
RYNAQDGDMSANSRISIFVDEAHSAINNPMINLLAQGRAAKIALFICTQTISDFIAAASVETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSAISTNMVTYTTGSETSLPHNNFSGSISERKQTTLEESIPKDLLGQV
PMFHIVARLQDGRKVVGQIPIAVAEKQMKPNTTLSEMLFKKAGKVTLRQNLDIKNLNKFLRKLH
MSDSKRTNLHAQENFYRPILEYRSASILLICSVSMLYMGLSSDGLDIAPIVLFTSILLFLLCLYRCKTAA
PFLMAHWRVFKRHFMFVSLDSLRVINKSNFFSNERKYRQLVQDYQNKNKDIPERKSYFCDGFEWGPEHAD
RAYQIANLSSDKREIELPFVFNPIKRHFDAMARKMGGSNAIFAVERREPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVVVVIDPKNDAEWRESLMEEAKTLGLPFYKFHPGQPASSVCIDVCNTYTNVSD
LTSRLLSLVTVPGEVNPFVQYAKALVSNVISGLSYIEKKPSIYLIHKNMKSHMSIVNLTVKVMESCYARY
YGYDVWTEKVKYVANETLPVRFKRLAEWFTAHFMNYEGSEQIDWLDTVSQLIDYSMSDPEHMAKMTAGIM
PVFDMLIEKPLNELLSPNPNSVSSREIVTSEGMFSTGGVLYISLDGLSNPDTAAAISQLIMSDLTSCAGS
RYNAQDGDMSANSRISIFVDEAHSAINNPMINLLAQGRAAKIALFICTQTISDFIAAASVETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSAISTNMVTYTTGSETSLPHNNFSGSISERKQTTLEESIPKDLLGQV
PMFHIVARLQDGRKVVGQIPIAVAEKQMKPNTTLSEMLFKKAGKVTLRQNLDIKNLNKFLRKLH
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 214882..224089
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
PRZ95_RS01200 (CPHELBEB_02884) | 214882..217563 | - | 2682 | WP_000387412 | TraC family protein | virb4 |
PRZ95_RS01205 (CPHELBEB_02885) | 217572..218522 | - | 951 | WP_001022587 | IncHI-type conjugal transfer lipoprotein TrhV | traV |
PRZ95_RS01210 (CPHELBEB_02886) | 218532..219284 | - | 753 | WP_183079335 | protein-disulfide isomerase HtdT | - |
PRZ95_RS01215 (CPHELBEB_02887) | 219403..219885 | - | 483 | WP_000377633 | hypothetical protein | - |
PRZ95_RS01220 (CPHELBEB_02888) | 219893..221248 | - | 1356 | WP_000351841 | IncHI-type conjugal transfer protein TrhB | traB |
PRZ95_RS01225 (CPHELBEB_02889) | 221238..221699 | - | 462 | WP_000521240 | plasmid transfer protein HtdO | - |
PRZ95_RS01230 (CPHELBEB_02890) | 221701..222972 | - | 1272 | WP_000592090 | type-F conjugative transfer system secretin TraK | traK |
PRZ95_RS01235 (CPHELBEB_02891) | 222972..223760 | - | 789 | WP_000783153 | TraE/TraK family type IV conjugative transfer system protein | traE |
PRZ95_RS01240 (CPHELBEB_02892) | 223772..224089 | - | 318 | WP_000043357 | type IV conjugative transfer system protein TraL | traL |
PRZ95_RS01245 (CPHELBEB_02893) | 224288..225400 | - | 1113 | WP_001300563 | IS4-like element IS421 family transposase | - |
PRZ95_RS01250 (CPHELBEB_02894) | 225489..225842 | - | 354 | WP_000423602 | hypothetical protein | - |
PRZ95_RS01255 (CPHELBEB_02897) | 226991..227866 | - | 876 | WP_000594612 | RepB family plasmid replication initiator protein | - |
Host bacterium
ID | 3685 | GenBank | NZ_ON960343 |
Plasmid name | pEC_G4W1 | Incompatibility group | - |
Plasmid size | 241276 bp | Coordinate of oriT [Strand] | 68478..68731 [+] |
Host baterium | Escherichia coli strain G4W1 |
Cargo genes
Drug resistance gene | fosA3, blaCTX-M-15, aadA5, floR, aph(3'')-Ib, aph(6)-Id, aph(3')-Ia |
Virulence gene | - |
Metal resistance gene | terE, terD, terC, terB, terA, terZ, terW |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |