Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 102675 |
Name | oriT_pGX1-3 |
Organism | Escherichia coli strain GX1-3T |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_POVQ01000022 (17953..18206 [-], 254 nt) |
oriT length | 254 nt |
IRs (inverted repeats) | 152..157, 159..164 (AAAAGT..ACTTTT) 121..129, 134..142 (TTAAGGCTT..AAGCCTTAA) 50..55, 59..64 (AATTTT..AAAATT) |
Location of nic site | 77..78 |
Conserved sequence flanking the nic site |
TTTGGTTAAA |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 254 nt
>oriT_pGX1-3
GGATTTAGGTTTTTTTTTAATCGCTTCACATTTCGTTAGCATGCGAAGAAATTTTGATAAAATTCTGGTCAGTTTGGTTAAAAAGTGTTACAAGTAAGCGGTGTGGTTGAAGGGATAGATTTAAGGCTTATTCAAGCCTTAAGAAAATACTAAAAGTTACTTTTCACCCTACCGAACACCTAACAAAAAATCCATGTTGAAGATTTGAACAATTGTAATGGCGCAAGGACAATCAGCACATGTCAGAATCTGAT
GGATTTAGGTTTTTTTTTAATCGCTTCACATTTCGTTAGCATGCGAAGAAATTTTGATAAAATTCTGGTCAGTTTGGTTAAAAAGTGTTACAAGTAAGCGGTGTGGTTGAAGGGATAGATTTAAGGCTTATTCAAGCCTTAAGAAAATACTAAAAGTTACTTTTCACCCTACCGAACACCTAACAAAAAATCCATGTTGAAGATTTGAACAATTGTAATGGCGCAAGGACAATCAGCACATGTCAGAATCTGAT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 1976 | GenBank | WP_000980119 |
Name | mobH_C1H52_RS06670_pGX1-3 | UniProt ID | A0A0H3VVI8 |
Length | 1050 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1050 a.a. Molecular weight: 118035.42 Da Isoelectric Point: 4.6054
>WP_000980119.1 MULTISPECIES: MobH family relaxase [Enterobacteriaceae]
MMMNFRALYLCIKRILGIFSSQENDATSVMIEDISSLSPFAQILGDQKYTVPDHPNPEVLKFIEYPTRPT
GIQTFNEQSILSLYREKLHSISMMLAISDSDIRDDAYTFTNLVLKPLVEYVRWIHLLPASENHHHNGIGG
LLSHSLEVAILSLKNAHHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLASPIIWT
PSSQSLLDWARENDVVEYEIHWRKRIHNQHNIWSSVFLERILNPVCLAFLDRVNKERVYSKMITALNVYT
DGNDFLSKCVRTADFYSTGTDLNVLRDPIMGLRSNDAAARAISTIKHNFTSININNYNAKPMHIIIVNGE
VYLNENAFLDFVLNDFELHKYNFPQGEAGKTVLVESLVQRGYVEPYDDERVVHYFIPGIYSENEISNIFR
NGIGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRDAITKITDT
VETAVLKVNDLGRSTASIDVDIHSKKNEGSSDDFEKKAESDNEIDNDTQIVKSEGEEAADPVIPDIEESE
DESAKDTESHVLVNQLHELLLSAPLSNDYIVCVDAVPYLNIDATMALLPGLDEKAFSEEPYFQLTFREGS
LDGMWIVRDIDDLRLVQLGDNCAGFQLTYHEPRRPTTLKSLFNTSMYQALVINDESSVENSAPRPKQTLE
LPPPRVNAVEEHSGDVEYHGTDSASATGPLKTEAVEYEHYQHLFEEEDEEHEIIDYTDFTQLSVSRPEVG
SCATSSSVHNEKLLPEPSELPELNREQNADPQGTIERSMDVSVGQENSEPDTEGNCPPPAEVVYSQTEAA
ATSVMASEEPALPPVLEESNGEHAPTDAKGHHLSPALARLFAPTAPVEKQNPKRNRNKSSDKAEVQKPAS
PVSGHNLNSKVFATTESDQNGEFSLISEGDVTELEFVEIASVLHQILSKMEVAFKRKRKNRFMVSTPNTL
YLTQSCVEKFGSQLEAQDLFNKLPQYLVNSGAVINTKCHAFNMPTLLAASDRAKVDIERIINNLKEAGNL
MMMNFRALYLCIKRILGIFSSQENDATSVMIEDISSLSPFAQILGDQKYTVPDHPNPEVLKFIEYPTRPT
GIQTFNEQSILSLYREKLHSISMMLAISDSDIRDDAYTFTNLVLKPLVEYVRWIHLLPASENHHHNGIGG
LLSHSLEVAILSLKNAHHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLASPIIWT
PSSQSLLDWARENDVVEYEIHWRKRIHNQHNIWSSVFLERILNPVCLAFLDRVNKERVYSKMITALNVYT
DGNDFLSKCVRTADFYSTGTDLNVLRDPIMGLRSNDAAARAISTIKHNFTSININNYNAKPMHIIIVNGE
VYLNENAFLDFVLNDFELHKYNFPQGEAGKTVLVESLVQRGYVEPYDDERVVHYFIPGIYSENEISNIFR
NGIGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRDAITKITDT
VETAVLKVNDLGRSTASIDVDIHSKKNEGSSDDFEKKAESDNEIDNDTQIVKSEGEEAADPVIPDIEESE
DESAKDTESHVLVNQLHELLLSAPLSNDYIVCVDAVPYLNIDATMALLPGLDEKAFSEEPYFQLTFREGS
LDGMWIVRDIDDLRLVQLGDNCAGFQLTYHEPRRPTTLKSLFNTSMYQALVINDESSVENSAPRPKQTLE
LPPPRVNAVEEHSGDVEYHGTDSASATGPLKTEAVEYEHYQHLFEEEDEEHEIIDYTDFTQLSVSRPEVG
SCATSSSVHNEKLLPEPSELPELNREQNADPQGTIERSMDVSVGQENSEPDTEGNCPPPAEVVYSQTEAA
ATSVMASEEPALPPVLEESNGEHAPTDAKGHHLSPALARLFAPTAPVEKQNPKRNRNKSSDKAEVQKPAS
PVSGHNLNSKVFATTESDQNGEFSLISEGDVTELEFVEIASVLHQILSKMEVAFKRKRKNRFMVSTPNTL
YLTQSCVEKFGSQLEAQDLFNKLPQYLVNSGAVINTKCHAFNMPTLLAASDRAKVDIERIINNLKEAGNL
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A0H3VVI8 |
T4CP
ID | 1614 | GenBank | WP_046788497 |
Name | traD_C1H52_RS06675_pGX1-3 | UniProt ID | _ |
Length | 694 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 694 a.a. Molecular weight: 78171.95 Da Isoelectric Point: 8.0177
>WP_046788497.1 MULTISPECIES: conjugative transfer system coupling protein TraD [Enterobacterales]
MSDSKRTNLHAQENFYRPILEYRSASILLICSVSMLYMGLSSDGLDIAPIVLFTSILLFLLCLYRCKTAA
PFLMAHWRVFKRHFMFVSLDSLRVINKSNFFSNERKYRQLVQDYQNKNKDIPERKSYFCDGFEWGPEHAD
RAYQIANLSSDKREIELPFVFNPIKRHFDAMARKMGGSNAIFAVERREPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVVVVIDPKNDAEWRESLMEEAKTLGLPFYKFHPGQPASSVCIDVCNTYTNVSD
LTSRLLSLVTVPGEVNPFVQYAKALVSNVISGLSYIEKKPSIYLIHKNMKSHMSIVNLTVKVMESCYARY
YGYDVWTEKVKYVANETLPVRFKRLAEWFTAHFMNYEGSEQIDWLDTVSQLIDYSMSDPEHMAKMTAGIM
PVFDMLIEKPLNELLSPNPNSVSSREIVTSEGMFSTGGVLYISLDGLSNPDTAAAISQLIMSDLTSCAGS
RYNAQDGDMSANSRISIFVDEAHSAINNPMINLLAQGRAAKIALFICTQTISDFIAAASVETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSAISTNMVTYTTGSETSLPHNNFSGSISERKQTTLEESIPKDLLGQV
PMFHIVARLQDGRKVVGQIPIAVAEKQMKPNTTLSEMLFKKAGKVTLRQNLDIKNLNKFLRKFH
MSDSKRTNLHAQENFYRPILEYRSASILLICSVSMLYMGLSSDGLDIAPIVLFTSILLFLLCLYRCKTAA
PFLMAHWRVFKRHFMFVSLDSLRVINKSNFFSNERKYRQLVQDYQNKNKDIPERKSYFCDGFEWGPEHAD
RAYQIANLSSDKREIELPFVFNPIKRHFDAMARKMGGSNAIFAVERREPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVVVVIDPKNDAEWRESLMEEAKTLGLPFYKFHPGQPASSVCIDVCNTYTNVSD
LTSRLLSLVTVPGEVNPFVQYAKALVSNVISGLSYIEKKPSIYLIHKNMKSHMSIVNLTVKVMESCYARY
YGYDVWTEKVKYVANETLPVRFKRLAEWFTAHFMNYEGSEQIDWLDTVSQLIDYSMSDPEHMAKMTAGIM
PVFDMLIEKPLNELLSPNPNSVSSREIVTSEGMFSTGGVLYISLDGLSNPDTAAAISQLIMSDLTSCAGS
RYNAQDGDMSANSRISIFVDEAHSAINNPMINLLAQGRAAKIALFICTQTISDFIAAASVETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSAISTNMVTYTTGSETSLPHNNFSGSISERKQTTLEESIPKDLLGQV
PMFHIVARLQDGRKVVGQIPIAVAEKQMKPNTTLSEMLFKKAGKVTLRQNLDIKNLNKFLRKFH
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Host bacterium
ID | 3119 | GenBank | NZ_POVQ01000022 |
Plasmid name | pGX1-3 | Incompatibility group | - |
Plasmid size | 38877 bp | Coordinate of oriT [Strand] | 17953..18206 [-] |
Host baterium | Escherichia coli strain GX1-3T |
Cargo genes
Drug resistance gene | - |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |