Detailed information of oriT
oriT
The information of the oriT region
| oriTDB ID | 102661 |
| Name | oriT_pC20 |
| Organism | Escherichia coli strain c20 |
| Sequence Completeness | - |
| NCBI accession of oriT (coordinates [strand]) | NZ_NGBR01000046 (20889..21143 [+], 255 nt) |
| oriT length | 255 nt |
| IRs (inverted repeats) | 153..158, 160..165 (AAAAGT..ACTTTT) |
| Location of nic site | _ |
| Conserved sequence flanking the nic site |
_ |
| Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 255 nt
>oriT_pC20
GGATTTAGGTTTTTTTTTAATCGCTTCACAGTTCGTTAGCAAGCTCAGTTTTTTTTGATAAAATTCTGGTCAGTTTGTTTAAAAAGTGTTACAAGTAAGGCGAATGGTTGAATGGTTAGTTTTAAGACTTATTCAAGCCTTAAGAAAATACTAAAAGTTACTTTTCACCCTACCGAACACCTAACAAAAAATCCATGTTGAAGATTTGAACAATTGTAATGGCGCAAGGACAATCAGCACATGTCAGAATCTGAT
GGATTTAGGTTTTTTTTTAATCGCTTCACAGTTCGTTAGCAAGCTCAGTTTTTTTTGATAAAATTCTGGTCAGTTTGTTTAAAAAGTGTTACAAGTAAGGCGAATGGTTGAATGGTTAGTTTTAAGACTTATTCAAGCCTTAAGAAAATACTAAAAGTTACTTTTCACCCTACCGAACACCTAACAAAAAATCCATGTTGAAGATTTGAACAATTGTAATGGCGCAAGGACAATCAGCACATGTCAGAATCTGAT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure file
Relaxase
| ID | 1969 | GenBank | WP_249537831 |
| Name | TraI_2_B6N66_RS24340_pC20 |
UniProt ID | _ |
| Length | 910 a.a. | PDB ID | |
| Note | Predicted by oriTfinder 2.0 | ||
Relaxase protein sequence
Download Length: 910 a.a. Molecular weight: 102054.08 Da Isoelectric Point: 4.5177
>WP_249537831.1 TraI domain-containing protein [Escherichia coli]
MLSHSLEVAMISLKNANHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLASPIIWT
PSSQSLLDWARENDVVEYEIHWRKRIHNQHNIWSSVFLERILNPVCLAFLDRVNKERVYSKMITALNVYT
DGNDFLSKCVRTADFYSTGTDLNVLRDPIMGLRSNDAAARAISTIKHNFTSININNYNAKPMHIIIVNGE
VYLNENAFLDFVLNDFELHKYNFPQGEAGKTVLVESLVQRGYVEPYDDERVVHYFIPGIYSENEISNIFR
NGIGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRDAITKITDT
VETAVLKVNDLGRSTASIDVDIHSKKNEGSSDDFEKKAESDNEIDNDTQIVKSEGEEAADPVIPDIEESE
DESAKDTESHVLVNQLHELLLSAPLSNDYIVCVDAVPYLNIDATMALLPGLDEKAFSEEPYFQLTFREGS
LDGMWIVRDIDDLRLVQLGDNCAGFQLTYHEPRRPTTLKSLFNTSMYQALVINDESSVENSAPRPKQTLE
LPPPRVNAVEEHSGDVEYHGTDSASATGPLKTEAVEYEHYQHLFEEEDEEHEIIDYTDFTQLSVSRPEVG
SCATSSSVHNEKLLPEPSELPELNREQNADPQGTIERSMDVSVGQENSEPDTEGNCPPPAEVVYSQTEAA
ATSVMASEEPALPPVLEESNGEHAPTDAKGHHLSPALARLFAPTAPVEKQNPKRNRNKSSDKAEVQKPAS
PVSGHNLNSKVFATTESDQNGEFSLISEGDVTELEFVEIASVLHQILSKMEVAFKRKRKNRFMVSTPNTL
YLTQSCVEKFGSQLEAQDLFNKLPQYLVNSGAVINTKCHAFNMPTLLAASDRAKVDIERIINNLKEAGNL
MLSHSLEVAMISLKNANHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLASPIIWT
PSSQSLLDWARENDVVEYEIHWRKRIHNQHNIWSSVFLERILNPVCLAFLDRVNKERVYSKMITALNVYT
DGNDFLSKCVRTADFYSTGTDLNVLRDPIMGLRSNDAAARAISTIKHNFTSININNYNAKPMHIIIVNGE
VYLNENAFLDFVLNDFELHKYNFPQGEAGKTVLVESLVQRGYVEPYDDERVVHYFIPGIYSENEISNIFR
NGIGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRDAITKITDT
VETAVLKVNDLGRSTASIDVDIHSKKNEGSSDDFEKKAESDNEIDNDTQIVKSEGEEAADPVIPDIEESE
DESAKDTESHVLVNQLHELLLSAPLSNDYIVCVDAVPYLNIDATMALLPGLDEKAFSEEPYFQLTFREGS
LDGMWIVRDIDDLRLVQLGDNCAGFQLTYHEPRRPTTLKSLFNTSMYQALVINDESSVENSAPRPKQTLE
LPPPRVNAVEEHSGDVEYHGTDSASATGPLKTEAVEYEHYQHLFEEEDEEHEIIDYTDFTQLSVSRPEVG
SCATSSSVHNEKLLPEPSELPELNREQNADPQGTIERSMDVSVGQENSEPDTEGNCPPPAEVVYSQTEAA
ATSVMASEEPALPPVLEESNGEHAPTDAKGHHLSPALARLFAPTAPVEKQNPKRNRNKSSDKAEVQKPAS
PVSGHNLNSKVFATTESDQNGEFSLISEGDVTELEFVEIASVLHQILSKMEVAFKRKRKNRFMVSTPNTL
YLTQSCVEKFGSQLEAQDLFNKLPQYLVNSGAVINTKCHAFNMPTLLAASDRAKVDIERIINNLKEAGNL
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4CP
| ID | 1607 | GenBank | WP_046788497 |
| Name | traD_B6N66_RS24335_pC20 |
UniProt ID | _ |
| Length | 694 a.a. | PDB ID | _ |
| Note | Predicted by oriTfinder 2.0 | ||
T4CP protein sequence
Download Length: 694 a.a. Molecular weight: 78171.95 Da Isoelectric Point: 8.0177
>WP_046788497.1 MULTISPECIES: conjugative transfer system coupling protein TraD [Enterobacterales]
MSDSKRTNLHAQENFYRPILEYRSASILLICSVSMLYMGLSSDGLDIAPIVLFTSILLFLLCLYRCKTAA
PFLMAHWRVFKRHFMFVSLDSLRVINKSNFFSNERKYRQLVQDYQNKNKDIPERKSYFCDGFEWGPEHAD
RAYQIANLSSDKREIELPFVFNPIKRHFDAMARKMGGSNAIFAVERREPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVVVVIDPKNDAEWRESLMEEAKTLGLPFYKFHPGQPASSVCIDVCNTYTNVSD
LTSRLLSLVTVPGEVNPFVQYAKALVSNVISGLSYIEKKPSIYLIHKNMKSHMSIVNLTVKVMESCYARY
YGYDVWTEKVKYVANETLPVRFKRLAEWFTAHFMNYEGSEQIDWLDTVSQLIDYSMSDPEHMAKMTAGIM
PVFDMLIEKPLNELLSPNPNSVSSREIVTSEGMFSTGGVLYISLDGLSNPDTAAAISQLIMSDLTSCAGS
RYNAQDGDMSANSRISIFVDEAHSAINNPMINLLAQGRAAKIALFICTQTISDFIAAASVETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSAISTNMVTYTTGSETSLPHNNFSGSISERKQTTLEESIPKDLLGQV
PMFHIVARLQDGRKVVGQIPIAVAEKQMKPNTTLSEMLFKKAGKVTLRQNLDIKNLNKFLRKFH
MSDSKRTNLHAQENFYRPILEYRSASILLICSVSMLYMGLSSDGLDIAPIVLFTSILLFLLCLYRCKTAA
PFLMAHWRVFKRHFMFVSLDSLRVINKSNFFSNERKYRQLVQDYQNKNKDIPERKSYFCDGFEWGPEHAD
RAYQIANLSSDKREIELPFVFNPIKRHFDAMARKMGGSNAIFAVERREPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVVVVIDPKNDAEWRESLMEEAKTLGLPFYKFHPGQPASSVCIDVCNTYTNVSD
LTSRLLSLVTVPGEVNPFVQYAKALVSNVISGLSYIEKKPSIYLIHKNMKSHMSIVNLTVKVMESCYARY
YGYDVWTEKVKYVANETLPVRFKRLAEWFTAHFMNYEGSEQIDWLDTVSQLIDYSMSDPEHMAKMTAGIM
PVFDMLIEKPLNELLSPNPNSVSSREIVTSEGMFSTGGVLYISLDGLSNPDTAAAISQLIMSDLTSCAGS
RYNAQDGDMSANSRISIFVDEAHSAINNPMINLLAQGRAAKIALFICTQTISDFIAAASVETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSAISTNMVTYTTGSETSLPHNNFSGSISERKQTTLEESIPKDLLGQV
PMFHIVARLQDGRKVVGQIPIAVAEKQMKPNTTLSEMLFKKAGKVTLRQNLDIKNLNKFLRKFH
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Host bacterium
| ID | 3105 | GenBank | NZ_NGBR01000046 |
| Plasmid name | pC20 | Incompatibility group | - |
| Plasmid size | 76326 bp | Coordinate of oriT [Strand] | 20889..21143 [+] |
| Host baterium | Escherichia coli strain c20 |
Cargo genes
| Drug resistance gene | sul1, dfrA12, aph(3')-Ia, sul3, ant(3'')-Ia, cmlA1, aadA2, sul2, aph(4)-Ia, aac(3)-IVa |
| Virulence gene | - |
| Metal resistance gene | - |
| Degradation gene | - |
| Symbiosis gene | - |
| Anti-CRISPR | - |