Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 102661 |
Name | oriT_pC20 |
Organism | Escherichia coli strain c20 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_NGBR01000046 (20889..21143 [+], 255 nt) |
oriT length | 255 nt |
IRs (inverted repeats) | 153..158, 160..165 (AAAAGT..ACTTTT) |
Location of nic site | _ |
Conserved sequence flanking the nic site |
_ |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 255 nt
>oriT_pC20
GGATTTAGGTTTTTTTTTAATCGCTTCACAGTTCGTTAGCAAGCTCAGTTTTTTTTGATAAAATTCTGGTCAGTTTGTTTAAAAAGTGTTACAAGTAAGGCGAATGGTTGAATGGTTAGTTTTAAGACTTATTCAAGCCTTAAGAAAATACTAAAAGTTACTTTTCACCCTACCGAACACCTAACAAAAAATCCATGTTGAAGATTTGAACAATTGTAATGGCGCAAGGACAATCAGCACATGTCAGAATCTGAT
GGATTTAGGTTTTTTTTTAATCGCTTCACAGTTCGTTAGCAAGCTCAGTTTTTTTTGATAAAATTCTGGTCAGTTTGTTTAAAAAGTGTTACAAGTAAGGCGAATGGTTGAATGGTTAGTTTTAAGACTTATTCAAGCCTTAAGAAAATACTAAAAGTTACTTTTCACCCTACCGAACACCTAACAAAAAATCCATGTTGAAGATTTGAACAATTGTAATGGCGCAAGGACAATCAGCACATGTCAGAATCTGAT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 1969 | GenBank | WP_249537831 |
Name | TraI_2_B6N66_RS24340_pC20 | UniProt ID | _ |
Length | 910 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 910 a.a. Molecular weight: 102054.08 Da Isoelectric Point: 4.5177
>WP_249537831.1 TraI domain-containing protein [Escherichia coli]
MLSHSLEVAMISLKNANHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLASPIIWT
PSSQSLLDWARENDVVEYEIHWRKRIHNQHNIWSSVFLERILNPVCLAFLDRVNKERVYSKMITALNVYT
DGNDFLSKCVRTADFYSTGTDLNVLRDPIMGLRSNDAAARAISTIKHNFTSININNYNAKPMHIIIVNGE
VYLNENAFLDFVLNDFELHKYNFPQGEAGKTVLVESLVQRGYVEPYDDERVVHYFIPGIYSENEISNIFR
NGIGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRDAITKITDT
VETAVLKVNDLGRSTASIDVDIHSKKNEGSSDDFEKKAESDNEIDNDTQIVKSEGEEAADPVIPDIEESE
DESAKDTESHVLVNQLHELLLSAPLSNDYIVCVDAVPYLNIDATMALLPGLDEKAFSEEPYFQLTFREGS
LDGMWIVRDIDDLRLVQLGDNCAGFQLTYHEPRRPTTLKSLFNTSMYQALVINDESSVENSAPRPKQTLE
LPPPRVNAVEEHSGDVEYHGTDSASATGPLKTEAVEYEHYQHLFEEEDEEHEIIDYTDFTQLSVSRPEVG
SCATSSSVHNEKLLPEPSELPELNREQNADPQGTIERSMDVSVGQENSEPDTEGNCPPPAEVVYSQTEAA
ATSVMASEEPALPPVLEESNGEHAPTDAKGHHLSPALARLFAPTAPVEKQNPKRNRNKSSDKAEVQKPAS
PVSGHNLNSKVFATTESDQNGEFSLISEGDVTELEFVEIASVLHQILSKMEVAFKRKRKNRFMVSTPNTL
YLTQSCVEKFGSQLEAQDLFNKLPQYLVNSGAVINTKCHAFNMPTLLAASDRAKVDIERIINNLKEAGNL
MLSHSLEVAMISLKNANHSELRPIGYQDEEVVRRKVYLYAAFICGLVHDAGKVYDLDIVSLNLASPIIWT
PSSQSLLDWARENDVVEYEIHWRKRIHNQHNIWSSVFLERILNPVCLAFLDRVNKERVYSKMITALNVYT
DGNDFLSKCVRTADFYSTGTDLNVLRDPIMGLRSNDAAARAISTIKHNFTSININNYNAKPMHIIIVNGE
VYLNENAFLDFVLNDFELHKYNFPQGEAGKTVLVESLVQRGYVEPYDDERVVHYFIPGIYSENEISNIFR
NGIGKLEFYNLLKLRWIGLIFDSYKIPDSVPGLFSVNANKDFIYIDEQKTVTEYRRPVPGRDAITKITDT
VETAVLKVNDLGRSTASIDVDIHSKKNEGSSDDFEKKAESDNEIDNDTQIVKSEGEEAADPVIPDIEESE
DESAKDTESHVLVNQLHELLLSAPLSNDYIVCVDAVPYLNIDATMALLPGLDEKAFSEEPYFQLTFREGS
LDGMWIVRDIDDLRLVQLGDNCAGFQLTYHEPRRPTTLKSLFNTSMYQALVINDESSVENSAPRPKQTLE
LPPPRVNAVEEHSGDVEYHGTDSASATGPLKTEAVEYEHYQHLFEEEDEEHEIIDYTDFTQLSVSRPEVG
SCATSSSVHNEKLLPEPSELPELNREQNADPQGTIERSMDVSVGQENSEPDTEGNCPPPAEVVYSQTEAA
ATSVMASEEPALPPVLEESNGEHAPTDAKGHHLSPALARLFAPTAPVEKQNPKRNRNKSSDKAEVQKPAS
PVSGHNLNSKVFATTESDQNGEFSLISEGDVTELEFVEIASVLHQILSKMEVAFKRKRKNRFMVSTPNTL
YLTQSCVEKFGSQLEAQDLFNKLPQYLVNSGAVINTKCHAFNMPTLLAASDRAKVDIERIINNLKEAGNL
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4CP
ID | 1607 | GenBank | WP_046788497 |
Name | traD_B6N66_RS24335_pC20 | UniProt ID | _ |
Length | 694 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 694 a.a. Molecular weight: 78171.95 Da Isoelectric Point: 8.0177
>WP_046788497.1 MULTISPECIES: conjugative transfer system coupling protein TraD [Enterobacterales]
MSDSKRTNLHAQENFYRPILEYRSASILLICSVSMLYMGLSSDGLDIAPIVLFTSILLFLLCLYRCKTAA
PFLMAHWRVFKRHFMFVSLDSLRVINKSNFFSNERKYRQLVQDYQNKNKDIPERKSYFCDGFEWGPEHAD
RAYQIANLSSDKREIELPFVFNPIKRHFDAMARKMGGSNAIFAVERREPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVVVVIDPKNDAEWRESLMEEAKTLGLPFYKFHPGQPASSVCIDVCNTYTNVSD
LTSRLLSLVTVPGEVNPFVQYAKALVSNVISGLSYIEKKPSIYLIHKNMKSHMSIVNLTVKVMESCYARY
YGYDVWTEKVKYVANETLPVRFKRLAEWFTAHFMNYEGSEQIDWLDTVSQLIDYSMSDPEHMAKMTAGIM
PVFDMLIEKPLNELLSPNPNSVSSREIVTSEGMFSTGGVLYISLDGLSNPDTAAAISQLIMSDLTSCAGS
RYNAQDGDMSANSRISIFVDEAHSAINNPMINLLAQGRAAKIALFICTQTISDFIAAASVETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSAISTNMVTYTTGSETSLPHNNFSGSISERKQTTLEESIPKDLLGQV
PMFHIVARLQDGRKVVGQIPIAVAEKQMKPNTTLSEMLFKKAGKVTLRQNLDIKNLNKFLRKFH
MSDSKRTNLHAQENFYRPILEYRSASILLICSVSMLYMGLSSDGLDIAPIVLFTSILLFLLCLYRCKTAA
PFLMAHWRVFKRHFMFVSLDSLRVINKSNFFSNERKYRQLVQDYQNKNKDIPERKSYFCDGFEWGPEHAD
RAYQIANLSSDKREIELPFVFNPIKRHFDAMARKMGGSNAIFAVERREPIFVTEDNWFGHTLITGNVGTG
KTVLQRLLSISMLHLGHVVVVIDPKNDAEWRESLMEEAKTLGLPFYKFHPGQPASSVCIDVCNTYTNVSD
LTSRLLSLVTVPGEVNPFVQYAKALVSNVISGLSYIEKKPSIYLIHKNMKSHMSIVNLTVKVMESCYARY
YGYDVWTEKVKYVANETLPVRFKRLAEWFTAHFMNYEGSEQIDWLDTVSQLIDYSMSDPEHMAKMTAGIM
PVFDMLIEKPLNELLSPNPNSVSSREIVTSEGMFSTGGVLYISLDGLSNPDTAAAISQLIMSDLTSCAGS
RYNAQDGDMSANSRISIFVDEAHSAINNPMINLLAQGRAAKIALFICTQTISDFIAAASVETANRITGLC
NNYISLRVNDTPTQTLVVENFGKSAISTNMVTYTTGSETSLPHNNFSGSISERKQTTLEESIPKDLLGQV
PMFHIVARLQDGRKVVGQIPIAVAEKQMKPNTTLSEMLFKKAGKVTLRQNLDIKNLNKFLRKFH
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Host bacterium
ID | 3105 | GenBank | NZ_NGBR01000046 |
Plasmid name | pC20 | Incompatibility group | - |
Plasmid size | 76326 bp | Coordinate of oriT [Strand] | 20889..21143 [+] |
Host baterium | Escherichia coli strain c20 |
Cargo genes
Drug resistance gene | sul1, dfrA12, aph(3')-Ia, sul3, ant(3'')-Ia, cmlA1, aadA2, sul2, aph(4)-Ia, aac(3)-IVa |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |