Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 103056 |
Name | oriT_pIMP-SZ1502 |
Organism | Escherichia coli strain CRE1502 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_KU051707 (13790..13890 [-], 101 nt) |
oriT length | 101 nt |
IRs (inverted repeats) | 80..85, 91..96 (AAAAAA..TTTTTT) 20..26, 38..44 (TAAATCA..TGATTTA) |
Location of nic site | 62..63 |
Conserved sequence flanking the nic site |
GGTGTATAGC |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 101 nt
>oriT_pIMP-SZ1502
TATTTATTTTTTTATCTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
TATTTATTTTTTTATCTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 2287 | GenBank | WP_012561166 |
Name | TrwC_HTH37_RS00110_pIMP-SZ1502 | UniProt ID | D7RU03 |
Length | 1078 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1078 a.a. Molecular weight: 120141.30 Da Isoelectric Point: 6.5229
>WP_012561166.1 MULTISPECIES: MobF family relaxase [Gammaproteobacteria]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATMADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKNEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATMADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKNEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | D7RU03 |
Auxiliary protein
ID | 947 | GenBank | WP_001749975 |
Name | WP_001749975_pIMP-SZ1502 | UniProt ID | A0A5C2CVS6 |
Length | 138 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 138 a.a. Molecular weight: 15332.60 Da Isoelectric Point: 5.9670
>WP_001749975.1 MULTISPECIES: hypothetical protein [Gammaproteobacteria]
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
Protein domains
No domain identified.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A5C2CVS6 |
T4CP
ID | 1991 | GenBank | WP_000342688 |
Name | t4cp2_HTH37_RS00105_pIMP-SZ1502 | UniProt ID | _ |
Length | 509 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 509 a.a. Molecular weight: 57762.87 Da Isoelectric Point: 9.6551
>WP_000342688.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 27636..37961
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
HTH37_RS00140 | 22758..23372 | + | 615 | WP_014839983 | recombinase family protein | - |
HTH37_RS00145 | 23691..24347 | - | 657 | WP_001516695 | quinolone resistance pentapeptide repeat protein QnrS1 | - |
HTH37_RS00150 | 24742..25373 | + | 632 | Protein_29 | transposase | - |
HTH37_RS00155 | 25427..26131 | - | 705 | WP_001067855 | IS6-like element IS26 family transposase | - |
HTH37_RS00160 | 26196..26375 | - | 180 | Protein_31 | helix-turn-helix domain-containing protein | - |
HTH37_RS00350 | 26642..26929 | + | 288 | Protein_32 | DUF6710 family protein | - |
HTH37_RS00170 | 27103..27636 | - | 534 | WP_000792636 | phospholipase D family protein | - |
HTH37_RS00175 | 27636..28631 | - | 996 | WP_000128596 | ATPase, T2SS/T4P/T4SS family | virB11 |
HTH37_RS00180 | 28673..29833 | - | 1161 | WP_000101710 | type IV secretion system protein VirB10 | virB10 |
HTH37_RS00185 | 29833..30717 | - | 885 | WP_000735066 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
HTH37_RS00190 | 30728..31426 | - | 699 | WP_000646594 | type IV secretion system protein | virB8 |
HTH37_RS00195 | 31416..31574 | - | 159 | WP_012561180 | hypothetical protein | - |
HTH37_RS00200 | 31645..32685 | - | 1041 | WP_001749958 | type IV secretion system protein | virB6 |
HTH37_RS00205 | 32701..32928 | - | 228 | WP_001749959 | IncN-type entry exclusion lipoprotein EexN | - |
HTH37_RS00210 | 32936..33700 | - | 765 | WP_013149461 | type IV secretion system protein | virB5 |
HTH37_RS00215 | 33667..36267 | - | 2601 | WP_001749961 | VirB4 family type IV secretion/conjugal transfer ATPase | virb4 |
HTH37_RS00220 | 36267..36584 | - | 318 | WP_000496058 | VirB3 family type IV secretion system protein | virB3 |
HTH37_RS00225 | 36634..36927 | - | 294 | WP_001749962 | hypothetical protein | virB2 |
HTH37_RS00230 | 36937..37218 | - | 282 | WP_000440698 | transcriptional repressor KorA | - |
HTH37_RS00235 | 37227..37961 | - | 735 | WP_001749963 | lytic transglycosylase domain-containing protein | virB1 |
HTH37_RS00240 | 38004..38375 | + | 372 | WP_011867773 | H-NS family nucleoid-associated regulatory protein | - |
HTH37_RS00245 | 38391..38735 | + | 345 | WP_001749964 | hypothetical protein | - |
HTH37_RS00250 | 38732..39046 | + | 315 | WP_001749965 | TrbM/KikA/MpfK family conjugal transfer protein | - |
HTH37_RS00255 | 39160..39393 | + | 234 | WP_001191790 | hypothetical protein | - |
HTH37_RS00260 | 39443..40090 | + | 648 | WP_015344958 | restriction endonuclease | - |
HTH37_RS00265 | 40095..40301 | + | 207 | WP_001749967 | hypothetical protein | - |
HTH37_RS00270 | 40312..40584 | - | 273 | Protein_53 | IS1 family transposase | - |
HTH37_RS00275 | 40683..42116 | + | 1434 | WP_001288432 | DNA cytosine methyltransferase | - |
Host bacterium
ID | 3499 | GenBank | NZ_KU051707 |
Plasmid name | pIMP-SZ1502 | Incompatibility group | IncN |
Plasmid size | 51362 bp | Coordinate of oriT [Strand] | 13790..13890 [-] |
Host baterium | Escherichia coli strain CRE1502 |
Cargo genes
Drug resistance gene | qnrS1, blaIMP-4 |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |