Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 103053 |
Name | oriT_pIMP-GZ1517 |
Organism | Escherichia coli strain CRE1517 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_KT982618 (13779..13879 [-], 101 nt) |
oriT length | 101 nt |
IRs (inverted repeats) | 80..85, 91..96 (AAAAAA..TTTTTT) 20..26, 38..44 (TAAATCA..TGATTTA) |
Location of nic site | 62..63 |
Conserved sequence flanking the nic site |
GGTGTATAGC |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 101 nt
>oriT_pIMP-GZ1517
TATTTATTTTTTTATCTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
TATTTATTTTTTTATCTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 2284 | GenBank | WP_012561166 |
Name | TrwC_HTH21_RS00110_pIMP-GZ1517 | UniProt ID | D7RU03 |
Length | 1078 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1078 a.a. Molecular weight: 120141.30 Da Isoelectric Point: 6.5229
>WP_012561166.1 MULTISPECIES: MobF family relaxase [Gammaproteobacteria]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATMADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKNEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATMADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKNEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | D7RU03 |
Auxiliary protein
ID | 944 | GenBank | WP_001749975 |
Name | WP_001749975_pIMP-GZ1517 | UniProt ID | A0A5C2CVS6 |
Length | 138 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 138 a.a. Molecular weight: 15332.60 Da Isoelectric Point: 5.9670
>WP_001749975.1 MULTISPECIES: hypothetical protein [Gammaproteobacteria]
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
Protein domains
No domain identified.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A5C2CVS6 |
T4CP
ID | 1988 | GenBank | WP_000342688 |
Name | t4cp2_HTH21_RS00105_pIMP-GZ1517 | UniProt ID | _ |
Length | 509 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 509 a.a. Molecular weight: 57762.87 Da Isoelectric Point: 9.6551
>WP_000342688.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 27863..38188
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
HTH21_RS00145 | 22985..23599 | + | 615 | WP_014839983 | recombinase family protein | - |
HTH21_RS00150 | 23918..24574 | - | 657 | WP_001516695 | quinolone resistance pentapeptide repeat protein QnrS1 | - |
HTH21_RS00155 | 24969..25600 | + | 632 | Protein_30 | transposase | - |
HTH21_RS00160 | 25654..26358 | - | 705 | WP_001067855 | IS6-like element IS26 family transposase | - |
HTH21_RS00165 | 26423..26602 | - | 180 | Protein_32 | helix-turn-helix domain-containing protein | - |
HTH21_RS00355 | 26869..27156 | + | 288 | Protein_33 | DUF6710 family protein | - |
HTH21_RS00175 | 27330..27863 | - | 534 | WP_000792636 | phospholipase D family protein | - |
HTH21_RS00180 | 27863..28858 | - | 996 | WP_000128596 | ATPase, T2SS/T4P/T4SS family | virB11 |
HTH21_RS00185 | 28900..30060 | - | 1161 | WP_000101710 | type IV secretion system protein VirB10 | virB10 |
HTH21_RS00190 | 30060..30944 | - | 885 | WP_000735066 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
HTH21_RS00195 | 30955..31653 | - | 699 | WP_000646594 | type IV secretion system protein | virB8 |
HTH21_RS00200 | 31643..31801 | - | 159 | WP_012561180 | hypothetical protein | - |
HTH21_RS00205 | 31872..32912 | - | 1041 | WP_001749958 | type IV secretion system protein | virB6 |
HTH21_RS00210 | 32928..33155 | - | 228 | WP_001749959 | IncN-type entry exclusion lipoprotein EexN | - |
HTH21_RS00215 | 33163..33927 | - | 765 | WP_013149461 | type IV secretion system protein | virB5 |
HTH21_RS00220 | 33894..36494 | - | 2601 | WP_001749961 | VirB4 family type IV secretion/conjugal transfer ATPase | virb4 |
HTH21_RS00225 | 36494..36811 | - | 318 | WP_000496058 | VirB3 family type IV secretion system protein | virB3 |
HTH21_RS00230 | 36861..37154 | - | 294 | WP_001749962 | hypothetical protein | virB2 |
HTH21_RS00235 | 37164..37445 | - | 282 | WP_000440698 | transcriptional repressor KorA | - |
HTH21_RS00240 | 37454..38188 | - | 735 | WP_001749963 | lytic transglycosylase domain-containing protein | virB1 |
HTH21_RS00245 | 38231..38602 | + | 372 | WP_011867773 | H-NS family nucleoid-associated regulatory protein | - |
HTH21_RS00250 | 38618..38962 | + | 345 | WP_001749964 | hypothetical protein | - |
HTH21_RS00255 | 38959..39273 | + | 315 | WP_001749965 | TrbM/KikA/MpfK family conjugal transfer protein | - |
HTH21_RS00260 | 39387..39620 | + | 234 | WP_001191790 | hypothetical protein | - |
HTH21_RS00265 | 39670..40317 | + | 648 | WP_015344958 | restriction endonuclease | - |
HTH21_RS00270 | 40322..40528 | + | 207 | WP_001749967 | hypothetical protein | - |
HTH21_RS00275 | 40539..40811 | - | 273 | Protein_54 | IS1 family transposase | - |
HTH21_RS00280 | 40910..42343 | + | 1434 | WP_001288432 | DNA cytosine methyltransferase | - |
Host bacterium
ID | 3496 | GenBank | NZ_KT982618 |
Plasmid name | pIMP-GZ1517 | Incompatibility group | IncN |
Plasmid size | 51589 bp | Coordinate of oriT [Strand] | 13779..13879 [-] |
Host baterium | Escherichia coli strain CRE1517 |
Cargo genes
Drug resistance gene | qnrS1, blaIMP-4 |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |