Detailed information of oriT

oriT


The information of the oriT region


oriTDB ID   103063
Name   oriT_pBK32602 in_silico
Organism   Escherichia coli strain BK32602
Sequence Completeness      -
NCBI accession of oriT (coordinates [strand])   NZ_KU295134 (74813..74913 [+], 101 nt)
oriT length   101 nt
IRs (inverted repeats)      80..85, 91..96  (AAAAAA..TTTTTT)
 20..26, 38..44  (TAAATCA..TGATTTA)
Location of nic site      62..63
Conserved sequence flanking the
  nic site  
 
 GGTGTATAGC
Note   Predicted by oriTfinder 2.0

  oriT sequence  


Download         Length: 101 nt

>oriT_pBK32602
TATTTATTTTTTTATCTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG

Visualization of oriT structure

  oriT secondary structure

Predicted by RNAfold.

Download structure file


Relaxase


ID   2293 GenBank   WP_012561166
Name   TrwC_HTG56_RS00400_pBK32602 insolico UniProt ID   D7RU03
Length   1078 a.a. PDB ID   
Note   Predicted by oriTfinder 2.0

  Relaxase protein sequence


Download         Length: 1078 a.a.        Molecular weight: 120141.30 Da        Isoelectric Point: 6.5229

>WP_012561166.1 MULTISPECIES: MobF family relaxase [Gammaproteobacteria]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATMADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKNEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI

  Protein domains


Predicted by InterproScan.

(851-895)

(16-287)

(477-653)


  Protein structure


Source ID Structure
AlphaFold DB D7RU03


Auxiliary protein


ID   954 GenBank   WP_001749975
Name   WP_001749975_pBK32602 insolico UniProt ID   A0A5C2CVS6
Length   138 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  Auxiliary protein sequence


Download         Length: 138 a.a.        Molecular weight: 15332.60 Da        Isoelectric Point: 5.9670

>WP_001749975.1 MULTISPECIES: hypothetical protein [Gammaproteobacteria]
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER

  Protein domains



No domain identified.



  Protein structure


Source ID Structure
AlphaFold DB A0A5C2CVS6


T4CP


ID   2002 GenBank   WP_000342688
Name   t4cp2_HTG56_RS00405_pBK32602 insolico UniProt ID   _
Length   509 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 509 a.a.        Molecular weight: 57762.87 Da        Isoelectric Point: 9.6551

>WP_000342688.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI

  Protein domains


Predicted by InterproScan.

(113-489)

  Protein structure



No available structure.




T4SS


T4SS were predicted by using oriTfinder2.

Region 1: 11260..21585

Locus tag Coordinates Strand Size (bp) Protein ID Product Description
HTG56_RS00045 7105..8538 - 1434 WP_001288432 DNA cytosine methyltransferase -
HTG56_RS00050 8637..8909 + 273 Protein_10 IS1 family transposase -
HTG56_RS00055 8920..9126 - 207 WP_001749967 hypothetical protein -
HTG56_RS00060 9131..9778 - 648 WP_012561184 restriction endonuclease -
HTG56_RS00065 9828..10061 - 234 WP_001191790 hypothetical protein -
HTG56_RS00070 10175..10489 - 315 WP_001749965 TrbM/KikA/MpfK family conjugal transfer protein -
HTG56_RS00075 10486..10830 - 345 WP_001749964 hypothetical protein -
HTG56_RS00080 10846..11217 - 372 WP_011867773 H-NS family nucleoid-associated regulatory protein -
HTG56_RS00085 11260..11994 + 735 WP_001749963 lytic transglycosylase domain-containing protein virB1
HTG56_RS00090 12003..12284 + 282 WP_000440698 transcriptional repressor KorA -
HTG56_RS00095 12294..12587 + 294 WP_001749962 hypothetical protein virB2
HTG56_RS00100 12637..12954 + 318 WP_000496058 VirB3 family type IV secretion system protein virB3
HTG56_RS00105 12954..15554 + 2601 WP_001749961 VirB4 family type IV secretion/conjugal transfer ATPase virb4
HTG56_RS00110 15572..16285 + 714 WP_001749960 type IV secretion system protein virB5
HTG56_RS00115 16293..16520 + 228 WP_001749959 IncN-type entry exclusion lipoprotein EexN -
HTG56_RS00120 16536..17576 + 1041 WP_001749958 type IV secretion system protein virB6
HTG56_RS00125 17647..17805 + 159 WP_012561180 hypothetical protein -
HTG56_RS00130 17795..18493 + 699 WP_000646594 type IV secretion system protein virB8
HTG56_RS00135 18504..19388 + 885 WP_000735066 TrbG/VirB9 family P-type conjugative transfer protein virB9
HTG56_RS00140 19388..20548 + 1161 WP_000101710 type IV secretion system protein VirB10 virB10
HTG56_RS00145 20590..21585 + 996 WP_000128596 ATPase, T2SS/T4P/T4SS family virB11
HTG56_RS00150 21585..22118 + 534 WP_000792636 phospholipase D family protein -
HTG56_RS00530 22292..22579 - 288 Protein_31 DUF6710 family protein -
HTG56_RS00160 22846..23025 + 180 Protein_32 helix-turn-helix domain-containing protein -
HTG56_RS00165 23090..23462 + 373 Protein_33 DDE-type integrase/transposase/recombinase -
HTG56_RS00170 23514..25280 + 1767 Protein_34 Tn3 family transposase -
HTG56_RS00175 25359..26363 + 1005 WP_000427619 IS110-like element IS5075 family transposase -


Host bacterium


ID   3506 GenBank   NZ_KU295134
Plasmid name   pBK32602 Incompatibility group   IncN
Plasmid size   87982 bp Coordinate of oriT [Strand]   74813..74913 [+]
Host baterium   Escherichia coli strain BK32602

Cargo genes


Drug resistance gene   dfrA14, sul2, aph(3'')-Ib, aph(6)-Id, blaTEM-1A, blaOXA-9, ant(3'')-Ia, aac(6')-Ib, blaKPC-3
Virulence gene   -
Metal resistance gene   -
Degradation gene   -
Symbiosis gene   -
Anti-CRISPR   -