Detailed information of oriT

oriT


The information of the oriT region


oriTDB ID   101501
Name   oriT_p2165-5 in_silico
Organism   Escherichia coli strain GN02165
Sequence Completeness      -
NCBI accession of oriT (coordinates [strand])   NZ_JALJQH010000007 (16778..16878 [+], 101 nt)
oriT length   101 nt
IRs (inverted repeats)      80..85, 91..96  (AAAAAA..TTTTTT)
 20..26, 38..44  (TAAATCA..TGATTTA)
Location of nic site      62..63
Conserved sequence flanking the
  nic site  
 
 GGTGTATAGC
Note   Predicted by oriTfinder 2.0

  oriT sequence  


Download         Length: 101 nt

>oriT_p2165-5
TATTTATTTTTTTATCTTTTAAATCAGTACGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG

Visualization of oriT structure

  oriT secondary structure

Predicted by RNAfold.

Download structure file


Relaxase


ID   1340 GenBank   WP_012561142
Name   mobF_MWG53_RS26220_p2165-5 insolico UniProt ID   A0A1S6KKJ9
Length   1078 a.a. PDB ID   
Note   Predicted by oriTfinder 2.0

  Relaxase protein sequence


Download         Length: 1078 a.a.        Molecular weight: 120195.37 Da        Isoelectric Point: 6.5230

>WP_012561142.1 MULTISPECIES: MobF family relaxase [Enterobacterales]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKADMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI

  Protein domains


Predicted by InterproScan.

(851-895)

(477-653)

(16-287)


  Protein structure


Source ID Structure
AlphaFold DB A0A1S6KKJ9


Auxiliary protein


ID   478 GenBank   WP_000706094
Name   WP_000706094_p2165-5 insolico UniProt ID   B6UZ32
Length   96 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  Auxiliary protein sequence


Download         Length: 96 a.a.        Molecular weight: 10829.54 Da        Isoelectric Point: 8.5040

>WP_000706094.1 MULTISPECIES: hypothetical protein [Enterobacterales]
MKIVADRLSDVNRKLDYLFDRASDADFGPLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIA
EPGKTKAAKAEVERNGYKVWEPKKER

  Protein domains



No domain identified.



  Protein structure


Source ID Structure
AlphaFold DB B6UZ32


T4CP


ID   965 GenBank   WP_000342688
Name   t4cp2_MWG53_RS26225_p2165-5 insolico UniProt ID   _
Length   509 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 509 a.a.        Molecular weight: 57762.87 Da        Isoelectric Point: 9.6551

>WP_000342688.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI

  Protein domains


Predicted by InterproScan.

(113-489)

  Protein structure



No available structure.




T4SS


T4SS were predicted by using oriTfinder2.

Region 1: 44..16211

Locus tag Coordinates Strand Size (bp) Protein ID Product Description
MWG53_RS26145 (MWG53_26170) 44..2644 + 2601 WP_012561149 VirB4 family type IV secretion/conjugal transfer ATPase virb4
MWG53_RS26150 (MWG53_26175) 2662..3375 + 714 WP_001749960 type IV secretion system protein virB5
MWG53_RS26155 (MWG53_26180) 3383..3610 + 228 WP_001749959 IncN-type entry exclusion lipoprotein EexN -
MWG53_RS26160 (MWG53_26185) 3626..4666 + 1041 WP_001749958 type IV secretion system protein virB6
MWG53_RS26165 (MWG53_26190) 4749..4895 + 147 WP_001257173 conjugal transfer protein TraN -
MWG53_RS26170 (MWG53_26195) 4885..5583 + 699 WP_000646594 type IV secretion system protein virB8
MWG53_RS26175 (MWG53_26200) 5594..6478 + 885 WP_000735066 TrbG/VirB9 family P-type conjugative transfer protein virB9
MWG53_RS26180 (MWG53_26205) 6478..7638 + 1161 WP_029602879 type IV secretion system protein VirB10 virB10
MWG53_RS26185 (MWG53_26210) 7680..8675 + 996 WP_012561144 ATPase, T2SS/T4P/T4SS family virB11
MWG53_RS26190 (MWG53_26215) 8675..9208 + 534 WP_000792636 phospholipase D family protein -
MWG53_RS26195 (MWG53_26220) 9381..9695 - 315 WP_000091613 hypothetical protein -
MWG53_RS26200 (MWG53_26225) 9950..10306 - 357 WP_000215515 cupin domain-containing protein -
MWG53_RS26205 (MWG53_26230) 10296..10697 - 402 WP_001293886 DUF86 domain-containing protein -
MWG53_RS26210 (MWG53_26235) 10694..10984 - 291 WP_001247892 nucleotidyltransferase -
MWG53_RS26215 (MWG53_26240) 11048..11446 - 399 WP_012561143 DUF6710 family protein -
MWG53_RS26220 (MWG53_26245) 11446..14682 - 3237 WP_012561142 MobF family relaxase -
MWG53_RS26225 (MWG53_26250) 14682..16211 - 1530 WP_000342688 type IV secretion system DNA-binding domain-containing protein virb4
MWG53_RS26230 (MWG53_26255) 16213..16503 - 291 WP_000706094 hypothetical protein -
MWG53_RS26235 (MWG53_26260) 17197..17583 + 387 WP_013279398 plasmid stabilization protein StbA -
MWG53_RS26240 (MWG53_26265) 17592..18308 + 717 WP_012561139 StbB family protein -
MWG53_RS26245 (MWG53_26270) 18310..18678 + 369 WP_012561138 plasmid stabilization protein StbC -
MWG53_RS26250 (MWG53_26275) 18879..19223 + 345 WP_012561137 hypothetical protein -
MWG53_RS26255 (MWG53_26280) 19685..19864 - 180 WP_012561134 protein CcgAI -
MWG53_RS26260 (MWG53_26285) 20748..20987 - 240 WP_000932975 hypothetical protein -


Host bacterium


ID   1945 GenBank   NZ_JALJQH010000007
Plasmid name   p2165-5 Incompatibility group   IncN
Plasmid size   44207 bp Coordinate of oriT [Strand]   16778..16878 [+]
Host baterium   Escherichia coli strain GN02165

Cargo genes


Drug resistance gene   -
Virulence gene   -
Metal resistance gene   nirD, ncrC, ncrB, ncrA
Degradation gene   -
Symbiosis gene   -
Anti-CRISPR   -