Detailed information of oriT

oriT


The information of the oriT region


oriTDB ID   103274
Name   oriT_pEc20/2xEcTOP in_silico
Organism   Escherichia coli strain TcEc20/2xEcTOP
Sequence Completeness      -
NCBI accession of oriT (coordinates [strand])   NZ_MH514861 (41226..41325 [+], 100 nt)
oriT length   100 nt
IRs (inverted repeats)      77..82, 90..95  (AAAAAA..TTTTTT)
 78..83, 90..95  (AAAAAA..TTTTTT)
 79..84, 90..95  (AAAAAA..TTTTTT)
 20..26, 38..44  (TAAATCA..TGATTTA)
Location of nic site      62..63
Conserved sequence flanking the
  nic site  
 
 GGTGTATAGC
Note   Predicted by oriTfinder 2.0

  oriT sequence  


Download         Length: 100 nt

>oriT_pEc20/2xEcTOP
TATTTATTTTTTTATCTTTTAAATCAGTACGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGAAAAAAAATCATCTTTTTTGGTAG

Visualization of oriT structure



Relaxase


ID   2445 GenBank   WP_012561142
Name   TrwC_HTS63_RS00240_pEc20/2xEcTOP insolico UniProt ID   A0A1S6KKJ9
Length   1078 a.a. PDB ID   
Note   Predicted by oriTfinder 2.0

  Relaxase protein sequence


Download         Length: 1078 a.a.        Molecular weight: 120195.37 Da        Isoelectric Point: 6.5230

>WP_012561142.1 MULTISPECIES: MobF family relaxase [Enterobacterales]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKADMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI

  Protein domains


Predicted by InterproScan.

(851-895)

(477-653)

(16-287)


  Protein structure


Source ID Structure
AlphaFold DB A0A1S6KKJ9


Auxiliary protein


ID   1044 GenBank   WP_001749975
Name   WP_001749975_pEc20/2xEcTOP insolico UniProt ID   A0A5C2CVS6
Length   138 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  Auxiliary protein sequence


Download         Length: 138 a.a.        Molecular weight: 15332.60 Da        Isoelectric Point: 5.9670

>WP_001749975.1 MULTISPECIES: hypothetical protein [Gammaproteobacteria]
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER

  Protein domains



No domain identified.



  Protein structure


Source ID Structure
AlphaFold DB A0A5C2CVS6


T4CP


ID   2217 GenBank   WP_000342688
Name   t4cp2_HTS63_RS00245_pEc20/2xEcTOP insolico UniProt ID   _
Length   509 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 509 a.a.        Molecular weight: 57762.87 Da        Isoelectric Point: 9.6551

>WP_000342688.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI

  Protein domains


Predicted by InterproScan.

(113-489)

  Protein structure



No available structure.




T4SS


T4SS were predicted by using oriTfinder2.

Region 1: 12783..23112

Locus tag Coordinates Strand Size (bp) Protein ID Product Description
HTS63_RS00065 9094..9366 + 273 Protein_13 IS1 family transposase -
HTS63_RS00070 9377..9583 - 207 WP_001749967 hypothetical protein -
HTS63_RS00075 9588..9941 - 354 WP_225622145 restriction endonuclease -
HTS63_RS00360 9876..10007 + 132 WP_255255389 hypothetical protein -
HTS63_RS00080 10025..10993 + 969 WP_016151349 IS5 family transposase -
HTS63_RS00355 11035..11295 - 261 WP_016359294 hypothetical protein -
HTS63_RS00090 11351..11662 - 312 WP_013279382 hypothetical protein -
HTS63_RS00095 11698..12012 - 315 WP_016338363 TrbM/KikA/MpfK family conjugal transfer protein -
HTS63_RS00100 12009..12353 - 345 WP_012561155 hypothetical protein -
HTS63_RS00105 12369..12746 - 378 WP_193567606 H-NS family nucleoid-associated regulatory protein -
HTS63_RS00110 12783..13520 + 738 WP_013279384 lytic transglycosylase domain-containing protein virB1
HTS63_RS00115 13529..13810 + 282 WP_016338364 transcriptional repressor KorA -
HTS63_RS00120 13820..14113 + 294 WP_001749962 hypothetical protein virB2
HTS63_RS00125 14164..14481 + 318 WP_000496058 VirB3 family type IV secretion system protein virB3
HTS63_RS00130 14481..17081 + 2601 WP_012561149 VirB4 family type IV secretion/conjugal transfer ATPase virb4
HTS63_RS00135 17099..17812 + 714 WP_001749960 type IV secretion system protein virB5
HTS63_RS00140 17820..18047 + 228 WP_001749959 IncN-type entry exclusion lipoprotein EexN -
HTS63_RS00145 18063..19103 + 1041 WP_001749958 type IV secretion system protein virB6
HTS63_RS00150 19174..19332 + 159 WP_012561180 hypothetical protein -
HTS63_RS00155 19322..20020 + 699 WP_000646594 type IV secretion system protein virB8
HTS63_RS00160 20031..20915 + 885 WP_000735066 TrbG/VirB9 family P-type conjugative transfer protein virB9
HTS63_RS00165 20915..22075 + 1161 WP_000101710 type IV secretion system protein VirB10 virB10
HTS63_RS00170 22117..23112 + 996 WP_012561144 ATPase, T2SS/T4P/T4SS family virB11
HTS63_RS00175 23112..23216 + 105 Protein_36 phospholipase D family protein -
HTS63_RS00180 23568..24887 + 1320 WP_004152397 IS1182-like element ISKpn6 family transposase -
HTS63_RS00185 25137..26018 - 882 WP_004199234 carbapenem-hydrolyzing class A beta-lactamase KPC-2 -
HTS63_RS00190 26405..27184 - 780 WP_004152394 IS21-like element ISKpn7 family helper ATPase IstB -


Host bacterium


ID   3717 GenBank   NZ_MH514861
Plasmid name   pEc20/2xEcTOP Incompatibility group   Col440I
Plasmid size   60204 bp Coordinate of oriT [Strand]   41226..41325 [+]
Host baterium   Escherichia coli strain TcEc20/2xEcTOP

Cargo genes


Drug resistance gene   blaKPC-2
Virulence gene   -
Metal resistance gene   -
Degradation gene   -
Symbiosis gene   -
Anti-CRISPR   -