Detailed information of oriT

oriT


The information of the oriT region


oriTDB ID   115206
Name   oriT_p2-KQ20786 in_silico
Organism   Klebsiella quasipneumoniae subsp. similipneumoniae strain KQ20786
Sequence Completeness      -
NCBI accession of oriT (coordinates [strand])   NZ_CP133225 (3600..3700 [+], 101 nt)
oriT length   101 nt
IRs (inverted repeats)      80..85, 91..96  (AAAAAA..TTTTTT)
 20..26, 38..44  (TAAATCA..TGATTTA)
Location of nic site      62..63
Conserved sequence flanking the
  nic site  
 
 GGTGTATAGC
Note   Predicted by oriTfinder 2.0

  oriT sequence  


Download         Length: 101 nt

>oriT_p2-KQ20786
TATTTATTTTTTTATCTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG

Visualization of oriT structure

  oriT secondary structure

Predicted by RNAfold.

Download structure file


Relaxase


ID   9719 GenBank   WP_012561166
Name   mobF_RCG66_RS25725_p2-KQ20786 insolico UniProt ID   D7RU03
Length   1078 a.a. PDB ID   
Note   Predicted by oriTfinder 2.0

  Relaxase protein sequence


Download         Length: 1078 a.a.        Molecular weight: 120141.30 Da        Isoelectric Point: 6.5229

>WP_012561166.1 MULTISPECIES: MobF family relaxase [Gammaproteobacteria]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATMADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKNEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI

  Protein domains


Predicted by InterproScan.

(851-895)

(16-287)

(477-653)


  Protein structure


Source ID Structure
AlphaFold DB D7RU03


Auxiliary protein


ID   5710 GenBank   WP_001749975
Name   WP_001749975_p2-KQ20786 insolico UniProt ID   A0A5C2CVS6
Length   138 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  Auxiliary protein sequence


Download         Length: 138 a.a.        Molecular weight: 15332.60 Da        Isoelectric Point: 5.9670

>WP_001749975.1 MULTISPECIES: hypothetical protein [Gammaproteobacteria]
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER

  Protein domains



No domain identified.



  Protein structure


Source ID Structure
AlphaFold DB A0A5C2CVS6


T4CP


ID   11335 GenBank   WP_021740568
Name   t4cp2_RCG66_RS25730_p2-KQ20786 insolico UniProt ID   _
Length   509 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 509 a.a.        Molecular weight: 57738.89 Da        Isoelectric Point: 9.6550

>WP_021740568.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacteriaceae]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVILWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI

  Protein domains


Predicted by InterproScan.

(113-489)

  Protein structure



No available structure.




T4SS


T4SS were predicted by using oriTfinder2.

Region 1: 29107..39432

Locus tag Coordinates Strand Size (bp) Protein ID Product Description
RCG66_RS25875 24952..26385 - 1434 WP_001288432 DNA cytosine methyltransferase -
RCG66_RS25880 26484..26756 + 273 Protein_31 IS1 family transposase -
RCG66_RS25885 26767..26973 - 207 WP_001749967 hypothetical protein -
RCG66_RS25890 26978..27625 - 648 WP_015344958 restriction endonuclease -
RCG66_RS25895 27675..27908 - 234 WP_001191790 hypothetical protein -
RCG66_RS25900 28022..28336 - 315 WP_001749965 TrbM/KikA/MpfK family conjugal transfer protein -
RCG66_RS25905 28333..28677 - 345 WP_001749964 hypothetical protein -
RCG66_RS25910 28693..28998 - 306 WP_000960954 H-NS family nucleoid-associated regulatory protein -
RCG66_RS25915 29107..29841 + 735 WP_001749963 lytic transglycosylase domain-containing protein virB1
RCG66_RS25920 29850..30131 + 282 WP_000440698 transcriptional repressor KorA -
RCG66_RS25925 30141..30434 + 294 WP_001749962 hypothetical protein virB2
RCG66_RS25930 30484..30801 + 318 WP_000496058 VirB3 family type IV secretion system protein virB3
RCG66_RS25935 30801..33401 + 2601 WP_001749961 VirB4 family type IV secretion/conjugal transfer ATPase virb4
RCG66_RS25940 33419..34132 + 714 WP_001749960 type IV secretion system protein virB5
RCG66_RS25945 34140..34367 + 228 WP_001749959 IncN-type entry exclusion lipoprotein EexN -
RCG66_RS25950 34383..35423 + 1041 WP_001749958 type IV secretion system protein virB6
RCG66_RS25955 35642..36340 + 699 WP_000646594 virB8 family protein virB8
RCG66_RS25960 36351..37235 + 885 WP_000735066 TrbG/VirB9 family P-type conjugative transfer protein virB9
RCG66_RS25965 37235..38395 + 1161 WP_000101710 type IV secretion system protein VirB10 virB10
RCG66_RS25970 38437..39432 + 996 WP_000128596 ATPase, T2SS/T4P/T4SS family virB11
RCG66_RS25975 39432..39965 + 534 WP_000792636 phospholipase D family protein -
RCG66_RS25980 40139..40444 - 306 WP_011742557 DUF6710 family protein -
RCG66_RS25985 40693..40872 + 180 Protein_52 helix-turn-helix domain-containing protein -
RCG66_RS25990 40937..41641 + 705 WP_001067855 IS6-like element IS26 family transposase -
RCG66_RS25995 41695..42326 - 632 Protein_54 transposase -
RCG66_RS26000 42721..43377 + 657 WP_001516695 quinolone resistance pentapeptide repeat protein QnrS1 -
RCG66_RS26005 43696..44310 - 615 WP_014839983 recombinase family protein -


Host bacterium


ID   15641 GenBank   NZ_CP133225
Plasmid name   p2-KQ20786 Incompatibility group   IncN
Plasmid size   59761 bp Coordinate of oriT [Strand]   3600..3700 [+]
Host baterium   Klebsiella quasipneumoniae subsp. similipneumoniae strain KQ20786

Cargo genes


Drug resistance gene   dfrA14, qnrS1, blaNDM-1
Virulence gene   -
Metal resistance gene   -
Degradation gene   -
Symbiosis gene   -
Anti-CRISPR   -