Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 115206 |
Name | oriT_p2-KQ20786 |
Organism | Klebsiella quasipneumoniae subsp. similipneumoniae strain KQ20786 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_CP133225 (3600..3700 [+], 101 nt) |
oriT length | 101 nt |
IRs (inverted repeats) | 80..85, 91..96 (AAAAAA..TTTTTT) 20..26, 38..44 (TAAATCA..TGATTTA) |
Location of nic site | 62..63 |
Conserved sequence flanking the nic site |
GGTGTATAGC |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 101 nt
>oriT_p2-KQ20786
TATTTATTTTTTTATCTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
TATTTATTTTTTTATCTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 9719 | GenBank | WP_012561166 |
Name | mobF_RCG66_RS25725_p2-KQ20786 | UniProt ID | D7RU03 |
Length | 1078 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1078 a.a. Molecular weight: 120141.30 Da Isoelectric Point: 6.5229
>WP_012561166.1 MULTISPECIES: MobF family relaxase [Gammaproteobacteria]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATMADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKNEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATMADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKNEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | D7RU03 |
Auxiliary protein
ID | 5710 | GenBank | WP_001749975 |
Name | WP_001749975_p2-KQ20786 | UniProt ID | A0A5C2CVS6 |
Length | 138 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 138 a.a. Molecular weight: 15332.60 Da Isoelectric Point: 5.9670
>WP_001749975.1 MULTISPECIES: hypothetical protein [Gammaproteobacteria]
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
Protein domains
No domain identified.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A5C2CVS6 |
T4CP
ID | 11335 | GenBank | WP_021740568 |
Name | t4cp2_RCG66_RS25730_p2-KQ20786 | UniProt ID | _ |
Length | 509 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 509 a.a. Molecular weight: 57738.89 Da Isoelectric Point: 9.6550
>WP_021740568.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacteriaceae]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVILWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVILWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 29107..39432
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
RCG66_RS25875 | 24952..26385 | - | 1434 | WP_001288432 | DNA cytosine methyltransferase | - |
RCG66_RS25880 | 26484..26756 | + | 273 | Protein_31 | IS1 family transposase | - |
RCG66_RS25885 | 26767..26973 | - | 207 | WP_001749967 | hypothetical protein | - |
RCG66_RS25890 | 26978..27625 | - | 648 | WP_015344958 | restriction endonuclease | - |
RCG66_RS25895 | 27675..27908 | - | 234 | WP_001191790 | hypothetical protein | - |
RCG66_RS25900 | 28022..28336 | - | 315 | WP_001749965 | TrbM/KikA/MpfK family conjugal transfer protein | - |
RCG66_RS25905 | 28333..28677 | - | 345 | WP_001749964 | hypothetical protein | - |
RCG66_RS25910 | 28693..28998 | - | 306 | WP_000960954 | H-NS family nucleoid-associated regulatory protein | - |
RCG66_RS25915 | 29107..29841 | + | 735 | WP_001749963 | lytic transglycosylase domain-containing protein | virB1 |
RCG66_RS25920 | 29850..30131 | + | 282 | WP_000440698 | transcriptional repressor KorA | - |
RCG66_RS25925 | 30141..30434 | + | 294 | WP_001749962 | hypothetical protein | virB2 |
RCG66_RS25930 | 30484..30801 | + | 318 | WP_000496058 | VirB3 family type IV secretion system protein | virB3 |
RCG66_RS25935 | 30801..33401 | + | 2601 | WP_001749961 | VirB4 family type IV secretion/conjugal transfer ATPase | virb4 |
RCG66_RS25940 | 33419..34132 | + | 714 | WP_001749960 | type IV secretion system protein | virB5 |
RCG66_RS25945 | 34140..34367 | + | 228 | WP_001749959 | IncN-type entry exclusion lipoprotein EexN | - |
RCG66_RS25950 | 34383..35423 | + | 1041 | WP_001749958 | type IV secretion system protein | virB6 |
RCG66_RS25955 | 35642..36340 | + | 699 | WP_000646594 | virB8 family protein | virB8 |
RCG66_RS25960 | 36351..37235 | + | 885 | WP_000735066 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
RCG66_RS25965 | 37235..38395 | + | 1161 | WP_000101710 | type IV secretion system protein VirB10 | virB10 |
RCG66_RS25970 | 38437..39432 | + | 996 | WP_000128596 | ATPase, T2SS/T4P/T4SS family | virB11 |
RCG66_RS25975 | 39432..39965 | + | 534 | WP_000792636 | phospholipase D family protein | - |
RCG66_RS25980 | 40139..40444 | - | 306 | WP_011742557 | DUF6710 family protein | - |
RCG66_RS25985 | 40693..40872 | + | 180 | Protein_52 | helix-turn-helix domain-containing protein | - |
RCG66_RS25990 | 40937..41641 | + | 705 | WP_001067855 | IS6-like element IS26 family transposase | - |
RCG66_RS25995 | 41695..42326 | - | 632 | Protein_54 | transposase | - |
RCG66_RS26000 | 42721..43377 | + | 657 | WP_001516695 | quinolone resistance pentapeptide repeat protein QnrS1 | - |
RCG66_RS26005 | 43696..44310 | - | 615 | WP_014839983 | recombinase family protein | - |
Host bacterium
ID | 15641 | GenBank | NZ_CP133225 |
Plasmid name | p2-KQ20786 | Incompatibility group | IncN |
Plasmid size | 59761 bp | Coordinate of oriT [Strand] | 3600..3700 [+] |
Host baterium | Klebsiella quasipneumoniae subsp. similipneumoniae strain KQ20786 |
Cargo genes
Drug resistance gene | dfrA14, qnrS1, blaNDM-1 |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |