Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 101501 |
Name | oriT_p2165-5 |
Organism | Escherichia coli strain GN02165 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_JALJQH010000007 (16778..16878 [+], 101 nt) |
oriT length | 101 nt |
IRs (inverted repeats) | 80..85, 91..96 (AAAAAA..TTTTTT) 20..26, 38..44 (TAAATCA..TGATTTA) |
Location of nic site | 62..63 |
Conserved sequence flanking the nic site |
GGTGTATAGC |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 101 nt
>oriT_p2165-5
TATTTATTTTTTTATCTTTTAAATCAGTACGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
TATTTATTTTTTTATCTTTTAAATCAGTACGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 1340 | GenBank | WP_012561142 |
Name | mobF_MWG53_RS26220_p2165-5 | UniProt ID | A0A1S6KKJ9 |
Length | 1078 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1078 a.a. Molecular weight: 120195.37 Da Isoelectric Point: 6.5230
>WP_012561142.1 MULTISPECIES: MobF family relaxase [Enterobacterales]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKADMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKADMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A1S6KKJ9 |
Auxiliary protein
ID | 478 | GenBank | WP_000706094 |
Name | WP_000706094_p2165-5 | UniProt ID | B6UZ32 |
Length | 96 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 96 a.a. Molecular weight: 10829.54 Da Isoelectric Point: 8.5040
>WP_000706094.1 MULTISPECIES: hypothetical protein [Enterobacterales]
MKIVADRLSDVNRKLDYLFDRASDADFGPLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIA
EPGKTKAAKAEVERNGYKVWEPKKER
MKIVADRLSDVNRKLDYLFDRASDADFGPLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIA
EPGKTKAAKAEVERNGYKVWEPKKER
Protein domains
No domain identified.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | B6UZ32 |
T4CP
ID | 965 | GenBank | WP_000342688 |
Name | t4cp2_MWG53_RS26225_p2165-5 | UniProt ID | _ |
Length | 509 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 509 a.a. Molecular weight: 57762.87 Da Isoelectric Point: 9.6551
>WP_000342688.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 44..16211
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
MWG53_RS26145 (MWG53_26170) | 44..2644 | + | 2601 | WP_012561149 | VirB4 family type IV secretion/conjugal transfer ATPase | virb4 |
MWG53_RS26150 (MWG53_26175) | 2662..3375 | + | 714 | WP_001749960 | type IV secretion system protein | virB5 |
MWG53_RS26155 (MWG53_26180) | 3383..3610 | + | 228 | WP_001749959 | IncN-type entry exclusion lipoprotein EexN | - |
MWG53_RS26160 (MWG53_26185) | 3626..4666 | + | 1041 | WP_001749958 | type IV secretion system protein | virB6 |
MWG53_RS26165 (MWG53_26190) | 4749..4895 | + | 147 | WP_001257173 | conjugal transfer protein TraN | - |
MWG53_RS26170 (MWG53_26195) | 4885..5583 | + | 699 | WP_000646594 | type IV secretion system protein | virB8 |
MWG53_RS26175 (MWG53_26200) | 5594..6478 | + | 885 | WP_000735066 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
MWG53_RS26180 (MWG53_26205) | 6478..7638 | + | 1161 | WP_029602879 | type IV secretion system protein VirB10 | virB10 |
MWG53_RS26185 (MWG53_26210) | 7680..8675 | + | 996 | WP_012561144 | ATPase, T2SS/T4P/T4SS family | virB11 |
MWG53_RS26190 (MWG53_26215) | 8675..9208 | + | 534 | WP_000792636 | phospholipase D family protein | - |
MWG53_RS26195 (MWG53_26220) | 9381..9695 | - | 315 | WP_000091613 | hypothetical protein | - |
MWG53_RS26200 (MWG53_26225) | 9950..10306 | - | 357 | WP_000215515 | cupin domain-containing protein | - |
MWG53_RS26205 (MWG53_26230) | 10296..10697 | - | 402 | WP_001293886 | DUF86 domain-containing protein | - |
MWG53_RS26210 (MWG53_26235) | 10694..10984 | - | 291 | WP_001247892 | nucleotidyltransferase | - |
MWG53_RS26215 (MWG53_26240) | 11048..11446 | - | 399 | WP_012561143 | DUF6710 family protein | - |
MWG53_RS26220 (MWG53_26245) | 11446..14682 | - | 3237 | WP_012561142 | MobF family relaxase | - |
MWG53_RS26225 (MWG53_26250) | 14682..16211 | - | 1530 | WP_000342688 | type IV secretion system DNA-binding domain-containing protein | virb4 |
MWG53_RS26230 (MWG53_26255) | 16213..16503 | - | 291 | WP_000706094 | hypothetical protein | - |
MWG53_RS26235 (MWG53_26260) | 17197..17583 | + | 387 | WP_013279398 | plasmid stabilization protein StbA | - |
MWG53_RS26240 (MWG53_26265) | 17592..18308 | + | 717 | WP_012561139 | StbB family protein | - |
MWG53_RS26245 (MWG53_26270) | 18310..18678 | + | 369 | WP_012561138 | plasmid stabilization protein StbC | - |
MWG53_RS26250 (MWG53_26275) | 18879..19223 | + | 345 | WP_012561137 | hypothetical protein | - |
MWG53_RS26255 (MWG53_26280) | 19685..19864 | - | 180 | WP_012561134 | protein CcgAI | - |
MWG53_RS26260 (MWG53_26285) | 20748..20987 | - | 240 | WP_000932975 | hypothetical protein | - |
Host bacterium
ID | 1945 | GenBank | NZ_JALJQH010000007 |
Plasmid name | p2165-5 | Incompatibility group | IncN |
Plasmid size | 44207 bp | Coordinate of oriT [Strand] | 16778..16878 [+] |
Host baterium | Escherichia coli strain GN02165 |
Cargo genes
Drug resistance gene | - |
Virulence gene | - |
Metal resistance gene | nirD, ncrC, ncrB, ncrA |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |