Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 102652 |
Name | oriT_ECOR31|unnamed3 |
Organism | Escherichia coli strain ECOR31 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_MIIL01000005 (43710..43810 [+], 101 nt) |
oriT length | 101 nt |
IRs (inverted repeats) | 80..85, 91..96 (AAAAAA..TTTTTT) 20..26, 38..44 (TAAATCA..TGATTTA) |
Location of nic site | 62..63 |
Conserved sequence flanking the nic site |
GGTGTATAGC |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 101 nt
>oriT_ECOR31|unnamed3
TATTTATTTTTTTATTTTTTAAATCAGTGTGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
TATTTATTTTTTTATTTTTTAAATCAGTGTGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 1965 | GenBank | WP_000884352 |
Name | mobF_BHF03_RS26435_ECOR31|unnamed3 | UniProt ID | A8R732 |
Length | 1078 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1078 a.a. Molecular weight: 120163.41 Da Isoelectric Point: 6.5766
>WP_000884352.1 MULTISPECIES: MobF family relaxase [Enterobacterales]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
ISGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
ISGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A8R732 |
Auxiliary protein
ID | 789 | GenBank | WP_000706094 |
Name | WP_000706094_ECOR31|unnamed3 | UniProt ID | B6UZ32 |
Length | 96 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 96 a.a. Molecular weight: 10829.54 Da Isoelectric Point: 8.5040
>WP_000706094.1 MULTISPECIES: hypothetical protein [Enterobacterales]
MKIVADRLSDVNRKLDYLFDRASDADFGPLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIA
EPGKTKAAKAEVERNGYKVWEPKKER
MKIVADRLSDVNRKLDYLFDRASDADFGPLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIA
EPGKTKAAKAEVERNGYKVWEPKKER
Protein domains
No domain identified.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | B6UZ32 |
T4CP
ID | 1601 | GenBank | WP_000342688 |
Name | t4cp2_BHF03_RS26440_ECOR31|unnamed3 | UniProt ID | _ |
Length | 509 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 509 a.a. Molecular weight: 57762.87 Da Isoelectric Point: 9.6551
>WP_000342688.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 21765..43143
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
BHF03_RS26310 (BHF03_26375) | 17308..18437 | + | 1130 | WP_085959879 | IS3-like element IS1133 family transposase | - |
BHF03_RS26315 (BHF03_26380) | 18569..19372 | + | 804 | WP_001082319 | aminoglycoside O-phosphotransferase APH(3'')-Ib | - |
BHF03_RS26320 (BHF03_26385) | 19372..20208 | + | 837 | WP_000480968 | aminoglycoside O-phosphotransferase APH(6)-Id | - |
BHF03_RS26325 (BHF03_26390) | 20334..20645 | - | 312 | WP_001452736 | hypothetical protein | - |
BHF03_RS26330 (BHF03_26395) | 20681..20995 | - | 315 | WP_000734776 | TrbM/KikA/MpfK family conjugal transfer protein | - |
BHF03_RS26335 (BHF03_26400) | 20958..21335 | - | 378 | WP_000504251 | hypothetical protein | - |
BHF03_RS26340 (BHF03_26405) | 21351..21722 | - | 372 | WP_011867773 | H-NS family nucleoid-associated regulatory protein | - |
BHF03_RS26345 (BHF03_26410) | 21765..22499 | + | 735 | WP_000033783 | lytic transglycosylase domain-containing protein | virB1 |
BHF03_RS26350 (BHF03_26415) | 22508..22789 | + | 282 | WP_000440698 | transcriptional repressor KorA | - |
BHF03_RS26355 (BHF03_26420) | 22799..23092 | + | 294 | WP_000209137 | TrbC/VirB2 family protein | virB2 |
BHF03_RS26360 (BHF03_26425) | 23142..23459 | + | 318 | WP_000496058 | VirB3 family type IV secretion system protein | virB3 |
BHF03_RS26365 (BHF03_26430) | 23459..26059 | + | 2601 | WP_001200711 | VirB4 family type IV secretion/conjugal transfer ATPase | virb4 |
BHF03_RS26370 (BHF03_26435) | 26077..26790 | + | 714 | WP_000749362 | type IV secretion system protein | virB5 |
BHF03_RS26375 (BHF03_26440) | 26798..27025 | + | 228 | WP_000734973 | IncN-type entry exclusion lipoprotein EexN | - |
BHF03_RS26380 (BHF03_26445) | 27041..28081 | + | 1041 | WP_000886022 | type IV secretion system protein | virB6 |
BHF03_RS26385 (BHF03_26450) | 28164..28310 | + | 147 | WP_001257173 | conjugal transfer protein TraN | - |
BHF03_RS26390 (BHF03_26455) | 28300..28998 | + | 699 | WP_000646594 | type IV secretion system protein | virB8 |
BHF03_RS26395 (BHF03_26460) | 29009..29893 | + | 885 | WP_000735067 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
BHF03_RS26400 (BHF03_26465) | 29893..31053 | + | 1161 | WP_000101711 | type IV secretion system protein VirB10 | virB10 |
BHF03_RS26405 (BHF03_26470) | 31095..32090 | + | 996 | WP_000128596 | ATPase, T2SS/T4P/T4SS family | virB11 |
BHF03_RS26410 (BHF03_26475) | 32090..32623 | + | 534 | WP_000731968 | phospholipase D family protein | - |
BHF03_RS29995 (BHF03_26480) | 32797..33387 | - | 591 | WP_077890220 | DUF6710 family protein | - |
BHF03_RS26420 (BHF03_26485) | 33519..34379 | - | 861 | WP_000027057 | broad-spectrum class A beta-lactamase TEM-1 | - |
BHF03_RS26425 (BHF03_26490) | 34562..35119 | - | 558 | WP_001235713 | recombinase family protein | - |
BHF03_RS26430 (BHF03_26495) | 35283..38288 | + | 3006 | WP_001143760 | Tn3-like element Tn3 family transposase | - |
BHF03_RS26435 (BHF03_26500) | 38378..41614 | - | 3237 | WP_000884352 | MobF family relaxase | - |
BHF03_RS26440 (BHF03_26505) | 41614..43143 | - | 1530 | WP_000342688 | type IV secretion system DNA-binding domain-containing protein | virb4 |
BHF03_RS26445 (BHF03_26510) | 43145..43435 | - | 291 | WP_000706094 | hypothetical protein | - |
BHF03_RS30000 (BHF03_26515) | 44074..44493 | + | 420 | WP_000802909 | plasmid stabilization protein StbA | - |
BHF03_RS26455 (BHF03_26520) | 44502..45218 | + | 717 | WP_000861577 | StbB family protein | - |
BHF03_RS26460 (BHF03_26525) | 45220..45588 | + | 369 | WP_000414913 | plasmid stabilization protein StbC | - |
BHF03_RS26465 (BHF03_26530) | 45770..46114 | + | 345 | WP_000999874 | hypothetical protein | - |
BHF03_RS26470 (BHF03_26535) | 46225..46545 | - | 321 | WP_000211833 | protein CcgAII | - |
BHF03_RS26475 (BHF03_26540) | 46600..46779 | - | 180 | WP_000214483 | protein CcgAI | - |
BHF03_RS26480 (BHF03_26545) | 47611..48120 | - | 510 | WP_000129958 | antirestriction protein ArdA | - |
Host bacterium
ID | 3096 | GenBank | NZ_MIIL01000005 |
Plasmid name | ECOR31|unnamed3 | Incompatibility group | IncN |
Plasmid size | 63179 bp | Coordinate of oriT [Strand] | 43710..43810 [+] |
Host baterium | Escherichia coli strain ECOR31 |
Cargo genes
Drug resistance gene | aph(3'')-Ib, aph(6)-Id, blaTEM-1B |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |