Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 103274 |
Name | oriT_pEc20/2xEcTOP |
Organism | Escherichia coli strain TcEc20/2xEcTOP |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_MH514861 (41226..41325 [+], 100 nt) |
oriT length | 100 nt |
IRs (inverted repeats) | 77..82, 90..95 (AAAAAA..TTTTTT) 78..83, 90..95 (AAAAAA..TTTTTT) 79..84, 90..95 (AAAAAA..TTTTTT) 20..26, 38..44 (TAAATCA..TGATTTA) |
Location of nic site | 62..63 |
Conserved sequence flanking the nic site |
GGTGTATAGC |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 100 nt
>oriT_pEc20/2xEcTOP
TATTTATTTTTTTATCTTTTAAATCAGTACGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGAAAAAAAATCATCTTTTTTGGTAG
TATTTATTTTTTTATCTTTTAAATCAGTACGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGAAAAAAAATCATCTTTTTTGGTAG
Visualization of oriT structure
Relaxase
ID | 2445 | GenBank | WP_012561142 |
Name | TrwC_HTS63_RS00240_pEc20/2xEcTOP | UniProt ID | A0A1S6KKJ9 |
Length | 1078 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1078 a.a. Molecular weight: 120195.37 Da Isoelectric Point: 6.5230
>WP_012561142.1 MULTISPECIES: MobF family relaxase [Enterobacterales]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKADMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKADMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
SSGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEHEEGGHEI
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A1S6KKJ9 |
Auxiliary protein
ID | 1044 | GenBank | WP_001749975 |
Name | WP_001749975_pEc20/2xEcTOP | UniProt ID | A0A5C2CVS6 |
Length | 138 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 138 a.a. Molecular weight: 15332.60 Da Isoelectric Point: 5.9670
>WP_001749975.1 MULTISPECIES: hypothetical protein [Gammaproteobacteria]
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCIEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
Protein domains
No domain identified.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A5C2CVS6 |
T4CP
ID | 2217 | GenBank | WP_000342688 |
Name | t4cp2_HTS63_RS00245_pEc20/2xEcTOP | UniProt ID | _ |
Length | 509 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 509 a.a. Molecular weight: 57762.87 Da Isoelectric Point: 9.6551
>WP_000342688.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 12783..23112
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
HTS63_RS00065 | 9094..9366 | + | 273 | Protein_13 | IS1 family transposase | - |
HTS63_RS00070 | 9377..9583 | - | 207 | WP_001749967 | hypothetical protein | - |
HTS63_RS00075 | 9588..9941 | - | 354 | WP_225622145 | restriction endonuclease | - |
HTS63_RS00360 | 9876..10007 | + | 132 | WP_255255389 | hypothetical protein | - |
HTS63_RS00080 | 10025..10993 | + | 969 | WP_016151349 | IS5 family transposase | - |
HTS63_RS00355 | 11035..11295 | - | 261 | WP_016359294 | hypothetical protein | - |
HTS63_RS00090 | 11351..11662 | - | 312 | WP_013279382 | hypothetical protein | - |
HTS63_RS00095 | 11698..12012 | - | 315 | WP_016338363 | TrbM/KikA/MpfK family conjugal transfer protein | - |
HTS63_RS00100 | 12009..12353 | - | 345 | WP_012561155 | hypothetical protein | - |
HTS63_RS00105 | 12369..12746 | - | 378 | WP_193567606 | H-NS family nucleoid-associated regulatory protein | - |
HTS63_RS00110 | 12783..13520 | + | 738 | WP_013279384 | lytic transglycosylase domain-containing protein | virB1 |
HTS63_RS00115 | 13529..13810 | + | 282 | WP_016338364 | transcriptional repressor KorA | - |
HTS63_RS00120 | 13820..14113 | + | 294 | WP_001749962 | hypothetical protein | virB2 |
HTS63_RS00125 | 14164..14481 | + | 318 | WP_000496058 | VirB3 family type IV secretion system protein | virB3 |
HTS63_RS00130 | 14481..17081 | + | 2601 | WP_012561149 | VirB4 family type IV secretion/conjugal transfer ATPase | virb4 |
HTS63_RS00135 | 17099..17812 | + | 714 | WP_001749960 | type IV secretion system protein | virB5 |
HTS63_RS00140 | 17820..18047 | + | 228 | WP_001749959 | IncN-type entry exclusion lipoprotein EexN | - |
HTS63_RS00145 | 18063..19103 | + | 1041 | WP_001749958 | type IV secretion system protein | virB6 |
HTS63_RS00150 | 19174..19332 | + | 159 | WP_012561180 | hypothetical protein | - |
HTS63_RS00155 | 19322..20020 | + | 699 | WP_000646594 | type IV secretion system protein | virB8 |
HTS63_RS00160 | 20031..20915 | + | 885 | WP_000735066 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
HTS63_RS00165 | 20915..22075 | + | 1161 | WP_000101710 | type IV secretion system protein VirB10 | virB10 |
HTS63_RS00170 | 22117..23112 | + | 996 | WP_012561144 | ATPase, T2SS/T4P/T4SS family | virB11 |
HTS63_RS00175 | 23112..23216 | + | 105 | Protein_36 | phospholipase D family protein | - |
HTS63_RS00180 | 23568..24887 | + | 1320 | WP_004152397 | IS1182-like element ISKpn6 family transposase | - |
HTS63_RS00185 | 25137..26018 | - | 882 | WP_004199234 | carbapenem-hydrolyzing class A beta-lactamase KPC-2 | - |
HTS63_RS00190 | 26405..27184 | - | 780 | WP_004152394 | IS21-like element ISKpn7 family helper ATPase IstB | - |
Host bacterium
ID | 3717 | GenBank | NZ_MH514861 |
Plasmid name | pEc20/2xEcTOP | Incompatibility group | Col440I |
Plasmid size | 60204 bp | Coordinate of oriT [Strand] | 41226..41325 [+] |
Host baterium | Escherichia coli strain TcEc20/2xEcTOP |
Cargo genes
Drug resistance gene | blaKPC-2 |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |