Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 101392 |
Name | oriT_pCFS1932-2 |
Organism | Yersinia enterocolitica subsp. palearctica strain CFS1932 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_SJZK02000012 (45398..45498 [+], 101 nt) |
oriT length | 101 nt |
IRs (inverted repeats) | 80..85, 91..96 (AAAAAA..TTTTTT) 20..26, 38..44 (TAAATCA..TGATTTA) |
Location of nic site | 62..63 |
Conserved sequence flanking the nic site |
GGTGTATAGC |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 101 nt
>oriT_pCFS1932-2
TATTTATTTTTTTATTTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
TATTTATTTTTTTATTTTTTAAATCAGTATGATAGCGTGATTTATCGCGCTGCGTTAGGTGTATAGCAGGTTAAGGGATAAAAAATCATCTTTTTTGGTAG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 1275 | GenBank | WP_000884351 |
Name | mobF_E0Z05_RS21340_pCFS1932-2 | UniProt ID | D7RVP6 |
Length | 1076 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 1076 a.a. Molecular weight: 119897.16 Da Isoelectric Point: 6.6071
>WP_000884351.1 MULTISPECIES: MobF family relaxase [Enterobacterales]
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
ISGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEEGGHEI
MLDITTITRQNVTSVVGYYSDAKDDYYSKDSSFTSWQGTGAEALGLSGDVESARFKELLVGEIDTFTHMQ
RHVGDAKKERLGYDLTFSAPKGVSMQALIHGDKTIIEAHEKAVAAAVREAEKLAQARTTRQGKSVTQNTN
NLVVATFRHETSRALDPDLHTHAFVMNMTQREDGQWRALKNDELMRNKMHLGDVYKQELALELTKAGYEL
RYNSKNNTFDMAHFSDEQIRAFSRRSEQIEKGLAAMGLTRETADAQTKSRVSMATREKKTEHSREEIHQE
WASRAKTLGIDFDNREWQGHGKPLEADIARNMAPDFTSPEVKADRAIQFAVKSLSERDASFERQKLIQIA
NKQVLGHATIADVEKAYLKAVQKGAIIEGEARYQSTLKVGASVMAETLTRKEWIDSLTNSGMRADKARFA
VDDGIKNGRLKKTSHRVTTVEGIRLERSILTIESRGRGQMPRQLTAEIAGQLLAGKTLKKEQMRAVTEIV
TSKDRFVAAHGYAGTGKSYMTMAAKELLESQGLKVTALAPYGTQKKALEDDGLPARTVAAFLKAKDKKLD
EKSVVFIDEAGVIPARQMKQLMEVIEKHNARAVFLGDTSQTKAVEAGKPFEQLIKAGMQTSYMKDIQRQK
NEVLLEAVKYAAEGNAARALKNITGVNELKEEAPRLAQLADRYLSLSSEQQDATLIISGTNASRKTLNDY
IRGNLGLAGTGETFTLLDRVDSTQAERRDSRYFSKGQIIIPEQDYKNGMKRGESYQVLDTGPGNKLTVES
ISGEQIAFSPRTHTKLSVYQAVSAELAPGDKVMVTRNDKTLDVANGDRFTVKTVEGEKLTLEDKKGRTVE
LDKKQASYLSYAYATTVHKSQGLTCDRVLFNIDTKSLTTSKDVFYVGISRARHEVEIFTDDKKSLASSVS
RDSPKTTAAEIDRFFGLEARFKDIGRDTSLETRSAEKGLPEATGESMAFNQKPDEHNMTTGTDYQPVSNA
EDAFHLKQNPMDDSVGLRRHEAQQNDAELAHDYAAADDQQWSAQEYADYEHYAEASDYDFDSSIYDDYAM
PQTSQAEQSHTGKEHTHEHEEGGHEI
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | D7RVP6 |
Auxiliary protein
ID | 454 | GenBank | WP_001532083 |
Name | WP_001532083_pCFS1932-2 | UniProt ID | A8R734 |
Length | 138 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 138 a.a. Molecular weight: 15332.60 Da Isoelectric Point: 5.9670
>WP_001532083.1 MULTISPECIES: hypothetical protein [Enterobacterales]
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCLEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
MPIITAKVSDELLAYIDLVSGGNRSDYLRRCLEAGPGDRESGLKIVADRLSDVNRKLDYLFDRASDADFG
PLRDELKAITETLSGVKFPPAGQMMLHESLAIETLILLRSIAEPGKTKAAKAEVERNGYKVWEPKKER
Protein domains
No domain identified.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A8R734 |
T4CP
ID | 888 | GenBank | WP_000342688 |
Name | t4cp2_E0Z05_RS21345_pCFS1932-2 | UniProt ID | _ |
Length | 509 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 509 a.a. Molecular weight: 57762.87 Da Isoelectric Point: 9.6551
>WP_000342688.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
MDDRERGLAFLFAITLPPVMVWFLVAKFTYGIDPSTAKYLIPYLVKNTFSLWPLWSALIAGWFIGVGGLI
AFIIYDKSRVFKGERFKKIYRGTELVRARTLADKTRERGVNQLTVANIPIPTYAENLHFSIAGTTGTGKT
TIFNELLFKSIIRGGKNIALDPNGGFLKNFYRPGDVILNAYDKRTEGWVFFNEIRRSYDYERLVNSIVQE
SPDMATEEWFGYGRLIFSEVSKKLHSLYSTVTMEEVIHWACNVDQKKLKEFLMGTPAEAIFSGSEKAVGS
ARFVLSKNLAPHLKMPEGNFSLRDWLDDGKPGTLFITWQEEMKRSLNPLISCWLDSIFSIVLGMGEKESR
INVFIDELESLQFLPNLNDALTKGRKSGLCVYAGYQTYSQLVKVYGRDMAQTILANMRSNIVLGGSRLGD
ETLDQMSRSLGEIEGEVERKESDPQKPWIVRKRRDVKVVRAVTPTEISMLPNLTGYLALPGDMPVAKFKA
KHVKYHRKNPVPGIELREI
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 23614..44831
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
E0Z05_RS21205 (E0Z05_021200) | 19743..20627 | - | 885 | WP_000058717 | EamA family transporter | - |
E0Z05_RS21210 (E0Z05_021205) | 20765..21157 | - | 393 | WP_001351729 | isochorismatase family cysteine hydrolase | - |
E0Z05_RS21215 (E0Z05_021210) | 21161..22912 | + | 1752 | Protein_23 | Tn3-like element TnAs1 family transposase | - |
E0Z05_RS21220 (E0Z05_021215) | 23081..23614 | - | 534 | WP_000731968 | phospholipase D family protein | - |
E0Z05_RS21225 (E0Z05_021220) | 23614..24609 | - | 996 | WP_000128596 | ATPase, T2SS/T4P/T4SS family | virB11 |
E0Z05_RS21230 (E0Z05_021225) | 24651..25811 | - | 1161 | WP_000101712 | type IV secretion system protein VirB10 | virB10 |
E0Z05_RS21235 (E0Z05_021230) | 25811..26695 | - | 885 | WP_000735067 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
E0Z05_RS21240 (E0Z05_021235) | 26706..27404 | - | 699 | WP_000646594 | type IV secretion system protein | virB8 |
E0Z05_RS21795 | 27394..27552 | - | 159 | WP_012561180 | hypothetical protein | - |
E0Z05_RS21245 (E0Z05_021240) | 27623..28663 | - | 1041 | WP_000886022 | type IV secretion system protein | virB6 |
E0Z05_RS21250 (E0Z05_021245) | 28679..28906 | - | 228 | WP_000734973 | IncN-type entry exclusion lipoprotein EexN | - |
E0Z05_RS21255 (E0Z05_021250) | 28914..29627 | - | 714 | WP_000749362 | type IV secretion system protein | virB5 |
E0Z05_RS21260 (E0Z05_021255) | 29645..32245 | - | 2601 | WP_001200711 | VirB4 family type IV secretion/conjugal transfer ATPase | virb4 |
E0Z05_RS21265 (E0Z05_021260) | 32245..32562 | - | 318 | WP_000496058 | VirB3 family type IV secretion system protein | virB3 |
E0Z05_RS21270 (E0Z05_021265) | 32612..32905 | - | 294 | WP_000209137 | TrbC/VirB2 family protein | virB2 |
E0Z05_RS21275 (E0Z05_021270) | 32915..33196 | - | 282 | WP_000440698 | transcriptional repressor KorA | - |
E0Z05_RS21280 (E0Z05_021275) | 33205..33939 | - | 735 | WP_000033783 | lytic transglycosylase domain-containing protein | virB1 |
E0Z05_RS21285 (E0Z05_021280) | 33982..34353 | + | 372 | WP_011867773 | H-NS family nucleoid-associated regulatory protein | - |
E0Z05_RS21290 (E0Z05_021285) | 34369..34746 | + | 378 | WP_000504250 | hypothetical protein | - |
E0Z05_RS21295 (E0Z05_021290) | 34709..35023 | + | 315 | WP_001749965 | TrbM/KikA/MpfK family conjugal transfer protein | - |
E0Z05_RS21300 (E0Z05_021295) | 35059..35370 | + | 312 | WP_001452736 | hypothetical protein | - |
E0Z05_RS21305 (E0Z05_021300) | 35420..36067 | + | 648 | WP_001532074 | restriction endonuclease | - |
E0Z05_RS21310 (E0Z05_021305) | 36072..36269 | + | 198 | WP_013188474 | hypothetical protein | - |
E0Z05_RS21315 (E0Z05_021310) | 36289..36560 | - | 272 | Protein_44 | IS1 family transposase | - |
E0Z05_RS21320 (E0Z05_021315) | 36654..38087 | + | 1434 | WP_001288433 | DNA cytosine methyltransferase | - |
E0Z05_RS21325 (E0Z05_021320) | 38121..38987 | - | 867 | Protein_46 | type II site-specific deoxyribonuclease | - |
E0Z05_RS21330 (E0Z05_021325) | 39042..39746 | - | 705 | WP_001067855 | IS6-like element IS26 family transposase | - |
E0Z05_RS21335 (E0Z05_021330) | 39758..40072 | - | 315 | WP_167710388 | FipA | - |
E0Z05_RS21340 (E0Z05_021335) | 40072..43302 | - | 3231 | WP_000884351 | MobF family relaxase | - |
E0Z05_RS21345 (E0Z05_021340) | 43302..44831 | - | 1530 | WP_000342688 | type IV secretion system DNA-binding domain-containing protein | virb4 |
E0Z05_RS21350 (E0Z05_021345) | 44833..45249 | - | 417 | WP_001532083 | hypothetical protein | - |
E0Z05_RS21720 | 45751..46170 | + | 420 | WP_000802909 | plasmid stabilization protein StbA | - |
E0Z05_RS21360 (E0Z05_021355) | 46179..46895 | + | 717 | WP_000861577 | StbB family protein | - |
E0Z05_RS21365 (E0Z05_021360) | 46897..47265 | + | 369 | WP_000414913 | plasmid stabilization protein StbC | - |
E0Z05_RS21370 (E0Z05_021365) | 47447..47791 | + | 345 | WP_000999874 | hypothetical protein | - |
E0Z05_RS21375 (E0Z05_021370) | 47902..48222 | - | 321 | WP_000211832 | protein CcgAII | - |
E0Z05_RS21380 (E0Z05_021375) | 48277..48456 | - | 180 | WP_000214483 | protein CcgAI | - |
E0Z05_RS21385 (E0Z05_021380) | 49288..49797 | - | 510 | WP_000129958 | antirestriction protein ArdA | - |
Host bacterium
ID | 1836 | GenBank | NZ_SJZK02000012 |
Plasmid name | pCFS1932-2 | Incompatibility group | IncN |
Plasmid size | 56991 bp | Coordinate of oriT [Strand] | 45398..45498 [+] |
Host baterium | Yersinia enterocolitica subsp. palearctica strain CFS1932 |
Cargo genes
Drug resistance gene | sul2, aadA5, tet(A) |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |