Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 104564 |
Name | oriT_SWHEFF_51|unnamed2 |
Organism | Shigella dysenteriae strain SWHEFF_51 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_CP055054 (31775..31875 [+], 101 nt) |
oriT length | 101 nt |
IRs (inverted repeats) | 32..38, 43..49 (TTACGAT..ATCGTAA) 21..26, 38..43 (TTTTCA..TGAAAA) |
Location of nic site | 62..63 |
Conserved sequence flanking the nic site |
GGTGTATAGC |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 101 nt
>oriT_SWHEFF_51|unnamed2
AATTAGAATAATTTGTTTTGTTTTCAAGCATTTACGATGAAAATCGTAATTGCGTATGGTGTATAGCCGTTAAGGGATACCATACCACGCCTTTTTTAAGG
AATTAGAATAATTTGTTTTGTTTTCAAGCATTTACGATGAAAATCGTAATTGCGTATGGTGTATAGCCGTTAAGGGATACCATACCACGCCTTTTTTAAGG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 3290 | GenBank | WP_063448733 |
Name | mobF_HUZ69_RS25065_SWHEFF_51|unnamed2 | UniProt ID | _ |
Length | 969 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 969 a.a. Molecular weight: 108453.52 Da Isoelectric Point: 9.7610
>WP_063448733.1 MULTISPECIES: MobF family relaxase [Enterobacteriaceae]
MLNITTITRQRVDKVVDYYADGADDYYAKDGNAMQWQGAGAEELGLTGEVEQKRFAKLLDGKVTDDISVM
RKTANSESKERLGYDLTFSAPKGVTLQALVNGDKRIIEAHDRAVTKAIEEAERLALGRFTVNKKTHVENT
NKLIVGKFRHETSRELDPDLHTHAFVMNLTKTSDGKWRSLTNDGIVNSLTLLGNVYKSELARELELAGYN
LRYDRNGTFDLAHFSQEQIAQFSQRSQQIEAELAKNGLSRETASHEQKNAAALKTRKSKGEVDRDVLYEG
WKNRAKELQIDFDSREWKGAANDDIARNTQPAALEKPIEYQADKVMTFAIRSLTERQSVITERELMDVAM
KHGYGRLTLDDIRAARERAITSGNLIKEEATYAAATANKKQQNVAMTREQWVTELVKAGRNKDEARKLVD
KGITNGRLVQQKPRFTTQLAQQRERNILKMEREGRGKIQTPYTREFSEGWLASRTLKPEQLKAVMGIIHT
PNQFISVHGFAGTGKSYMTKSAADFLKEQGVHVTSLAPYGSQVKALQAEGLESRTLQSFLRASDKKIGPG
SVVFIDEAGVIPARQMEETMRVIRDAGARAVFLGDTKQTKAVEAGKPFEQLIKAGMETAYMKDIQRQKDP
ELLKAALNAAEGKIKASLTHVTSIVEEKNHDQRYRRIVGDYVAMPPTDRANALIITGTNDSRKKINEYIQ
AELGLKGQGINYPLLNRLDTTQAQRQHSKYYEKGAVIIPERDYSNGLKRGAVYTVLDTGPGNKLTVSDGS
GDTIAFSPARFSKLSVYSVEKTELAVGDQVRITRNDAHLDLANGDRFKVKGVQSGEVLLENEKGRLVKID
ANKPMYLGLAYASTVHSAQGLTCDKVFINMDTRSLTTAKDVYYVAVTRAKHEAVIYTDDEKKLDKAASRE
GFKTAALELEQLKRYAKEQQHDAREKGRDKTEPTQEKDNAKRKGKGHDKNNDERAFTSL
MLNITTITRQRVDKVVDYYADGADDYYAKDGNAMQWQGAGAEELGLTGEVEQKRFAKLLDGKVTDDISVM
RKTANSESKERLGYDLTFSAPKGVTLQALVNGDKRIIEAHDRAVTKAIEEAERLALGRFTVNKKTHVENT
NKLIVGKFRHETSRELDPDLHTHAFVMNLTKTSDGKWRSLTNDGIVNSLTLLGNVYKSELARELELAGYN
LRYDRNGTFDLAHFSQEQIAQFSQRSQQIEAELAKNGLSRETASHEQKNAAALKTRKSKGEVDRDVLYEG
WKNRAKELQIDFDSREWKGAANDDIARNTQPAALEKPIEYQADKVMTFAIRSLTERQSVITERELMDVAM
KHGYGRLTLDDIRAARERAITSGNLIKEEATYAAATANKKQQNVAMTREQWVTELVKAGRNKDEARKLVD
KGITNGRLVQQKPRFTTQLAQQRERNILKMEREGRGKIQTPYTREFSEGWLASRTLKPEQLKAVMGIIHT
PNQFISVHGFAGTGKSYMTKSAADFLKEQGVHVTSLAPYGSQVKALQAEGLESRTLQSFLRASDKKIGPG
SVVFIDEAGVIPARQMEETMRVIRDAGARAVFLGDTKQTKAVEAGKPFEQLIKAGMETAYMKDIQRQKDP
ELLKAALNAAEGKIKASLTHVTSIVEEKNHDQRYRRIVGDYVAMPPTDRANALIITGTNDSRKKINEYIQ
AELGLKGQGINYPLLNRLDTTQAQRQHSKYYEKGAVIIPERDYSNGLKRGAVYTVLDTGPGNKLTVSDGS
GDTIAFSPARFSKLSVYSVEKTELAVGDQVRITRNDAHLDLANGDRFKVKGVQSGEVLLENEKGRLVKID
ANKPMYLGLAYASTVHSAQGLTCDKVFINMDTRSLTTAKDVYYVAVTRAKHEAVIYTDDEKKLDKAASRE
GFKTAALELEQLKRYAKEQQHDAREKGRDKTEPTQEKDNAKRKGKGHDKNNDERAFTSL
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4CP
ID | 3229 | GenBank | WP_001001035 |
Name | t4cp2_HUZ69_RS25070_SWHEFF_51|unnamed2 | UniProt ID | _ |
Length | 525 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 525 a.a. Molecular weight: 59019.93 Da Isoelectric Point: 9.5507
>WP_001001035.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacterales]
MNEADKGKICLGALLFTPLVTWFIAAKTLYGFTPKDARERLVLLIQQTFDLWPLWGALLVGEIIAIIGLI
IFFKFNHQIFKGAPFKKIYRGTRLVSQRALASLTKEREPQITIADIPIPTKAEGTHISIAGATGVGKSTI
FAEMMKGCLMRGKMQRVASKKDRMIILDPDGDFLSRFYKKGDKILNPYDARTEGWVFFNEIKSDFDFERF
GVSIVPTAKDSQTEEWNGYGRLLFTEVSRKVFNTSRNPTMEEVFYWTNEVPLEELEEYVRGTKAQALFAG
SGRAVGSARFVLSNKLAAHLKMPAGNFSIREWLEDPKGGNLYITWTDEMREALKSLISCWTDSIFSIVLG
MSKSNTRKIWTYIDELESLDYLPSLRDALTKGRKKGLRVVTGYQSYSQVISIYGSDVAETLLSNHRTSVV
MAAGRLGERTLEFVSKSLGEIEGEREKTGISRRFGQMGTKNINDDHKRERAVTPTEIATLDDLTGYIAFP
GNLPVAKFETHHVNYTRSNPVPGVVLRESTAFAGM
MNEADKGKICLGALLFTPLVTWFIAAKTLYGFTPKDARERLVLLIQQTFDLWPLWGALLVGEIIAIIGLI
IFFKFNHQIFKGAPFKKIYRGTRLVSQRALASLTKEREPQITIADIPIPTKAEGTHISIAGATGVGKSTI
FAEMMKGCLMRGKMQRVASKKDRMIILDPDGDFLSRFYKKGDKILNPYDARTEGWVFFNEIKSDFDFERF
GVSIVPTAKDSQTEEWNGYGRLLFTEVSRKVFNTSRNPTMEEVFYWTNEVPLEELEEYVRGTKAQALFAG
SGRAVGSARFVLSNKLAAHLKMPAGNFSIREWLEDPKGGNLYITWTDEMREALKSLISCWTDSIFSIVLG
MSKSNTRKIWTYIDELESLDYLPSLRDALTKGRKKGLRVVTGYQSYSQVISIYGSDVAETLLSNHRTSVV
MAAGRLGERTLEFVSKSLGEIEGEREKTGISRRFGQMGTKNINDDHKRERAVTPTEIATLDDLTGYIAFP
GNLPVAKFETHHVNYTRSNPVPGVVLRESTAFAGM
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 13544..31333
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
HUZ69_RS24925 (HUZ69_24670) | 9288..9848 | - | 561 | WP_275094608 | recombinase family protein | - |
HUZ69_RS24930 (HUZ69_24675) | 9888..10592 | - | 705 | WP_001067855 | IS6-like element IS26 family transposase | - |
HUZ69_RS24935 (HUZ69_24680) | 11140..11385 | + | 246 | WP_021536799 | AbrB/MazE/SpoVT family DNA-binding domain-containing protein | - |
HUZ69_RS24940 (HUZ69_24685) | 11397..11654 | + | 258 | WP_016240387 | hypothetical protein | - |
HUZ69_RS24945 (HUZ69_24690) | 11644..11970 | + | 327 | WP_016240388 | endoribonuclease MazF | - |
HUZ69_RS24950 (HUZ69_24695) | 11979..12416 | + | 438 | WP_057517876 | hypothetical protein | - |
HUZ69_RS24955 (HUZ69_24700) | 12441..12677 | - | 237 | WP_042041953 | Hha/YmoA family nucleoid-associated regulatory protein | - |
HUZ69_RS24960 (HUZ69_24710) | 12843..13568 | - | 726 | WP_224132007 | restriction endonuclease | - |
HUZ69_RS24965 (HUZ69_24715) | 13544..13858 | - | 315 | WP_023223516 | hypothetical protein | tfc15 |
HUZ69_RS24970 (HUZ69_24720) | 13861..14178 | - | 318 | WP_023223517 | TrbM/KikA/MpfK family conjugal transfer protein | - |
HUZ69_RS24975 (HUZ69_24725) | 14184..14621 | - | 438 | WP_057517875 | hypothetical protein | - |
HUZ69_RS24980 (HUZ69_24730) | 14633..14869 | - | 237 | WP_000089855 | H-NS histone family protein | - |
HUZ69_RS24985 (HUZ69_24735) | 14932..15672 | + | 741 | WP_000815895 | lytic transglycosylase domain-containing protein | virB1 |
HUZ69_RS24990 (HUZ69_24740) | 15695..16012 | + | 318 | WP_001267114 | hypothetical protein | - |
HUZ69_RS24995 (HUZ69_24745) | 15996..16286 | + | 291 | WP_057517874 | TrbC/VirB2 family protein | virB2 |
HUZ69_RS25000 (HUZ69_24750) | 16336..16650 | + | 315 | WP_001053888 | VirB3 family type IV secretion system protein | virB3 |
HUZ69_RS25005 (HUZ69_24755) | 17007..19280 | + | 2274 | WP_231225962 | VirB4 family type IV secretion/conjugal transfer ATPase | virb4 |
HUZ69_RS25010 (HUZ69_24760) | 19290..19997 | + | 708 | WP_000751606 | type IV secretion system protein | virB5 |
HUZ69_RS25015 (HUZ69_24765) | 20001..20237 | + | 237 | WP_000733633 | EexN family lipoprotein | - |
HUZ69_RS25020 (HUZ69_24770) | 20316..21299 | + | 984 | WP_173671886 | type IV secretion system protein | virB6 |
HUZ69_RS25025 (HUZ69_24775) | 21394..21564 | + | 171 | WP_021536809 | hypothetical protein | - |
HUZ69_RS25030 (HUZ69_24780) | 21567..22283 | + | 717 | WP_000914645 | type IV secretion system protein | virB8 |
HUZ69_RS25035 (HUZ69_24785) | 22286..23173 | + | 888 | WP_000744355 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
HUZ69_RS25040 (HUZ69_24790) | 23170..24324 | + | 1155 | WP_015345225 | type IV secretion system protein VirB10 | virB10 |
HUZ69_RS25045 (HUZ69_24795) | 24317..25348 | + | 1032 | WP_000138004 | P-type DNA transfer ATPase VirB11 | virB11 |
HUZ69_RS25050 (HUZ69_24800) | 25311..25865 | + | 555 | WP_000616063 | phospholipase D family protein | - |
HUZ69_RS25055 (HUZ69_24805) | 25878..26360 | + | 483 | WP_063448738 | GNAT family protein | - |
HUZ69_RS25060 (HUZ69_24810) | 26372..26566 | + | 195 | WP_024249955 | hypothetical protein | - |
HUZ69_RS25065 (HUZ69_24815) | 26844..29753 | - | 2910 | WP_063448733 | MobF family relaxase | - |
HUZ69_RS25070 (HUZ69_24820) | 29756..31333 | - | 1578 | WP_001001035 | type IV secretion system DNA-binding domain-containing protein | virb4 |
HUZ69_RS25075 (HUZ69_24825) | 31326..31673 | - | 348 | WP_001139329 | hypothetical protein | - |
HUZ69_RS25080 (HUZ69_24830) | 32105..32548 | + | 444 | WP_024249956 | hypothetical protein | - |
HUZ69_RS25085 (HUZ69_24835) | 32561..33271 | + | 711 | WP_021544361 | StbB family protein | - |
HUZ69_RS25090 (HUZ69_24840) | 33273..33641 | + | 369 | WP_015345231 | plasmid stabilization protein StbC | - |
HUZ69_RS25095 (HUZ69_24845) | 33846..34268 | - | 423 | WP_021536823 | hypothetical protein | - |
HUZ69_RS25100 (HUZ69_24850) | 34975..35277 | - | 303 | WP_057517860 | TrfB-related DNA-binding protein | - |
HUZ69_RS25105 (HUZ69_24855) | 35346..35825 | - | 480 | WP_000168313 | single-stranded DNA-binding protein | - |
Host bacterium
ID | 5003 | GenBank | NZ_CP055054 |
Plasmid name | SWHEFF_51|unnamed2 | Incompatibility group | IncN |
Plasmid size | 41542 bp | Coordinate of oriT [Strand] | 31775..31875 [+] |
Host baterium | Shigella dysenteriae strain SWHEFF_51 |
Cargo genes
Drug resistance gene | - |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |