Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 104469 |
Name | oriT_SWHEFF_59|unnamed2 |
Organism | Shigella flexneri strain SWHEFF_59 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_CP055043 (18995..19610 [-], 616 nt) |
oriT length | 616 nt |
IRs (inverted repeats) | 599..604, 609..614 (ATGCCC..GGGCAT) 526..532, 538..544 (CCTCCCG..CGGGAGG) 462..467, 471..476 (GTTCGC..GCGAAC) 52..58, 62..68 (CATTATC..GATAATG) |
Location of nic site | 322..323 |
Conserved sequence flanking the nic site |
TCCTGCATCG |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 616 nt
>oriT_SWHEFF_59|unnamed2
TCTTACTTCTTTGCGTAGCTGTTAAATACAGCGTTGTTTTAATAAAATCATCATTATCATCGATAATGCTTTCTTCAATTTTTTTATCCTTACTCTTTAATAAAGCACTTGCTAATAACTTCATACCTTTTGCAACTGTCAAATTTGGTTCATCAGGGTAAATGCTTTTAAGGCATACTAACAAATAATCATGGTCTTCATCTTCAACTCTAAACTGAATTTTTTTCATCATAACTCCCAACAAGAACCGACTGTAGGTCACCGGGCAAACGCTGAAAAATAACGTCGAATGACGTCATTTTGCGGCGTTTGCCCTATCCTGCATCGCAGTGAATCTGGCGCCGCTCATAAATTGTGCTTGAGCATCAACAAAAATGCAAAAAGGCTGATGTTGTCATGGCTGAACATACGTACGTCTGCGACCGGAAACCCGATCAATGCGGTAAGGACAGTGCCGTCAGGTTCGCTCCGCGAACGGTCTGGCAGGAAGCGCAAGCGCAGTCCCGCGCAATATATCAGAAAGCACCTCCCGAAATACGGGAGGGCTTCGGCTATTCAGCTAAGGTTGCTCTGTTCAGGTTGCTGAGTGTAGCCTTCCATGCCCTGACGGGCATCA
TCTTACTTCTTTGCGTAGCTGTTAAATACAGCGTTGTTTTAATAAAATCATCATTATCATCGATAATGCTTTCTTCAATTTTTTTATCCTTACTCTTTAATAAAGCACTTGCTAATAACTTCATACCTTTTGCAACTGTCAAATTTGGTTCATCAGGGTAAATGCTTTTAAGGCATACTAACAAATAATCATGGTCTTCATCTTCAACTCTAAACTGAATTTTTTTCATCATAACTCCCAACAAGAACCGACTGTAGGTCACCGGGCAAACGCTGAAAAATAACGTCGAATGACGTCATTTTGCGGCGTTTGCCCTATCCTGCATCGCAGTGAATCTGGCGCCGCTCATAAATTGTGCTTGAGCATCAACAAAAATGCAAAAAGGCTGATGTTGTCATGGCTGAACATACGTACGTCTGCGACCGGAAACCCGATCAATGCGGTAAGGACAGTGCCGTCAGGTTCGCTCCGCGAACGGTCTGGCAGGAAGCGCAAGCGCAGTCCCGCGCAATATATCAGAAAGCACCTCCCGAAATACGGGAGGGCTTCGGCTATTCAGCTAAGGTTGCTCTGTTCAGGTTGCTGAGTGTAGCCTTCCATGCCCTGACGGGCATCA
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileT4CP
ID | 3160 | GenBank | WP_000108725 |
Name | t4cp2_HUZ71_RS23130_SWHEFF_59|unnamed2 | UniProt ID | _ |
Length | 919 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 919 a.a. Molecular weight: 104758.78 Da Isoelectric Point: 6.1962
>WP_000108725.1 MULTISPECIES: VirB3 family type IV secretion system protein [Enterobacteriaceae]
MSTVFKGLTRPALIRGLGVPLYPFLAMCVLVVLLGVWIDDLMYFLILPGWFAIKRVTKIDERFFDLLYLR
AVVKGHPLANKRFSAVHYAGSQYDEVDISKVDNFMKLKDQASIEALIPYSSHITDDLVVTKNRDLLATWQ
VDGAYFECVDDEDLTLLTDQLNTLIRSFEGRAITFYTHRVRVRKEVKVGFNSKIPFVNRVMNDYYSSLSE
PEYFENRLYLTICYKPFNIEDKVSHFISKNKKENNIFDEPINDMNEICGRLDTYLSRFHAHRLGLVEENG
RVFSDQLSLFQYLLSGKWQKVRVTNSPFYTYLGGKDLFFGNDAGQITASQNARYFRSIEIKDYFQETDAG
IFDALMYLPVEYVFTSSFTSMDKQAAVKALDDQIDKLELTDDAAKSLLADLRVGLDMVSSGYNSFGKCHQ
TLIVFADTPERLVKDTNIVTTTLEDLGLIVTYSTLSLGAAFFSQLPGNYNLRPRLSMLSSLNFAEMESFH
NFFHGKATGNTWGNSLMALRGSGNDVYHLNYHMTTENINFFGKNPTLGHCEILGTSNVGKTVLMMIMSYA
AQQFGTPESFPENRKVRKLTTVFFDKDRAGEVGIRAMGGAYFRVKGGEPTGWNPAALPPTKRNIAFVKDI
IRLICKLNGNTVDDYQHSLISSAVDRLMQREDRSFPISKLIPLIMEPDDTETKRHGLKARLRAWKQGGEY
GWLLDNASDSFDVAHLDVFGIDGTEFLDDKVVAPVASFYLIYRVTMLADGRRLLIYMDEFWQWINNDAFK
DFVYNKLKTGRKLDMVLVPATQSPDELIKSPIAAAVREQCATHIYLANPKAKRNEYVDELQVRDLYFDKI
KAIDPLSRQFLVVKNPQRQGERDDFAAFAKLDLGKAAYYLPILSASAEQLELFDEIWSEGMAPEEWIDTY
LERANRGVK
MSTVFKGLTRPALIRGLGVPLYPFLAMCVLVVLLGVWIDDLMYFLILPGWFAIKRVTKIDERFFDLLYLR
AVVKGHPLANKRFSAVHYAGSQYDEVDISKVDNFMKLKDQASIEALIPYSSHITDDLVVTKNRDLLATWQ
VDGAYFECVDDEDLTLLTDQLNTLIRSFEGRAITFYTHRVRVRKEVKVGFNSKIPFVNRVMNDYYSSLSE
PEYFENRLYLTICYKPFNIEDKVSHFISKNKKENNIFDEPINDMNEICGRLDTYLSRFHAHRLGLVEENG
RVFSDQLSLFQYLLSGKWQKVRVTNSPFYTYLGGKDLFFGNDAGQITASQNARYFRSIEIKDYFQETDAG
IFDALMYLPVEYVFTSSFTSMDKQAAVKALDDQIDKLELTDDAAKSLLADLRVGLDMVSSGYNSFGKCHQ
TLIVFADTPERLVKDTNIVTTTLEDLGLIVTYSTLSLGAAFFSQLPGNYNLRPRLSMLSSLNFAEMESFH
NFFHGKATGNTWGNSLMALRGSGNDVYHLNYHMTTENINFFGKNPTLGHCEILGTSNVGKTVLMMIMSYA
AQQFGTPESFPENRKVRKLTTVFFDKDRAGEVGIRAMGGAYFRVKGGEPTGWNPAALPPTKRNIAFVKDI
IRLICKLNGNTVDDYQHSLISSAVDRLMQREDRSFPISKLIPLIMEPDDTETKRHGLKARLRAWKQGGEY
GWLLDNASDSFDVAHLDVFGIDGTEFLDDKVVAPVASFYLIYRVTMLADGRRLLIYMDEFWQWINNDAFK
DFVYNKLKTGRKLDMVLVPATQSPDELIKSPIAAAVREQCATHIYLANPKAKRNEYVDELQVRDLYFDKI
KAIDPLSRQFLVVKNPQRQGERDDFAAFAKLDLGKAAYYLPILSASAEQLELFDEIWSEGMAPEEWIDTY
LERANRGVK
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
ID | 3161 | GenBank | WP_000053826 |
Name | t4cp2_HUZ71_RS23170_SWHEFF_59|unnamed2 | UniProt ID | _ |
Length | 611 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 611 a.a. Molecular weight: 69231.07 Da Isoelectric Point: 8.0208
>WP_000053826.1 MULTISPECIES: type IV secretory system conjugative DNA transfer family protein [Enterobacteriaceae]
MSLKLPDKGQWAFIILVMCLIAYYTGSVAIYFFNHKTPLYIWKHYDSLLLWRVIIDSSIKSEVRFTALPS
LLSGALASLAAPAFIIWKLNQKDAPLFGDAKFASDSDLKKSKLLKWEKENDTDILVGRYKGKYLWYTAPD
FVSLGAGTRAGKGAAIGIPNMLVKKHSIIALDPKQELWKITSKVREQILGNKVYLLDPFNTKTHKCNPLF
YVDLKSESGAKDLLKLVEILFPSFGLTGAEAHFNNLAGQYWTGLAKLLHFFINFAPEWIEELRVKPVFSI
GTVVDLYSNIDREQVLSNRETFEELAQGNETALFHLKDALTKIREYHETEDEQRSSIDGSFRKKMSLFYL
PTVRKCTDGNDFDLRQMRREDITVYVGVNAEDMILAYDFLNLFFNLVVEVTLRENPDFDPTLKHDCLLFL
DEFPSIGYMPIIKKGSGYIAGYKLKLLTIYQNISQLNEIYGTEGAKTLLSAHPCRVIYAVSEEDDAKKIS
EKLGYITARSKGSSKTSGKATSRSNSENEAQRALVLPQELGTLDFSEEMIILKGEHPVKAEKALYYLDNY
FMERLMLVSPKLTALTASINRSNKIFGVKGIKYPSKEKMLSVGELESEVLL
MSLKLPDKGQWAFIILVMCLIAYYTGSVAIYFFNHKTPLYIWKHYDSLLLWRVIIDSSIKSEVRFTALPS
LLSGALASLAAPAFIIWKLNQKDAPLFGDAKFASDSDLKKSKLLKWEKENDTDILVGRYKGKYLWYTAPD
FVSLGAGTRAGKGAAIGIPNMLVKKHSIIALDPKQELWKITSKVREQILGNKVYLLDPFNTKTHKCNPLF
YVDLKSESGAKDLLKLVEILFPSFGLTGAEAHFNNLAGQYWTGLAKLLHFFINFAPEWIEELRVKPVFSI
GTVVDLYSNIDREQVLSNRETFEELAQGNETALFHLKDALTKIREYHETEDEQRSSIDGSFRKKMSLFYL
PTVRKCTDGNDFDLRQMRREDITVYVGVNAEDMILAYDFLNLFFNLVVEVTLRENPDFDPTLKHDCLLFL
DEFPSIGYMPIIKKGSGYIAGYKLKLLTIYQNISQLNEIYGTEGAKTLLSAHPCRVIYAVSEEDDAKKIS
EKLGYITARSKGSSKTSGKATSRSNSENEAQRALVLPQELGTLDFSEEMIILKGEHPVKAEKALYYLDNY
FMERLMLVSPKLTALTASINRSNKIFGVKGIKYPSKEKMLSVGELESEVLL
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 22164..32054
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
HUZ71_RS23080 (HUZ71_22900) | 17462..17707 | + | 246 | WP_000356546 | hypothetical protein | - |
HUZ71_RS23085 (HUZ71_22905) | 17697..17912 | + | 216 | WP_001180116 | hypothetical protein | - |
HUZ71_RS23090 (HUZ71_22910) | 18005..18334 | + | 330 | WP_000866648 | hypothetical protein | - |
HUZ71_RS23095 (HUZ71_22915) | 18621..18770 | + | 150 | WP_000003880 | hypothetical protein | - |
HUZ71_RS23100 (HUZ71_22920) | 18783..19055 | + | 273 | WP_000160399 | hypothetical protein | - |
HUZ71_RS23105 (HUZ71_22925) | 19382..19927 | + | 546 | WP_038976855 | DNA distortion polypeptide 1 | - |
HUZ71_RS23110 (HUZ71_22930) | 19930..21087 | + | 1158 | WP_000538023 | DNA distortion polypeptide 2 | - |
HUZ71_RS23115 (HUZ71_22935) | 21448..21939 | + | 492 | WP_000872475 | transcription termination/antitermination NusG family protein | - |
HUZ71_RS23120 (HUZ71_22940) | 22164..22808 | + | 645 | WP_000953539 | lytic transglycosylase domain-containing protein | virB1 |
HUZ71_RS23125 (HUZ71_22945) | 22792..23082 | + | 291 | WP_000921916 | TrbC/VirB2 family protein | virB2 |
HUZ71_RS23130 (HUZ71_22950) | 23106..25865 | + | 2760 | WP_000108725 | VirB3 family type IV secretion system protein | virb4 |
HUZ71_RS23135 (HUZ71_22955) | 25867..26604 | + | 738 | WP_000737859 | type IV secretion system protein | - |
HUZ71_RS23140 (HUZ71_22960) | 26614..26874 | + | 261 | WP_001228869 | EexN family lipoprotein | - |
HUZ71_RS23145 (HUZ71_22965) | 26886..27941 | + | 1056 | WP_000235774 | type IV secretion system protein | virB6 |
HUZ71_RS23150 (HUZ71_22970) | 28131..28853 | + | 723 | WP_000394570 | type IV secretion system protein | virB8 |
HUZ71_RS23155 (HUZ71_22975) | 28859..29788 | + | 930 | WP_000776689 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
HUZ71_RS23160 (HUZ71_22980) | 29785..30999 | + | 1215 | WP_001295061 | VirB10/TraB/TrbI family type IV secretion system protein | virB10 |
HUZ71_RS23165 (HUZ71_22985) | 31017..32054 | + | 1038 | WP_000217791 | P-type DNA transfer ATPase VirB11 | virB11 |
HUZ71_RS23170 (HUZ71_22990) | 32059..33894 | + | 1836 | WP_000053826 | type IV secretory system conjugative DNA transfer family protein | - |
HUZ71_RS23175 (HUZ71_22995) | 33891..34286 | + | 396 | WP_000733627 | cag pathogenicity island Cag12 family protein | - |
HUZ71_RS23180 | 34289..34621 | + | 333 | WP_000699980 | hypothetical protein | - |
HUZ71_RS23185 (HUZ71_23005) | 34864..35679 | + | 816 | WP_000018321 | aminoglycoside O-phosphotransferase APH(3')-Ia | - |
HUZ71_RS23190 (HUZ71_23010) | 35833..36741 | - | 909 | WP_000174819 | C45 family peptidase | - |
HUZ71_RS23195 | 36888..36974 | + | 87 | Protein_48 | micrococcal nuclease | - |
Host bacterium
ID | 4908 | GenBank | NZ_CP055043 |
Plasmid name | SWHEFF_59|unnamed2 | Incompatibility group | IncX1 |
Plasmid size | 47227 bp | Coordinate of oriT [Strand] | 18995..19610 [-] |
Host baterium | Shigella flexneri strain SWHEFF_59 |
Cargo genes
Drug resistance gene | ant(3'')-Ia, sul3, blaTEM-176, aph(3')-Ia, floR |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |