Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 100636 |
Name | oriT_p972816 |
Organism | Salmonella enterica subsp. enterica strain SA972816 |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | NZ_CP007486 (7240..7790 [-], 551 nt) |
oriT length | 551 nt |
IRs (inverted repeats) | IR1: 62..70, 71..79 (AACCCTTTC..GACAGGGTT) IR2: 143..150, 154..161 (TGGCCTGC..GGAGGCCA) |
Location of nic site | _ |
Conserved sequence flanking the nic site |
_ |
Note | predicted by the oriTfinder |
oriT sequence
Download Length: 551 nt
>oriT_p972816
GAGCAGAGCTATGTGTGACAAGAAGTATAGAAATTACGAGGTAGCCATCATGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAAGGCTAATCAGGAGCCTGTGATTTACCCGATCATTGAAGAAGCTCTGCATCGCTACAGCCAGCTCGTGTTTCATGAGCAGAGAGAGAAATATGAAGACCCGGCCAGAATTGGGGCATTTCTGGAAACCCTGATCACCGAAACCTGCCGGGCGTTGGAAGTGCAAATTGTCGATAGTGGCGGTGATTCATGGTCTGTCGATTCAGGAGAGTCGTTCTCACTGTGGCTTTCTTCCCATCCAGGAGAACTATCCATTAACCCGCAGCCCCATGAGGATGAGACCTCTTTGCGTGGCTTGCTGTATGAGCTCATCACCTGTGAGAGCGTGAAAACTGTTTTAAGGAGAACCGACT
GAGCAGAGCTATGTGTGACAAGAAGTATAGAAATTACGAGGTAGCCATCATGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAAGGCTAATCAGGAGCCTGTGATTTACCCGATCATTGAAGAAGCTCTGCATCGCTACAGCCAGCTCGTGTTTCATGAGCAGAGAGAGAAATATGAAGACCCGGCCAGAATTGGGGCATTTCTGGAAACCCTGATCACCGAAACCTGCCGGGCGTTGGAAGTGCAAATTGTCGATAGTGGCGGTGATTCATGGTCTGTCGATTCAGGAGAGTCGTTCTCACTGTGGCTTTCTTCCCATCCAGGAGAACTATCCATTAACCCGCAGCCCCATGAGGATGAGACCTCTTTGCGTGGCTTGCTGTATGAGCTCATCACCTGTGAGAGCGTGAAAACTGTTTTAAGGAGAACCGACT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 708 | GenBank | WP_001326173 |
Name | TraI_p972816 | UniProt ID | _ |
Length | 990 a.a. | PDB ID | |
Note | putative relaxase |
Relaxase protein sequence
Download Length: 990 a.a. Molecular weight: 109356.90 Da Isoelectric Point: 4.9395
>WP_001326173.1 MULTISPECIES: MobH family relaxase [Gammaproteobacteria]
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPMFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPMFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 566..9656
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
CO44_RS23860 (CO44_24390) | 170..562 | - | 393 | WP_000479535 | TraA family conjugative transfer protein | - |
CO44_RS23865 (CO44_24395) | 566..1144 | - | 579 | WP_000793435 | type IV conjugative transfer system lipoprotein TraV | traV |
CO44_RS23870 (CO44_24400) | 1141..2454 | - | 1314 | WP_024131605 | TraB/VirB10 family protein | traB |
CO44_RS23875 (CO44_24405) | 2454..3371 | - | 918 | WP_000794249 | type-F conjugative transfer system secretin TraK | traK |
CO44_RS23880 (CO44_24410) | 3355..3981 | - | 627 | WP_001049717 | TraE/TraK family type IV conjugative transfer system protein | traE |
CO44_RS23885 (CO44_24415) | 3978..4259 | - | 282 | WP_000805625 | type IV conjugative transfer system protein TraL | traL |
CO44_RS23890 (CO44_24420) | 4404..4769 | - | 366 | WP_001052530 | hypothetical protein | - |
CO44_RS23895 (CO44_24425) | 5105..5767 | - | 663 | WP_001231464 | hypothetical protein | - |
CO44_RS23900 (CO44_24430) | 5767..6144 | - | 378 | WP_000869297 | hypothetical protein | - |
CO44_RS23905 (CO44_24435) | 6154..6600 | - | 447 | WP_000122507 | hypothetical protein | - |
CO44_RS23910 (CO44_24440) | 6610..7239 | - | 630 | WP_000743449 | DUF4400 domain-containing protein | tfc7 |
CO44_RS23915 (CO44_24445) | 7196..7741 | - | 546 | WP_000228720 | hypothetical protein | - |
CO44_RS23920 (CO44_24450) | 7791..9656 | - | 1866 | WP_000178857 | conjugative transfer system coupling protein TraD | virb4 |
CO44_RS23925 (CO44_24455) | 9653..12625 | - | 2973 | WP_001326173 | MobH family relaxase | - |
CO44_RS23930 (CO44_24460) | 12793..13410 | + | 618 | WP_001249395 | hypothetical protein | - |
CO44_RS23935 (CO44_24465) | 13392..13625 | + | 234 | WP_001191890 | hypothetical protein | - |
Host bacterium
ID | 1096 | GenBank | NZ_CP007486 |
Plasmid name | p972816 | Incompatibility group | _ |
Plasmid size | 31727 bp | Coordinate of oriT [Strand] | 7240..7790 [-] |
Host baterium | Salmonella enterica subsp. enterica strain SA972816 |
Cargo genes
Drug resistance gene | sul2, aph(3'')-Ib, aph(6)-Id, tet(A) |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |