Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 100650 |
Name | oriT_pCVMN1543 |
Organism | Salmonella enterica subsp. enterica serovar Newport str. CVM N1543 isolate CFSAN000926 |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | NZ_CP009570 (21401..21951 [-], 551 nt) |
oriT length | 551 nt |
IRs (inverted repeats) | IR1: 62..70, 71..79 (AACCCTTTC..GACAGGGTT) IR2: 143..150, 154..161 (TGGCCTGC..GGAGGCCA) |
Location of nic site | _ |
Conserved sequence flanking the nic site |
_ |
Note | predicted by the oriTfinder |
oriT sequence
Download Length: 551 nt
>oriT_pCVMN1543
TGTAAAGAGGCTGTGAAAGAATAAGAGCATCAAGATTCCAGATAGATAGAGGGAAATTTGACAAATTCCAAAGATGGGTTAGCCTAGTGACAGAACTAGATTCCAGTATTGGAATAATCAGCTTTAAATTCCAGATAGATAGTTATGTGGATAGGAATTGGATAGGAATTGGGAGGGTATTGAGG
TGTAAAGAGGCTGTGAAAGAATAAGAGCATCAAGATTCCAGATAGATAGAGGGAAATTTGACAAATTCCAAAGATGGGTTAGCCTAGTGACAGAACTAGATTCCAGTATTGGAATAATCAGCTTTAAATTCCAGATAGATAGTTATGTGGATAGGAATTGGATAGGAATTGGGAGGGTATTGAGG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 722 | GenBank | WP_001326173 |
Name | TraI_pCVMN1543 | UniProt ID | _ |
Length | 990 a.a. | PDB ID | |
Note | putative relaxase |
Relaxase protein sequence
Download Length: 990 a.a. Molecular weight: 109356.90 Da Isoelectric Point: 4.9395
>WP_001326173.1 MULTISPECIES: MobH family relaxase [Gammaproteobacteria]
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPMFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPMFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 12497..23817
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
SEEN543_RS00065 (SEEN543_26040) | 7860..9047 | + | 1188 | WP_000219627 | hypothetical protein | - |
SEEN543_RS00070 (SEEN543_26045) | 9047..10531 | + | 1485 | WP_000124148 | hypothetical protein | - |
SEEN543_RS00075 (SEEN543_26050) | 10558..11409 | - | 852 | WP_000248802 | phage repressor protein C1 | - |
SEEN543_RS00080 (SEEN543_26055) | 11518..11727 | - | 210 | WP_000864014 | hypothetical protein | - |
SEEN543_RS00090 (SEEN543_26065) | 12623..12829 | + | 207 | WP_000968892 | hypothetical protein | - |
SEEN543_RS00095 (SEEN543_26070) | 12853..13923 | + | 1071 | WP_001533532 | hypothetical protein | - |
SEEN543_RS00100 (SEEN543_26075) | 13920..14135 | + | 216 | WP_000804048 | hypothetical protein | - |
SEEN543_RS00105 (SEEN543_26080) | 14143..15171 | + | 1029 | WP_000185184 | tyrosine-type recombinase/integrase | - |
SEEN543_RS00110 (SEEN543_26085) | 15206..15568 | + | 363 | WP_001533519 | hypothetical protein | - |
SEEN543_RS00115 (SEEN543_26090) | 15565..15723 | + | 159 | WP_001252573 | hypothetical protein | - |
SEEN543_RS00120 (SEEN543_26095) | 15838..16458 | + | 621 | WP_000527000 | hypothetical protein | - |
SEEN543_RS00125 (SEEN543_26100) | 16445..16888 | + | 444 | WP_000169636 | phage protein NinX family protein | - |
SEEN543_RS00130 (SEEN543_26105) | 16891..17202 | + | 312 | WP_000934524 | hypothetical protein | - |
SEEN543_RS00135 (SEEN543_26110) | 17335..17886 | + | 552 | WP_024135640 | Ref family recombination enhancement nuclease | - |
SEEN543_RS00140 (SEEN543_26115) | 17914..18147 | + | 234 | WP_000354002 | hypothetical protein | - |
SEEN543_RS00145 (SEEN543_26120) | 18353..19003 | + | 651 | WP_000933936 | DUF2829 domain-containing protein | - |
SEEN543_RS00150 (SEEN543_26125) | 19101..19679 | + | 579 | WP_023226264 | hypothetical protein | - |
SEEN543_RS00155 (SEEN543_26130) | 19736..20071 | - | 336 | WP_000806443 | HigA family addiction module antitoxin | - |
SEEN543_RS00160 (SEEN543_26135) | 20133..20435 | - | 303 | WP_000037759 | type II toxin-antitoxin system RelE/ParE family toxin | - |
SEEN543_RS00165 (SEEN543_26140) | 20710..21351 | + | 642 | WP_024133281 | hypothetical protein | - |
SEEN543_RS00170 (SEEN543_26145) | 21520..21897 | + | 378 | WP_000235963 | hypothetical protein | - |
SEEN543_RS00175 (SEEN543_26150) | 21910..23127 | + | 1218 | WP_001143860 | hypothetical protein | - |
SEEN543_RS00180 (SEEN543_26155) | 23131..23871 | + | 741 | WP_000896790 | hypothetical protein | - |
SEEN543_RS00185 (SEEN543_26160) | 23855..24646 | + | 792 | WP_000224251 | hypothetical protein | - |
SEEN543_RS00190 (SEEN543_26165) | 24675..25691 | + | 1017 | WP_000212013 | hypothetical protein | - |
SEEN543_RS00195 (SEEN543_26170) | 25684..26292 | + | 609 | WP_000527468 | hypothetical protein | - |
SEEN543_RS00200 (SEEN543_26175) | 26423..26626 | + | 204 | WP_000167440 | hypothetical protein | - |
Host bacterium
ID | 1110 | GenBank | NZ_CP009570 |
Plasmid name | pCVMN1543 | Incompatibility group | IncA/C2 |
Plasmid size | 85837 bp | Coordinate of oriT [Strand] | 21401..21951 [-] |
Host baterium | Salmonella enterica subsp. enterica serovar Newport str. CVM N1543 isolate CFSAN000926 |
Cargo genes
Drug resistance gene | resistance to cetylpyridinium; quaternary ammonium compound-resistance |
Virulence gene | - |
Metal resistance gene | merR, merT, merP, merA, merB, merD, merE |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |