Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 107738 |
Name | oriT_pSC-31-2 |
Organism | Salmonella enterica subsp. enterica serovar Typhimurium var. 5- strain CFSAN067217 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_CP028316 (170531..170681 [-], 151 nt) |
oriT length | 151 nt |
IRs (inverted repeats) | 93..98, 106..111 (TGGCCT..AGGCCA) 12..17, 24..29 (AACCCT..AGGGTT) |
Location of nic site | _ |
Conserved sequence flanking the nic site |
_ |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 151 nt
>oriT_pSC-31-2
TGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAA
TGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAA
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 5197 | GenBank | WP_001326173 |
Name | mobH_C7X12_RS25670_pSC-31-2 | UniProt ID | F5BPR1 |
Length | 990 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 990 a.a. Molecular weight: 109356.90 Da Isoelectric Point: 4.9395
>WP_001326173.1 MULTISPECIES: MobH family relaxase [Gammaproteobacteria]
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPMFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPMFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | F5BPR1 |
T4CP
ID | 5443 | GenBank | WP_000178857 |
Name | traD_C7X12_RS25665_pSC-31-2 | UniProt ID | _ |
Length | 621 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 621 a.a. Molecular weight: 69225.71 Da Isoelectric Point: 6.9403
>WP_000178857.1 MULTISPECIES: conjugative transfer system coupling protein TraD [Gammaproteobacteria]
MTMSYDPLAYEMPWRPNYEKNAVAGWLAASGAALAVEQVSTMPPEPFYWMTGICGVMAMARLPKAIKLHL
LQKHLKGRDLEFISIAELQKYIKDTPDDMWLGSGFLWENRHAQRVFEILKRDWTSIVGRESTVKKVVRKI
QGKKKELPIGQPWIHGVEPKEEKLMQPLKHTEGHSLIVGTTGSGKTRMFDILISQAILRGEAVIIIDPKG
DKEMRDNARRACEAMGQPERFVSFHPAFPEESVRIDPLRNFTRVTEIASRLAALIPSEAGADPFKSFGWQ
ALNNIAQGLVITHDRPNLTKLRRFLEGGAAGLVIKAVQAYSERVMPDWEAEAAAYLEKVKNGSREKIAFA
LMKFYYDIIQPEHPNSDLEGLLSMFQHDQTHFSKMVANLLPIMNMLTSGELGPLLSPDSSDLSDERQITD
SAKIINNAQVAYLGLDSLTDNMVGSAMGSIFLSDLTAVAGDRYNYGVNNRPVNIFVDEAAEVINDPFIQL
LNKGRGAKLRLFVATQTFADFAARLGSKDKALQVLGNINNTFALRIVDGETQEYIADNLPKTRLKYVMRT
QGQNSDGKEPIMHGGNQGERLMEEEADLFPAQLLGMLPNLEYIAKISGGTIVKGRLPILTQ
MTMSYDPLAYEMPWRPNYEKNAVAGWLAASGAALAVEQVSTMPPEPFYWMTGICGVMAMARLPKAIKLHL
LQKHLKGRDLEFISIAELQKYIKDTPDDMWLGSGFLWENRHAQRVFEILKRDWTSIVGRESTVKKVVRKI
QGKKKELPIGQPWIHGVEPKEEKLMQPLKHTEGHSLIVGTTGSGKTRMFDILISQAILRGEAVIIIDPKG
DKEMRDNARRACEAMGQPERFVSFHPAFPEESVRIDPLRNFTRVTEIASRLAALIPSEAGADPFKSFGWQ
ALNNIAQGLVITHDRPNLTKLRRFLEGGAAGLVIKAVQAYSERVMPDWEAEAAAYLEKVKNGSREKIAFA
LMKFYYDIIQPEHPNSDLEGLLSMFQHDQTHFSKMVANLLPIMNMLTSGELGPLLSPDSSDLSDERQITD
SAKIINNAQVAYLGLDSLTDNMVGSAMGSIFLSDLTAVAGDRYNYGVNNRPVNIFVDEAAEVINDPFIQL
LNKGRGAKLRLFVATQTFADFAARLGSKDKALQVLGNINNTFALRIVDGETQEYIADNLPKTRLKYVMRT
QGQNSDGKEPIMHGGNQGERLMEEEADLFPAQLLGMLPNLEYIAKISGGTIVKGRLPILTQ
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Host bacterium
ID | 8173 | GenBank | NZ_CP028316 |
Plasmid name | pSC-31-2 | Incompatibility group | IncA/C2 |
Plasmid size | 187594 bp | Coordinate of oriT [Strand] | 170531..170681 [-] |
Host baterium | Salmonella enterica subsp. enterica serovar Typhimurium var. 5- strain CFSAN067217 |
Cargo genes
Drug resistance gene | sul2, tet(A) |
Virulence gene | ybtS, ybtX, ybtQ, ybtP, ybtA, irp2, irp1, ybtU, ybtT, ybtE, fyuA |
Metal resistance gene | merP, merT, merR |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |