Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 101357 |
Name | oriT_159230|unnamed4 |
Organism | Salmonella enterica subsp. enterica serovar Kentucky strain 159230 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_VUJD01000023 (16463..17069 [+], 607 nt) |
oriT length | 607 nt |
IRs (inverted repeats) | 590..595, 600..605 (ATGCCC..GGGCAT) 517..523, 529..535 (CCTCCCG..CGGGAGG) 453..458, 462..467 (GTTCGC..GCGAAC) 196..202, 204..210 (TCAATTC..GAATTGA) 175..180, 187..192 (AAATCA..TGATTT) 79..84, 93..98 (GCTTTA..TAAAGC) 29..37, 46..54 (TTTTGATAA..TTATCAAAA) |
Location of nic site | 313..314 |
Conserved sequence flanking the nic site |
TCCTGCATCG |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 607 nt
>oriT_159230|unnamed4
CTTTGTTTACCTGTTAAGTACATCGTTGTTTTGATAAAATCATTATTATCAAAAAACGTATTTATATCCTTACTGCCAGCTTTACTTTTTAATAAAGCATTTGCCAAAAGTTTCATGCCTTTAGCTACTGTTAAAGCTGGTTCATCTGGATAAAGAGTTTTAAGGCAATCCAGCAAATCATCATGCTGATTTTCATCAATTCTGAATTGAACTTTTTTCATAAAATTCTCCAAAAAGAACCGACTGTAGGTCACCGGGCAAACGTTGCGGAATGGCGTCAGAGACGTCATTTTGCGGCGTTTGCCCTATCCTGCATCGCAGTGAATCTGGCGCCGCTCATAAATTGTGCTTGAGCATCAACAAAAATGCAAAAAGGCTGATGTTGTCATGGCTGAACATACGTACGTCTGCGACCGGAAACCCGATCAATGCGGTAAGGACAGTGCCGTCAGGTTCGCTCCGCGAACGGTCTGGCAGAAAGCGCAAGCGCAGCCCCGCGCAATATATCAGAAAGCACCTCCCGAAATACGGGAGGGCTTCGGCTATTCAGCTAAGGTTGCTCTGTTCAGGTTGCTGAGTGTAGCCTTCCATGCCCTGACGGGCATCA
CTTTGTTTACCTGTTAAGTACATCGTTGTTTTGATAAAATCATTATTATCAAAAAACGTATTTATATCCTTACTGCCAGCTTTACTTTTTAATAAAGCATTTGCCAAAAGTTTCATGCCTTTAGCTACTGTTAAAGCTGGTTCATCTGGATAAAGAGTTTTAAGGCAATCCAGCAAATCATCATGCTGATTTTCATCAATTCTGAATTGAACTTTTTTCATAAAATTCTCCAAAAAGAACCGACTGTAGGTCACCGGGCAAACGTTGCGGAATGGCGTCAGAGACGTCATTTTGCGGCGTTTGCCCTATCCTGCATCGCAGTGAATCTGGCGCCGCTCATAAATTGTGCTTGAGCATCAACAAAAATGCAAAAAGGCTGATGTTGTCATGGCTGAACATACGTACGTCTGCGACCGGAAACCCGATCAATGCGGTAAGGACAGTGCCGTCAGGTTCGCTCCGCGAACGGTCTGGCAGAAAGCGCAAGCGCAGCCCCGCGCAATATATCAGAAAGCACCTCCCGAAATACGGGAGGGCTTCGGCTATTCAGCTAAGGTTGCTCTGTTCAGGTTGCTGAGTGTAGCCTTCCATGCCCTGACGGGCATCA
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 1256 | GenBank | WP_000539538 |
Name | mobP1_F1E40_RS23130_159230|unnamed4 | UniProt ID | A0A5J2A6K9 |
Length | 388 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 388 a.a. Molecular weight: 44507.83 Da Isoelectric Point: 10.5067
>WP_000539538.1 MULTISPECIES: MobP1 family relaxase [Enterobacterales]
MGVYVDKEYRVKRKSSENGRKSAFAHKVKNGGKNYSRNVQERINRKGASKEVVVKISGGAVTRQGIRNSI
DYMSRESELPVMSESGRVWTGDEILEAKDHMIDRANDPQHVMNDKGKENKKITQNIVFSPPVSAKVKPED
LLESVRKTMQKKYPNHRFVLGYHCDKKEHPHVHVVFRIRDNDGKRADIRKKDLREIRTGFCEELKLKGYD
VKATHKQQHGLNQSVKDAHNTAPKRQKGVYEVVDVGYDHYQNDKTKSKQHFIKLKTLNKGVEKTYWGADF
GDLCSRESVKAGDLVRLKKLGQKEVKIPALDKNGVQHGWKTVHRNEWQLENLGVKGVDRTPSASKELVLN
SPDMLLKQQQRMAQFTQQKASTLQSEQKLKTGIKFWSL
MGVYVDKEYRVKRKSSENGRKSAFAHKVKNGGKNYSRNVQERINRKGASKEVVVKISGGAVTRQGIRNSI
DYMSRESELPVMSESGRVWTGDEILEAKDHMIDRANDPQHVMNDKGKENKKITQNIVFSPPVSAKVKPED
LLESVRKTMQKKYPNHRFVLGYHCDKKEHPHVHVVFRIRDNDGKRADIRKKDLREIRTGFCEELKLKGYD
VKATHKQQHGLNQSVKDAHNTAPKRQKGVYEVVDVGYDHYQNDKTKSKQHFIKLKTLNKGVEKTYWGADF
GDLCSRESVKAGDLVRLKKLGQKEVKIPALDKNGVQHGWKTVHRNEWQLENLGVKGVDRTPSASKELVLN
SPDMLLKQQQRMAQFTQQKASTLQSEQKLKTGIKFWSL
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A5J2A6K9 |
T4CP
ID | 865 | GenBank | WP_000053820 |
Name | t4cp2_F1E40_RS23065_159230|unnamed4 | UniProt ID | _ |
Length | 611 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 611 a.a. Molecular weight: 69389.23 Da Isoelectric Point: 7.1491
>WP_000053820.1 MULTISPECIES: type IV secretory system conjugative DNA transfer family protein [Enterobacteriaceae]
MSLKLPDKGQWAFIGLVMCLVTYYTGSVAVYFLNGKTPLYIWKNFDSMLLWRIITESNIRTDIRLTAIPS
LLSGMVSSLIVPVFIIWQLNKTDVALYGDAKFASDNDLRKSKLLKWEKENDTDILVGAYKGKYLWYTAPD
FVSLGAGTRAGKGAAIGIPNLLVRKHSLIALDPKQELWKITSKVREILLGNKVYLLDPFNSKTHQFNPLF
YIDLKAESGAKDLLKLIEILFPSYGMTGAEAHFNNLAGQYWTGLAKLLHFFINYEPSWLNEFGLKPVFSI
GSVVDLYSNIDRELILSKREELEGTNGLDENALYHLRDALTKIREYHETEDEQRSSIDGSFRKKMSLFYL
PTVRKCTDGNDFDLRQLRREDITVYVGVNAEDISLAYDFLNLFFNFVVEVTLRENPDFDPTLKHDCLMFL
DEFPSIGYMPIIKKGSGYIAGFKLKLLTIYQNISQLNEIYGIEGAKTLMSAHPCRIIYAVSEEDDAAKIS
EKLGYITTTSKSTSKSRGRSTSQGESESEARRALVLPQELGTLDFKEEFIILKGENPVKAEKALYYLDPY
FMDRLMKVSPKLASLTMELNKTKKIFGVKGLKYPSKEKMLSVGELESEVLL
MSLKLPDKGQWAFIGLVMCLVTYYTGSVAVYFLNGKTPLYIWKNFDSMLLWRIITESNIRTDIRLTAIPS
LLSGMVSSLIVPVFIIWQLNKTDVALYGDAKFASDNDLRKSKLLKWEKENDTDILVGAYKGKYLWYTAPD
FVSLGAGTRAGKGAAIGIPNLLVRKHSLIALDPKQELWKITSKVREILLGNKVYLLDPFNSKTHQFNPLF
YIDLKAESGAKDLLKLIEILFPSYGMTGAEAHFNNLAGQYWTGLAKLLHFFINYEPSWLNEFGLKPVFSI
GSVVDLYSNIDRELILSKREELEGTNGLDENALYHLRDALTKIREYHETEDEQRSSIDGSFRKKMSLFYL
PTVRKCTDGNDFDLRQLRREDITVYVGVNAEDISLAYDFLNLFFNFVVEVTLRENPDFDPTLKHDCLMFL
DEFPSIGYMPIIKKGSGYIAGFKLKLLTIYQNISQLNEIYGIEGAKTLMSAHPCRIIYAVSEEDDAAKIS
EKLGYITTTSKSTSKSRGRSTSQGESESEARRALVLPQELGTLDFKEEFIILKGENPVKAEKALYYLDPY
FMDRLMKVSPKLASLTMELNKTKKIFGVKGLKYPSKEKMLSVGELESEVLL
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 3905..13923
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
F1E40_RS23050 (F1E40_23280) | 1..1256 | - | 1256 | WP_154221857 | hypothetical protein | - |
F1E40_RS23055 (F1E40_23285) | 1358..1660 | - | 303 | WP_054446582 | TrbM/KikA/MpfK family conjugal transfer protein | - |
F1E40_RS23060 (F1E40_23290) | 1657..2070 | - | 414 | WP_000722603 | cag pathogenicity island Cag12 family protein | - |
F1E40_RS23065 (F1E40_23295) | 2067..3902 | - | 1836 | WP_000053820 | type IV secretory system conjugative DNA transfer family protein | - |
F1E40_RS23070 (F1E40_23300) | 3905..4936 | - | 1032 | WP_001058465 | P-type DNA transfer ATPase VirB11 | virB11 |
F1E40_RS23075 (F1E40_23305) | 4938..6143 | - | 1206 | WP_000139139 | VirB10/TraB/TrbI family type IV secretion system protein | virB10 |
F1E40_RS23080 (F1E40_23310) | 6140..7069 | - | 930 | WP_000783372 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
F1E40_RS23085 (F1E40_23315) | 7074..7787 | - | 714 | WP_000394614 | type IV secretion system protein | virB8 |
F1E40_RS23090 (F1E40_23320) | 8034..9164 | - | 1131 | WP_154221858 | type IV secretion system protein | virB6 |
F1E40_RS23095 (F1E40_23325) | 9173..9460 | - | 288 | WP_000835347 | EexN family lipoprotein | - |
F1E40_RS23100 (F1E40_23330) | 9461..10219 | - | 759 | WP_000744200 | type IV secretion system protein | - |
F1E40_RS23105 (F1E40_23335) | 10230..12983 | - | 2754 | WP_001352869 | VirB3 family type IV secretion system protein | virb4 |
F1E40_RS23110 (F1E40_23340) | 13002..13298 | - | 297 | WP_000916182 | TrbC/VirB2 family protein | virB2 |
F1E40_RS23115 (F1E40_23345) | 13276..13923 | - | 648 | WP_000953526 | lytic transglycosylase domain-containing protein | virB1 |
F1E40_RS23120 (F1E40_23350) | 13956..14168 | - | 213 | WP_000125172 | hypothetical protein | - |
F1E40_RS23125 (F1E40_23355) | 14098..14616 | - | 519 | WP_001446885 | transcription termination/antitermination NusG family protein | - |
F1E40_RS23130 (F1E40_23360) | 14969..16135 | - | 1167 | WP_000539538 | MobP1 family relaxase | - |
F1E40_RS23135 (F1E40_23365) | 16138..16683 | - | 546 | WP_001426313 | DNA distortion polypeptide 1 | - |
F1E40_RS23140 (F1E40_23370) | 17009..17281 | - | 273 | WP_000160399 | hypothetical protein | - |
F1E40_RS23445 | 17294..17443 | - | 150 | WP_000003880 | hypothetical protein | - |
F1E40_RS23145 (F1E40_23375) | 17513..17692 | - | 180 | WP_000439078 | hypothetical protein | - |
F1E40_RS23150 (F1E40_23380) | 17735..18064 | - | 330 | WP_046623240 | hypothetical protein | - |
F1E40_RS23155 (F1E40_23385) | 18157..18372 | - | 216 | WP_001180116 | hypothetical protein | - |
F1E40_RS23160 (F1E40_23390) | 18362..18607 | - | 246 | WP_000356546 | hypothetical protein | - |
Host bacterium
ID | 1801 | GenBank | NZ_VUJD01000023 |
Plasmid name | 159230|unnamed4 | Incompatibility group | - |
Plasmid size | 19770 bp | Coordinate of oriT [Strand] | 16463..17069 [+] |
Host baterium | Salmonella enterica subsp. enterica serovar Kentucky strain 159230 |
Cargo genes
Drug resistance gene | - |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |