Detailed information of oriT

oriT


The information of the oriT region


oriTDB ID   103397
Name   oriT_pSX5G-tetX4 in_silico
Organism   Escherichia sp. strain SX5G
Sequence Completeness      -
NCBI accession of oriT (coordinates [strand])   NZ_MW940624 (45189..45804 [-], 616 nt)
oriT length   616 nt
IRs (inverted repeats)      599..604, 609..614  (ATGCCC..GGGCAT)
 526..532, 538..544  (CCTCCCG..CGGGAGG)
 462..467, 471..476  (GTTCGC..GCGAAC)
 52..58, 62..68  (CATTATC..GATAATG)
 40..45, 54..59  (TGATAA..TTATCA)
Location of nic site      322..323
Conserved sequence flanking the
  nic site  
 
 TCCTGCATCG
Note   Predicted by oriTfinder 2.0

  oriT sequence  


Download         Length: 616 nt

>oriT_pSX5G-tetX4
TCTTACTTCTTTGCGTAGCTGTTAAATACAGCGTTGTTTTGATAAAATCATCATTATCATCGATAATGCTTTCTTCAATTTTTTTATCCTTACTCTTTAATAAAGCACTTGCTAATAACTTCATACCTTTTGCAACTGTCAAATTTGGTTCATCAGGGTAAATGCTTTTAAGGCATACTAACAAATAATCATGGTCTTCATCTTCAACTCTAAACTGAATTTTTTTCATCATAACTCCCAACAAGAACCGACTGTAGGTCACCGGGCAAACGCTGAAAAATAACGTCGAATGACGTCATTTTGCGGCGTTTGCCCTATCCTGCATCGCAGTGAATCTGGCGCCGCTCATAAATTGTGCTTGAGCATCAACAAAAATGCAAAAAGGCTGATGTTGTCATGGCTGAACATACGTACGTCTGCGACCGGAAACCCGATCAATGCGGTAAGGACAGTGCCGTCAGGTTCGCTCCGCGAACGGTCTGGCAGGAAGCGCAAGCGCAGTCCCGCGCAATATATCAGAAAGCACCTCCCGAAATACGGGAGGGCTTCGGCTATTCAGCTAAGGTTGCTCTGTTCAGGTTGCTGAGTGTAGCCTTCCATGCCCTGACGGGCATCA

Visualization of oriT structure

  oriT secondary structure

Predicted by RNAfold.

Download structure file


T4CP


ID   2315 GenBank   WP_025755766
Name   t4cp2_KSF41_RS00015_pSX5G-tetX4 insolico UniProt ID   _
Length   611 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 611 a.a.        Molecular weight: 69235.06 Da        Isoelectric Point: 8.0208

>WP_025755766.1 MULTISPECIES: type IV secretory system conjugative DNA transfer family protein [Enterobacteriaceae]
MSLKLPDKGQWAFIILVMCLIAYYTGSVAIYFFNHKTTLYIWKHYDSLLLWRVIIDSSIKSEVRFTALPS
LLSGALASLAAPAFIIWKLNQKDAPLFGDAKFASDSDLKKSKLLKWEKENDTDILVGRYKGKYLWYTAPD
FVSLGAGTRAGKGAAIGIPNMLVKKHSIIALDPKQELWKITSKVREQILGNKVYLLDPFNTKTHKCNPLF
YVDLKSESGAKDLLKLVEILFPSFGLTGAEAHFNNLAGQYWTGLAKLLHFFINFAPEWIEELRVKPVFSI
GTVVDLYSNIDREQVLSNRETFEELAQGNETALFHLKDALTKIREYHETEDEQRSSIDGSFRKKMSLFYL
PTVRKCTDGNDFDLRQMRREDITVYVGVNAEDMILAYDFLNLFFNLVVEVTLRENPDFDPTLKHDCLLFL
DEFPSIGYMPIIKKGSGYIAGYKLKLLTIYQNISQLNEIYGTEGAKTLLSAHPCRVIYAVSEEDDAKKIS
EKLGYITARSKGSSKTSGKATSRSNSENEAQRALVLPQELGTLDFSEEMIILKGEHPVKAEKALYYLDNY
FMERLMLVSPKLTALTASINRSNKIFGVKGIKYPSKEKMLSVGELESEVLL

  Protein domains


Predicted by InterproScan.

(102-581)

  Protein structure



No available structure.



ID   2316 GenBank   WP_000108725
Name   t4cp2_KSF41_RS00305_pSX5G-tetX4 insolico UniProt ID   _
Length   919 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 919 a.a.        Molecular weight: 104758.78 Da        Isoelectric Point: 6.1962

>WP_000108725.1 MULTISPECIES: VirB3 family type IV secretion system protein [Enterobacteriaceae]
MSTVFKGLTRPALIRGLGVPLYPFLAMCVLVVLLGVWIDDLMYFLILPGWFAIKRVTKIDERFFDLLYLR
AVVKGHPLANKRFSAVHYAGSQYDEVDISKVDNFMKLKDQASIEALIPYSSHITDDLVVTKNRDLLATWQ
VDGAYFECVDDEDLTLLTDQLNTLIRSFEGRAITFYTHRVRVRKEVKVGFNSKIPFVNRVMNDYYSSLSE
PEYFENRLYLTICYKPFNIEDKVSHFISKNKKENNIFDEPINDMNEICGRLDTYLSRFHAHRLGLVEENG
RVFSDQLSLFQYLLSGKWQKVRVTNSPFYTYLGGKDLFFGNDAGQITASQNARYFRSIEIKDYFQETDAG
IFDALMYLPVEYVFTSSFTSMDKQAAVKALDDQIDKLELTDDAAKSLLADLRVGLDMVSSGYNSFGKCHQ
TLIVFADTPERLVKDTNIVTTTLEDLGLIVTYSTLSLGAAFFSQLPGNYNLRPRLSMLSSLNFAEMESFH
NFFHGKATGNTWGNSLMALRGSGNDVYHLNYHMTTENINFFGKNPTLGHCEILGTSNVGKTVLMMIMSYA
AQQFGTPESFPENRKVRKLTTVFFDKDRAGEVGIRAMGGAYFRVKGGEPTGWNPAALPPTKRNIAFVKDI
IRLICKLNGNTVDDYQHSLISSAVDRLMQREDRSFPISKLIPLIMEPDDTETKRHGLKARLRAWKQGGEY
GWLLDNASDSFDVAHLDVFGIDGTEFLDDKVVAPVASFYLIYRVTMLADGRRLLIYMDEFWQWINNDAFK
DFVYNKLKTGRKLDMVLVPATQSPDELIKSPIAAAVREQCATHIYLANPKAKRNEYVDELQVRDLYFDKI
KAIDPLSRQFLVVKNPQRQGERDDFAAFAKLDLGKAAYYLPILSASAEQLELFDEIWSEGMAPEEWIDTY
LERANRGVK

  Protein domains


Predicted by InterproScan.

(689-817)

(272-474)

(2-83)

  Protein structure



No available structure.




T4SS


T4SS were predicted by using oriTfinder2.

Region 1: 48358..55982

Locus tag Coordinates Strand Size (bp) Protein ID Product Description
KSF41_RS00255 (CAAMOENC_00049) 43656..43901 + 246 WP_000356546 hypothetical protein -
KSF41_RS00260 (CAAMOENC_00050) 43891..44106 + 216 WP_001180116 hypothetical protein -
KSF41_RS00265 (CAAMOENC_00051) 44199..44528 + 330 WP_000866648 hypothetical protein -
KSF41_RS00270 (CAAMOENC_00052) 44767..44964 + 198 WP_001675595 hypothetical protein -
KSF41_RS00275 (CAAMOENC_00053) 44977..45249 + 273 WP_000160399 hypothetical protein -
KSF41_RS00280 (CAAMOENC_00055) 45576..46121 + 546 WP_038976855 DNA distortion polypeptide 1 -
KSF41_RS00285 (CAAMOENC_00056) 46124..47281 + 1158 WP_000538023 DNA distortion polypeptide 2 -
KSF41_RS00290 (CAAMOENC_00057) 47642..48133 + 492 WP_000872475 transcription termination/antitermination NusG family protein -
KSF41_RS00295 (CAAMOENC_00059) 48358..49002 + 645 WP_000953539 lytic transglycosylase domain-containing protein virB1
KSF41_RS00300 (CAAMOENC_00060) 48986..49276 + 291 WP_000921916 TrbC/VirB2 family protein virB2
KSF41_RS00305 (CAAMOENC_00061) 49300..52059 + 2760 WP_000108725 VirB3 family type IV secretion system protein virb4
KSF41_RS00310 (CAAMOENC_00062) 52061..52798 + 738 WP_000737859 type IV secretion system protein -
KSF41_RS00315 (CAAMOENC_00063) 52808..53068 + 261 WP_001228869 EexN family lipoprotein -
KSF41_RS00320 (CAAMOENC_00064) 53080..54135 + 1056 WP_000235774 type IV secretion system protein virB6
KSF41_RS00325 (CAAMOENC_00065) 54325..55047 + 723 WP_000394570 type IV secretion system protein virB8
KSF41_RS00330 (CAAMOENC_00066) 55053..55982 + 930 WP_025755764 TrbG/VirB9 family P-type conjugative transfer protein virB9


Host bacterium


ID   3840 GenBank   NZ_MW940624
Plasmid name   pSX5G-tetX4 Incompatibility group   IncX1
Plasmid size   57105 bp Coordinate of oriT [Strand]   45189..45804 [-]
Host baterium   Escherichia sp. strain SX5G

Cargo genes


Drug resistance gene   blaSHV-12, tet(A), floR, tet(X4), lnu(F), aadA2
Virulence gene   -
Metal resistance gene   -
Degradation gene   -
Symbiosis gene   -
Anti-CRISPR   -