Detailed information of oriT

oriT


The information of the oriT region


oriTDB ID   103201
Name   oriT_IP40a in_silico
Organism   Escherichia coli strain K-12
Sequence Completeness      -
NCBI accession of oriT (coordinates [strand])   NZ_KX156772 (72764..72914 [+], 151 nt)
oriT length   151 nt
IRs (inverted repeats)      93..98, 106..111  (TGGCCT..AGGCCA)
 12..17, 24..29  (AACCCT..AGGGTT)
Location of nic site      _
Conserved sequence flanking the
  nic site  
 
 _
Note   Predicted by oriTfinder 2.0

  oriT sequence  


Download         Length: 151 nt

>oriT_IP40a
TGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAA

Visualization of oriT structure

  oriT secondary structure

Predicted by RNAfold.

Download structure file


Relaxase


ID   2397 GenBank   WP_172688147
Name   TraI_2_HTI41_RS00500_IP40a insolico UniProt ID   _
Length   990 a.a. PDB ID   
Note   Predicted by oriTfinder 2.0

  Relaxase protein sequence


Download         Length: 990 a.a.        Molecular weight: 109338.87 Da        Isoelectric Point: 4.9395

>WP_172688147.1 MULTISPECIES: MobH family relaxase [Gammaproteobacteria]
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPIFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET

  Protein domains


Predicted by InterproScan.

(51-357)


  Protein structure



No available structure.




T4CP


ID   2153 GenBank   WP_000178857
Name   traD_HTI41_RS00505_IP40a insolico UniProt ID   _
Length   621 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 621 a.a.        Molecular weight: 69225.71 Da        Isoelectric Point: 6.9403

>WP_000178857.1 MULTISPECIES: conjugative transfer system coupling protein TraD [Gammaproteobacteria]
MTMSYDPLAYEMPWRPNYEKNAVAGWLAASGAALAVEQVSTMPPEPFYWMTGICGVMAMARLPKAIKLHL
LQKHLKGRDLEFISIAELQKYIKDTPDDMWLGSGFLWENRHAQRVFEILKRDWTSIVGRESTVKKVVRKI
QGKKKELPIGQPWIHGVEPKEEKLMQPLKHTEGHSLIVGTTGSGKTRMFDILISQAILRGEAVIIIDPKG
DKEMRDNARRACEAMGQPERFVSFHPAFPEESVRIDPLRNFTRVTEIASRLAALIPSEAGADPFKSFGWQ
ALNNIAQGLVITHDRPNLTKLRRFLEGGAAGLVIKAVQAYSERVMPDWEAEAAAYLEKVKNGSREKIAFA
LMKFYYDIIQPEHPNSDLEGLLSMFQHDQTHFSKMVANLLPIMNMLTSGELGPLLSPDSSDLSDERQITD
SAKIINNAQVAYLGLDSLTDNMVGSAMGSIFLSDLTAVAGDRYNYGVNNRPVNIFVDEAAEVINDPFIQL
LNKGRGAKLRLFVATQTFADFAARLGSKDKALQVLGNINNTFALRIVDGETQEYIADNLPKTRLKYVMRT
QGQNSDGKEPIMHGGNQGERLMEEEADLFPAQLLGMLPNLEYIAKISGGTIVKGRLPILTQ

  Protein domains


Predicted by InterproScan.

(170-299)

(471-598)

  Protein structure



No available structure.



ID   2154 GenBank   WP_020833628
Name   traC_HTI41_RS00585_IP40a insolico UniProt ID   _
Length   815 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 815 a.a.        Molecular weight: 92590.83 Da        Isoelectric Point: 5.9216

>WP_020833628.1 MULTISPECIES: type IV secretion system protein TraC [Gammaproteobacteria]
MIVTIKKKLEETLIPEHLRAAGIIPVLAYDEDDHVFLMDDHSAGFGFMCEPLCGADEKVQERMNGFLNQE
FPSKTTLQFVLFRSPDINQEMYRMMGLRDGFRHELLTSVIKERINFLQHHTTDRIFAKTNKGIYDNGLIQ
DLKLFVTCKVPIKNNNPTESELQQLAQLRTKVESSLQTVGLRPRTMTAVNYIRIMSTILNWGPDASWRHD
SVDWEMDKPICEQIFDYGTDVEVSKNGIRLGDYHAKVMSAKKLPDVFYFGDALTYAGDLSGGNSSIKENY
MVVTNVFFPEAESTKNTLERKRQFTVNQAYGPMLKFVPVLADKKESFDTLYESMKEGAKPVKITYSVVLF
GPTKERVEAAAMAARNIWRESRFELMEDKFVALPMFLNCLPFCTDRDAVRDLFRYKTMTTEQAAVVLPVF
GEWKGTGTYHAALISRNGQLMSLSLHDSNTNKNLVIAAESGSGKSFLTNELIFSYLSEGAQVWVIDAGKS
YQKLSEMLNGDFVHFEEGTHVCLNPFELIQNYEDEEDAIVSLVCAMASAKGLLDEWQISALKQVLSRLWE
EKGKEMKVDDIAERCLEEENDQRLKDIGQQLYAFTSQGSYGKYFSRKNNVSFQNQFTVLELDELQGRKHL
RQVVLLQLIYQIQQEVFLGERNRKKVVIVDEAWDLLKEGEVSVFMEHAYRKFRKYGGSVVIATQSINDLY
ENAVGRAIAENSASMYLLGQTEETVESVKRSGRLTLSEGGFHTLKTVHTIQGVYSEIFIKSKSGMGVGRL
IVGDFQKLLYSTDPVDVNAIDQFVKQGMSIPEAIKAVMRSRRQAA

  Protein domains


Predicted by InterproScan.

(21-255)

(284-434)

(449-809)

  Protein structure



No available structure.




T4SS


T4SS were predicted by using oriTfinder2.

Region 1: 70848..95994

Locus tag Coordinates Strand Size (bp) Protein ID Product Description
HTI41_RS00490 66879..67112 - 234 WP_001191890 hypothetical protein -
HTI41_RS00495 67094..67711 - 618 WP_001249395 hypothetical protein -
HTI41_RS00500 67879..70851 + 2973 WP_172688147 MobH family relaxase -
HTI41_RS00505 70848..72713 + 1866 WP_000178857 conjugative transfer system coupling protein TraD virb4
HTI41_RS00510 72724..73308 + 585 WP_000332868 hypothetical protein -
HTI41_RS00515 73265..73894 + 630 WP_000743449 DUF4400 domain-containing protein tfc7
HTI41_RS00520 73904..74350 + 447 WP_000122507 hypothetical protein -
HTI41_RS00525 74360..74737 + 378 WP_000869297 hypothetical protein -
HTI41_RS00530 74737..75399 + 663 WP_001231464 hypothetical protein -
HTI41_RS00535 75580..75723 + 144 WP_001275801 hypothetical protein -
HTI41_RS00540 75735..76100 + 366 WP_001052531 hypothetical protein -
HTI41_RS00545 76245..76526 + 282 WP_000805625 type IV conjugative transfer system protein TraL traL
HTI41_RS00550 76523..77149 + 627 WP_001049717 TraE/TraK family type IV conjugative transfer system protein traE
HTI41_RS00555 77133..78050 + 918 WP_000794249 type-F conjugative transfer system secretin TraK traK
HTI41_RS00560 78050..79363 + 1314 WP_024131605 TraB/VirB10 family protein traB
HTI41_RS00565 79360..79938 + 579 WP_000793435 type IV conjugative transfer system lipoprotein TraV traV
HTI41_RS00570 79942..80334 + 393 WP_000479535 TraA family conjugative transfer protein -
HTI41_RS00575 80535..86021 + 5487 WP_000606835 Ig-like domain-containing protein -
HTI41_RS00580 86170..86877 + 708 WP_001259346 DsbC family protein -
HTI41_RS00585 86874..89321 + 2448 WP_020833628 type IV secretion system protein TraC virb4
HTI41_RS00590 89336..89653 + 318 WP_000351984 hypothetical protein -
HTI41_RS00595 89650..90180 + 531 WP_001010740 S26 family signal peptidase -
HTI41_RS00600 90143..91408 + 1266 WP_001447719 TrbC family F-type conjugative pilus assembly protein traW
HTI41_RS00605 91405..92076 + 672 WP_000575345 EAL domain-containing protein -
HTI41_RS00610 92076..93080 + 1005 WP_005507569 TraU family protein traU
HTI41_RS00615 93196..95994 + 2799 WP_072503043 conjugal transfer mating pair stabilization protein TraN traN
HTI41_RS00620 96033..96893 - 861 WP_172688148 hypothetical protein -
HTI41_RS00625 97016..97657 - 642 WP_000796664 hypothetical protein -
HTI41_RS00630 97951..98271 + 321 WP_000547566 hypothetical protein -
HTI41_RS00635 98575..98760 + 186 WP_001186917 hypothetical protein -
HTI41_RS00640 98980..99948 + 969 WP_000085162 AAA family ATPase -
HTI41_RS00645 99959..100867 + 909 WP_000739139 hypothetical protein -


Host bacterium


ID   3644 GenBank   NZ_KX156772
Plasmid name   IP40a Incompatibility group   IncA/C2
Plasmid size   170404 bp Coordinate of oriT [Strand]   72764..72914 [+]
Host baterium   Escherichia coli strain K-12

Cargo genes


Drug resistance gene   sul2, blaTEM-1D, aph(3')-Ia
Virulence gene   -
Metal resistance gene   arsH
Degradation gene   -
Symbiosis gene   -
Anti-CRISPR   -