Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 103201 |
Name | oriT_IP40a |
Organism | Escherichia coli strain K-12 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_KX156772 (72764..72914 [+], 151 nt) |
oriT length | 151 nt |
IRs (inverted repeats) | 93..98, 106..111 (TGGCCT..AGGCCA) 12..17, 24..29 (AACCCT..AGGGTT) |
Location of nic site | _ |
Conserved sequence flanking the nic site |
_ |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 151 nt
>oriT_IP40a
TGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAA
TGGTCGATGTGAACCCTTTCGACAGGGTTATGAATGAATTGAAAAGTCGTGGCCGCAAGAACGCTCACATCCTGAGCATCCTCCAATTCGACTGGCCTGCATCGGAGGCCATCATCGAGAAGCTGAGCTGCTACATCACAGACGGGATTAA
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 2397 | GenBank | WP_172688147 |
Name | TraI_2_HTI41_RS00500_IP40a | UniProt ID | _ |
Length | 990 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 990 a.a. Molecular weight: 109338.87 Da Isoelectric Point: 4.9395
>WP_172688147.1 MULTISPECIES: MobH family relaxase [Gammaproteobacteria]
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPIFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
MLKALNKLFGGRSGVIETAPSVRVLPLKDVEDEEIPRYPPFAKGLPVAPLDKILATQAELIEKVRNSLGF
TVDDFNRLVLPVIQRYAAFVHLLPASESHHHRGAGGLFRHGLEVAFWAAQASESVIFSIEGTPRERRDNE
PRWRLASCFSGLLHDVGKPLSDVSITDKDGSITWNPYSESLHDWAHRHEIDRYFIRWRDKRHKRHEQFSL
LAVDRIIPAETREFLSKSGPSIMEAMLEAISGTSVNQPVTKLMLRADQESVSRDLRQSRLDVDEFSYGVP
VERYVFDAIRRLVKTGKWKVNEPGAKVWHLNQGVFIAWKQLGDLYDLISHDKIPGIPRDPDTLADILIER
GFAVPNTVQEKGERAYYRYWEVLPEMLQEAAGSVKILMLRLESNDLVFTTEPPAAVAAEVVGDVEDAEIE
FVDPEEVDDDQEEDVSALNDDMLAAEQEAEKALAGLGFGDAMEMLKSTSDAVEEKPEQKDAGSTESSKPD
AGKKGKPQSKPGKAKPKSDTEKQPHKPEAKEDLSPQDIAKNAPPLANDNPLQALKDVGGGLGDIDFPFDA
FSASAETASTDATNSEIPDVAMPGKQEKQPKQDFVPQEQNSLQGDDFPIFGSSDEPPSWAIEPLPMLTDA
PEQTTPAPAMPPTDKPNLHEKDAKTLLVEMLAGYGEASALLEQAIMPVLEGKTTLGEVLCLMKGQAVILY
PDGARSLGAPSEVLSKLSHANAIVPDPIMPGRKVRDFSGVKAIVLAEQLSDAVVAAIKDAEASMGGYQDA
FELVSPPGLDASKNKSAPKQQSRKKAQQQKPEVNAGKPSPEQKAKGKDSQPQQKEKKVDVTSPVEEPQRQ
PVQEKQNVARLPKREVQPVAPEPKVEREKELGHVEVREREEPEVREFEPPKAKTNPKDINAEDFLPSGVT
PQKALQMLKDMIQKRSGRWLVTPVLEEDGCLVTSDKAFDMIAGENIGISKHILCGMLSRAQRRPLLKKRQ
GKLYLEVNET
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4CP
ID | 2153 | GenBank | WP_000178857 |
Name | traD_HTI41_RS00505_IP40a | UniProt ID | _ |
Length | 621 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 621 a.a. Molecular weight: 69225.71 Da Isoelectric Point: 6.9403
>WP_000178857.1 MULTISPECIES: conjugative transfer system coupling protein TraD [Gammaproteobacteria]
MTMSYDPLAYEMPWRPNYEKNAVAGWLAASGAALAVEQVSTMPPEPFYWMTGICGVMAMARLPKAIKLHL
LQKHLKGRDLEFISIAELQKYIKDTPDDMWLGSGFLWENRHAQRVFEILKRDWTSIVGRESTVKKVVRKI
QGKKKELPIGQPWIHGVEPKEEKLMQPLKHTEGHSLIVGTTGSGKTRMFDILISQAILRGEAVIIIDPKG
DKEMRDNARRACEAMGQPERFVSFHPAFPEESVRIDPLRNFTRVTEIASRLAALIPSEAGADPFKSFGWQ
ALNNIAQGLVITHDRPNLTKLRRFLEGGAAGLVIKAVQAYSERVMPDWEAEAAAYLEKVKNGSREKIAFA
LMKFYYDIIQPEHPNSDLEGLLSMFQHDQTHFSKMVANLLPIMNMLTSGELGPLLSPDSSDLSDERQITD
SAKIINNAQVAYLGLDSLTDNMVGSAMGSIFLSDLTAVAGDRYNYGVNNRPVNIFVDEAAEVINDPFIQL
LNKGRGAKLRLFVATQTFADFAARLGSKDKALQVLGNINNTFALRIVDGETQEYIADNLPKTRLKYVMRT
QGQNSDGKEPIMHGGNQGERLMEEEADLFPAQLLGMLPNLEYIAKISGGTIVKGRLPILTQ
MTMSYDPLAYEMPWRPNYEKNAVAGWLAASGAALAVEQVSTMPPEPFYWMTGICGVMAMARLPKAIKLHL
LQKHLKGRDLEFISIAELQKYIKDTPDDMWLGSGFLWENRHAQRVFEILKRDWTSIVGRESTVKKVVRKI
QGKKKELPIGQPWIHGVEPKEEKLMQPLKHTEGHSLIVGTTGSGKTRMFDILISQAILRGEAVIIIDPKG
DKEMRDNARRACEAMGQPERFVSFHPAFPEESVRIDPLRNFTRVTEIASRLAALIPSEAGADPFKSFGWQ
ALNNIAQGLVITHDRPNLTKLRRFLEGGAAGLVIKAVQAYSERVMPDWEAEAAAYLEKVKNGSREKIAFA
LMKFYYDIIQPEHPNSDLEGLLSMFQHDQTHFSKMVANLLPIMNMLTSGELGPLLSPDSSDLSDERQITD
SAKIINNAQVAYLGLDSLTDNMVGSAMGSIFLSDLTAVAGDRYNYGVNNRPVNIFVDEAAEVINDPFIQL
LNKGRGAKLRLFVATQTFADFAARLGSKDKALQVLGNINNTFALRIVDGETQEYIADNLPKTRLKYVMRT
QGQNSDGKEPIMHGGNQGERLMEEEADLFPAQLLGMLPNLEYIAKISGGTIVKGRLPILTQ
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
ID | 2154 | GenBank | WP_020833628 |
Name | traC_HTI41_RS00585_IP40a | UniProt ID | _ |
Length | 815 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 815 a.a. Molecular weight: 92590.83 Da Isoelectric Point: 5.9216
>WP_020833628.1 MULTISPECIES: type IV secretion system protein TraC [Gammaproteobacteria]
MIVTIKKKLEETLIPEHLRAAGIIPVLAYDEDDHVFLMDDHSAGFGFMCEPLCGADEKVQERMNGFLNQE
FPSKTTLQFVLFRSPDINQEMYRMMGLRDGFRHELLTSVIKERINFLQHHTTDRIFAKTNKGIYDNGLIQ
DLKLFVTCKVPIKNNNPTESELQQLAQLRTKVESSLQTVGLRPRTMTAVNYIRIMSTILNWGPDASWRHD
SVDWEMDKPICEQIFDYGTDVEVSKNGIRLGDYHAKVMSAKKLPDVFYFGDALTYAGDLSGGNSSIKENY
MVVTNVFFPEAESTKNTLERKRQFTVNQAYGPMLKFVPVLADKKESFDTLYESMKEGAKPVKITYSVVLF
GPTKERVEAAAMAARNIWRESRFELMEDKFVALPMFLNCLPFCTDRDAVRDLFRYKTMTTEQAAVVLPVF
GEWKGTGTYHAALISRNGQLMSLSLHDSNTNKNLVIAAESGSGKSFLTNELIFSYLSEGAQVWVIDAGKS
YQKLSEMLNGDFVHFEEGTHVCLNPFELIQNYEDEEDAIVSLVCAMASAKGLLDEWQISALKQVLSRLWE
EKGKEMKVDDIAERCLEEENDQRLKDIGQQLYAFTSQGSYGKYFSRKNNVSFQNQFTVLELDELQGRKHL
RQVVLLQLIYQIQQEVFLGERNRKKVVIVDEAWDLLKEGEVSVFMEHAYRKFRKYGGSVVIATQSINDLY
ENAVGRAIAENSASMYLLGQTEETVESVKRSGRLTLSEGGFHTLKTVHTIQGVYSEIFIKSKSGMGVGRL
IVGDFQKLLYSTDPVDVNAIDQFVKQGMSIPEAIKAVMRSRRQAA
MIVTIKKKLEETLIPEHLRAAGIIPVLAYDEDDHVFLMDDHSAGFGFMCEPLCGADEKVQERMNGFLNQE
FPSKTTLQFVLFRSPDINQEMYRMMGLRDGFRHELLTSVIKERINFLQHHTTDRIFAKTNKGIYDNGLIQ
DLKLFVTCKVPIKNNNPTESELQQLAQLRTKVESSLQTVGLRPRTMTAVNYIRIMSTILNWGPDASWRHD
SVDWEMDKPICEQIFDYGTDVEVSKNGIRLGDYHAKVMSAKKLPDVFYFGDALTYAGDLSGGNSSIKENY
MVVTNVFFPEAESTKNTLERKRQFTVNQAYGPMLKFVPVLADKKESFDTLYESMKEGAKPVKITYSVVLF
GPTKERVEAAAMAARNIWRESRFELMEDKFVALPMFLNCLPFCTDRDAVRDLFRYKTMTTEQAAVVLPVF
GEWKGTGTYHAALISRNGQLMSLSLHDSNTNKNLVIAAESGSGKSFLTNELIFSYLSEGAQVWVIDAGKS
YQKLSEMLNGDFVHFEEGTHVCLNPFELIQNYEDEEDAIVSLVCAMASAKGLLDEWQISALKQVLSRLWE
EKGKEMKVDDIAERCLEEENDQRLKDIGQQLYAFTSQGSYGKYFSRKNNVSFQNQFTVLELDELQGRKHL
RQVVLLQLIYQIQQEVFLGERNRKKVVIVDEAWDLLKEGEVSVFMEHAYRKFRKYGGSVVIATQSINDLY
ENAVGRAIAENSASMYLLGQTEETVESVKRSGRLTLSEGGFHTLKTVHTIQGVYSEIFIKSKSGMGVGRL
IVGDFQKLLYSTDPVDVNAIDQFVKQGMSIPEAIKAVMRSRRQAA
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 70848..95994
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
HTI41_RS00490 | 66879..67112 | - | 234 | WP_001191890 | hypothetical protein | - |
HTI41_RS00495 | 67094..67711 | - | 618 | WP_001249395 | hypothetical protein | - |
HTI41_RS00500 | 67879..70851 | + | 2973 | WP_172688147 | MobH family relaxase | - |
HTI41_RS00505 | 70848..72713 | + | 1866 | WP_000178857 | conjugative transfer system coupling protein TraD | virb4 |
HTI41_RS00510 | 72724..73308 | + | 585 | WP_000332868 | hypothetical protein | - |
HTI41_RS00515 | 73265..73894 | + | 630 | WP_000743449 | DUF4400 domain-containing protein | tfc7 |
HTI41_RS00520 | 73904..74350 | + | 447 | WP_000122507 | hypothetical protein | - |
HTI41_RS00525 | 74360..74737 | + | 378 | WP_000869297 | hypothetical protein | - |
HTI41_RS00530 | 74737..75399 | + | 663 | WP_001231464 | hypothetical protein | - |
HTI41_RS00535 | 75580..75723 | + | 144 | WP_001275801 | hypothetical protein | - |
HTI41_RS00540 | 75735..76100 | + | 366 | WP_001052531 | hypothetical protein | - |
HTI41_RS00545 | 76245..76526 | + | 282 | WP_000805625 | type IV conjugative transfer system protein TraL | traL |
HTI41_RS00550 | 76523..77149 | + | 627 | WP_001049717 | TraE/TraK family type IV conjugative transfer system protein | traE |
HTI41_RS00555 | 77133..78050 | + | 918 | WP_000794249 | type-F conjugative transfer system secretin TraK | traK |
HTI41_RS00560 | 78050..79363 | + | 1314 | WP_024131605 | TraB/VirB10 family protein | traB |
HTI41_RS00565 | 79360..79938 | + | 579 | WP_000793435 | type IV conjugative transfer system lipoprotein TraV | traV |
HTI41_RS00570 | 79942..80334 | + | 393 | WP_000479535 | TraA family conjugative transfer protein | - |
HTI41_RS00575 | 80535..86021 | + | 5487 | WP_000606835 | Ig-like domain-containing protein | - |
HTI41_RS00580 | 86170..86877 | + | 708 | WP_001259346 | DsbC family protein | - |
HTI41_RS00585 | 86874..89321 | + | 2448 | WP_020833628 | type IV secretion system protein TraC | virb4 |
HTI41_RS00590 | 89336..89653 | + | 318 | WP_000351984 | hypothetical protein | - |
HTI41_RS00595 | 89650..90180 | + | 531 | WP_001010740 | S26 family signal peptidase | - |
HTI41_RS00600 | 90143..91408 | + | 1266 | WP_001447719 | TrbC family F-type conjugative pilus assembly protein | traW |
HTI41_RS00605 | 91405..92076 | + | 672 | WP_000575345 | EAL domain-containing protein | - |
HTI41_RS00610 | 92076..93080 | + | 1005 | WP_005507569 | TraU family protein | traU |
HTI41_RS00615 | 93196..95994 | + | 2799 | WP_072503043 | conjugal transfer mating pair stabilization protein TraN | traN |
HTI41_RS00620 | 96033..96893 | - | 861 | WP_172688148 | hypothetical protein | - |
HTI41_RS00625 | 97016..97657 | - | 642 | WP_000796664 | hypothetical protein | - |
HTI41_RS00630 | 97951..98271 | + | 321 | WP_000547566 | hypothetical protein | - |
HTI41_RS00635 | 98575..98760 | + | 186 | WP_001186917 | hypothetical protein | - |
HTI41_RS00640 | 98980..99948 | + | 969 | WP_000085162 | AAA family ATPase | - |
HTI41_RS00645 | 99959..100867 | + | 909 | WP_000739139 | hypothetical protein | - |
Host bacterium
ID | 3644 | GenBank | NZ_KX156772 |
Plasmid name | IP40a | Incompatibility group | IncA/C2 |
Plasmid size | 170404 bp | Coordinate of oriT [Strand] | 72764..72914 [+] |
Host baterium | Escherichia coli strain K-12 |
Cargo genes
Drug resistance gene | sul2, blaTEM-1D, aph(3')-Ia |
Virulence gene | - |
Metal resistance gene | arsH |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |