Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 103207 |
Name | oriT_pNDM5_WCHEC0215 |
Organism | Escherichia coli strain WCHEC005215 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_KY435936 (41039..41656 [+], 618 nt) |
oriT length | 618 nt |
IRs (inverted repeats) | 589..594, 608..613 (TCCATG..CATGGA) 592..597, 602..607 (ATGCCC..GGGCAT) 520..527, 530..537 (CCTCCCGT..ACGGGAGG) 455..460, 464..469 (ATTCGC..GCGAAT) 197..202, 208..213 (TTCAAT..ATTGAA) 85..90, 96..101 (TGCTTT..AAAGCA) |
Location of nic site | 315..316 |
Conserved sequence flanking the nic site |
TCCTGCATCG |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 618 nt
>oriT_pNDM5_WCHEC0215
TTCTTTGTTTCCCTGTCAAATACATTGTGGTTTTGATAAAATCATCATTGTCTTTTGACACATTAACATCTTCATTTTTAACCTTGCTTTTCAGTAAAGCATTGGCAAGCAGCTTCATTCCTTTTGCCACGGTTAAACCCGGTTCATCTGGATAAAGTGTTTTAAGACAATCAAGTAGATCATCATGTTGGCTTTCTTCAATCCTGAATTGAACTTTTTTCATAAAACTCTCCAACAAGAACCGACTGTAGGTCACCGGGCAAACGTTGCGGAATGGCGTCAGAGACGTCATTTTGCGGCGTTTGCCCTATCCTGCATCGCAGTGAACTACCTCCACACATAAATTGTGCTTCAGCGTCCATCGAAATGCAAAAAATGAAAGGCTATCGTGGCTGTGATCTGGTGCGTCTGCGACCGGAGACAACGGATTCGCGGCTAACTTGGGCGGAAATAAATTCGCTACGCGAATGGCCTGGCAGGGGGCGCGAGCGCTGTTTTACGGAATATACAAAAAAAGCACCTCCCGTAAACGGGAGGGCTTCGGCGATTCAGGAACGGGAATTTTATTCTGCCTCTGGTGTGTCGCCTTCCATGCCCTGCCGGGCATCATGGAGCCAG
TTCTTTGTTTCCCTGTCAAATACATTGTGGTTTTGATAAAATCATCATTGTCTTTTGACACATTAACATCTTCATTTTTAACCTTGCTTTTCAGTAAAGCATTGGCAAGCAGCTTCATTCCTTTTGCCACGGTTAAACCCGGTTCATCTGGATAAAGTGTTTTAAGACAATCAAGTAGATCATCATGTTGGCTTTCTTCAATCCTGAATTGAACTTTTTTCATAAAACTCTCCAACAAGAACCGACTGTAGGTCACCGGGCAAACGTTGCGGAATGGCGTCAGAGACGTCATTTTGCGGCGTTTGCCCTATCCTGCATCGCAGTGAACTACCTCCACACATAAATTGTGCTTCAGCGTCCATCGAAATGCAAAAAATGAAAGGCTATCGTGGCTGTGATCTGGTGCGTCTGCGACCGGAGACAACGGATTCGCGGCTAACTTGGGCGGAAATAAATTCGCTACGCGAATGGCCTGGCAGGGGGCGCGAGCGCTGTTTTACGGAATATACAAAAAAAGCACCTCCCGTAAACGGGAGGGCTTCGGCGATTCAGGAACGGGAATTTTATTCTGCCTCTGGTGTGTCGCCTTCCATGCCCTGCCGGGCATCATGGAGCCAG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 2401 | GenBank | WP_004199101 |
Name | Relaxase_BV871_RS00250_pNDM5_WCHEC0215 | UniProt ID | A0A515HL38 |
Length | 386 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 386 a.a. Molecular weight: 44388.82 Da Isoelectric Point: 10.6017
>WP_004199101.1 MULTISPECIES: MobP1 family relaxase [Enterobacterales]
MGVYVDKEFRVKRKSSEAGRKSAFAHKVKNGGKNYQRNVQERINRKGASKEVVVKISGGAITRQGVRNSI
DYMSRESELPVMSESGQVWKGDEIQEAKEHMIDRANDPQNVFDDKGKENKKVTQNIVFSPPVSAKVKPED
LLESVRKTMNKKYPNHRFVLGYHNDKKEHPHVHVVFRIRDNDGKRADIRKKDLREIRTGFCEELKMKGYD
VKATHKQQHGLNQSIKDAHKTAPKRQKGVYEVVDVGYDHYQNDKTKPKQHFIKLKTLNKGVEKTYWGADF
GELTTRENVKKGDLVKLKKLGQKEVKIPALDKNGVQHGWKTAHRNEWQLENLGVKGIDRISSSSKELVLN
SAEMIKKQQLQMRNFSQIKQSMIQSEQKVKIGIRLG
MGVYVDKEFRVKRKSSEAGRKSAFAHKVKNGGKNYQRNVQERINRKGASKEVVVKISGGAITRQGVRNSI
DYMSRESELPVMSESGQVWKGDEIQEAKEHMIDRANDPQNVFDDKGKENKKVTQNIVFSPPVSAKVKPED
LLESVRKTMNKKYPNHRFVLGYHNDKKEHPHVHVVFRIRDNDGKRADIRKKDLREIRTGFCEELKMKGYD
VKATHKQQHGLNQSIKDAHKTAPKRQKGVYEVVDVGYDHYQNDKTKPKQHFIKLKTLNKGVEKTYWGADF
GELTTRENVKKGDLVKLKKLGQKEVKIPALDKNGVQHGWKTAHRNEWQLENLGVKGIDRISSSSKELVLN
SAEMIKKQQLQMRNFSQIKQSMIQSEQKVKIGIRLG
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | A0A515HL38 |
T4CP
ID | 2159 | GenBank | WP_000053822 |
Name | t4cp2_BV871_RS00180_pNDM5_WCHEC0215 | UniProt ID | _ |
Length | 611 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 611 a.a. Molecular weight: 69204.04 Da Isoelectric Point: 7.4342
>WP_000053822.1 MULTISPECIES: type IV secretory system conjugative DNA transfer family protein [Enterobacterales]
MSLKLPDKGQWAFIGLVMCLVTYYTGSVAVYFLNGKTPLYIWKNFDSMLLWRIITESNIRTDIRLTAIPS
LLSGMVSSLIVPVFIIWQLNKTDVALYGDAKFASDNDLRKSKLLKWEKENDTDILVGAYKGKYLWYTAPD
FVSLGAGTRAGKGAAIGIPNLLVRKHSLIALDPKQELWKITSKVREKLLGNKVYLLDPFNSKTHQFNPLF
YIDLKEESGAKDLLKLIEILFPSYGLTGAEAHFNNLAGQYWTGLAKLLHFFINYDPSWLGEFGLKPVFSI
GSVVDLYSNIDRELILSKREDLEGTKGLDENALYHLRDALTKIREYHETEDEQRSSIDGSFRKKMSLFYL
PTVRKCTDGNDFDLRQLRREDITVYVGVNAEDMSLAYDFLNLFFNFVVEVTLRENPDFDPTLKHDCLMFL
DEFPSIGYMPIIKKGSGYIAGFKLKLLTIYQNISQLNEIYGVEGAKTLMSAHPCRIIYAVSEEDDAKKIS
EKLGYITTKSTGSSKTSGRSTSKSSSESEAQRALVLPQELGTLDFKEEFIILKGENPVKAEKALYFLDPY
FMDRLMMVSPKLTELTASLNKTKKVLGVKGLKYPSKEKMLSVGELESEVLL
MSLKLPDKGQWAFIGLVMCLVTYYTGSVAVYFLNGKTPLYIWKNFDSMLLWRIITESNIRTDIRLTAIPS
LLSGMVSSLIVPVFIIWQLNKTDVALYGDAKFASDNDLRKSKLLKWEKENDTDILVGAYKGKYLWYTAPD
FVSLGAGTRAGKGAAIGIPNLLVRKHSLIALDPKQELWKITSKVREKLLGNKVYLLDPFNSKTHQFNPLF
YIDLKEESGAKDLLKLIEILFPSYGLTGAEAHFNNLAGQYWTGLAKLLHFFINYDPSWLGEFGLKPVFSI
GSVVDLYSNIDRELILSKREDLEGTKGLDENALYHLRDALTKIREYHETEDEQRSSIDGSFRKKMSLFYL
PTVRKCTDGNDFDLRQLRREDITVYVGVNAEDMSLAYDFLNLFFNFVVEVTLRENPDFDPTLKHDCLMFL
DEFPSIGYMPIIKKGSGYIAGFKLKLLTIYQNISQLNEIYGVEGAKTLMSAHPCRIIYAVSEEDDAKKIS
EKLGYITTKSTGSSKTSGRSTSKSSSESEAQRALVLPQELGTLDFKEEFIILKGENPVKAEKALYFLDPY
FMDRLMMVSPKLTELTASLNKTKKVLGVKGLKYPSKEKMLSVGELESEVLL
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
ID | 2160 | GenBank | WP_015060057 |
Name | t4cp2_BV871_RS00225_pNDM5_WCHEC0215 | UniProt ID | _ |
Length | 917 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 917 a.a. Molecular weight: 104788.72 Da Isoelectric Point: 6.2666
>WP_015060057.1 MULTISPECIES: VirB3 family type IV secretion system protein [Enterobacterales]
MSTVFKGLTRPALIRGLGVPLYPFLGMCVICVLLGVWIHEAMYALILPGWYAIKRVTKIDERFFDLLYLR
MQIKGNPLANKRFNAVHYAGSSYDAVDISKVDNFMKLKDQSSLEELIPYSSHITDNLIVTRNHDLLATWQ
IDGAYFECVDEADLALLTDQLNTLIRSFDGKPVTFYTHRIRVRKEVRPVFDSKIPFVNRVMNDYYESLSA
AEYFENKLYLTVCYKPFSAEDKVTHFLSRKKGNKNIFEEPINDMNEICGRLSTYLSRFHSRRLGLYEENN
IVYSEQLTLFQKLLSGRWQKVRVTNSPFYTYLGGKDLFFGNDAGQITASDHARYFRCIEIKDYFQETDAG
IFDALMYLPVEYVQTSSLTPIDKQSAIKALDDQIDKLEMTDDAAKSLLADLKVGLDMVSSGYISFGKSHQ
TLIVYADSPERLVKDTNIVTTTLEDLGLIVTYSTLSLGAAYFAQLPGNYTLRPRLSSISSLNFAEMESFH
NFFTGKEKGNTWGNNLITLGGSGNDIYNLNYHMTTEHQNYFGKNPTLGHTEILGTSNVGKTVVMMTKAFA
AQQFGTPESFPPERKLKKLTTVFFDKDRAAEPGIRSMGGAYFRVKEGEPTGWNPAALPPTKRNISFMKDL
VRLLCTLNSEPLDDYQNRLISDAVERLMQRSNRSYPISKLRPLILEPDDTETRRHGLKARLNAWVQGGEF
GWVFDNPDDTFDVDNLDVFGIDGTEFLDNKVLSSAASFYLIYRVTMLADGRRLLIYMDEFWQWINNEAFR
DFVYNKLKTARKLDMVLVVATQSPDELIKSPIAAAVREQCATHIYLANPKAKRSEYVDELEVRELYFDKI
KAIDPLSRQFLVVKNPQRKGESDDFAAFARLDLGKAAYYLPVLSASKPQLELFDEIWKEGMKPEEWLDTY
LERANLI
MSTVFKGLTRPALIRGLGVPLYPFLGMCVICVLLGVWIHEAMYALILPGWYAIKRVTKIDERFFDLLYLR
MQIKGNPLANKRFNAVHYAGSSYDAVDISKVDNFMKLKDQSSLEELIPYSSHITDNLIVTRNHDLLATWQ
IDGAYFECVDEADLALLTDQLNTLIRSFDGKPVTFYTHRIRVRKEVRPVFDSKIPFVNRVMNDYYESLSA
AEYFENKLYLTVCYKPFSAEDKVTHFLSRKKGNKNIFEEPINDMNEICGRLSTYLSRFHSRRLGLYEENN
IVYSEQLTLFQKLLSGRWQKVRVTNSPFYTYLGGKDLFFGNDAGQITASDHARYFRCIEIKDYFQETDAG
IFDALMYLPVEYVQTSSLTPIDKQSAIKALDDQIDKLEMTDDAAKSLLADLKVGLDMVSSGYISFGKSHQ
TLIVYADSPERLVKDTNIVTTTLEDLGLIVTYSTLSLGAAYFAQLPGNYTLRPRLSSISSLNFAEMESFH
NFFTGKEKGNTWGNNLITLGGSGNDIYNLNYHMTTEHQNYFGKNPTLGHTEILGTSNVGKTVVMMTKAFA
AQQFGTPESFPPERKLKKLTTVFFDKDRAAEPGIRSMGGAYFRVKEGEPTGWNPAALPPTKRNISFMKDL
VRLLCTLNSEPLDDYQNRLISDAVERLMQRSNRSYPISKLRPLILEPDDTETRRHGLKARLNAWVQGGEF
GWVFDNPDDTFDVDNLDVFGIDGTEFLDNKVLSSAASFYLIYRVTMLADGRRLLIYMDEFWQWINNEAFR
DFVYNKLKTARKLDMVLVVATQSPDELIKSPIAAAVREQCATHIYLANPKAKRSEYVDELEVRELYFDKI
KAIDPLSRQFLVVKNPQRKGESDDFAAFARLDLGKAAYYLPVLSASKPQLELFDEIWKEGMKPEEWLDTY
LERANLI
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 28349..38458
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
BV871_RS00155 (pNDM5_WCHEC0215_00030) | 23472..24734 | - | 1263 | WP_000517490 | ATP-binding protein | - |
BV871_RS00160 (pNDM5_WCHEC0215_00031) | 24817..25311 | - | 495 | WP_001215543 | thermonuclease family protein | - |
BV871_RS00165 (pNDM5_WCHEC0215_00032) | 25308..25634 | - | 327 | WP_004208538 | hypothetical protein | - |
BV871_RS00170 (pNDM5_WCHEC0215_00033) | 25720..26034 | - | 315 | WP_000754496 | TrbM/KikA/MpfK family conjugal transfer protein | - |
BV871_RS00175 (pNDM5_WCHEC0215_00034) | 26122..26514 | - | 393 | WP_000738524 | cag pathogenicity island Cag12 family protein | - |
BV871_RS00180 (pNDM5_WCHEC0215_00035) | 26511..28346 | - | 1836 | WP_000053822 | type IV secretory system conjugative DNA transfer family protein | - |
BV871_RS00185 (pNDM5_WCHEC0215_00036) | 28349..29383 | - | 1035 | WP_000650512 | P-type DNA transfer ATPase VirB11 | virB11 |
BV871_RS00190 (pNDM5_WCHEC0215_00037) | 29380..29577 | - | 198 | WP_000629109 | DNA-binding protein | - |
BV871_RS00195 (pNDM5_WCHEC0215_00038) | 29579..30793 | - | 1215 | WP_001295060 | VirB10/TraB/TrbI family type IV secretion system protein | virB10 |
BV871_RS00200 (pNDM5_WCHEC0215_00039) | 30790..31719 | - | 930 | WP_000783386 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
BV871_RS00205 (pNDM5_WCHEC0215_00040) | 31725..32453 | - | 729 | WP_004199085 | type IV secretion system protein | virB8 |
BV871_RS00210 (pNDM5_WCHEC0215_00041) | 32648..33703 | - | 1056 | WP_000235886 | type IV secretion system protein | virB6 |
BV871_RS00215 (pNDM5_WCHEC0215_00042) | 33715..33972 | - | 258 | WP_000759841 | EexN family lipoprotein | - |
BV871_RS00220 (pNDM5_WCHEC0215_00043) | 33982..34752 | - | 771 | WP_000716326 | type IV secretion system protein | - |
BV871_RS00225 (pNDM5_WCHEC0215_00044) | 34762..37515 | - | 2754 | WP_015060057 | VirB3 family type IV secretion system protein | virb4 |
BV871_RS00230 (pNDM5_WCHEC0215_00045) | 37540..37830 | - | 291 | WP_000916156 | TrbC/VirB2 family protein | virB2 |
BV871_RS00235 (pNDM5_WCHEC0215_00046) | 37814..38458 | - | 645 | WP_000953538 | lytic transglycosylase domain-containing protein | virB1 |
BV871_RS00240 (pNDM5_WCHEC0215_00047) | 38504..38740 | - | 237 | WP_001348696 | hypothetical protein | - |
BV871_RS00245 (pNDM5_WCHEC0215_00048) | 38691..39341 | - | 651 | WP_004208545 | transcription termination/antitermination NusG family protein | - |
BV871_RS00250 (pNDM5_WCHEC0215_00049) | 39553..40713 | - | 1161 | WP_004199101 | MobP1 family relaxase | - |
BV871_RS00255 (pNDM5_WCHEC0215_00050) | 40716..41261 | - | 546 | WP_001348698 | DNA distortion polypeptide 1 | - |
BV871_RS00260 (pNDM5_WCHEC0215_00051) | 41602..41859 | - | 258 | WP_004199355 | hypothetical protein | - |
BV871_RS00265 (pNDM5_WCHEC0215_00052) | 41856..42008 | - | 153 | WP_162542717 | hypothetical protein | - |
BV871_RS00270 (pNDM5_WCHEC0215_00053) | 42095..42433 | - | 339 | WP_001099725 | hypothetical protein | - |
BV871_RS00275 (pNDM5_WCHEC0215_00054) | 42527..42769 | - | 243 | WP_000008707 | hypothetical protein | - |
BV871_RS00280 (pNDM5_WCHEC0215_00055) | 42759..43040 | - | 282 | WP_001348701 | hypothetical protein | - |
Host bacterium
ID | 3650 | GenBank | NZ_KY435936 |
Plasmid name | pNDM5_WCHEC0215 | Incompatibility group | IncX3 |
Plasmid size | 47337 bp | Coordinate of oriT [Strand] | 41039..41656 [+] |
Host baterium | Escherichia coli strain WCHEC005215 |
Cargo genes
Drug resistance gene | blaNDM-5 |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |