Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 102994 |
Name | oriT_p203740_35 |
Organism | Escherichia coli strain 203740 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_CP025915 (23983..24082 [+], 100 nt) |
oriT length | 100 nt |
IRs (inverted repeats) | 32..38, 42..48 (TTACGAT..ATCGTAA) |
Location of nic site | 61..62 |
Conserved sequence flanking the nic site |
GGTGTATAGC |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 100 nt
>oriT_p203740_35
AATTAGAATAATTTGTTTTGTTTTCAAGCATTTACGATGAAATCGTAATTGCGTATGGTGTATAGCCATTAAGGGATACCATAACACGCCTTTTTTAAGG
AATTAGAATAATTTGTTTTGTTTTCAAGCATTTACGATGAAATCGTAATTGCGTATGGTGTATAGCCATTAAGGGATACCATAACACGCCTTTTTTAAGG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 2246 | GenBank | WP_052953523 |
Name | TrwC_CXW72_RS25090_p203740_35 | UniProt ID | _ |
Length | 968 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 968 a.a. Molecular weight: 108373.52 Da Isoelectric Point: 9.8255
>WP_052953523.1 MULTISPECIES: MobF family relaxase [Escherichia]
MLNITTITRQRVDRVVDYYADGADDYYAKDGNAMQWQGAGAEELGLTGEVEQKRFAKLLDGKVTDDISVM
RKTANSESKERLGYDLTFSAPKGVSLQALVNGDKRIIEAHDRAVTKAIEEAERLALGRFTVNKKTHVENT
NKLIVGKFRHETSRELDPDLHTHAFVMNLTKTSDGKWRSLTNDGIVNSLTLLGNVYKSELARELELAGYN
LRYDRNGTFDLAHFSQEQIAQFSQRSQQIEAELAKNGLSRETASHELKNAAALKTRKSKGEVDRDVLYEG
WKNRAKELQIDFDSREWKGAANDDIARNTQPAALEKPIEYQADKVMTFAIRSLTERQSVITERELMDVAM
KHGYGRLTLDDIRAARERAITSGNLIKEEATYAAATTNKKQQNVAMTRKQWVTELVKAGRSKDEARKLVD
KGITNGRLVQQKPRFTTQLAQQRERNILKMEREGRGKIQTPYTREFSEGWLASRTLKPEQLKAVMGIIHT
PNQFISVHGFAGTGKSYMTKSAADFLKEQGVHVTSLAPYGSQVKALQAEGLESRTLQSFLRASDKKIGPG
SVVFIDEAGVIPARQMEETMRVIRDAGARAVFLGDTKQTKAVEAGKPFEQLIKAGMETAYMKDIQRQKDP
ELLKAALNAAEGKIKASLTHVTSIVEEKNHDQRYRRIVGDYVAMPPTDRANALIITGTNDSRKKINEYIQ
AELGLKGQGINYPLLNRLDTTQAQRQHSKYYEKGAVIIPERDYSNGLKRGAVYTVLDTGPGNKLTVSDGS
GDTIAFSPARFSKLSVYSVEKTELAVGDQVRITRNDAHLDLANGDRFKVKSVQSGEVLLENEKGRLVKID
ANKPMYLGLAYASTVHSAQGLTCDKVFINMDTRSLTTAKDVYYVAVTRAKHEAVIYTDDEKKLDKAASRE
GFKTAALELEQLKRYAKEQQHDTREKGRDKAEPTQDKGNARPKDKGHEKNNDDRALRL
MLNITTITRQRVDRVVDYYADGADDYYAKDGNAMQWQGAGAEELGLTGEVEQKRFAKLLDGKVTDDISVM
RKTANSESKERLGYDLTFSAPKGVSLQALVNGDKRIIEAHDRAVTKAIEEAERLALGRFTVNKKTHVENT
NKLIVGKFRHETSRELDPDLHTHAFVMNLTKTSDGKWRSLTNDGIVNSLTLLGNVYKSELARELELAGYN
LRYDRNGTFDLAHFSQEQIAQFSQRSQQIEAELAKNGLSRETASHELKNAAALKTRKSKGEVDRDVLYEG
WKNRAKELQIDFDSREWKGAANDDIARNTQPAALEKPIEYQADKVMTFAIRSLTERQSVITERELMDVAM
KHGYGRLTLDDIRAARERAITSGNLIKEEATYAAATTNKKQQNVAMTRKQWVTELVKAGRSKDEARKLVD
KGITNGRLVQQKPRFTTQLAQQRERNILKMEREGRGKIQTPYTREFSEGWLASRTLKPEQLKAVMGIIHT
PNQFISVHGFAGTGKSYMTKSAADFLKEQGVHVTSLAPYGSQVKALQAEGLESRTLQSFLRASDKKIGPG
SVVFIDEAGVIPARQMEETMRVIRDAGARAVFLGDTKQTKAVEAGKPFEQLIKAGMETAYMKDIQRQKDP
ELLKAALNAAEGKIKASLTHVTSIVEEKNHDQRYRRIVGDYVAMPPTDRANALIITGTNDSRKKINEYIQ
AELGLKGQGINYPLLNRLDTTQAQRQHSKYYEKGAVIIPERDYSNGLKRGAVYTVLDTGPGNKLTVSDGS
GDTIAFSPARFSKLSVYSVEKTELAVGDQVRITRNDAHLDLANGDRFKVKSVQSGEVLLENEKGRLVKID
ANKPMYLGLAYASTVHSAQGLTCDKVFINMDTRSLTTAKDVYYVAVTRAKHEAVIYTDDEKKLDKAASRE
GFKTAALELEQLKRYAKEQQHDTREKGRDKAEPTQDKGNARPKDKGHEKNNDDRALRL
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4CP
ID | 1933 | GenBank | WP_047929022 |
Name | t4cp2_CXW72_RS25095_p203740_35 | UniProt ID | _ |
Length | 525 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 525 a.a. Molecular weight: 58988.82 Da Isoelectric Point: 9.4812
>WP_047929022.1 MULTISPECIES: type IV secretion system DNA-binding domain-containing protein [Enterobacteriaceae]
MNEADKGKICLGALLFTPLVTWFISAKALYGFTPKDARERLVLLIQQTFDLWPLWGALLVGEIIAIIGLI
IFFKFNHQIFKGAPFKKIYRGTRLVSQRALASITKEREPQITIADIPIPTKAEGTHISIAGATGVGKSTI
FAEMMKCCLMRGKKQRIASNKDRMIILDPDGDFLSRFYKKGDKILNPYDARTEGWVFFNEIKSDFDFERF
GVSIVPTAKDSQTEEWNGYGRLLFTEVSRKVFNTSRNPTMEEVFYWTNEVPLEELEEYVRGTKAQALFAG
SGRAVGSARFVLSNKLAAHLKMPAGNFSIREWLEDPKGGNLYITWTDEMREALKSLISCWTDSIFSIVLG
MSKSNSRKIWTFIDELESLDYLPSLRDALTKGRKKGLRVVTGYQSYSQVISIYGSDVAETLLSNHRTSVV
MAAGRLGERTLEFVSKSLGEIEGEREKTGISRRFGQLGTKNTNDDHKRERAVTPTEIATLDDLTGYIAFP
GNLPVAKFETHHVNYTRSNPVPGVVLRESTAFAGM
MNEADKGKICLGALLFTPLVTWFISAKALYGFTPKDARERLVLLIQQTFDLWPLWGALLVGEIIAIIGLI
IFFKFNHQIFKGAPFKKIYRGTRLVSQRALASITKEREPQITIADIPIPTKAEGTHISIAGATGVGKSTI
FAEMMKCCLMRGKKQRIASNKDRMIILDPDGDFLSRFYKKGDKILNPYDARTEGWVFFNEIKSDFDFERF
GVSIVPTAKDSQTEEWNGYGRLLFTEVSRKVFNTSRNPTMEEVFYWTNEVPLEELEEYVRGTKAQALFAG
SGRAVGSARFVLSNKLAAHLKMPAGNFSIREWLEDPKGGNLYITWTDEMREALKSLISCWTDSIFSIVLG
MSKSNSRKIWTFIDELESLDYLPSLRDALTKGRKKGLRVVTGYQSYSQVISIYGSDVAETLLSNHRTSVV
MAAGRLGERTLEFVSKSLGEIEGEREKTGISRRFGQLGTKNTNDDHKRERAVTPTEIATLDDLTGYIAFP
GNLPVAKFETHHVNYTRSNPVPGVVLRESTAFAGM
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 6848..23541
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
CXW72_RS24960 | 2876..3121 | + | 246 | WP_033544576 | AbrB/MazE/SpoVT family DNA-binding domain-containing protein | - |
CXW72_RS24965 | 3121..3663 | + | 543 | WP_047929020 | hypothetical protein | - |
CXW72_RS24970 | 3633..3929 | + | 297 | WP_226426392 | hypothetical protein | - |
CXW72_RS24975 | 3919..4245 | + | 327 | WP_000817647 | endoribonuclease MazF | - |
CXW72_RS26155 | 4359..4760 | - | 402 | WP_306415735 | Hha/YmoA family nucleoid-associated regulatory protein | - |
CXW72_RS24985 | 4761..5486 | - | 726 | WP_226426391 | restriction endonuclease | - |
CXW72_RS24990 | 5462..5776 | - | 315 | WP_033544579 | hypothetical protein | - |
CXW72_RS24995 | 5780..6001 | - | 222 | WP_226426390 | TrbM/KikA/MpfK family conjugal transfer protein | - |
CXW72_RS25000 | 6100..6537 | - | 438 | WP_033544581 | hypothetical protein | - |
CXW72_RS25005 | 6552..6785 | - | 234 | WP_033544582 | H-NS histone family protein | - |
CXW72_RS25010 | 6848..7588 | + | 741 | WP_033544583 | lytic transglycosylase domain-containing protein | virB1 |
CXW72_RS25015 | 7611..7928 | + | 318 | WP_050008675 | hypothetical protein | - |
CXW72_RS25020 | 7912..8202 | + | 291 | WP_033544584 | TrbC/VirB2 family protein | virB2 |
CXW72_RS25025 | 8254..8568 | + | 315 | WP_033544585 | VirB3 family type IV secretion system protein | virB3 |
CXW72_RS25030 | 8868..11198 | + | 2331 | WP_226426389 | VirB4 family type IV secretion/conjugal transfer ATPase | - |
CXW72_RS25035 | 11208..11915 | + | 708 | WP_047929021 | type IV secretion system protein | virB5 |
CXW72_RS25040 | 11919..12155 | + | 237 | WP_033544588 | EexN family lipoprotein | - |
CXW72_RS25045 | 12210..13217 | + | 1008 | WP_170913282 | type IV secretion system protein | virB6 |
CXW72_RS25430 | 13312..13482 | + | 171 | WP_021536809 | hypothetical protein | - |
CXW72_RS25050 | 13485..14201 | + | 717 | WP_000914645 | type IV secretion system protein | virB8 |
CXW72_RS25055 | 14204..15091 | + | 888 | WP_033544590 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
CXW72_RS25060 | 15088..16242 | + | 1155 | WP_050008676 | type IV secretion system protein VirB10 | virB10 |
CXW72_RS25065 | 16235..17266 | + | 1032 | WP_033544592 | P-type DNA transfer ATPase VirB11 | virB11 |
CXW72_RS25070 | 17229..17783 | + | 555 | WP_042036424 | phospholipase D family protein | - |
CXW72_RS25075 | 17796..18278 | + | 483 | WP_033544604 | GNAT family protein | - |
CXW72_RS25080 | 18290..18484 | + | 195 | WP_000529710 | hypothetical protein | - |
CXW72_RS25085 | 18758..19045 | - | 288 | WP_033544594 | hypothetical protein | - |
CXW72_RS25090 | 19055..21961 | - | 2907 | WP_052953523 | MobF family relaxase | - |
CXW72_RS25095 | 21964..23541 | - | 1578 | WP_047929022 | type IV secretion system DNA-binding domain-containing protein | virb4 |
CXW72_RS25100 | 23534..23881 | - | 348 | WP_047929023 | hypothetical protein | - |
CXW72_RS25105 | 24366..24800 | + | 435 | WP_053273076 | hypothetical protein | - |
CXW72_RS25110 | 24828..25523 | + | 696 | WP_265737297 | StbB family protein | - |
CXW72_RS25115 | 25525..25893 | + | 369 | WP_015345231 | plasmid stabilization protein StbC | - |
CXW72_RS25120 | 26107..26877 | + | 771 | WP_039026342 | SprT-like domain-containing protein | - |
CXW72_RS25435 | 26997..27152 | - | 156 | WP_021536821 | hypothetical protein | - |
CXW72_RS25125 | 27307..27549 | - | 243 | WP_042036420 | hypothetical protein | - |
CXW72_RS25130 | 27585..27872 | - | 288 | WP_001098466 | hypothetical protein | - |
CXW72_RS25135 | 28023..28244 | + | 222 | WP_180305487 | hypothetical protein | - |
Host bacterium
ID | 3437 | GenBank | NZ_CP025915 |
Plasmid name | p203740_35 | Incompatibility group | - |
Plasmid size | 35753 bp | Coordinate of oriT [Strand] | 23983..24082 [+] |
Host baterium | Escherichia coli strain 203740 |
Cargo genes
Drug resistance gene | - |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |