Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 114630 |
Name | oriT_CCBC3-3-1|unnamed2 |
Organism | Pantoea sp. CCBC3-3-1 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | NZ_CP034365 (2348..2976 [-], 629 nt) |
oriT length | 629 nt |
IRs (inverted repeats) | 590..595, 600..605 (ATGCCC..GGGCAT) 518..525, 529..536 (CCTCCCGT..ACGGGAGG) 453..458, 462..467 (GTTCGC..GCGAAC) |
Location of nic site | 315..316 |
Conserved sequence flanking the nic site |
TCCTGCATCG |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 629 nt
>oriT_CCBC3-3-1|unnamed2
TCTTTGCCTCGCTGTTAAATAAAGCGTAGTTTTGATAAAATCTTCATTATGAATATTATTGTTTTCTTCTATATTTTTATCCTTGCTCTTTAATAAAGCACTAGCCAATAACTTCATGCCTTTTGCAACTGTCAAACTTGGTTCGTCAGGGTAAATGCTTTTAAGGCAAACTAACAAATAATCATGATCTTCATCTTCAATTCTAAACTGAACTTTTTTCATAAAAACTCCCAACAAGAACCGACTGTAGGTCACCGGGCAAACGCCGCAAAATAACGTCGAATGACGTCATTTTGCGGCGTTTGCCCTATCCTGCATCGCAGTGCAATCCACCGCTACTGAAATTGTGCTTCAGGATGGAGGGAAATGCAAAAGCTGAGGGAATTTGTAGCTGAAAGATTGCTAGTCTTCGACCGTTAGCATATGAGTTGCTATAGAACCTGTGCGGGGGTGTTCGCTTCGCGAACGGGTCTGGCAGGGGGCGCAAGCGCTGTGCTGTGCTACATGAGGAAAAGCACCTCCCGTAAAACGGGAGGGCTTCGGCGAATTGACTACAGTGATCTACTTGCCGGGCTGATTGTCGCCTTCCATGCCCTCACGGGCATCCTGCAACCAGCGCCGTAATTTAG
TCTTTGCCTCGCTGTTAAATAAAGCGTAGTTTTGATAAAATCTTCATTATGAATATTATTGTTTTCTTCTATATTTTTATCCTTGCTCTTTAATAAAGCACTAGCCAATAACTTCATGCCTTTTGCAACTGTCAAACTTGGTTCGTCAGGGTAAATGCTTTTAAGGCAAACTAACAAATAATCATGATCTTCATCTTCAATTCTAAACTGAACTTTTTTCATAAAAACTCCCAACAAGAACCGACTGTAGGTCACCGGGCAAACGCCGCAAAATAACGTCGAATGACGTCATTTTGCGGCGTTTGCCCTATCCTGCATCGCAGTGCAATCCACCGCTACTGAAATTGTGCTTCAGGATGGAGGGAAATGCAAAAGCTGAGGGAATTTGTAGCTGAAAGATTGCTAGTCTTCGACCGTTAGCATATGAGTTGCTATAGAACCTGTGCGGGGGTGTTCGCTTCGCGAACGGGTCTGGCAGGGGGCGCAAGCGCTGTGCTGTGCTACATGAGGAAAAGCACCTCCCGTAAAACGGGAGGGCTTCGGCGAATTGACTACAGTGATCTACTTGCCGGGCTGATTGTCGCCTTCCATGCCCTCACGGGCATCCTGCAACCAGCGCCGTAATTTAG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileT4CP
ID | 10890 | GenBank | WP_147200741 |
Name | t4cp2_EHV07_RS23760_CCBC3-3-1|unnamed2 | UniProt ID | _ |
Length | 919 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 919 a.a. Molecular weight: 104668.75 Da Isoelectric Point: 6.3801
>WP_147200741.1 VirB3 family type IV secretion system protein [Pantoea sp. CCBC3-3-1]
MSTVFKGLTRPALVRGLGVPLYPFLGMCMVCVLLGVWIHDLLYFLILPGWFAIKRVTQIDERFFDLLYLR
TVVKGHPLANKRFSAVHYAGSQYDEVDISKVDNFMKLKDQTSIEALIPYSSHITDDLVVTKNRDLVATWQ
VDGAYFECVDDEDLTLLTDQLNTLIRSFEGKSVTFYTHRVRVRKEVKVDFNSKIPFVNRVMNDYYSSLSK
PEYFENKLYLTICYKPFNIEDKVSHLLSKKKGNKKIFDEPINDMNEICGRLDTYLSRFHAHRLGLVEENG
RVFSDQLSLFQYLLSGKWQKVRVTNSPFYTYLGGKDLFFGNDAGQITASENARYFRSIEIKDYFQETDAG
IFDALMYLPVEYVFTSSFTSMDKQAAIKALDDQIEKLEMTDDAAKSQLADLHVGLDMVSSGYNSFGKCHL
TLVIFADSPERLVKDTTIVTTTLEDLGLVVTYSTLSLGAAFFSQLPGNYNLRPRLSILSSLNFAEMESFH
NFFHGKADGNTWGKSLMALRGSGNDVYHLNYHMTTENINFFGKNPTLGHCEILGTSNVGKTVLMMTMSYA
AQQFGTPESFPANRKVRKLTTVFFDKDRAGEVGIRAMGGAYFRVKGGEPTGWNPAALPPTKRNIAFVKDI
IRLICKLNGNTVDDYQNTLISSAVERLMEREDRSYPISKLIPLIMEPDDVETKRHGIKARLRAWKQGGEY
GWLLDNASDSFDVTNLDVFGIDGTEFLDDKVVAPVASFYLIYRVTMLADGRRLLIYMDEFWQWINNDAFK
DFVYNKLKTGRKLDMVLVPATQSPDELIKSPIAAAVREQCATHIYLANPKAKRSEYVDELQVRDLYFDKI
KAIDPLSRQFLVVKNPQRQGERDDFAAFAKLDLGKAAYYLPILSASAEQLELFDEIWSEGMAPEDWIDTY
LERANRGVR
MSTVFKGLTRPALVRGLGVPLYPFLGMCMVCVLLGVWIHDLLYFLILPGWFAIKRVTQIDERFFDLLYLR
TVVKGHPLANKRFSAVHYAGSQYDEVDISKVDNFMKLKDQTSIEALIPYSSHITDDLVVTKNRDLVATWQ
VDGAYFECVDDEDLTLLTDQLNTLIRSFEGKSVTFYTHRVRVRKEVKVDFNSKIPFVNRVMNDYYSSLSK
PEYFENKLYLTICYKPFNIEDKVSHLLSKKKGNKKIFDEPINDMNEICGRLDTYLSRFHAHRLGLVEENG
RVFSDQLSLFQYLLSGKWQKVRVTNSPFYTYLGGKDLFFGNDAGQITASENARYFRSIEIKDYFQETDAG
IFDALMYLPVEYVFTSSFTSMDKQAAIKALDDQIEKLEMTDDAAKSQLADLHVGLDMVSSGYNSFGKCHL
TLVIFADSPERLVKDTTIVTTTLEDLGLVVTYSTLSLGAAFFSQLPGNYNLRPRLSILSSLNFAEMESFH
NFFHGKADGNTWGKSLMALRGSGNDVYHLNYHMTTENINFFGKNPTLGHCEILGTSNVGKTVLMMTMSYA
AQQFGTPESFPANRKVRKLTTVFFDKDRAGEVGIRAMGGAYFRVKGGEPTGWNPAALPPTKRNIAFVKDI
IRLICKLNGNTVDDYQNTLISSAVERLMEREDRSYPISKLIPLIMEPDDVETKRHGIKARLRAWKQGGEY
GWLLDNASDSFDVTNLDVFGIDGTEFLDDKVVAPVASFYLIYRVTMLADGRRLLIYMDEFWQWINNDAFK
DFVYNKLKTGRKLDMVLVPATQSPDELIKSPIAAAVREQCATHIYLANPKAKRSEYVDELQVRDLYFDKI
KAIDPLSRQFLVVKNPQRQGERDDFAAFAKLDLGKAAYYLPILSASAEQLELFDEIWSEGMAPEDWIDTY
LERANRGVR
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
ID | 10891 | GenBank | WP_147200749 |
Name | t4cp2_EHV07_RS23800_CCBC3-3-1|unnamed2 | UniProt ID | _ |
Length | 611 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 611 a.a. Molecular weight: 69308.20 Da Isoelectric Point: 7.7432
>WP_147200749.1 type IV secretory system conjugative DNA transfer family protein [Pantoea sp. CCBC3-3-1]
MSLKLPDKAQWAFIILIMCLVAYYAGSVAIYFFNHKTPLYIWKHYASMLLWRVMIDSSVKSEIRLTSVYS
LLSGLLASLAAPVFIIWKINQDDAPLFGDAKFASDADLKKSKLLKWERENDTDILVGRYKGKYLWYTAPD
FVSLGAGTRAGKGAAIGIPNMLVKKHSIIALDPKQELWKITSKVREIVLRNKVYLLDPFNTKTHQCNPLF
YVDLKAESGAKDLLKLVEILFPSFGLTGAEAHFNNLAGQYWTGLAKLLHFFINFAPEWIDEFRIKPVFSI
GTVVDLYGNIDREQVLSNRETFEELVQGNETARYHLQDALTKIREYHETEDEQRSSVDGSFRKKMSLFYL
PTVRKCTDGNDFDLRQMRREDITVYVGVNAEDMILAYDFLNLFFNLVVEVTLRENPDFDPTLKHDCLLFL
DEFPSIGYMPIIKKGSGYIAGYKLKLLTIYQNISQLNEIYGTEGAKTLMSAHPCRVIYAVSEEDDAKKIS
EKLGYITTKSKGSSKTSGKATSRSNSESEAQRALVLPQELGTLAFSEEMIILKGENPVKAEKALYYLDPY
FMDRLMLVSPKLTALTASINKTNNIFGVKGIKYPSKEKMLSVGELESEVLL
MSLKLPDKAQWAFIILIMCLVAYYAGSVAIYFFNHKTPLYIWKHYASMLLWRVMIDSSVKSEIRLTSVYS
LLSGLLASLAAPVFIIWKINQDDAPLFGDAKFASDADLKKSKLLKWERENDTDILVGRYKGKYLWYTAPD
FVSLGAGTRAGKGAAIGIPNMLVKKHSIIALDPKQELWKITSKVREIVLRNKVYLLDPFNTKTHQCNPLF
YVDLKAESGAKDLLKLVEILFPSFGLTGAEAHFNNLAGQYWTGLAKLLHFFINFAPEWIDEFRIKPVFSI
GTVVDLYGNIDREQVLSNRETFEELVQGNETARYHLQDALTKIREYHETEDEQRSSVDGSFRKKMSLFYL
PTVRKCTDGNDFDLRQMRREDITVYVGVNAEDMILAYDFLNLFFNLVVEVTLRENPDFDPTLKHDCLLFL
DEFPSIGYMPIIKKGSGYIAGYKLKLLTIYQNISQLNEIYGTEGAKTLMSAHPCRVIYAVSEEDDAKKIS
EKLGYITTKSKGSSKTSGKATSRSNSESEAQRALVLPQELGTLAFSEEMIILKGENPVKAEKALYYLDPY
FMDRLMLVSPKLTALTASINKTNNIFGVKGIKYPSKEKMLSVGELESEVLL
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 5534..15531
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
EHV07_RS23710 | 1002..1268 | + | 267 | WP_147200732 | hypothetical protein | - |
EHV07_RS23715 | 1258..1476 | + | 219 | WP_147200733 | hypothetical protein | - |
EHV07_RS24725 | 1573..1917 | + | 345 | WP_174822398 | hypothetical protein | - |
EHV07_RS24625 | 1993..2142 | + | 150 | WP_168199686 | hypothetical protein | - |
EHV07_RS23725 | 2158..2415 | + | 258 | WP_147200734 | hypothetical protein | - |
EHV07_RS23730 | 2755..3300 | + | 546 | WP_147200735 | DNA distortion polypeptide 1 | - |
EHV07_RS23735 | 3303..4460 | + | 1158 | WP_147200736 | MobP1 family relaxase | - |
EHV07_RS23740 | 4819..5310 | + | 492 | WP_147200737 | transcription termination/antitermination NusG family protein | - |
EHV07_RS23750 | 5534..6175 | + | 642 | WP_147200739 | lytic transglycosylase domain-containing protein | virB1 |
EHV07_RS23755 | 6162..6452 | + | 291 | WP_147200740 | TrbC/VirB2 family protein | virB2 |
EHV07_RS23760 | 6480..9239 | + | 2760 | WP_147200741 | VirB3 family type IV secretion system protein | virb4 |
EHV07_RS23765 | 9241..9978 | + | 738 | WP_147200742 | type IV secretion system protein | - |
EHV07_RS23770 | 9988..10248 | + | 261 | WP_147200743 | EexN family lipoprotein | - |
EHV07_RS23775 | 10261..11316 | + | 1056 | WP_147200744 | type IV secretion system protein | virB6 |
EHV07_RS23780 | 11599..12327 | + | 729 | WP_147200745 | type IV secretion system protein | virB8 |
EHV07_RS23785 | 12333..13262 | + | 930 | WP_147200746 | TrbG/VirB9 family P-type conjugative transfer protein | virB9 |
EHV07_RS23790 | 13259..14476 | + | 1218 | WP_147200747 | VirB10/TraB/TrbI family type IV secretion system protein | virB10 |
EHV07_RS23795 | 14494..15531 | + | 1038 | WP_147200748 | P-type DNA transfer ATPase VirB11 | virB11 |
EHV07_RS23800 | 15536..17371 | + | 1836 | WP_147200749 | type IV secretory system conjugative DNA transfer family protein | - |
EHV07_RS23805 | 17368..17763 | + | 396 | WP_147200750 | cag pathogenicity island Cag12 family protein | - |
EHV07_RS25030 | 17766..18098 | + | 333 | WP_254446384 | hypothetical protein | - |
EHV07_RS23815 | 18182..18652 | + | 471 | WP_147200751 | thermonuclease family protein | - |
EHV07_RS23820 | 18669..19445 | + | 777 | WP_147200752 | hypothetical protein | - |
EHV07_RS23825 | 19537..19782 | + | 246 | WP_147200753 | hypothetical protein | - |
EHV07_RS23830 | 19798..20496 | + | 699 | WP_147200754 | hypothetical protein | - |
Host bacterium
ID | 15065 | GenBank | NZ_CP034365 |
Plasmid name | CCBC3-3-1|unnamed2 | Incompatibility group | - |
Plasmid size | 29467 bp | Coordinate of oriT [Strand] | 2348..2976 [-] |
Host baterium | Pantoea sp. CCBC3-3-1 |
Cargo genes
Drug resistance gene | - |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | AcrIIA9 |