Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 200035 |
Name | oriT_ICESpaNUF1049 |
Organism | Streptococcus parauberis |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | AB468159 (5456..5670 [+], 215 nt) |
oriT length | 215 nt |
IRs (inverted repeats) | 61..76, 80..95 (ACTTAACCCCCCGTAT..ACAGGGGGGTACAAAT) 123..134, 139..150 (GAAAATCCTTTG..CAAGGGATTTAC) |
Location of nic site | 135..136 |
Conserved sequence flanking the nic site |
TGG|T |
Note | blastn alignment with the oriT_Tn916 |
oriT sequence
Download Length: 215 nt
>oriT_ICESpaNUF1049
AAGCGGAAGTCGCAGGTGTGGACTGATCTTGCTGGCTGGTGTGGCAATAGCCACGCCAGCACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATACGCCTTTTTGATTGGAGGGATTT
AAGCGGAAGTCGCAGGTGTGGACTGATCTTGCTGGCTGGTGTGGCAATAGCCACGCCAGCACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATACGCCTTTTTGATTGGAGGGATTT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 249 | GenBank | BAG80620 |
Name | Relaxase_ICESpaNUF1049 | UniProt ID | B6F268 |
Length | 329 a.a. | PDB ID | |
Note | similar to Tn916(ORF20); U09422 |
Relaxase protein sequence
Download Length: 329 a.a. Molecular weight: 39099.72 Da Isoelectric Point: 7.0754
>BAG80620.1 hypothetic protein [Streptococcus parauberis]
MLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGRGC
RQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRSGE
LVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLLVY
DNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQVAP
TLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK
MLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGRGC
RQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRSGE
LVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLLVY
DNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQVAP
TLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | B6F268 |
T4CP
ID | 249 | GenBank | BAG80619 |
Name | T4CP_ICESpaNUF1049 | UniProt ID | B6F267 |
Length | 461 a.a. | PDB ID | _ |
Note | similar to Tn916(ORF21);U09422 |
T4CP protein sequence
Download Length: 461 a.a. Molecular weight: 53370.27 Da Isoelectric Point: 9.0687
>BAG80619.1 hypothetic protein [Streptococcus parauberis]
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | B6F267 |
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 3351..14760
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
Locus_0 | 842..1381 | + | 540 | BAG80614 | hypothetic protein | - |
Locus_1 | 1390..2379 | + | 990 | BAG80615 | hypothetical protein | - |
Locus_2 | 3209..3328 | + | 120 | BAG80616 | hypothetical protein | - |
Locus_3 | 3351..3665 | + | 315 | BAG80617 | hypothetic protein | orf23 |
Locus_4 | 3681..4067 | + | 387 | BAG80618 | hypothetic protein | orf23 |
Locus_5 | 4096..5481 | + | 1386 | BAG80619 | hypothetic protein | virb4 |
Locus_6 | 5875..6864 | + | 990 | BAG80620 | hypothetic protein | - |
Locus_7 | 6907..7128 | + | 222 | BAG80621 | hypothetic protein | orf19 |
Locus_8 | 7245..7742 | + | 498 | BAG80622 | hypothetic protein | - |
Locus_9 | 7717..8223 | + | 507 | BAG80623 | hypothetic protein | orf17a |
Locus_10 | 8207..10654 | + | 2448 | BAG80624 | hypothetic protein | virb4 |
Locus_11 | 10657..12921 | + | 2265 | BAG80625 | hypothetic protein | orf15 |
Locus_12 | 12830..13831 | + | 1002 | BAG80626 | hypothetic protein | orf14 |
Locus_13 | 13828..14760 | + | 933 | BAG80627 | hypothetic protein | orf13 |
Locus_14 | 15035..15121 | + | 87 | BAG80628 | tet(M) leader peptide | - |
Locus_15 | 15137..17056 | + | 1920 | BAG80629 | Tn916-like_tet(M) | - |
Locus_16 | 17154..17342 | + | 189 | BAG80630 | hypothetic protein | - |
Locus_17 | 17402..17755 | - | 354 | BAG80631 | putative transcriptional regulator | - |
Locus_18 | 17960..18031 | + | 72 | BAG80632 | hypothetic protein | - |
Locus_19 | 18209..18682 | + | 474 | BAG80633 | hypothetic protein | - |
Locus_20 | 18679..18909 | + | 231 | BAG80634 | hypothetic protein | - |
Locus_21 | 19135..19386 | - | 252 | BAG80635 | hypothetic protein | - |
Locus_22 | 19370..19573 | + | 204 | BAG80636 | Tn-excisionase | - |
Host bacterium
ID | 40 | Element type | |
Element name | ICESpaNUF1049 | GenBank | AB468159 |
Element size | 22381 bp | Coordinate of oriT [Strand] | 5456..5670 [+] |
Host bacterium | Streptococcus parauberis | Coordinate of element | 3016..21046 |
Cargo genes
Drug resistance gene | tet(M) |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |