Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 200022 |
Name | oriT_Tn5251 |
Organism | Streptococcus pneumoniae |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | FJ711160 (15378..15592 [+], 215 nt) |
oriT length | 215 nt |
IRs (inverted repeats) | 61..76, 80..95 (ACTTAACCCCCCGTAT..ACAGGGGGGTACAAAT) 123..134, 139..150 (GAAAATCCTTTG..CAAGGGATTTAC) |
Location of nic site | 135..136 |
Conserved sequence flanking the nic site |
TGG|T |
Note | blastn alignment with the oriT_Tn916 |
oriT sequence
Download Length: 215 nt
>oriT_Tn5251
AAGCGGAAGTCGCAGGTGTGGACTGATCTTGCTGGCTGGTGTGGCAATAGCCACGCCAGCACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATACGCCTTTTTGATTGGAGGGATTT
AAGCGGAAGTCGCAGGTGTGGACTGATCTTGCTGGCTGGTGTGGCAATAGCCACGCCAGCACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATACGCCTTTTTGATTGGAGGGATTT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 236 | GenBank | ACW84401 |
Name | Orf20_Tn5251 | UniProt ID | D3U1A0 |
Length | 401 a.a. | PDB ID | |
Note | _ |
Relaxase protein sequence
Download Length: 401 a.a. Molecular weight: 47398.10 Da Isoelectric Point: 6.5091
>ACW84401.1 relaxase [Streptococcus pneumoniae]
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | D3U1A0 |
T4CP
ID | 236 | GenBank | ACW84402 |
Name | Orf21_Tn5251 | UniProt ID | B7UTY5 |
Length | 461 a.a. | PDB ID | _ |
Note | FtsK-SpoIIIE family protein |
T4CP protein sequence
Download Length: 461 a.a. Molecular weight: 53370.27 Da Isoelectric Point: 9.0687
>ACW84402.1 FtsK-SpoIIIE family protein [Streptococcus pneumoniae]
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | B7UTY5 |
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 6287..17697
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
Locus_1 | 1474..1677 | - | 204 | ACW84385 | excisionase | - |
Locus_2 | 1661..1912 | + | 252 | ACW84386 | unknown | - |
Locus_3 | 2138..2368 | - | 231 | ACW84387 | unknown | - |
Locus_4 | 2365..2787 | - | 423 | ACW84388 | putative sigma factor | - |
Locus_5 | 3016..3162 | - | 147 | ACW84389 | unknown | - |
Locus_6 | 3292..3645 | + | 354 | ACW84390 | putative transcriptional regulator | - |
Locus_7 | 3705..3893 | - | 189 | ACW84391 | unknown | - |
Locus_8 | 3991..5910 | - | 1920 | ACW84392 | tetracycline resistance protein | - |
Locus_9 | 5926..6012 | - | 87 | ACW84393 | tet(M) leader peptide | - |
Locus_10 | 6287..7219 | - | 933 | ACW84394 | unknown | orf13 |
Locus_11 | 7216..8217 | - | 1002 | ACW84395 | putative cell wall hydrolase | orf14 |
Locus_12 | 8214..10391 | - | 2178 | ACW84396 | unknown | orf15 |
Locus_13 | 10394..12841 | - | 2448 | ACW84397 | type IV secretion protein virB4 | virb4 |
Locus_14 | 12825..13331 | - | 507 | ACW84398 | unknown | orf17a |
Locus_15 | 13306..13803 | - | 498 | ACW84399 | antirestriction protein | - |
Locus_16 | 13920..14141 | - | 222 | ACW84400 | unknown | orf19 |
Locus_17 | 14184..15389 | - | 1206 | ACW84401 | relaxase | - |
Locus_18 | 15567..16952 | - | 1386 | ACW84402 | FtsK-SpoIIIE family protein | virb4 |
Locus_19 | 16981..17364 | - | 384 | ACW84403 | unknown | orf23 |
Locus_20 | 17383..17697 | - | 315 | ACW84404 | unknown | orf23 |
Locus_21 | 17720..17839 | - | 120 | ACW84405 | unknown | - |
Host bacterium
ID | 27 | Element type | |
Element name | Tn5251 | GenBank | FJ711160 |
Element size | 18033 bp | Coordinate of oriT [Strand] | 15378..15592 [+] |
Host bacterium | Streptococcus pneumoniae |
Cargo genes
Drug resistance gene | tet(M) |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |