Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 200049 |
Name | oriT_Tn2010 |
Organism | Streptococcus pneumoniae |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | AB426620 (2441..2655 [+], 215 nt) |
oriT length | 215 nt |
IRs (inverted repeats) | 61..76, 80..95 (ACTTAACCCCCCGTAT..ACAGGGGGGTACAAAT) 123..134, 139..150 (GAAAATCCTTTG..CAAGGGATTTAC) |
Location of nic site | 135..136 |
Conserved sequence flanking the nic site |
TGG|T |
Note | blastn alignment with the oriT_Tn916 |
oriT sequence
Download Length: 215 nt
>oriT_Tn2010
AAGCGGAAGTCGCAGGTGTGGACTGATCTTGCTGGCTGGTGTGGCAATAGCCACGCCAGCACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATACGCCTTTTTGATTGGAGGGATTT
AAGCGGAAGTCGCAGGTGTGGACTGATCTTGCTGGCTGGTGTGGCAATAGCCACGCCAGCACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATACGCCTTTTTGATTGGAGGGATTT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 263 | GenBank | BAG12482 |
Name | ORF20_Tn2010 | UniProt ID | B1B630 |
Length | 348 a.a. | PDB ID | |
Note | similar to ORF20 in Tn6002 |
Relaxase protein sequence
Download Length: 348 a.a. Molecular weight: 41346.28 Da Isoelectric Point: 8.1935
>BAG12482.1 hypothetical protein [Streptococcus pneumoniae]
MLFDYVRIRFPTTDVQHVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGRGC
RQFESYLLAQQRSWYEFFMDALVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRSGE
LVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLLVY
DNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQVAP
TLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKKGYLSTIPVDRYPKKRYNGR
MLFDYVRIRFPTTDVQHVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGRGC
RQFESYLLAQQRSWYEFFMDALVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRSGE
LVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLLVY
DNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQVAP
TLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKKGYLSTIPVDRYPKKRYNGR
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | B1B630 |
T4CP
ID | 263 | GenBank | BAG12481 |
Name | ORF21_Tn2010 | UniProt ID | B1B629 |
Length | 461 a.a. | PDB ID | _ |
Note | similar to ORF21 in Tn6002 |
T4CP protein sequence
Download Length: 461 a.a. Molecular weight: 53369.29 Da Isoelectric Point: 9.1761
>BAG12481.1 hypothetical protein [Streptococcus pneumoniae]
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVQAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVQAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | B1B629 |
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 336..14595
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
Locus_0 | 194..313 | + | 120 | BAG12478 | hypothetical protein | - |
Locus_1 | 336..650 | + | 315 | BAG12479 | hypothetical protein | orf23 |
Locus_2 | 666..1052 | + | 387 | BAG12480 | hypothetical protein | orf23 |
Locus_3 | 1081..2466 | + | 1386 | BAG12481 | hypothetical protein | virb4 |
Locus_4 | 2860..3906 | + | 1047 | BAG12482 | hypothetical protein | - |
Locus_5 | 3896..4063 | + | 168 | BAG12483 | MLS leader peptide | - |
Locus_6 | 4112..4195 | + | 84 | BAG12484 | MLS leader peptide | - |
Locus_7 | 4320..5057 | + | 738 | BAG12485 | MLS methylase | - |
Locus_8 | 5002..5193 | + | 192 | BAG12486 | hypothetical protein | - |
Locus_9 | 5314..6561 | - | 1248 | BAG12487 | hypothetical protein | - |
Locus_10 | 6741..6962 | + | 222 | BAG12488 | hypothetical protein | orf19 |
Locus_11 | 7079..7576 | + | 498 | BAG12489 | hypothetical protein | - |
Locus_12 | 7551..8057 | + | 507 | BAG12490 | hypothetical protein | orf17a |
Locus_13 | 8041..10488 | + | 2448 | BAG12491 | hypothetical protein | virb4 |
Locus_14 | 10491..12668 | + | 2178 | BAG12492 | hypothetical protein | orf15 |
Locus_15 | 12665..13666 | + | 1002 | BAG12493 | hypothetical protein | orf14 |
Locus_16 | 13663..14595 | + | 933 | BAG12494 | hypothetical protein | orf13 |
Locus_17 | 14870..14956 | + | 87 | BAG12495 | hypothetical protein | - |
Locus_18 | 14972..16891 | + | 1920 | BAG12496 | tetracycline resistance protein | - |
Locus_19 | 17070..17258 | + | 189 | BAG12497 | hypothetical protein | - |
Locus_20 | 17497..17832 | + | 336 | BAG12498 | hypothetical protein | - |
Locus_21 | 17845..18213 | + | 369 | BAG12499 | hypothetical protein | - |
Locus_22 | 18200..18499 | + | 300 | BAG12500 | hypothetical protein | - |
Host bacterium
ID | 54 | Element type | Transposon |
Element name | Tn2010 | GenBank | AB426620 |
Element size | 26390 bp | Coordinate of oriT [Strand] | 2441..2655 [+] |
Host bacterium | Streptococcus pneumoniae |
Cargo genes
Drug resistance gene | erm(B), tet(M), msr(D), mef(A) |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |