Detailed information of oriT

oriT


The information of the oriT region


oriTDB ID   200468
Name   oriT_Tn5253 in_silico
Organism   Streptococcus pneumoniae strain DP1322
Sequence Completeness      -
NCBI accession of oriT (coordinates [strand])   EU351020 (31699..31831 [-], 133 nt)
oriT length   133 nt
IRs (inverted repeats)      65..70, 83..88  (AAATCC..GGATTT)
 6..12, 23..29  (ACCCCCC..GGGGGGT)
Location of nic site      75..76
Conserved sequence flanking the
  nic site  
 
 TTTGGTTACA
Note   Predicted by oriTfinder 2.0

  oriT sequence  


Download         Length: 133 nt

>oriT_Tn5253
ACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATAC

Visualization of oriT structure

  oriT secondary structure

Predicted by RNAfold.

Download structure file


Relaxase


ID   14494 GenBank   ACC59242
Name   Rep_trans_-_Tn5253 insolico UniProt ID   _
Length   401 a.a. PDB ID   
Note   Predicted by oriTfinder 2.0

  Relaxase protein sequence


Download         Length: 401 a.a.        Molecular weight: 47398.10 Da        Isoelectric Point: 6.5091

>ACC59242.1 relaxase [Streptococcus pneumoniae]
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK

  Protein domains


Predicted by InterproScan.

(15-53)

(169-373)

(73-161)


  Protein structure



No available structure.



ID   14495 GenBank   ACC59279
Name   Relaxase_-_Tn5253 insolico UniProt ID   _
Length   609 a.a. PDB ID   
Note   Predicted by oriTfinder 2.0

  Relaxase protein sequence


Download         Length: 609 a.a.        Molecular weight: 73069.47 Da        Isoelectric Point: 9.0694

>ACC59279.1 relaxase [Streptococcus pneumoniae]
MVITKHFAIHGKNYRSKLIKYILNPSKTKNLTLVSDFGMRNYLDFPSYKELVKMYNDNFLSNDTLYEFRH
DRQEVNQRKIHSHHIIQSFSPDDHLTPEQINRIGYEAAKELTGGRFRFIVATHVDKGHIHNHIILNSIDQ
NSDKKFLWDYKAEHNLRMVSDRLSKIAGAKIIENRYSHRQYEVYRKTNYKYEIKQRVYFLIENSKNFEDL
KKKAKALHLKIDFRHKHVTYFMTDSNMKQVVRDSKLSRKQPYNETYFEKKFVQREIINILEFLLPKMKNM
NELIQRAEVFGLKIIPKEKHVLFEFDGIKLAEQELVKTNLYSVSYFQDYFNNKNETFVLDNKNLVELYNE
EKIIKEKELPSEEMVWKSYQDFKRNRDAVHEFEVELNLNQIEEVVDDGIYIKVQFGIRQEGLIFVPNIQI
NMEEEKVKVFLRETSSYYVYHKDSVEKNRFMKGKTLIRQFNLQYEPQYMYRRIPLSKIKEKIEQLDFLIS
AENSSNDFEDITNDFIAQISYLENMIEQVQNKINDLTNLEEVLLKDTTNSSSNLENSIQGKSSVDTIEKD
LYIYKGKIETLKEQHREAINLFEMFNKTIKKYKEKQNMKSIKENEIHLE

  Protein domains


Predicted by InterproScan.

(12-264)


  Protein structure



No available structure.




Auxiliary protein


ID   7915 GenBank   ACC59226
Name   ACC59226_Tn5253 insolico UniProt ID   _
Length   405 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  Auxiliary protein sequence


Download         Length: 405 a.a.        Molecular weight: 47021.92 Da        Isoelectric Point: 9.7978

>ACC59226.1 integrase [Streptococcus pneumoniae]
MSEKRRDNKGRILKTGESQRKDGRYLYKYIDSFGEPQFVYSWKLVATDRVPAGKRDCISLREKIAELQKD
IHDGIDVVGKKMTLCQLYAKQNAQRPKVRKNTETGRKYLMDILKKDKLGVRSIDSIKPSDAKEWAIRMSE
NGYAYQTINNYKRSLKASFYIAIQDDCVRKNPFDFQLKAVLDDDTVPKTVLTEEQEEKLLAFAKADKTYS
KNYDEILILLKTGLRISEFGGLTLPDLDFENRLVNIDHQLLRDTEIGYYIETPKTKSGERQVPMVEEAYQ
AFKRVLANRKNDKRVEIDGYSDFLFLNRKNYPKVASDYNGMMKGLVKKYNKYNEDKLPHITPHSLRHTFC
TNYANAGMNPKALQYIMGHANIAMTLNYYAHATFDSAMAEMKRLNKEKQQERLVA

  Protein domains


Predicted by InterproScan.

(190-383)

(1-71)


  Protein structure



No available structure.




T4CP


ID   17101 GenBank   ACC59218
Name   traG_-_Tn5253 insolico UniProt ID   _
Length   625 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 625 a.a.        Molecular weight: 72444.76 Da        Isoelectric Point: 9.5719

>ACC59218.1 TraG protein [Streptococcus pneumoniae]
MYSGKKFLLFSLLGILLGYLFHRLTLLYDSYTGNTLDKWTRLLMEGQEEVLQSPWNISFTGKSSAFFLLG
FVMMLLVYLYLETGKKQYREGVEYGSARFGTLKEKNLFYGKEFSHDTILAQDVRLTLLDKKPPQYDRNKN
IAVIGGSGSGKTFRFVKPNLIQMNSSNIVVDPKDHLAEKTGKLFLEHGYQVKVLDLVNMKNSDGFNPFRY
IETENDLNRMLTVYFNNTKGSGSRSDPFWDEASMTLVRALASYLVDFYNPPKTREQLIEESRLSQKEYQN
LLKRQKKEVEERKKRGRYPSFAEISKLIKHLSKGENQEKSVLEILFENYAKKYGTENFTMRNWADFQNYK
DKTLDSVIAVTTAKFALFNIQSVMDLTKRDTLDMKTWGQEKSMVYLVIPDNDSTFRFLSALFFSTVFQTL
TRQADIDFKGQLPLHVRVYLDEFANIGEIPDFAEQTSTVRSRNMSLVPILQNIAQLQGLYKEKEAWKTIL
GNCDSLVYLGGNDEDTFKFMSGLLGKQTIDVRNTSRSFGQTGSGSLSHQKIARDLMTPDEVGNMKRHECL
VRIANMPVFKSKKYNSTKHPNWKYLANQETDERWWNYQINPLNQRQENHLEGLRIRDLTFESSLK

  Protein domains


Predicted by InterproScan.

(133-574)

  Protein structure



No available structure.



ID   17102 GenBank   ACC59243
Name   tcpA_-_Tn5253 insolico UniProt ID   _
Length   461 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 461 a.a.        Molecular weight: 53370.27 Da        Isoelectric Point: 9.0687

>ACC59243.1 putative FtsK-SpoIIIE family protein [Streptococcus pneumoniae]
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD

  Protein domains


Predicted by InterproScan.

(217-301)

  Protein structure



No available structure.




T4SS


T4SS were predicted by using oriTfinder2.

Region 1: 3460..47747

Locus tag Coordinates Strand Size (bp) Protein ID Product Description
Locus_0 1..353 + 353 ACC59205 IgA-specific zinc metalloproteinase -
Locus_1 1070..1243 + 174 ACC59206 unknown -
Locus_2 1240..2019 + 780 ACC59207 RepA -
Locus_3 2020..2121 + 102 ACC59208 unknown -
Locus_4 2118..3476 + 1359 ACC59209 methyltransferase -
Locus_5 3460..3909 + 450 ACC59210 unknown gbs1369
Locus_6 3902..4282 + 381 ACC59211 unknown -
Locus_7 4295..4528 + 234 ACC59212 unknown -
Locus_8 4531..5118 + 588 ACC59213 putative protease -
Locus_9 5246..5836 + 591 ACC59214 abortive infection protein AbiEi -
Locus_10 5836..6672 + 837 ACC59215 abortive infection protein AbiEii -
Locus_11 6955..7254 + 300 ACC59216 unknown -
Locus_12 7244..7732 + 489 ACC59217 unknown gbs1365
Locus_13 7732..9609 + 1878 ACC59218 TraG protein virb4
Locus_14 9630..9872 + 243 ACC59219 unknown prgF
Locus_15 9889..10743 + 855 ACC59220 unknown prgHb
Locus_16 10797..11156 + 360 ACC59221 unknown prgIc
Locus_17 11107..13464 + 2358 ACC59222 type IV secretion protein VirB4 virb4
Locus_18 13476..16289 + 2814 ACC59223 putative peptidoglycan hydrolase prgK
Locus_19 16341..16469 + 129 ACC59224 unknown -
Locus_20 16591..16785 - 195 ACC59225 truncated ISSth5 transposase -
Locus_21 17306..18523 - 1218 ACC59226 integrase -
Locus_22 18605..18808 - 204 ACC59227 excisionase -
Locus_23 18792..19043 + 252 ACC59228 unknown -
Locus_24 19269..19499 - 231 ACC59229 unknown -
Locus_25 19496..19918 - 423 ACC59230 putative sigma factor -
Locus_26 20147..20293 - 147 ACC59231 unknown -
Locus_27 20423..20776 + 354 ACC59232 putative transcriptional regulator -
Locus_28 20836..21024 - 189 ACC59233 unknown -
Locus_29 21122..23041 - 1920 ACC59234 tetracycline resistance protein Tet(M) -
Locus_30 23057..23143 - 87 ADB27150 tet(M) leader peptide -
Locus_31 23418..24350 - 933 ACC59235 unknown orf13
Locus_32 24347..25348 - 1002 ACC59236 putative cell wall hydrolase orf14
Locus_33 25345..27522 - 2178 ACC59237 unknown orf15
Locus_34 27525..29972 - 2448 ACC59238 type IV secretion protein VirB4 virb4
Locus_35 29956..30462 - 507 ACC59239 unknown orf17a
Locus_36 30437..30934 - 498 ACC59240 antirestriction protein ArdA -
Locus_37 31051..31272 - 222 ACC59241 unknown orf19
Locus_38 31315..32520 - 1206 ACC59242 relaxase -
Locus_39 32698..34083 - 1386 ACC59243 putative FtsK-SpoIIIE family protein virb4
Locus_40 34112..34495 - 384 ACC59244 unknown orf23
Locus_41 34514..34828 - 315 ACC59245 unknown orf23
Locus_42 34851..34970 - 120 ACC59246 unknown -
Locus_43 35267..35998 + 732 ACC59247 putative ATP-binding protein -
Locus_44 36016..37332 + 1317 ACC59248 putative permease protein -
Locus_45 37388..37492 - 105 ACC59249 unknown -
Locus_46 37763..38173 + 411 ACC59250 unknown cd424
Locus_47 38221..44307 + 6087 ACC59251 putative restriction-modification protein -
Locus_48 44382..44666 + 285 ACC59252 unknown -
Locus_49 44680..44976 + 297 ACC59253 unknown gbs1350
Locus_50 44990..45187 - 198 ACC59254 unknown -
Locus_51 45347..45988 + 642 ACC59255 unknown prgL
Locus_52 45998..47083 + 1086 ACC59256 unknown traP
Locus_53 47134..47367 + 234 ACC59257 unknown gbs1347
Locus_54 47364..47747 + 384 ACC59258 unknown gbs1346
Locus_55 47785..48057 + 273 ACC59259 unknown -
Locus_56 48125..48601 + 477 ACC59260 antitoxin PezA -
Locus_57 48601..49359 + 759 ACC59261 toxin PezT -
Locus_58 49835..50623 + 789 ACC59262 unknown -
Locus_59 50862..51110 + 249 ACC59263 unknown -
Locus_60 51281..51931 + 651 ACC59264 chloramphenicol acetyl transferase CAT -
Locus_61 52236..52325 - 90 ACC59265 unknown -
Locus_62 52442..52666 - 225 ACC59266 unknown -


Host bacterium


ID   413 Element type   ICE (Integrative and conjugative element)
Element name   Tn5253 GenBank   EU351020
Element size   66192 bp Coordinate of oriT [Strand]   31699..31831 [-]
Host bacterium   Streptococcus pneumoniae strain DP1322 Coordinate of element   833..65360

Cargo genes


Drug resistance gene   tet(M), cat(pC194)
Virulence gene   -
Metal resistance gene   -
Degradation gene   -
Symbiosis gene   -
Anti-CRISPR   AcrIIA21