Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 200468 |
Name | oriT_Tn5253 |
Organism | Streptococcus pneumoniae strain DP1322 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | EU351020 (31699..31831 [-], 133 nt) |
oriT length | 133 nt |
IRs (inverted repeats) | 65..70, 83..88 (AAATCC..GGATTT) 6..12, 23..29 (ACCCCCC..GGGGGGT) |
Location of nic site | 75..76 |
Conserved sequence flanking the nic site |
TTTGGTTACA |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 133 nt
ACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATAC
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 14494 | GenBank | ACC59242 |
Name | Rep_trans_-_Tn5253 | UniProt ID | _ |
Length | 401 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 401 a.a. Molecular weight: 47398.10 Da Isoelectric Point: 6.5091
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
ID | 14495 | GenBank | ACC59279 |
Name | Relaxase_-_Tn5253 | UniProt ID | _ |
Length | 609 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 609 a.a. Molecular weight: 73069.47 Da Isoelectric Point: 9.0694
MVITKHFAIHGKNYRSKLIKYILNPSKTKNLTLVSDFGMRNYLDFPSYKELVKMYNDNFLSNDTLYEFRH
DRQEVNQRKIHSHHIIQSFSPDDHLTPEQINRIGYEAAKELTGGRFRFIVATHVDKGHIHNHIILNSIDQ
NSDKKFLWDYKAEHNLRMVSDRLSKIAGAKIIENRYSHRQYEVYRKTNYKYEIKQRVYFLIENSKNFEDL
KKKAKALHLKIDFRHKHVTYFMTDSNMKQVVRDSKLSRKQPYNETYFEKKFVQREIINILEFLLPKMKNM
NELIQRAEVFGLKIIPKEKHVLFEFDGIKLAEQELVKTNLYSVSYFQDYFNNKNETFVLDNKNLVELYNE
EKIIKEKELPSEEMVWKSYQDFKRNRDAVHEFEVELNLNQIEEVVDDGIYIKVQFGIRQEGLIFVPNIQI
NMEEEKVKVFLRETSSYYVYHKDSVEKNRFMKGKTLIRQFNLQYEPQYMYRRIPLSKIKEKIEQLDFLIS
AENSSNDFEDITNDFIAQISYLENMIEQVQNKINDLTNLEEVLLKDTTNSSSNLENSIQGKSSVDTIEKD
LYIYKGKIETLKEQHREAINLFEMFNKTIKKYKEKQNMKSIKENEIHLE
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Auxiliary protein
ID | 7915 | GenBank | ACC59226 |
Name | ACC59226_Tn5253 | UniProt ID | _ |
Length | 405 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 405 a.a. Molecular weight: 47021.92 Da Isoelectric Point: 9.7978
MSEKRRDNKGRILKTGESQRKDGRYLYKYIDSFGEPQFVYSWKLVATDRVPAGKRDCISLREKIAELQKD
IHDGIDVVGKKMTLCQLYAKQNAQRPKVRKNTETGRKYLMDILKKDKLGVRSIDSIKPSDAKEWAIRMSE
NGYAYQTINNYKRSLKASFYIAIQDDCVRKNPFDFQLKAVLDDDTVPKTVLTEEQEEKLLAFAKADKTYS
KNYDEILILLKTGLRISEFGGLTLPDLDFENRLVNIDHQLLRDTEIGYYIETPKTKSGERQVPMVEEAYQ
AFKRVLANRKNDKRVEIDGYSDFLFLNRKNYPKVASDYNGMMKGLVKKYNKYNEDKLPHITPHSLRHTFC
TNYANAGMNPKALQYIMGHANIAMTLNYYAHATFDSAMAEMKRLNKEKQQERLVA
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4CP
ID | 17101 | GenBank | ACC59218 |
Name | traG_-_Tn5253 | UniProt ID | _ |
Length | 625 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 625 a.a. Molecular weight: 72444.76 Da Isoelectric Point: 9.5719
MYSGKKFLLFSLLGILLGYLFHRLTLLYDSYTGNTLDKWTRLLMEGQEEVLQSPWNISFTGKSSAFFLLG
FVMMLLVYLYLETGKKQYREGVEYGSARFGTLKEKNLFYGKEFSHDTILAQDVRLTLLDKKPPQYDRNKN
IAVIGGSGSGKTFRFVKPNLIQMNSSNIVVDPKDHLAEKTGKLFLEHGYQVKVLDLVNMKNSDGFNPFRY
IETENDLNRMLTVYFNNTKGSGSRSDPFWDEASMTLVRALASYLVDFYNPPKTREQLIEESRLSQKEYQN
LLKRQKKEVEERKKRGRYPSFAEISKLIKHLSKGENQEKSVLEILFENYAKKYGTENFTMRNWADFQNYK
DKTLDSVIAVTTAKFALFNIQSVMDLTKRDTLDMKTWGQEKSMVYLVIPDNDSTFRFLSALFFSTVFQTL
TRQADIDFKGQLPLHVRVYLDEFANIGEIPDFAEQTSTVRSRNMSLVPILQNIAQLQGLYKEKEAWKTIL
GNCDSLVYLGGNDEDTFKFMSGLLGKQTIDVRNTSRSFGQTGSGSLSHQKIARDLMTPDEVGNMKRHECL
VRIANMPVFKSKKYNSTKHPNWKYLANQETDERWWNYQINPLNQRQENHLEGLRIRDLTFESSLK
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
ID | 17102 | GenBank | ACC59243 |
Name | tcpA_-_Tn5253 | UniProt ID | _ |
Length | 461 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 461 a.a. Molecular weight: 53370.27 Da Isoelectric Point: 9.0687
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 3460..47747
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
Locus_0 | 1..353 | + | 353 | ACC59205 | IgA-specific zinc metalloproteinase | - |
Locus_1 | 1070..1243 | + | 174 | ACC59206 | unknown | - |
Locus_2 | 1240..2019 | + | 780 | ACC59207 | RepA | - |
Locus_3 | 2020..2121 | + | 102 | ACC59208 | unknown | - |
Locus_4 | 2118..3476 | + | 1359 | ACC59209 | methyltransferase | - |
Locus_5 | 3460..3909 | + | 450 | ACC59210 | unknown | gbs1369 |
Locus_6 | 3902..4282 | + | 381 | ACC59211 | unknown | - |
Locus_7 | 4295..4528 | + | 234 | ACC59212 | unknown | - |
Locus_8 | 4531..5118 | + | 588 | ACC59213 | putative protease | - |
Locus_9 | 5246..5836 | + | 591 | ACC59214 | abortive infection protein AbiEi | - |
Locus_10 | 5836..6672 | + | 837 | ACC59215 | abortive infection protein AbiEii | - |
Locus_11 | 6955..7254 | + | 300 | ACC59216 | unknown | - |
Locus_12 | 7244..7732 | + | 489 | ACC59217 | unknown | gbs1365 |
Locus_13 | 7732..9609 | + | 1878 | ACC59218 | TraG protein | virb4 |
Locus_14 | 9630..9872 | + | 243 | ACC59219 | unknown | prgF |
Locus_15 | 9889..10743 | + | 855 | ACC59220 | unknown | prgHb |
Locus_16 | 10797..11156 | + | 360 | ACC59221 | unknown | prgIc |
Locus_17 | 11107..13464 | + | 2358 | ACC59222 | type IV secretion protein VirB4 | virb4 |
Locus_18 | 13476..16289 | + | 2814 | ACC59223 | putative peptidoglycan hydrolase | prgK |
Locus_19 | 16341..16469 | + | 129 | ACC59224 | unknown | - |
Locus_20 | 16591..16785 | - | 195 | ACC59225 | truncated ISSth5 transposase | - |
Locus_21 | 17306..18523 | - | 1218 | ACC59226 | integrase | - |
Locus_22 | 18605..18808 | - | 204 | ACC59227 | excisionase | - |
Locus_23 | 18792..19043 | + | 252 | ACC59228 | unknown | - |
Locus_24 | 19269..19499 | - | 231 | ACC59229 | unknown | - |
Locus_25 | 19496..19918 | - | 423 | ACC59230 | putative sigma factor | - |
Locus_26 | 20147..20293 | - | 147 | ACC59231 | unknown | - |
Locus_27 | 20423..20776 | + | 354 | ACC59232 | putative transcriptional regulator | - |
Locus_28 | 20836..21024 | - | 189 | ACC59233 | unknown | - |
Locus_29 | 21122..23041 | - | 1920 | ACC59234 | tetracycline resistance protein Tet(M) | - |
Locus_30 | 23057..23143 | - | 87 | ADB27150 | tet(M) leader peptide | - |
Locus_31 | 23418..24350 | - | 933 | ACC59235 | unknown | orf13 |
Locus_32 | 24347..25348 | - | 1002 | ACC59236 | putative cell wall hydrolase | orf14 |
Locus_33 | 25345..27522 | - | 2178 | ACC59237 | unknown | orf15 |
Locus_34 | 27525..29972 | - | 2448 | ACC59238 | type IV secretion protein VirB4 | virb4 |
Locus_35 | 29956..30462 | - | 507 | ACC59239 | unknown | orf17a |
Locus_36 | 30437..30934 | - | 498 | ACC59240 | antirestriction protein ArdA | - |
Locus_37 | 31051..31272 | - | 222 | ACC59241 | unknown | orf19 |
Locus_38 | 31315..32520 | - | 1206 | ACC59242 | relaxase | - |
Locus_39 | 32698..34083 | - | 1386 | ACC59243 | putative FtsK-SpoIIIE family protein | virb4 |
Locus_40 | 34112..34495 | - | 384 | ACC59244 | unknown | orf23 |
Locus_41 | 34514..34828 | - | 315 | ACC59245 | unknown | orf23 |
Locus_42 | 34851..34970 | - | 120 | ACC59246 | unknown | - |
Locus_43 | 35267..35998 | + | 732 | ACC59247 | putative ATP-binding protein | - |
Locus_44 | 36016..37332 | + | 1317 | ACC59248 | putative permease protein | - |
Locus_45 | 37388..37492 | - | 105 | ACC59249 | unknown | - |
Locus_46 | 37763..38173 | + | 411 | ACC59250 | unknown | cd424 |
Locus_47 | 38221..44307 | + | 6087 | ACC59251 | putative restriction-modification protein | - |
Locus_48 | 44382..44666 | + | 285 | ACC59252 | unknown | - |
Locus_49 | 44680..44976 | + | 297 | ACC59253 | unknown | gbs1350 |
Locus_50 | 44990..45187 | - | 198 | ACC59254 | unknown | - |
Locus_51 | 45347..45988 | + | 642 | ACC59255 | unknown | prgL |
Locus_52 | 45998..47083 | + | 1086 | ACC59256 | unknown | traP |
Locus_53 | 47134..47367 | + | 234 | ACC59257 | unknown | gbs1347 |
Locus_54 | 47364..47747 | + | 384 | ACC59258 | unknown | gbs1346 |
Locus_55 | 47785..48057 | + | 273 | ACC59259 | unknown | - |
Locus_56 | 48125..48601 | + | 477 | ACC59260 | antitoxin PezA | - |
Locus_57 | 48601..49359 | + | 759 | ACC59261 | toxin PezT | - |
Locus_58 | 49835..50623 | + | 789 | ACC59262 | unknown | - |
Locus_59 | 50862..51110 | + | 249 | ACC59263 | unknown | - |
Locus_60 | 51281..51931 | + | 651 | ACC59264 | chloramphenicol acetyl transferase CAT | - |
Locus_61 | 52236..52325 | - | 90 | ACC59265 | unknown | - |
Locus_62 | 52442..52666 | - | 225 | ACC59266 | unknown | - |
Host bacterium
ID | 413 | Element type | ICE (Integrative and conjugative element) |
Element name | Tn5253 | GenBank | EU351020 |
Element size | 66192 bp | Coordinate of oriT [Strand] | 31699..31831 [-] |
Host bacterium | Streptococcus pneumoniae strain DP1322 | Coordinate of element | 833..65360 |
Cargo genes
Drug resistance gene | tet(M), cat(pC194) |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | AcrIIA21 |