Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 200042 |
Name | oriT_ICESpnCGSP14-1 |
Organism | Streptococcus pneumoniae CGSP14 |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | CP001033 (162035..162249 [+], 215 nt) |
oriT length | 215 nt |
IRs (inverted repeats) | 61..76, 80..95 (ACTTAACCCCCCGTAT..ACAGGGGGGTACAAAT) 123..134, 139..150 (GAAAATCCTTTG..CAAGGGATTTAC) |
Location of nic site | 135..136 |
Conserved sequence flanking the nic site |
TGG|T |
Note | blastn alignment with the oriT_Tn916 |
oriT sequence
Download Length: 215 nt
>oriT_ICESpnCGSP14-1
AAGCGGAAGTCGCAGGTGTGGACTGATCTTGCTGGCTGGTGTGGCAATAGCCACGCCAGCACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATACGCCTTTTTGATTGGAGGGATTT
AAGCGGAAGTCGCAGGTGTGGACTGATCTTGCTGGCTGGTGTGGCAATAGCCACGCCAGCACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATACGCCTTTTTGATTGGAGGGATTT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 256 | GenBank | ACB89414 |
Name | Relaxase_ICESpnCGSP14-1 | UniProt ID | B2IRY4 |
Length | 401 a.a. | PDB ID | |
Note | Tn916, transcriptional regulator, putative |
Relaxase protein sequence
Download Length: 401 a.a. Molecular weight: 47398.10 Da Isoelectric Point: 6.5091
>ACB89414.1 Tn916, transcriptional regulator, putative [Streptococcus pneumoniae CGSP14]
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | B2IRY4 |
T4CP
ID | 256 | GenBank | ACB89413 |
Name | T4CP_ICESpnCGSP14-1 | UniProt ID | B2IQE9 |
Length | 461 a.a. | PDB ID | _ |
Note | _ |
T4CP protein sequence
Download Length: 461 a.a. Molecular weight: 53370.27 Da Isoelectric Point: 9.0687
>ACB89413.1 hypothetical protein SPCG_0161 [Streptococcus pneumoniae CGSP14]
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | B2IQE9 |
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 159930..171343
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
SPCG_0154 | 155596..156969 | + | 1374 | ACB89406 | hypothetical protein | - |
SPCG_0155 | 156962..158023 | + | 1062 | ACB89407 | ABC transporter, ATP-binding protein | - |
SPCG_0156 | 158025..158717 | + | 693 | ACB89408 | ABC transporter, permease protein, putative | - |
SPCG_0157 | 158833..159606 | + | 774 | ACB89409 | transcriptional activator, Rgg/GadR/MutR family protein, C-terminal domain | - |
SPCG_0158 | 159788..159907 | + | 120 | ACB89410 | hypothetical protein | - |
SPCG_0159 | 159930..160244 | + | 315 | ACB89411 | hypothetical protein | orf23 |
SPCG_0160 | 160260..160646 | + | 387 | ACB89412 | hypothetical protein | orf23 |
SPCG_0161 | 160675..162060 | + | 1386 | ACB89413 | hypothetical protein | virb4 |
SPCG_0162 | 162238..163443 | + | 1206 | ACB89414 | Tn916, transcriptional regulator, putative | - |
SPCG_0163 | 163486..163707 | + | 222 | ACB89415 | hypothetical protein | orf19 |
SPCG_0164 | 163824..164321 | + | 498 | ACB89416 | hypothetical protein | - |
SPCG_0165 | 164296..164802 | + | 507 | ACB89417 | hypothetical protein | orf17a |
SPCG_0166 | 164735..167233 | + | 2499 | ACB89418 | hypothetical protein | virb4 |
SPCG_0167 | 167236..169413 | + | 2178 | ACB89419 | hypothetical protein | orf15 |
SPCG_0168 | 169410..170411 | + | 1002 | ACB89420 | hypothetical protein | orf14 |
SPCG_0169 | 170408..171343 | + | 936 | ACB89421 | hypothetical protein | orf13 |
SPCG_0170 | 171618..171704 | + | 87 | ACB89422 | tet(M) leader peptide | - |
SPCG_0171 | 171705..173639 | + | 1935 | ACB89423 | tetracycline resistance protein TetM | - |
SPCG_0172 | 174564..175301 | + | 738 | ACB89424 | rRNA methylase | - |
SPCG_0173 | 175656..176210 | + | 555 | ACB89425 | resolvase | - |
Region 2: 1284145..1308821
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
SPCG_1292 | 1279959..1280609 | - | 651 | ACB90544 | chloramphenicol acetyltransferase | - |
SPCG_1293 | 1280780..1281028 | - | 249 | ACB90545 | hypothetical protein | - |
SPCG_1294 | 1281267..1282055 | - | 789 | ACB90546 | hypothetical protein | - |
SPCG_1295 | 1282531..1283289 | - | 759 | ACB90547 | hypothetical protein | - |
SPCG_1296 | 1283289..1283765 | - | 477 | ACB90548 | transcriptional regulator, putative | - |
SPCG_1297 | 1283835..1284107 | - | 273 | ACB90549 | hypothetical protein | - |
SPCG_1298 | 1284145..1284528 | - | 384 | ACB90550 | hypothetical protein | gbs1346 |
SPCG_1299 | 1284525..1284758 | - | 234 | ACB90551 | hypothetical protein | gbs1347 |
SPCG_1300 | 1284809..1285894 | - | 1086 | ACB90552 | hypothetical protein | traP |
SPCG_1301 | 1285904..1286545 | - | 642 | ACB90553 | parvulin-like peptidyl-prolyl isomerase | prgL |
SPCG_1302 | 1286705..1286902 | + | 198 | ACB90554 | hypothetical protein | - |
SPCG_1303 | 1286916..1287233 | - | 318 | ACB90555 | hypothetical protein | gbs1350 |
SPCG_1304 | 1287226..1287510 | - | 285 | ACB90556 | hypothetical protein | - |
SPCG_1305 | 1287585..1293818 | - | 6234 | ACB90557 | SNF2 family protein | - |
SPCG_1306 | 1293851..1294261 | - | 411 | ACB90558 | hypothetical protein | cd424 |
SPCG_1307 | 1294413..1295108 | - | 696 | ACB90559 | hypothetical protein | - |
SPCG_1308 | 1295217..1298030 | - | 2814 | ACB90560 | hypothetical protein | prgK |
SPCG_1309 | 1298042..1300357 | - | 2316 | ACB90561 | hypothetical protein | virb4 |
SPCG_1310 | 1300347..1300709 | - | 363 | ACB90562 | hypothetical protein | prgIc |
SPCG_1311 | 1300763..1301617 | - | 855 | ACB90563 | hypothetical protein | prgHb |
SPCG_1312 | 1301634..1301876 | - | 243 | ACB90564 | hypothetical protein | prgF |
SPCG_1313 | 1301897..1303777 | - | 1881 | ACB90565 | hypothetical protein | virb4 |
SPCG_1314 | 1303774..1304283 | - | 510 | ACB90566 | hypothetical protein | gbs1365 |
SPCG_1315 | 1304252..1304551 | - | 300 | ACB90567 | hypothetical protein | - |
SPCG_1316 | 1304834..1305670 | - | 837 | ACB90568 | hypothetical protein | - |
SPCG_1317 | 1305670..1306314 | - | 645 | ACB90569 | hypothetical protein | - |
SPCG_1318 | 1306549..1306668 | + | 120 | ACB90570 | hypothetical protein | - |
SPCG_1319 | 1306691..1307005 | + | 315 | ACB90571 | hypothetical protein | orf23 |
SPCG_1320 | 1307021..1307407 | + | 387 | ACB90572 | hypothetical protein | orf23 |
SPCG_1321 | 1307436..1308821 | + | 1386 | ACB90573 | hypothetical protein | virb4 |
SPCG_1322 | 1308999..1310417 | + | 1419 | ACB90574 | hypothetical protein | - |
SPCG_1323 | 1310674..1311459 | + | 786 | ACB90575 | erythromycin ribosome methylase | - |
SPCG_1324 | 1311797..1312339 | + | 543 | ACB90576 | streptothricin acetyltransferase | - |
SPCG_1325 | 1312432..1313226 | + | 795 | ACB90577 | aminoglycoside phosphotransferase type III | - |
Host bacterium
ID | 47 | Element type | Transposon |
Element name | ICESpnCGSP14-1 | GenBank | CP001033 |
Element size | 2209198 bp | Coordinate of oriT [Strand] | 162035..162249 [+] |
Host bacterium | Streptococcus pneumoniae CGSP14 | Coordinate of element | 159595..182718 |
Cargo genes
Drug resistance gene | tet(M), erm(B), cat(pC194), aph(3')-III |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | AcrIIA21 |