Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 200036 |
Name | oriT_ICESag2603VR-1 |
Organism | Streptococcus agalactiae 2603V/R |
Sequence Completeness | intact |
NCBI accession of oriT (coordinates [strand]) | AE009948 (938841..939055 [-], 215 nt) |
oriT length | 215 nt |
IRs (inverted repeats) | 61..76, 80..95 (ACTTAACCCCCCGTAT..ACAGGGGGGTACAAAT) 123..134, 139..150 (GAAAATCCTTTG..CAAGGGATTTAC) |
Location of nic site | 135..136 |
Conserved sequence flanking the nic site |
TGG|T |
Note | blastn alignment with the oriT_Tn916 |
oriT sequence
Download Length: 215 nt
>oriT_ICESag2603VR-1
AAGCGGAAGTCGCAGGTGTGGACTGATCTTGCTGGCTGGTGTGGCAATAGCCACGCCAGCACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATACGCCTTTTTGATTGGAGGGATTT
AAGCGGAAGTCGCAGGTGTGGACTGATCTTGCTGGCTGGTGTGGCAATAGCCACGCCAGCACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATACGCCTTTTTGATTGGAGGGATTT
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 250 | GenBank | AAM99817 |
Name | Relaxase_ICESag2603VR-1 | UniProt ID | Q8E007 |
Length | 401 a.a. | PDB ID | |
Note | Tn916, transcriptional regulator, putative |
Relaxase protein sequence
Download Length: 401 a.a. Molecular weight: 47398.10 Da Isoelectric Point: 6.5091
>AAM99817.1 Tn916, transcriptional regulator, putative [Streptococcus agalactiae 2603V/R]
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | Q8E007 |
T4CP
ID | 250 | GenBank | AAM99818 |
Name | T4CP_ICESag2603VR-1 | UniProt ID | Q8CX12 |
Length | 461 a.a. | PDB ID | _ |
Note | Tn916, FtsK/SpoIIIE family protein |
T4CP protein sequence
Download Length: 461 a.a. Molecular weight: 53370.27 Da Isoelectric Point: 9.0687
>AAM99818.1 Tn916, FtsK/SpoIIIE family protein [Streptococcus agalactiae 2603V/R]
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
Protein domains
Predicted by InterproScan.
Protein structure
Source | ID | Structure |
---|---|---|
AlphaFold DB | Q8CX12 |
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 929751..941160
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
SAG0916 | 924938..925141 | - | 204 | AAM99802 | Tn916, excisionase | - |
SAG0917 | 925125..925376 | + | 252 | AAM99803 | Tn916, hypothetical protein | - |
SAG0918 | 925602..925832 | - | 231 | AAM99804 | Tn916, hypothetical protein | - |
SAG0919 | 925829..926302 | - | 474 | AAM99805 | Tn916, hypothetical protein | - |
SAG0920 | 926480..926551 | - | 72 | AAM99806 | Tn916, hypothetical protein | - |
SAG0921 | 926756..927109 | + | 354 | AAM99807 | Tn916, transcriptional regulator, putative | - |
SAG0922 | 927169..927354 | - | 186 | AAM99808 | Tn916, hypothetical protein | - |
SAG0923 | 927455..929374 | - | 1920 | AAM99809 | Tn916, tetracycline resistance protein | - |
SAG0924 | 929390..929476 | - | 87 | AAM99810 | Tn916, tetM leader peptide | - |
SAG0925 | 929751..930683 | - | 933 | AAM99811 | Tn916, hypothetical protein | orf13 |
SAG0926 | 930680..931681 | - | 1002 | AAM99812 | Tn916, NLP/P60 family protein | orf14 |
SAG0927 | 931678..933855 | - | 2178 | AAM99813 | membrane protein, putative | orf15 |
SAG0929 | 936288..936794 | - | 507 | AAM99814 | Tn916, hypothetical protein | orf17a |
SAG0930 | 936769..937266 | - | 498 | AAM99815 | Tn916, hypothetical protein | - |
SAG0931 | 937383..937604 | - | 222 | AAM99816 | Tn916, hypothetical protein | orf19 |
SAG0932 | 937647..938852 | - | 1206 | AAM99817 | Tn916, transcriptional regulator, putative | - |
SAG0933 | 939030..940415 | - | 1386 | AAM99818 | Tn916, FtsK/SpoIIIE family protein | virb4 |
SAG0934 | 940444..940830 | - | 387 | AAM99819 | Tn916, hypothetical protein | orf23 |
SAG0935 | 940846..941160 | - | 315 | AAM99820 | Tn916, hypothetical protein | orf23 |
SAG0936 | 941183..941302 | - | 120 | AAM99821 | Tn916, hypothetical protein | - |
SAG0938 | 943034..943402 | - | 369 | AAM99822 | transcriptional regulator, GntR family | - |
Region 2: 1280548..1308722
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
SAG1266 | 1276299..1276757 | + | 459 | AAN00139 | hypothetical protein | - |
SAG1267 | 1276791..1277117 | + | 327 | AAN00140 | hypothetical protein | - |
SAG1268 | 1277301..1277993 | + | 693 | AAN00141 | repressor protein, putative | - |
SAG1269 | 1277996..1278130 | + | 135 | AAN00142 | hypothetical protein | - |
SAG1270 | 1278136..1279551 | + | 1416 | AAN00143 | ImpB/MucB/SamB family protein | - |
SAG1271 | 1279548..1279898 | + | 351 | AAN00144 | conserved hypothetical protein | - |
SAG1272 | 1279885..1280193 | + | 309 | AAN00145 | conserved hypothetical protein | - |
SAG1273 | 1280202..1280558 | - | 357 | AAN00146 | conserved hypothetical protein | - |
SAG1274 | 1280548..1280937 | - | 390 | AAN00147 | conserved hypothetical protein | gbs1346 |
SAG1275 | 1280934..1281161 | - | 228 | AAN00148 | hypothetical protein | gbs1347 |
SAG1276 | 1281215..1282291 | - | 1077 | AAN00149 | conserved hypothetical protein | - |
SAG1277 | 1282330..1282821 | - | 492 | AAN00150 | hypothetical protein | prgL |
SAG1278 | 1283064..1283354 | - | 291 | AAN00151 | hypothetical protein | gbs1350 |
SAG1279 | 1283368..1283667 | - | 300 | AAN00152 | conserved domain protein | - |
SAG1280 | 1283738..1290562 | - | 6825 | AAN00153 | SNF2 family protein | - |
SAG1281 | 1290611..1291162 | - | 552 | AAN00154 | hypothetical protein | gbs1354 |
SAG1282 | 1291146..1291337 | - | 192 | AAN00155 | calcium-binding protein, putative | - |
SAG1283 | 1291338..1296233 | - | 4896 | AAN00156 | agglutinin receptor | prgB |
SAG1284 | 1296512..1297102 | + | 591 | AAN00157 | abortive infection protein AbiGI | - |
SAG1285 | 1297099..1297944 | + | 846 | AAN00158 | abortive infection protein AbiGII | - |
SAG1286 | 1298030..1300831 | - | 2802 | AAN00159 | Tn5252, Orf28 | prgK |
SAG1287 | 1300833..1303163 | - | 2331 | AAN00160 | Tn5252, Orf26 | virb4 |
SAG1289 | 1303556..1304410 | - | 855 | AAN00161 | Tn5252, Orf23 | prgHb |
SAG1290 | 1304429..1304671 | - | 243 | AAN00162 | hypothetical protein | prgF |
SAG1292 | 1306506..1306994 | - | 489 | AAN00163 | hypothetical protein | gbs1365 |
SAG1293 | 1307071..1307655 | - | 585 | AAN00164 | protease, putative | - |
SAG1294 | 1307658..1307891 | - | 234 | AAN00165 | conserved hypothetical protein | - |
SAG1295 | 1307900..1308283 | - | 384 | AAN00166 | conserved hypothetical protein | - |
SAG1296 | 1308294..1308722 | - | 429 | AAN00167 | conserved hypothetical protein | gbs1369 |
SAG1297 | 1308706..1310061 | - | 1356 | AAN00168 | C-5 cytosine-specific DNA methylase | - |
SAG1298 | 1310113..1310208 | - | 96 | AAN00169 | hypothetical protein | - |
SAG1299 | 1310210..1311028 | - | 819 | AAN00170 | conserved hypothetical protein | - |
SAG1300 | 1311025..1311198 | - | 174 | AAN00171 | conserved hypothetical protein | - |
SAG1301 | 1311397..1311762 | - | 366 | AAN00172 | ribosomal protein L7/L12 | - |
SAG1302 | 1311826..1312326 | - | 501 | AAN00173 | ribosomal protein L10 | - |
Host bacterium
ID | 41 | Element type | Transposon |
Element name | ICESag2603VR-1 | GenBank | AE009948 |
Element size | 2160267 bp | Coordinate of oriT [Strand] | 938841..939055 [-] |
Host bacterium | Streptococcus agalactiae 2603V/R | Coordinate of element | 923465..941495 |
Cargo genes
Drug resistance gene | tet(M) |
Virulence gene | - |
Metal resistance gene | cadC, tcrB |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | AcrIIA21, AcrIIA8 |