Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 200471 |
Name | oriT_ICESdy12394-1 |
Organism | Streptococcus dysgalactiae subsp. equisimilis ATCC 12394 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | CP002215 (35063..35103 [+], 41 nt) |
oriT length | 41 nt |
IRs (inverted repeats) | _ |
Location of nic site | _ |
Conserved sequence flanking the nic site |
_ |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 41 nt
>oriT_ICESdy12394-1
CACATTTACGAAGTAAAGTATAATGGGTTATACTTTACATG
CACATTTACGAAGTAAAGTATAATGGGTTATACTTTACATG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 14501 | GenBank | ADX24459 |
Name | Relaxase_SDE12394_04825_ICESdy12394-1 | UniProt ID | _ |
Length | 621 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 621 a.a. Molecular weight: 74154.75 Da Isoelectric Point: 7.3028
>ADX24459.1 Tn5252, relaxase [Streptococcus dysgalactiae subsp. equisimilis ATCC 12394]
MVITKHYAVHGKKYRRQLIKYILDPKKTRNLSLISDFGMSNYLDFPDYVELVKMYQNNFLSNDQLYDSRF
DRQEKKQQKIHAHHIIQSFSPEDKLSPEEINRIGYETIKELIGGQYKFIVATHVDQDHCHNHIIINSINS
QSQKKLKWDYALERNLQMISDRISKVAGAKIIPPKRYSHRDYEVYRRSNHKYELKQRLFFLMEHSIDFND
FMQKAEQLNVKIDFSRKHSRFFMTDRNMKQVIQGDKLNKREPYSKEYFQRYFAKKKIELILEFLLLRSNS
FDDLVEKARLLGLELKSKKKTIDFVLSDGKSCISIPNKSLRKKNLYDTTYFDSYFKEHDVFEVLHNNEVK
IEFEKFETQQLSEILTVEEITEAYETYKTKRDAVHEFEVEITEEQIEKIVLDGLFVKVWMGIGQEGLIFI
PNHQLNILEQENKKQYQVFIRETSSYFIYHKEDSEMNRFMKGRDLIRQLTFDNKSLPYKRRISLVSLQQK
IEEINLLMTLNIQNKSFLELKDELVGDIAQLDIELTNLQDKNTTLNKMAEVVVNLQSDNQDTKQLAKYEC
SKMNLSQNVTIGQIESEIEMIQNQLDNKIEEYENAVRKLDEYVRVLNMDKYKTDDFSIHIE
MVITKHYAVHGKKYRRQLIKYILDPKKTRNLSLISDFGMSNYLDFPDYVELVKMYQNNFLSNDQLYDSRF
DRQEKKQQKIHAHHIIQSFSPEDKLSPEEINRIGYETIKELIGGQYKFIVATHVDQDHCHNHIIINSINS
QSQKKLKWDYALERNLQMISDRISKVAGAKIIPPKRYSHRDYEVYRRSNHKYELKQRLFFLMEHSIDFND
FMQKAEQLNVKIDFSRKHSRFFMTDRNMKQVIQGDKLNKREPYSKEYFQRYFAKKKIELILEFLLLRSNS
FDDLVEKARLLGLELKSKKKTIDFVLSDGKSCISIPNKSLRKKNLYDTTYFDSYFKEHDVFEVLHNNEVK
IEFEKFETQQLSEILTVEEITEAYETYKTKRDAVHEFEVEITEEQIEKIVLDGLFVKVWMGIGQEGLIFI
PNHQLNILEQENKKQYQVFIRETSSYFIYHKEDSEMNRFMKGRDLIRQLTFDNKSLPYKRRISLVSLQQK
IEEINLLMTLNIQNKSFLELKDELVGDIAQLDIELTNLQDKNTTLNKMAEVVVNLQSDNQDTKQLAKYEC
SKMNLSQNVTIGQIESEIEMIQNQLDNKIEEYENAVRKLDEYVRVLNMDKYKTDDFSIHIE
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4CP
ID | 17111 | GenBank | ADX24420 |
Name | t4cp2_SDE12394_04610_ICESdy12394-1 | UniProt ID | _ |
Length | 605 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 605 a.a. Molecular weight: 69440.21 Da Isoelectric Point: 9.0612
>ADX24420.1 Type IV secretory pathway, VirD4 component [Streptococcus dysgalactiae subsp. equisimilis ATCC 12394]
MYSRQKAFVFGLLGLAFGYFCHRLTLLYDSLTNAPPMERIAYLLGEGLNQVFNPLWLFSFTQKSFLAFIL
GVLMMTLVYLYVSTGQKVYREGEEYGSARFGTSKEKRNFYSKNPFNDTILARDVRLTLLEKKKPQFDRNK
NLIVIGGSGAGKTFRFVKPNLIQLNCSNIVVDPKDHLAEKTGKLFLENGYQVKVLDLVNMTNSDGFNPFR
YVETENDLNRMLTVYFNNTKGNGSRSDPFWDEASMTLVRAIASYLVDFYNPPGSSKQEQEARRKRGRYPA
FSEIGKLIKLLSKGDNQDKSILEVLFEDYAKKYGHENFTMRNWADFQNYKDKTLDSVIAVTTAKFALFNI
QSVIDLTQRDTMDLKTWGTQKTMVYLVIPDNDTTFRFLSALFFSTVFSTLTRQADVDFKGQLPIHVRSYL
DEFANVGEIPDFAEQTSTVRSRNMSLVPILQNIAQLQGLYKEKEAWKTILGNCDSLLYLGGNDEETFKFM
SGLLGKQTVDVRSTSRSFGQTGSSSTSHQKIARDLMTADEVGTMKRDECLVRIAGVPVFRTKKYFPLKHK
HWKLLADKETDDRWWNYHINPLAKEEELDLSDYQIRDLSTETSLH
MYSRQKAFVFGLLGLAFGYFCHRLTLLYDSLTNAPPMERIAYLLGEGLNQVFNPLWLFSFTQKSFLAFIL
GVLMMTLVYLYVSTGQKVYREGEEYGSARFGTSKEKRNFYSKNPFNDTILARDVRLTLLEKKKPQFDRNK
NLIVIGGSGAGKTFRFVKPNLIQLNCSNIVVDPKDHLAEKTGKLFLENGYQVKVLDLVNMTNSDGFNPFR
YVETENDLNRMLTVYFNNTKGNGSRSDPFWDEASMTLVRAIASYLVDFYNPPGSSKQEQEARRKRGRYPA
FSEIGKLIKLLSKGDNQDKSILEVLFEDYAKKYGHENFTMRNWADFQNYKDKTLDSVIAVTTAKFALFNI
QSVIDLTQRDTMDLKTWGTQKTMVYLVIPDNDTTFRFLSALFFSTVFSTLTRQADVDFKGQLPIHVRSYL
DEFANVGEIPDFAEQTSTVRSRNMSLVPILQNIAQLQGLYKEKEAWKTILGNCDSLLYLGGNDEETFKFM
SGLLGKQTVDVRSTSRSFGQTGSSSTSHQKIARDLMTADEVGTMKRDECLVRIAGVPVFRTKKYFPLKHK
HWKLLADKETDDRWWNYHINPLAKEEELDLSDYQIRDLSTETSLH
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 888418..916593
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
SDE12394_04555 | 884816..885313 | + | 498 | ADX24409 | 50S ribosomal protein L10 | - |
SDE12394_04560 | 885378..885743 | + | 366 | ADX24410 | 50S ribosomal protein L7/L12 | - |
SDE12394_04565 | 885942..886115 | + | 174 | ADX24411 | hypothetical protein | - |
SDE12394_04570 | 886112..886930 | + | 819 | ADX24412 | hypothetical protein | - |
SDE12394_04575 | 886932..887027 | + | 96 | ADX24413 | hypothetical protein | - |
SDE12394_04580 | 887079..888434 | + | 1356 | ADX24414 | C-5 cytosine-specific DNA methylase | - |
SDE12394_04585 | 888418..888846 | + | 429 | ADX24415 | hypothetical protein | gbs1369 |
SDE12394_04590 | 888857..889240 | + | 384 | ADX24416 | hypothetical protein | - |
SDE12394_04595 | 889249..889482 | + | 234 | ADX24417 | hypothetical protein | - |
SDE12394_04600 | 889581..890069 | + | 489 | ADX24418 | protease, putative | - |
SDE12394_04605 | 890146..890634 | + | 489 | ADX24419 | hypothetical protein | gbs1365 |
SDE12394_04610 | 890634..892451 | + | 1818 | ADX24420 | Type IV secretory pathway, VirD4 component | virb4 |
SDE12394_04615 | 892469..892711 | + | 243 | ADX24421 | hypothetical protein | prgF |
SDE12394_04620 | 892730..893584 | + | 855 | ADX24422 | hypothetical protein | prgHb |
SDE12394_04625 | 893646..893999 | + | 354 | ADX24423 | hypothetical protein | prgIc |
SDE12394_04640 | 896310..899111 | + | 2802 | ADX24424 | hypothetical protein | prgK |
SDE12394_04645 | 899197..900042 | - | 846 | ADX24425 | abortive infection protein AbiGII | - |
SDE12394_04650 | 900039..900629 | - | 591 | ADX24426 | abortive infection protein AbiGI | - |
SDE12394_04655 | 900908..905803 | + | 4896 | ADX24427 | agglutinin receptor | prgB |
SDE12394_04660 | 905804..905995 | + | 192 | ADX24428 | calcium-binding protein, putative | - |
SDE12394_04665 | 905979..906530 | + | 552 | ADX24429 | hypothetical protein | gbs1354 |
SDE12394_04670 | 906579..913403 | + | 6825 | ADX24430 | SNF2 family protein | - |
SDE12394_04675 | 913474..913773 | + | 300 | ADX24431 | hypothetical protein | - |
SDE12394_04680 | 913787..914077 | + | 291 | ADX24432 | hypothetical protein | gbs1350 |
SDE12394_04685 | 914173..914811 | + | 639 | ADX24433 | hypothetical protein | prgL |
SDE12394_04690 | 914850..915926 | + | 1077 | ADX24434 | hypothetical protein | - |
SDE12394_04695 | 915980..916207 | + | 228 | ADX24435 | hypothetical protein | gbs1347 |
SDE12394_04700 | 916204..916593 | + | 390 | ADX24436 | hypothetical protein | gbs1346 |
SDE12394_04705 | 916643..916939 | + | 297 | ADX24437 | hypothetical protein | - |
SDE12394_04710 | 916948..917256 | - | 309 | ADX24438 | hypothetical protein | - |
SDE12394_04715 | 917243..917593 | - | 351 | ADX24439 | hypothetical protein | - |
SDE12394_04720 | 917590..919005 | - | 1416 | ADX24440 | ImpB/MucB/SamB family protein | - |
SDE12394_04725 | 919011..919145 | - | 135 | ADX24441 | hypothetical protein | - |
SDE12394_04730 | 919148..919840 | - | 693 | ADX24442 | repressor protein, putative | - |
SDE12394_04735 | 920024..920350 | - | 327 | ADX24443 | hypothetical protein | - |
SDE12394_04740 | 920384..920842 | - | 459 | ADX24444 | hypothetical protein | - |
Host bacterium
ID | 416 | Element type | ICE (Integrative and conjugative element) |
Element name | ICESdy12394-1 | GenBank | CP002215 |
Element size | 2159491 bp | Coordinate of oriT [Strand] | 35063..35103 [+] |
Host bacterium | Streptococcus dysgalactiae subsp. equisimilis ATCC 12394 | Coordinate of element | 886112..936679 |
Cargo genes
Drug resistance gene | - |
Virulence gene | - |
Metal resistance gene | tcrB, cadC |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | AcrIIA8, AcrIIA21 |