Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 200508 |
Name | oriT_ICESpn11930 |
Organism | Streptococcus pneumoniae integrative and |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | FR671403 (55335..55467 [-], 133 nt) |
oriT length | 133 nt |
IRs (inverted repeats) | 65..70, 83..88 (AAATCC..GGATTT) 6..12, 23..29 (ACCCCCC..GGGGGGT) |
Location of nic site | 75..76 |
Conserved sequence flanking the nic site |
TTTGGTTATA |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 133 nt
ACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTATAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATAC
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 14561 | GenBank | CBW38771 |
Name | Relaxase_-_ICESpn11930 | UniProt ID | _ |
Length | 609 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 609 a.a. Molecular weight: 73069.47 Da Isoelectric Point: 9.0694
MVITKHFAIHGKNYRSKLIKYILNPSKTKNLTLVSDFGMRNYLDFPSYKELVKMYNDNFLSNDTLYEFRH
DRQEVNQRKIHSHHIIQSFSPDDHLTPEQINRIGYEAAKELTGGRFRFIVATHVDKGHIHNHIILNSIDQ
NSDKKFLWDYKAEHNLRMVSDRLSKIAGAKIIENRYSHRQYEVYRKTNYKYEIKQRVYFLIENSKNFEDL
KKKAKALHLKIDFRHKHVTYFMTDSNMKQVVRDSKLSRKQPYNETYFEKKFVQREIINILEFLLPKMKNM
NELIQRAEVFGLKIIPKEKHVLFEFDGIKLAEQELVKTNLYSVSYFQDYFNNKNETFVLDNKNLVELYNE
EKIIKEKELPSEEMVWKSYQDFKRNRDAVHEFEVELNLNQIEEVVDDGIYIKVQFGIRQEGLIFVPNIQI
NMEEEKVKVFLRETSSYYVYHKDSVEKNRFMKGKTLIRQFNLQYEPQYMYRRIPLSKIKEKIEQLDFLIS
AENSSNDFEDITNDFIAQISYLENMIEQVQNKINDLTNLEEVLLKDTTNSSSNLENSIQGKSSVDTIEKD
LYIYKGKIETLKEQHREAINLFEMFNKTIKKYKEKQNMKSIKENEIHLE
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
ID | 14562 | GenBank | CBW38817 |
Name | orf20_-_ICESpn11930 | UniProt ID | _ |
Length | 472 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 472 a.a. Molecular weight: 55639.89 Da Isoelectric Point: 8.3884
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKKGYLSTIPVDRYPKKDIMGD
KTVRVRADLHHIIKIETAKNGGNVKEVMEIRLRSKLKSVLIVHYLKILYNRN
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Auxiliary protein
ID | 7932 | GenBank | CBW38800 |
Name | CBW38800_ICESpn11930 | UniProt ID | _ |
Length | 405 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 405 a.a. Molecular weight: 47021.92 Da Isoelectric Point: 9.7978
MSEKRRDNKGRILKTGESQRKDGRYLYKYIDSFGEPQFVYSWKLVATDRVPAGKRDCISLREKIAELQKD
IHDGIDVVGKKMTLCQLYAKQNAQRPKVRKNTETGRKYLMDILKKDKLGVRSIDSIKPSDAKEWAIRMSE
NGYAYQTINNYKRSLKASFYIAIQDDCVRKNPFDFQLKAVLDDDTVPKTVLTEEQEEKLLAFAKADKTYS
KNYDEILILLKTGLRISEFGGLTLPDLDFENRLVNIDHQLLRDTEIGYYIETPKTKSGERQVPMVEEAYQ
AFKRVLANRKNDKRVEIDGYSDFLFLNRKNYPKVASDYNGMMKGLVKKYNKYNEDKLPHITPHSLRHTFC
TNYANAGMNPKALQYIMGHANIAMTLNYYAHATFDSAMAEMKRLNKEKQQERLVA
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4CP
ID | 17178 | GenBank | CBW38818 |
Name | orf21_-_ICESpn11930 | UniProt ID | _ |
Length | 461 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 461 a.a. Molecular weight: 53370.27 Da Isoelectric Point: 9.0687
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
ID | 17179 | GenBank | CBW38826 |
Name | traG_-_ICESpn11930 | UniProt ID | _ |
Length | 625 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 625 a.a. Molecular weight: 72333.61 Da Isoelectric Point: 9.5689
MYSGKKFLLFSLLGILLGYLFHRLTLLYDSYTGNSLDKWTHLLMEGQDEVLQSPWNVSFTGKSSAFFLLG
FVMMLLVYLYLETGKKQYREGVEYGSARFGTLKEKKLFYGKEFSHDTILAQDVRLTLLDKKPPQYDRNKN
IAVIGGSGSGKTFRFVKPNLIQMNSSNIVVDPKDHLAEKTGKLFLENGYQVKVLDLVNMKNSDGFNPFRY
IETENDLNRMLTVYFNNTKGSGSRSDPFWDEASMTLVRALASYLVDFYNPPKTREQLIEESRLSQKEYQN
LLKRQKKEVEERKKRGRYPSFAEISKLIKHLSKGENQEKSVLEILFENYAKKYGTENFTMRNWADFQNYK
DKTLDSVIAVTTAKFALFNIQSVMDLTKRDTLDMKTWGQEKSMVYLVIPDNDSTFRFLSALFFSTVFQTL
TRQADIDFKGQLPLHVRVYLDEFANIGEIPDFAEQTSTVRSRNMSLVPILQNIAQLQGLYKEKEAWKTIL
GNCDSLVYLGGNDEDTFKFMSGLLGKQTIDVRNTSRSFGQTGSGSLSHQKIARDLMTPDEVGNMKRHECL
VRIANMPVFKSKKYNSTKHPNWKYLANQETDERWWNYQINPLNRSQENHLEGLRIRDLTFESSLK
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 43371..71028
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
Locus_37 | 39231..39968 | - | 738 | CBW38806 | - | - |
Locus_38 | 41075..43009 | - | 1935 | CBW38807 | Tetracycline resistance protein | - |
Locus_39 | 43371..44306 | - | 936 | CBW38808 | putative conjugative transposon exported protein | orf13 |
Locus_40 | 44303..45304 | - | 1002 | CBW38809 | putative cell wall hydrolase | orf14 |
Locus_41 | 45301..47412 | - | 2112 | CBW38810 | conjugative transposon membrane protein | orf15 |
Locus_42 | 47481..49928 | - | 2448 | CBW38811 | conjugative transposon ATP/GTP-binding protein | virb4 |
Locus_43 | 49912..50418 | - | 507 | CBW38812 | putative conjugative transposon membrane protein | orf17a |
Locus_44 | 50393..50890 | - | 498 | CBW38813 | conjugative transposon protein | - |
Locus_45 | 51007..51174 | - | 168 | CBW38814 | conjugative transposon protein | orf19 |
Locus_46 | 51408..52655 | + | 1248 | CBW38815 | Transposase | - |
Locus_47 | 52912..53649 | - | 738 | CBW38816 | erm(b) methylase; subname: full=mls methylase; subname: full=rrna adenine n-6-methyltransferase; ec=2.1.1.48; subname: full=erm(b) methylase; subname: full=mls methylase; subname: full=rRNA adenine N-6-methyltransferase; ec=2.1.1.48 | - |
Locus_48 | 53906..55324 | - | 1419 | CBW38817 | putative conjugative transposon replication initiation factor | - |
Locus_49 | 55502..56887 | - | 1386 | CBW38818 | conjugative transposon FtsK/SpoIIIE-family protein | virb4 |
Locus_50 | 56916..57302 | - | 387 | CBW38819 | conjugative transposon protein | orf23 |
Locus_51 | 57318..57632 | - | 315 | CBW38820 | conjugative transposon protein | orf23 |
Locus_52 | 58199..61012 | - | 2814 | CBW38821 | putative conjugal transfer protein | prgK |
Locus_53 | 61024..63339 | - | 2316 | CBW38822 | putative conjugal transfer protein | virb4 |
Locus_54 | 63332..63691 | - | 360 | CBW38823 | putative uncharacterized protein | prgIc |
Locus_55 | 63745..64599 | - | 855 | CBW38824 | putative uncharacterized protein | prgHb |
Locus_56 | 64616..64858 | - | 243 | CBW38825 | putative uncharacterized protein | prgF |
Locus_57 | 64879..66756 | - | 1878 | CBW38826 | putative conjugal transfer protein TraG | virb4 |
Locus_58 | 66756..67244 | - | 489 | CBW38827 | putative uncharacterized protein | gbs1365 |
Locus_59 | 67234..67533 | - | 300 | CBW38828 | putative uncharacterized protein | - |
Locus_60 | 67816..68652 | - | 837 | CBW38829 | conserved hypothetical protein | - |
Locus_61 | 68652..69296 | - | 645 | CBW38830 | conserved hypothetical protein | - |
Locus_62 | 69370..69957 | - | 588 | CBW38831 | putative membrane protein | - |
Locus_63 | 69960..70193 | - | 234 | CBW38832 | putative uncharacterized protein | - |
Locus_64 | 70206..70586 | - | 381 | CBW38833 | putative uncharacterized protein | - |
Locus_65 | 70579..71028 | - | 450 | CBW38834 | putative uncharacterized protein | gbs1369 |
Locus_66 | 71012..72370 | - | 1359 | CBW38835 | putative DNA methylase | - |
Locus_67 | 72469..73248 | - | 780 | CBW38836 | putative replication initiator protein | - |
Locus_68 | 73245..73418 | - | 174 | CBW38837 | putative uncharacterized protein | - |
Host bacterium
ID | 452 | Element type | ICE (Integrative and conjugative element) |
Element name | ICESpn11930 | GenBank | FR671403 |
Element size | 73716 bp | Coordinate of oriT [Strand] | 55335..55467 [-] |
Host bacterium | Streptococcus pneumoniae integrative and | Coordinate of element | 1..73716 |
Cargo genes
Drug resistance gene | cat(pC194), erm(B), tet(M) |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | AcrIIA21 |