Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 200469 |
Name | oriT_ICESpnP1031-1 |
Organism | Streptococcus pneumoniae P1031 |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | CP000920 (19692..19824 [-], 133 nt) |
oriT length | 133 nt |
IRs (inverted repeats) | 65..70, 83..88 (AAATCC..GGATTT) 6..12, 23..29 (ACCCCCC..GGGGGGT) |
Location of nic site | 75..76 |
Conserved sequence flanking the nic site |
TTTGGTTACA |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 133 nt
ACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATAC
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 14496 | GenBank | ACO21742 |
Name | Rep_trans_SPP_1164_ICESpnP1031-1 | UniProt ID | _ |
Length | 401 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 401 a.a. Molecular weight: 47398.10 Da Isoelectric Point: 6.5091
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
ID | 14497 | GenBank | ACO21051 |
Name | Relaxase_SPP_1196_ICESpnP1031-1 | UniProt ID | _ |
Length | 609 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 609 a.a. Molecular weight: 73273.60 Da Isoelectric Point: 9.3359
MVITKHFAIHGKNYRSKLIKYILNPSKTKNLTLVSDFGMRNYLDFPSYEELVKMYNDNFLSNDTLYEFRH
DRREVNQRKIHSHHIIQSFSPDDHLTPEQINQIGYETVKELTGGRFRFIVATHMDKDHIHNHIILNSIDQ
NSDKKFLWDYKSERNLRMVSDRLSKIAGAKIIENRYSHRQYEVYRKTNYKYEIKQRVYFLIENSKNFEDF
KKKAKALHLKIDFRHKHVTFFMTDSNMKQVVRDSKLSRKQPYNETYFKKKFVQREIINILEFLLPKMKNM
NELIQRAEFFGLKIIPKEKHVQFEFDEIKISEQELVKANRYSVSYFQNYFNNKNETVVLDNKNLVELYNE
EKLIKEKELPTEEVVWKSYQDFKRNRDAVHEFEVELNLNQIEEVVDDGIYIKVQFGIRQEGLIFVPNIQI
NMEEEKVKVFLRETSSYYVYHKDSTEKNRFMKGKTLIRQFNLQYEPQYMYRRIPLSKIKEKIEQLDFLVS
AENSQNAFEDITKDFIAQISYLENMIDHVQNKIDDLSNLEEVLLNDATNSSSNLENSIQGKSSVDTIEKD
LYIYKGKIETLKEQHREAINLFEMFNKTIKKYKKKQNTKSIKENEIHLE
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
Auxiliary protein
ID | 7916 | GenBank | ACO21540 |
Name | ACO21540_ICESpnP1031-1 | UniProt ID | _ |
Length | 405 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
Auxiliary protein sequence
Download Length: 405 a.a. Molecular weight: 47021.92 Da Isoelectric Point: 9.7978
MSEKRRDNKGRILKTGESQRKDGRYLYKYIDSFGEPQFVYSWKLVATDRVPAGKRDCISLREKIAELQKD
IHDGIDVVGKKMTLCQLYAKQNAQRPKVRKNTETGRKYLMDILKKDKLGVRSIDSIKPSDAKEWAIRMSE
NGYAYQTINNYKRSLKASFYIAIQDDCVRKNPFDFQLKAVLDDDTVPKTVLTEEQEEKLLAFAKADKTYS
KNYDEILILLKTGLRISEFGGLTLPDLDFENRLVNIDHQLLRDTEIGYYIETPKTKSGERQVPMVEEAYQ
AFKRVLANRKNDKRVEIDGYSDFLFLNRKNYPKVASDYNGMMKGLVKKYNKYNEDKLPHITPHSLRHTFC
TNYANAGMNPKALQYIMGHANIAMTLNYYAHATFDSAMAEMKRLNKEKQQERLVA
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4CP
ID | 17105 | GenBank | ACO22165 |
Name | tcpA_SPP_1165_ICESpnP1031-1 | UniProt ID | _ |
Length | 461 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 461 a.a. Molecular weight: 53428.31 Da Isoelectric Point: 8.9454
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESDLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
ID | 17106 | GenBank | ACO21231 |
Name | t4cp2_SPP_1171_ICESpnP1031-1 | UniProt ID | _ |
Length | 625 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 625 a.a. Molecular weight: 72467.79 Da Isoelectric Point: 9.5534
MYSGKKFLLFSLLGILLGYLFHRLTLLYDSYTGNTLDKWTHLLMEGQDEVLQSPWNVSFTGKSSAFFLLG
FVMMLLVYLYLETGKKQYREGVEYGSARFGTLKEKKLFYGKEFSHDTILAQDVRLTLLDKKPPQYDRNKN
IAVIGGSGSGKTFRFVKPNLIQMNSSNIVVDPKDHLAEKTGKLFLEHGYQVKVLDLVNMKNSDGFNPFRY
IETENDLNRMLTVYFNNTKGSGSRSDPFWDEASMTLVRALASYLVDFYNPPKTREQLIEESRLSQKEYQN
LLKRQKKEVEERKKRGRYPSFAEISKLIKHLSKGENQEKSVLEILFENYAKKYGTENFTMRNWADFQNYK
DKTLDSVIAVTTAKFALFNIQSVMDLTKRDTLDMKTWGKEKSMVYLVIPDNDSTFRFLSALFFSTVFQTL
TRQADIDFKGQLPLHVRVYLDEFANIGEIPDFAEQTSTVRSRNMSLVPILQNIAQLQGLYKEKEAWKTIL
GNCDSLVYLGGNDEDTFKFMSGLLGKQTIDVRNTSRSFGQTGSGSLSHQKIARDLMTPDEVGNMKRHECL
VRIANMPVFKSKKYNSTKHPNWKYLANQESDERWWNYQINPLNQRQENYLDDLRIRDLTFESSLK
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 1066920..1107371
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
SPP_1142 | 1064005..1064190 | - | 186 | ACO22149 | hypothetical protein | - |
SPP_1143 | 1064530..1064703 | + | 174 | ACO21714 | conserved hypothetical protein | - |
SPP_1144 | 1064700..1065479 | + | 780 | ACO21883 | replication initiator protein A (RepA) N-terminus | - |
SPP_1145 | 1065578..1066936 | + | 1359 | ACO20297 | methyl transferase | - |
SPP_1146 | 1066920..1067369 | + | 450 | ACO20907 | conserved hypothetical protein | gbs1369 |
SPP_1147 | 1067362..1067742 | + | 381 | ACO21572 | conserved hypothetical protein | - |
SPP_1148 | 1067755..1067988 | + | 234 | ACO20785 | conserved hypothetical protein | - |
SPP_1149 | 1068132..1068593 | + | 462 | ACO21348 | caax amino protease family | - |
SPP_1150 | 1068755..1069972 | - | 1218 | ACO21540 | transposase from transposon (Integrase) | - |
SPP_1151 | 1070054..1070257 | - | 204 | ACO22031 | conserved domain protein | - |
SPP_1152 | 1070945..1071367 | - | 423 | ACO20943 | sigma-70, region 4 family | - |
SPP_1153 | 1071400..1071516 | + | 117 | ACO21620 | hypothetical protein | - |
SPP_1154 | 1071872..1072225 | + | 354 | ACO21567 | transcriptional regulator, putative | - |
SPP_1155 | 1072571..1074490 | - | 1920 | ACO22036 | tetracycline resistance protein TetM | - |
SPP_1156 | 1074506..1074622 | - | 117 | ACO21110 | Tetracycline resistance determinant leader peptide | - |
SPP_1157 | 1074867..1075784 | - | 918 | ACO20340 | conjugative transposon protein | orf13 |
SPP_1158 | 1075799..1076800 | - | 1002 | ACO20860 | NLP/P60 family protein | orf14 |
SPP_1159 | 1076797..1078974 | - | 2178 | ACO21330 | conjugative transposon membrane protein | orf15 |
SPP_1160 | 1078977..1081424 | - | 2448 | ACO21795 | conjugative transposon protein | virb4 |
SPP_1161 | 1081408..1081914 | - | 507 | ACO20347 | conjugative transposon membrane protein | orf17a |
SPP_1162 | 1081889..1082386 | - | 498 | ACO20815 | conjugative transposon protein | - |
SPP_1163 | 1082503..1082724 | - | 222 | ACO21324 | conserved domain protein | orf19 |
SPP_1164 | 1082767..1083972 | - | 1206 | ACO21742 | transcriptional regulator, Cro/CI family | - |
SPP_1165 | 1084150..1085535 | - | 1386 | ACO22165 | ftsk/spoiiie family protein | virb4 |
SPP_1166 | 1085564..1085947 | - | 384 | ACO20643 | conjugative transposon protein | orf23 |
SPP_1167 | 1085966..1086280 | - | 315 | ACO20990 | conjugative transposon protein | orf23 |
SPP_1168 | 1086303..1086422 | - | 120 | ACO21220 | conserved hypothetical protein | - |
SPP_1169 | 1086953..1087252 | + | 300 | ACO21559 | conserved hypothetical protein | - |
SPP_1170 | 1087242..1087730 | + | 489 | ACO20790 | conserved hypothetical protein | gbs1365 |
SPP_1171 | 1087730..1089607 | + | 1878 | ACO21231 | TraG/TraD family protein | virb4 |
SPP_1172 | 1089628..1089870 | + | 243 | ACO22011 | conserved hypothetical protein | prgF |
SPP_1173 | 1089887..1090741 | + | 855 | ACO20364 | membrane protein, putative | prgHb |
SPP_1174 | 1090795..1091154 | + | 360 | ACO20530 | conserved hypothetical protein | prgIc |
SPP_1175 | 1091147..1093462 | + | 2316 | ACO20939 | conserved hypothetical protein | virb4 |
SPP_1176 | 1093474..1096287 | + | 2814 | ACO22206 | M23 peptidase domain protein | prgK |
SPP_1177 | 1096525..1097091 | + | 567 | ACO21609 | conserved hypothetical protein | - |
SPP_1178 | 1097243..1097653 | + | 411 | ACO21580 | Ig domain protein, group 2 domain protein | cd424 |
SPP_1179 | 1097701..1103931 | + | 6231 | ACO20977 | SNF2 family protein | - |
SPP_1180 | 1104006..1104290 | + | 285 | ACO20529 | conserved hypothetical protein | - |
SPP_1181 | 1104304..1104600 | + | 297 | ACO21385 | conserved hypothetical protein | gbs1350 |
SPP_1182 | 1104614..1104811 | - | 198 | ACO20276 | hypothetical protein | - |
SPP_1183 | 1104971..1105612 | + | 642 | ACO21767 | conserved hypothetical protein | prgL |
SPP_1184 | 1105622..1106707 | + | 1086 | ACO21629 | DNA primase | traP |
SPP_1185 | 1106758..1106991 | + | 234 | ACO21022 | conserved hypothetical protein | gbs1347 |
SPP_1186 | 1106988..1107371 | + | 384 | ACO21673 | methyl-accepting chemotaxis protein | gbs1346 |
SPP_1187 | 1107409..1107681 | + | 273 | ACO21534 | ATPases with chaperone activity, ATP-binding subunit | - |
SPP_1188 | 1107751..1108227 | + | 477 | ACO20956 | helix-turn-helix domain protein | - |
SPP_1189 | 1108227..1108997 | + | 771 | ACO21923 | signal recognition particle GTPase | - |
SPP_1190 | 1109225..1109908 | + | 684 | ACO22113 | transcriptional regulator | - |
SPP_1191 | 1109911..1111326 | + | 1416 | ACO21798 | SOS responce UmuC protein | - |
SPP_1192 | 1111326..1111688 | + | 363 | ACO20400 | conserved hypothetical protein | - |
SPP_1193 | 1111678..1111968 | + | 291 | ACO20889 | conserved hypothetical protein | - |
Host bacterium
ID | 414 | Element type | ICE (Integrative and conjugative element) |
Element name | ICESpnP1031-1 | GenBank | CP000920 |
Element size | 2111882 bp | Coordinate of oriT [Strand] | 19692..19824 [-] |
Host bacterium | Streptococcus pneumoniae P1031 | Coordinate of element | 1064292..1117331 |
Cargo genes
Drug resistance gene | tet(M) |
Virulence gene | - |
Metal resistance gene | - |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | AcrIIA21, AcrIIA1 |