Detailed information of oriT

oriT


The information of the oriT region


oriTDB ID   200469
Name   oriT_ICESpnP1031-1 in_silico
Organism   Streptococcus pneumoniae P1031
Sequence Completeness      -
NCBI accession of oriT (coordinates [strand])   CP000920 (19692..19824 [-], 133 nt)
oriT length   133 nt
IRs (inverted repeats)      65..70, 83..88  (AAATCC..GGATTT)
 6..12, 23..29  (ACCCCCC..GGGGGGT)
Location of nic site      75..76
Conserved sequence flanking the
  nic site  
 
 TTTGGTTACA
Note   Predicted by oriTfinder 2.0

  oriT sequence  


Download         Length: 133 nt

>oriT_ICESpnP1031-1
ACTTAACCCCCCGTATCTAACAGGGGGGTACAAATCGACAGGAAACAGTCAAAAAAACATTAGAAAATCCTTTGGTTACAAGGGATTTACAAAATTTCAGCGTATGTCAAATGGGCTTTAAAAGTTGACATAC

Visualization of oriT structure

  oriT secondary structure

Predicted by RNAfold.

Download structure file


Relaxase


ID   14496 GenBank   ACO21742
Name   Rep_trans_SPP_1164_ICESpnP1031-1 insolico UniProt ID   _
Length   401 a.a. PDB ID   
Note   Predicted by oriTfinder 2.0

  Relaxase protein sequence


Download         Length: 401 a.a.        Molecular weight: 47398.10 Da        Isoelectric Point: 6.5091

>ACO21742.1 transcriptional regulator, Cro/CI family [Streptococcus pneumoniae P1031]
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAP
LEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGR
GCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRS
GELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLL
VYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK

  Protein domains


Predicted by InterproScan.

(15-53)

(169-373)

(73-161)


  Protein structure



No available structure.



ID   14497 GenBank   ACO21051
Name   Relaxase_SPP_1196_ICESpnP1031-1 insolico UniProt ID   _
Length   609 a.a. PDB ID   
Note   Predicted by oriTfinder 2.0

  Relaxase protein sequence


Download         Length: 609 a.a.        Molecular weight: 73273.60 Da        Isoelectric Point: 9.3359

>ACO21051.1 relaxase [Streptococcus pneumoniae P1031]
MVITKHFAIHGKNYRSKLIKYILNPSKTKNLTLVSDFGMRNYLDFPSYEELVKMYNDNFLSNDTLYEFRH
DRREVNQRKIHSHHIIQSFSPDDHLTPEQINQIGYETVKELTGGRFRFIVATHMDKDHIHNHIILNSIDQ
NSDKKFLWDYKSERNLRMVSDRLSKIAGAKIIENRYSHRQYEVYRKTNYKYEIKQRVYFLIENSKNFEDF
KKKAKALHLKIDFRHKHVTFFMTDSNMKQVVRDSKLSRKQPYNETYFKKKFVQREIINILEFLLPKMKNM
NELIQRAEFFGLKIIPKEKHVQFEFDEIKISEQELVKANRYSVSYFQNYFNNKNETVVLDNKNLVELYNE
EKLIKEKELPTEEVVWKSYQDFKRNRDAVHEFEVELNLNQIEEVVDDGIYIKVQFGIRQEGLIFVPNIQI
NMEEEKVKVFLRETSSYYVYHKDSTEKNRFMKGKTLIRQFNLQYEPQYMYRRIPLSKIKEKIEQLDFLVS
AENSQNAFEDITKDFIAQISYLENMIDHVQNKIDDLSNLEEVLLNDATNSSSNLENSIQGKSSVDTIEKD
LYIYKGKIETLKEQHREAINLFEMFNKTIKKYKKKQNTKSIKENEIHLE

  Protein domains


Predicted by InterproScan.

(12-264)


  Protein structure



No available structure.




Auxiliary protein


ID   7916 GenBank   ACO21540
Name   ACO21540_ICESpnP1031-1 insolico UniProt ID   _
Length   405 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  Auxiliary protein sequence


Download         Length: 405 a.a.        Molecular weight: 47021.92 Da        Isoelectric Point: 9.7978

>ACO21540.1 transposase from transposon (Integrase) [Streptococcus pneumoniae P1031]
MSEKRRDNKGRILKTGESQRKDGRYLYKYIDSFGEPQFVYSWKLVATDRVPAGKRDCISLREKIAELQKD
IHDGIDVVGKKMTLCQLYAKQNAQRPKVRKNTETGRKYLMDILKKDKLGVRSIDSIKPSDAKEWAIRMSE
NGYAYQTINNYKRSLKASFYIAIQDDCVRKNPFDFQLKAVLDDDTVPKTVLTEEQEEKLLAFAKADKTYS
KNYDEILILLKTGLRISEFGGLTLPDLDFENRLVNIDHQLLRDTEIGYYIETPKTKSGERQVPMVEEAYQ
AFKRVLANRKNDKRVEIDGYSDFLFLNRKNYPKVASDYNGMMKGLVKKYNKYNEDKLPHITPHSLRHTFC
TNYANAGMNPKALQYIMGHANIAMTLNYYAHATFDSAMAEMKRLNKEKQQERLVA

  Protein domains


Predicted by InterproScan.

(190-383)

(1-71)


  Protein structure



No available structure.




T4CP


ID   17105 GenBank   ACO22165
Name   tcpA_SPP_1165_ICESpnP1031-1 insolico UniProt ID   _
Length   461 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 461 a.a.        Molecular weight: 53428.31 Da        Isoelectric Point: 8.9454

>ACO22165.1 ftsk/spoiiie family protein [Streptococcus pneumoniae P1031]
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAIL
ICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKN
GLIQIRVEITLGKYQDQLLHLEKKLESDLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRL
MKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLL
SCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLG
RQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSV
ISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD

  Protein domains


Predicted by InterproScan.

(217-301)

  Protein structure



No available structure.



ID   17106 GenBank   ACO21231
Name   t4cp2_SPP_1171_ICESpnP1031-1 insolico UniProt ID   _
Length   625 a.a. PDB ID   _
Note   Predicted by oriTfinder 2.0

  T4CP protein sequence


Download         Length: 625 a.a.        Molecular weight: 72467.79 Da        Isoelectric Point: 9.5534

>ACO21231.1 TraG/TraD family protein [Streptococcus pneumoniae P1031]
MYSGKKFLLFSLLGILLGYLFHRLTLLYDSYTGNTLDKWTHLLMEGQDEVLQSPWNVSFTGKSSAFFLLG
FVMMLLVYLYLETGKKQYREGVEYGSARFGTLKEKKLFYGKEFSHDTILAQDVRLTLLDKKPPQYDRNKN
IAVIGGSGSGKTFRFVKPNLIQMNSSNIVVDPKDHLAEKTGKLFLEHGYQVKVLDLVNMKNSDGFNPFRY
IETENDLNRMLTVYFNNTKGSGSRSDPFWDEASMTLVRALASYLVDFYNPPKTREQLIEESRLSQKEYQN
LLKRQKKEVEERKKRGRYPSFAEISKLIKHLSKGENQEKSVLEILFENYAKKYGTENFTMRNWADFQNYK
DKTLDSVIAVTTAKFALFNIQSVMDLTKRDTLDMKTWGKEKSMVYLVIPDNDSTFRFLSALFFSTVFQTL
TRQADIDFKGQLPLHVRVYLDEFANIGEIPDFAEQTSTVRSRNMSLVPILQNIAQLQGLYKEKEAWKTIL
GNCDSLVYLGGNDEDTFKFMSGLLGKQTIDVRNTSRSFGQTGSGSLSHQKIARDLMTPDEVGNMKRHECL
VRIANMPVFKSKKYNSTKHPNWKYLANQESDERWWNYQINPLNQRQENYLDDLRIRDLTFESSLK

  Protein domains


Predicted by InterproScan.

(131-574)

  Protein structure



No available structure.




T4SS


T4SS were predicted by using oriTfinder2.

Region 1: 1066920..1107371

Locus tag Coordinates Strand Size (bp) Protein ID Product Description
SPP_1142 1064005..1064190 - 186 ACO22149 hypothetical protein -
SPP_1143 1064530..1064703 + 174 ACO21714 conserved hypothetical protein -
SPP_1144 1064700..1065479 + 780 ACO21883 replication initiator protein A (RepA) N-terminus -
SPP_1145 1065578..1066936 + 1359 ACO20297 methyl transferase -
SPP_1146 1066920..1067369 + 450 ACO20907 conserved hypothetical protein gbs1369
SPP_1147 1067362..1067742 + 381 ACO21572 conserved hypothetical protein -
SPP_1148 1067755..1067988 + 234 ACO20785 conserved hypothetical protein -
SPP_1149 1068132..1068593 + 462 ACO21348 caax amino protease family -
SPP_1150 1068755..1069972 - 1218 ACO21540 transposase from transposon (Integrase) -
SPP_1151 1070054..1070257 - 204 ACO22031 conserved domain protein -
SPP_1152 1070945..1071367 - 423 ACO20943 sigma-70, region 4 family -
SPP_1153 1071400..1071516 + 117 ACO21620 hypothetical protein -
SPP_1154 1071872..1072225 + 354 ACO21567 transcriptional regulator, putative -
SPP_1155 1072571..1074490 - 1920 ACO22036 tetracycline resistance protein TetM -
SPP_1156 1074506..1074622 - 117 ACO21110 Tetracycline resistance determinant leader peptide -
SPP_1157 1074867..1075784 - 918 ACO20340 conjugative transposon protein orf13
SPP_1158 1075799..1076800 - 1002 ACO20860 NLP/P60 family protein orf14
SPP_1159 1076797..1078974 - 2178 ACO21330 conjugative transposon membrane protein orf15
SPP_1160 1078977..1081424 - 2448 ACO21795 conjugative transposon protein virb4
SPP_1161 1081408..1081914 - 507 ACO20347 conjugative transposon membrane protein orf17a
SPP_1162 1081889..1082386 - 498 ACO20815 conjugative transposon protein -
SPP_1163 1082503..1082724 - 222 ACO21324 conserved domain protein orf19
SPP_1164 1082767..1083972 - 1206 ACO21742 transcriptional regulator, Cro/CI family -
SPP_1165 1084150..1085535 - 1386 ACO22165 ftsk/spoiiie family protein virb4
SPP_1166 1085564..1085947 - 384 ACO20643 conjugative transposon protein orf23
SPP_1167 1085966..1086280 - 315 ACO20990 conjugative transposon protein orf23
SPP_1168 1086303..1086422 - 120 ACO21220 conserved hypothetical protein -
SPP_1169 1086953..1087252 + 300 ACO21559 conserved hypothetical protein -
SPP_1170 1087242..1087730 + 489 ACO20790 conserved hypothetical protein gbs1365
SPP_1171 1087730..1089607 + 1878 ACO21231 TraG/TraD family protein virb4
SPP_1172 1089628..1089870 + 243 ACO22011 conserved hypothetical protein prgF
SPP_1173 1089887..1090741 + 855 ACO20364 membrane protein, putative prgHb
SPP_1174 1090795..1091154 + 360 ACO20530 conserved hypothetical protein prgIc
SPP_1175 1091147..1093462 + 2316 ACO20939 conserved hypothetical protein virb4
SPP_1176 1093474..1096287 + 2814 ACO22206 M23 peptidase domain protein prgK
SPP_1177 1096525..1097091 + 567 ACO21609 conserved hypothetical protein -
SPP_1178 1097243..1097653 + 411 ACO21580 Ig domain protein, group 2 domain protein cd424
SPP_1179 1097701..1103931 + 6231 ACO20977 SNF2 family protein -
SPP_1180 1104006..1104290 + 285 ACO20529 conserved hypothetical protein -
SPP_1181 1104304..1104600 + 297 ACO21385 conserved hypothetical protein gbs1350
SPP_1182 1104614..1104811 - 198 ACO20276 hypothetical protein -
SPP_1183 1104971..1105612 + 642 ACO21767 conserved hypothetical protein prgL
SPP_1184 1105622..1106707 + 1086 ACO21629 DNA primase traP
SPP_1185 1106758..1106991 + 234 ACO21022 conserved hypothetical protein gbs1347
SPP_1186 1106988..1107371 + 384 ACO21673 methyl-accepting chemotaxis protein gbs1346
SPP_1187 1107409..1107681 + 273 ACO21534 ATPases with chaperone activity, ATP-binding subunit -
SPP_1188 1107751..1108227 + 477 ACO20956 helix-turn-helix domain protein -
SPP_1189 1108227..1108997 + 771 ACO21923 signal recognition particle GTPase -
SPP_1190 1109225..1109908 + 684 ACO22113 transcriptional regulator -
SPP_1191 1109911..1111326 + 1416 ACO21798 SOS responce UmuC protein -
SPP_1192 1111326..1111688 + 363 ACO20400 conserved hypothetical protein -
SPP_1193 1111678..1111968 + 291 ACO20889 conserved hypothetical protein -


Host bacterium


ID   414 Element type   ICE (Integrative and conjugative element)
Element name   ICESpnP1031-1 GenBank   CP000920
Element size   2111882 bp Coordinate of oriT [Strand]   19692..19824 [-]
Host bacterium   Streptococcus pneumoniae P1031 Coordinate of element   1064292..1117331

Cargo genes


Drug resistance gene   tet(M)
Virulence gene   -
Metal resistance gene   -
Degradation gene   -
Symbiosis gene   -
Anti-CRISPR   AcrIIA21, AcrIIA1