Detailed information    

insolico Bioinformatically predicted

Overview


Name   dprA   Type   Machinery gene
Locus tag   I6K34_RS20740 Genome accession   NZ_CP069990
Coordinates   4267825..4268949 (+) Length   374 a.a.
NCBI ID   WP_000228509.1    Uniprot ID   -
Organism   Escherichia coli strain FDAARGOS_1304     
Function   ssDNA binding; loading RecA onto ssDNA (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 4245978..4268949 4267825..4268949 within 0


Gene organization within MGE regions


Location: 4245978..4268949
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I6K34_RS20550 (I6K34_20550) gspA 4245978..4247447 (+) 1470 WP_000107586.1 type II secretion system protein GspA -
  I6K34_RS20555 (I6K34_20555) gspB 4247449..4247868 (+) 420 WP_000461443.1 putative general secretion pathway protein GspB -
  I6K34_RS20560 (I6K34_20560) rpsJ 4248106..4248417 (+) 312 WP_001181004.1 30S ribosomal protein S10 -
  I6K34_RS20565 (I6K34_20565) rplC 4248450..4249079 (+) 630 WP_000579833.1 50S ribosomal protein L3 -
  I6K34_RS20570 (I6K34_20570) rplD 4249090..4249695 (+) 606 WP_000424395.1 50S ribosomal protein L4 -
  I6K34_RS20575 (I6K34_20575) rplW 4249692..4249994 (+) 303 WP_000617544.1 50S ribosomal protein L23 -
  I6K34_RS20580 (I6K34_20580) rplB 4250012..4250833 (+) 822 WP_000301864.1 50S ribosomal protein L2 -
  I6K34_RS20585 (I6K34_20585) rpsS 4250850..4251128 (+) 279 WP_001138117.1 30S ribosomal protein S19 -
  I6K34_RS20590 (I6K34_20590) rplV 4251143..4251475 (+) 333 WP_000447529.1 50S ribosomal protein L22 -
  I6K34_RS20595 (I6K34_20595) rpsC 4251493..4252194 (+) 702 WP_000529945.1 30S ribosomal protein S3 -
  I6K34_RS20600 (I6K34_20600) rplP 4252207..4252617 (+) 411 WP_000941212.1 50S ribosomal protein L16 -
  I6K34_RS20605 (I6K34_20605) rpmC 4252617..4252808 (+) 192 WP_000644741.1 50S ribosomal protein L29 -
  I6K34_RS20610 (I6K34_20610) rpsQ 4252808..4253062 (+) 255 WP_000130100.1 30S ribosomal protein S17 -
  I6K34_RS20615 (I6K34_20615) rplN 4253227..4253598 (+) 372 WP_000613955.1 50S ribosomal protein L14 -
  I6K34_RS20620 (I6K34_20620) rplX 4253609..4253923 (+) 315 WP_000729186.1 50S ribosomal protein L24 -
  I6K34_RS20625 (I6K34_20625) rplE 4253938..4254477 (+) 540 WP_001096200.1 50S ribosomal protein L5 -
  I6K34_RS20630 (I6K34_20630) rpsN 4254492..4254797 (+) 306 WP_001118930.1 30S ribosomal protein S14 -
  I6K34_RS20635 (I6K34_20635) rpsH 4254831..4255223 (+) 393 WP_000062611.1 30S ribosomal protein S8 -
  I6K34_RS20640 (I6K34_20640) rplF 4255236..4255769 (+) 534 WP_000091945.1 50S ribosomal protein L6 -
  I6K34_RS20645 (I6K34_20645) rplR 4255779..4256132 (+) 354 WP_000358960.1 50S ribosomal protein L18 -
  I6K34_RS20650 (I6K34_20650) rpsE 4256147..4256650 (+) 504 WP_000940121.1 30S ribosomal protein S5 -
  I6K34_RS20655 (I6K34_20655) rpmD 4256654..4256833 (+) 180 WP_001140433.1 50S ribosomal protein L30 -
  I6K34_RS20660 (I6K34_20660) rplO 4256837..4257271 (+) 435 WP_001238917.1 50S ribosomal protein L15 -
  I6K34_RS20665 (I6K34_20665) secY 4257279..4258610 (+) 1332 WP_001118861.1 preprotein translocase subunit SecY -
  I6K34_RS20670 (I6K34_20670) rpmJ 4258642..4258758 (+) 117 WP_000868187.1 50S ribosomal protein L36 -
  I6K34_RS20675 (I6K34_20675) rpsM 4258905..4259261 (+) 357 WP_000090775.1 30S ribosomal protein S13 -
  I6K34_RS20680 (I6K34_20680) rpsK 4259278..4259667 (+) 390 WP_001029684.1 30S ribosomal protein S11 -
  I6K34_RS20685 (I6K34_20685) rpsD 4259701..4260321 (+) 621 WP_000135224.1 30S ribosomal protein S4 -
  I6K34_RS20690 (I6K34_20690) rpoA 4260347..4261336 (+) 990 WP_001162094.1 DNA-directed RNA polymerase subunit alpha -
  I6K34_RS20695 (I6K34_20695) rplQ 4261377..4261760 (+) 384 WP_001216368.1 50S ribosomal protein L17 -
  I6K34_RS20700 (I6K34_20700) yhdN 4261867..4262235 (+) 369 WP_000266504.1 DUF1992 domain-containing protein -
  I6K34_RS20705 (I6K34_20705) zntR 4262246..4262671 (+) 426 WP_000285610.1 Zn(2+)-responsive transcriptional regulator -
  I6K34_RS20710 (I6K34_20710) arfA 4262727..4262945 (+) 219 WP_000092695.1 alternative ribosome-rescue factor ArfA -
  I6K34_RS20715 (I6K34_20715) mscL 4262942..4263355 (-) 414 WP_000022450.1 large-conductance mechanosensitive channel protein MscL -
  I6K34_RS20720 (I6K34_20720) trkA 4263485..4264861 (-) 1377 WP_000691382.1 Trk system potassium transporter TrkA -
  I6K34_RS20725 (I6K34_20725) rsmB 4264883..4266172 (-) 1290 WP_000744766.1 16S rRNA (cytosine(967)-C(5))-methyltransferase RsmB -
  I6K34_RS20730 (I6K34_20730) fmt 4266224..4267171 (-) 948 WP_000004421.1 methionyl-tRNA formyltransferase -
  I6K34_RS20735 (I6K34_20735) def 4267186..4267695 (-) 510 WP_000114984.1 peptide deformylase -
  I6K34_RS20740 (I6K34_20740) dprA 4267825..4268949 (+) 1125 WP_000228509.1 DNA-protecting protein DprA Machinery gene

Sequence


Protein


Download         Length: 374 a.a.        Molecular weight: 40991.96 Da        Isoelectric Point: 6.1815

>NTDB_id=537751 I6K34_RS20740 WP_000228509.1 4267825..4268949(+) (dprA) [Escherichia coli strain FDAARGOS_1304]
MVDTEIWLRLISISSLYGDDMVRIAHWLAKQSHIDAVVLQQTGLTLRQAQRFLSFPRKSIESSLCWLEQPNHHLIPADSE
FYPPQLLATTDYPGALFVEGELHALHSFQLAVVGSRAHSWYGERWRRLFCETLATRGVTITSGLARGIDGVAHKAALQVN
GVSIAVLGNGLNTIHPRRHAPLAASLLEQGGALVSEFPLDVPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCA
LEQGREVFALPGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPDAPENSFYSPDQEDVALPFPELLANVGDE
VTPVDVVAERAGQPVPEVVTQLLELELAGWIAAVPGGYVRLRRACHVRRTNVFV

Nucleotide


Download         Length: 1125 bp        

>NTDB_id=537751 I6K34_RS20740 WP_000228509.1 4267825..4268949(+) (dprA) [Escherichia coli strain FDAARGOS_1304]
ATGGTTGATACAGAAATTTGGCTGCGTTTAATTAGCATCAGCAGCTTGTATGGCGATGATATGGTCCGTATAGCTCACTG
GCTGGCAAAACAGTCGCATATTGATGCGGTTGTATTGCAGCAAACAGGGCTTACATTGCGGCAGGCACAACGCTTTCTTT
CATTTCCGCGAAAGAGTATCGAAAGCTCACTTTGTTGGTTGGAGCAACCCAACCATCATTTAATCCCTGCGGACAGCGAA
TTTTATCCTCCTCAACTTCTGGCGACGACAGATTACCCCGGCGCACTGTTTGTTGAAGGAGAACTGCACGCGTTGCATTC
ATTTCAGCTTGCCGTAGTGGGGAGTCGGGCGCATTCATGGTATGGCGAGCGATGGAGACGATTATTTTGCGAAACTCTGG
CGACGCGTGGAGTGACAATTACGAGTGGACTGGCGCGTGGAATCGATGGTGTGGCGCATAAAGCGGCCTTACAGGTAAAT
GGCGTTAGCATTGCTGTATTGGGGAATGGACTTAATACCATTCATCCCCGCCGCCATGCTCCACTGGCTGCCAGTCTACT
TGAGCAAGGTGGTGCTCTCGTCTCGGAATTTCCCCTCGATGTTCCACCCCTTGCTTACAATTTCCCACGAAGAAATCGCA
TTATCAGTGGTCTAAGTAAAGGTGTACTGGTGGTGGAAGCGGCTTTGCGCAGTGGTTCGCTGGTGACAGCACGTTGTGCG
CTTGAGCAGGGGCGTGAAGTTTTTGCTTTGCCAGGTCCAATAGGGAATCCGGGAAGCGAAGGGCCTCACTGGTTAATAAA
ACAAGGTGCGATTCTTGTGACGGAACCGGAAGAAATTCTGGAAAACTTGCAATTTGGATTGCACTGGTTGCCAGACGCCC
CTGAAAATTCATTTTATTCACCAGATCAGGAAGACGTGGCATTGCCATTTCCTGAGCTCCTGGCTAACGTAGGAGATGAG
GTAACACCTGTTGACGTCGTCGCTGAACGTGCCGGCCAACCTGTGCCAGAGGTAGTTACTCAACTACTCGAACTGGAGTT
AGCAGGATGGATCGCAGCTGTACCCGGCGGCTATGTCCGATTGAGGAGGGCATGCCATGTTCGACGTACTAATGTATTTG
TTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  dprA Vibrio campbellii strain DS40M4

48.78

98.663

0.481

  dprA Vibrio cholerae strain A1552

49.032

82.888

0.406

  dprA Glaesserella parasuis strain SC1401

44.807

90.107

0.404

  dprA Haemophilus influenzae Rd KW20

43.917

90.107

0.396

  dprA Legionella pneumophila strain ERS1305867

44.242

88.235

0.39