Detailed information    

insolico Bioinformatically predicted

Overview


Name   dprA   Type   Machinery gene
Locus tag   MWM20_RS02235 Genome accession   NZ_AP025677
Coordinates   457589..458713 (+) Length   374 a.a.
NCBI ID   WP_000228517.1    Uniprot ID   E2QFE2
Organism   Escherichia coli strain Rb-23     
Function   ssDNA binding; loading RecA onto ssDNA (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 435748..458713 457589..458713 within 0


Gene organization within MGE regions


Location: 435748..458713
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MWM20_RS02045 (TRECRb23_04020) gspA 435748..437217 (+) 1470 WP_000107580.1 type II secretion system protein GspA -
  MWM20_RS02050 (TRECRb23_04030) gspB 437219..437638 (+) 420 WP_000461439.1 putative general secretion pathway protein GspB -
  MWM20_RS02055 (TRECRb23_04040) rpsJ 437876..438187 (+) 312 WP_001181004.1 30S ribosomal protein S10 -
  MWM20_RS02060 (TRECRb23_04050) rplC 438220..438849 (+) 630 WP_000579833.1 50S ribosomal protein L3 -
  MWM20_RS02065 (TRECRb23_04060) rplD 438860..439465 (+) 606 WP_000424395.1 50S ribosomal protein L4 -
  MWM20_RS02070 (TRECRb23_04070) rplW 439462..439764 (+) 303 WP_000617544.1 50S ribosomal protein L23 -
  MWM20_RS02075 (TRECRb23_04080) rplB 439782..440603 (+) 822 WP_000301864.1 50S ribosomal protein L2 -
  MWM20_RS02080 (TRECRb23_04090) rpsS 440620..440898 (+) 279 WP_001138117.1 30S ribosomal protein S19 -
  MWM20_RS02085 (TRECRb23_04100) rplV 440913..441245 (+) 333 WP_000447529.1 50S ribosomal protein L22 -
  MWM20_RS02090 (TRECRb23_04110) rpsC 441263..441964 (+) 702 WP_000529945.1 30S ribosomal protein S3 -
  MWM20_RS02095 (TRECRb23_04120) rplP 441977..442387 (+) 411 WP_000941212.1 50S ribosomal protein L16 -
  MWM20_RS02100 (TRECRb23_04130) rpmC 442387..442578 (+) 192 WP_000644741.1 50S ribosomal protein L29 -
  MWM20_RS02105 (TRECRb23_04140) rpsQ 442578..442832 (+) 255 WP_000130100.1 30S ribosomal protein S17 -
  MWM20_RS02110 (TRECRb23_04150) rplN 442997..443368 (+) 372 WP_000613955.1 50S ribosomal protein L14 -
  MWM20_RS02115 (TRECRb23_04160) rplX 443379..443693 (+) 315 WP_000729186.1 50S ribosomal protein L24 -
  MWM20_RS02120 (TRECRb23_04170) rplE 443708..444247 (+) 540 WP_001096200.1 50S ribosomal protein L5 -
  MWM20_RS02125 (TRECRb23_04180) rpsN 444262..444567 (+) 306 WP_001118930.1 30S ribosomal protein S14 -
  MWM20_RS02130 (TRECRb23_04190) rpsH 444601..444993 (+) 393 WP_000062611.1 30S ribosomal protein S8 -
  MWM20_RS02135 (TRECRb23_04200) rplF 445006..445539 (+) 534 WP_000091945.1 50S ribosomal protein L6 -
  MWM20_RS02140 (TRECRb23_04210) rplR 445549..445902 (+) 354 WP_000358960.1 50S ribosomal protein L18 -
  MWM20_RS02145 (TRECRb23_04220) rpsE 445917..446420 (+) 504 WP_000940121.1 30S ribosomal protein S5 -
  MWM20_RS02150 (TRECRb23_04230) rpmD 446424..446603 (+) 180 WP_001140433.1 50S ribosomal protein L30 -
  MWM20_RS02155 (TRECRb23_04240) rplO 446607..447041 (+) 435 WP_001238917.1 50S ribosomal protein L15 -
  MWM20_RS02160 (TRECRb23_04250) secY 447049..448380 (+) 1332 WP_001118861.1 preprotein translocase subunit SecY -
  MWM20_RS02165 rpmJ 448412..448528 (+) 117 WP_000868187.1 50S ribosomal protein L36 -
  MWM20_RS02170 (TRECRb23_04260) rpsM 448675..449031 (+) 357 WP_000090775.1 30S ribosomal protein S13 -
  MWM20_RS02175 (TRECRb23_04270) rpsK 449048..449437 (+) 390 WP_001029684.1 30S ribosomal protein S11 -
  MWM20_RS02180 (TRECRb23_04280) rpsD 449471..450091 (+) 621 WP_000135224.1 30S ribosomal protein S4 -
  MWM20_RS02185 (TRECRb23_04290) rpoA 450117..451106 (+) 990 WP_001162094.1 DNA-directed RNA polymerase subunit alpha -
  MWM20_RS02190 (TRECRb23_04300) rplQ 451147..451530 (+) 384 WP_001216368.1 50S ribosomal protein L17 -
  MWM20_RS02195 (TRECRb23_04310) yhdN 451637..452005 (+) 369 WP_000266505.1 DUF1992 domain-containing protein -
  MWM20_RS02200 (TRECRb23_04320) zntR 452016..452441 (+) 426 WP_000285607.1 Zn(2+)-responsive transcriptional regulator -
  MWM20_RS02205 (TRECRb23_04330) arfA 452497..452715 (+) 219 WP_000092695.1 alternative ribosome-rescue factor ArfA -
  MWM20_RS02210 (TRECRb23_04340) mscL 452712..453125 (-) 414 WP_000022450.1 large-conductance mechanosensitive channel protein MscL -
  MWM20_RS02215 (TRECRb23_04350) trkA 453255..454631 (-) 1377 WP_000691382.1 Trk system potassium transporter TrkA -
  MWM20_RS02220 (TRECRb23_04360) rsmB 454653..455942 (-) 1290 WP_000744788.1 16S rRNA (cytosine(967)-C(5))-methyltransferase RsmB -
  MWM20_RS02225 (TRECRb23_04370) fmt 455988..456935 (-) 948 WP_000004422.1 methionyl-tRNA formyltransferase -
  MWM20_RS02230 (TRECRb23_04380) def 456950..457459 (-) 510 WP_000114984.1 peptide deformylase -
  MWM20_RS02235 (TRECRb23_04390) dprA 457589..458713 (+) 1125 WP_000228517.1 DNA-protecting protein DprA Machinery gene

Sequence


Protein


Download         Length: 374 a.a.        Molecular weight: 40910.86 Da        Isoelectric Point: 6.0395

>NTDB_id=93517 MWM20_RS02235 WP_000228517.1 457589..458713(+) (dprA) [Escherichia coli strain Rb-23]
MVDTEIWLRLMSISSLYGDDMVRIAHWLAKQSHIDAVVLQQTGLTLRQAQRFLSFPRKSIESSLCWLEQPNHHLIPADSE
FYPPQLLATTDYPGALFVEGELHALHSFQLAVVGSRAHSWYGERWGRLFCETLATRGVTITSGLARGIDGVAHKAALQVN
GVSIAVLGNGLNTIHPRRHAPLAASLLEQGGALVSEFPLDVPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCA
LEQGREVFALPGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPDAPENSFYSPDQEDVALPFPELLANVGDE
VTPVDVVAERAGQPVPEVVTQLLELELAGWIAAVPGGYVRLRRACHVRRTNVFV

Nucleotide


Download         Length: 1125 bp        

>NTDB_id=93517 MWM20_RS02235 WP_000228517.1 457589..458713(+) (dprA) [Escherichia coli strain Rb-23]
ATGGTCGATACAGAAATTTGGCTGCGTTTAATGAGCATCAGCAGCTTGTATGGCGATGATATGGTCCGTATAGCTCACTG
GCTGGCAAAACAGTCGCATATTGATGCGGTTGTATTGCAGCAAACAGGGCTTACATTGCGGCAGGCACAACGCTTTCTTT
CATTTCCGCGAAAGAGTATCGAAAGCTCACTTTGTTGGTTGGAGCAACCCAACCATCATTTAATCCCTGCGGACAGCGAA
TTTTATCCTCCTCAACTTCTGGCGACGACAGATTACCCCGGCGCACTGTTTGTTGAAGGAGAACTGCACGCGTTGCATTC
ATTTCAGCTTGCCGTAGTGGGGAGTCGGGCGCATTCATGGTATGGCGAGCGATGGGGACGGTTATTTTGCGAAACTCTGG
CGACGCGTGGAGTGACAATTACGAGTGGACTGGCGCGTGGAATCGATGGTGTGGCGCATAAAGCGGCCTTACAGGTAAAT
GGCGTTAGCATTGCTGTATTGGGGAATGGACTTAATACCATTCATCCCCGCCGCCATGCTCCACTGGCTGCCAGTCTACT
TGAGCAAGGTGGTGCTCTCGTCTCGGAATTTCCCCTCGATGTTCCACCCCTTGCTTACAATTTCCCACGAAGAAATCGCA
TTATCAGTGGTCTAAGTAAGGGTGTACTGGTGGTGGAAGCGGCTTTGCGCAGTGGTTCGCTGGTGACAGCACGTTGTGCG
CTTGAGCAGGGGCGTGAAGTTTTTGCTTTGCCAGGTCCAATAGGGAATCCGGGAAGCGAAGGGCCTCACTGGTTAATAAA
ACAAGGTGCGATTCTTGTGACGGAACCGGAAGAAATTCTGGAAAACTTGCAATTTGGATTGCACTGGTTGCCAGACGCCC
CTGAAAATTCATTTTATTCACCAGATCAGGAAGACGTGGCATTGCCATTTCCTGAGCTCCTGGCTAACGTAGGAGATGAG
GTAACACCTGTTGACGTCGTCGCTGAACGTGCCGGCCAACCTGTGCCAGAGGTAGTTACTCAACTACTCGAACTGGAGTT
AGCAGGATGGATCGCAGCTGTACCCGGCGGCTATGTCCGATTGAGGAGGGCATGCCATGTTCGACGTACTAATGTATTTG
TTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB E2QFE2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  dprA Vibrio campbellii strain DS40M4

48.78

98.663

0.481

  dprA Vibrio cholerae strain A1552

49.355

82.888

0.409

  dprA Glaesserella parasuis strain SC1401

44.807

90.107

0.404

  dprA Haemophilus influenzae Rd KW20

44.214

90.107

0.398

  dprA Legionella pneumophila strain ERS1305867

44.242

88.235

0.39


Multiple sequence alignment