Detailed information    

insolico Bioinformatically predicted

Overview


Name   dprA   Type   Machinery gene
Locus tag   I6J97_RS01975 Genome accession   NZ_CP069522
Coordinates   390934..392058 (+) Length   374 a.a.
NCBI ID   WP_000228517.1    Uniprot ID   E2QFE2
Organism   Escherichia coli strain FDAARGOS_1267     
Function   ssDNA binding; loading RecA onto ssDNA (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 369087..392058 390934..392058 within 0


Gene organization within MGE regions


Location: 369087..392058
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I6J97_RS01785 (I6J97_01785) gspA 369087..370556 (+) 1470 WP_000107592.1 type II secretion system protein GspA -
  I6J97_RS01790 (I6J97_01790) gspB 370558..370977 (+) 420 WP_000461443.1 putative general secretion pathway protein GspB -
  I6J97_RS01795 (I6J97_01795) rpsJ 371215..371526 (+) 312 WP_001181004.1 30S ribosomal protein S10 -
  I6J97_RS01800 (I6J97_01800) rplC 371559..372188 (+) 630 WP_000579833.1 50S ribosomal protein L3 -
  I6J97_RS01805 (I6J97_01805) rplD 372199..372804 (+) 606 WP_000424395.1 50S ribosomal protein L4 -
  I6J97_RS01810 (I6J97_01810) rplW 372801..373103 (+) 303 WP_000617544.1 50S ribosomal protein L23 -
  I6J97_RS01815 (I6J97_01815) rplB 373121..373942 (+) 822 WP_000301864.1 50S ribosomal protein L2 -
  I6J97_RS01820 (I6J97_01820) rpsS 373959..374237 (+) 279 WP_001138117.1 30S ribosomal protein S19 -
  I6J97_RS01825 (I6J97_01825) rplV 374252..374584 (+) 333 WP_000447529.1 50S ribosomal protein L22 -
  I6J97_RS01830 (I6J97_01830) rpsC 374602..375303 (+) 702 WP_000529945.1 30S ribosomal protein S3 -
  I6J97_RS01835 (I6J97_01835) rplP 375316..375726 (+) 411 WP_000941212.1 50S ribosomal protein L16 -
  I6J97_RS01840 (I6J97_01840) rpmC 375726..375917 (+) 192 WP_000644741.1 50S ribosomal protein L29 -
  I6J97_RS01845 (I6J97_01845) rpsQ 375917..376171 (+) 255 WP_000130100.1 30S ribosomal protein S17 -
  I6J97_RS01850 (I6J97_01850) rplN 376336..376707 (+) 372 WP_000613955.1 50S ribosomal protein L14 -
  I6J97_RS01855 (I6J97_01855) rplX 376718..377032 (+) 315 WP_000729186.1 50S ribosomal protein L24 -
  I6J97_RS01860 (I6J97_01860) rplE 377047..377586 (+) 540 WP_001096200.1 50S ribosomal protein L5 -
  I6J97_RS01865 (I6J97_01865) rpsN 377601..377906 (+) 306 WP_001118930.1 30S ribosomal protein S14 -
  I6J97_RS01870 (I6J97_01870) rpsH 377940..378332 (+) 393 WP_000062611.1 30S ribosomal protein S8 -
  I6J97_RS01875 (I6J97_01875) rplF 378345..378878 (+) 534 WP_000091945.1 50S ribosomal protein L6 -
  I6J97_RS01880 (I6J97_01880) rplR 378888..379241 (+) 354 WP_000358960.1 50S ribosomal protein L18 -
  I6J97_RS01885 (I6J97_01885) rpsE 379256..379759 (+) 504 WP_000940121.1 30S ribosomal protein S5 -
  I6J97_RS01890 (I6J97_01890) rpmD 379763..379942 (+) 180 WP_001140433.1 50S ribosomal protein L30 -
  I6J97_RS01895 (I6J97_01895) rplO 379946..380380 (+) 435 WP_001238917.1 50S ribosomal protein L15 -
  I6J97_RS01900 (I6J97_01900) secY 380388..381719 (+) 1332 WP_001118861.1 preprotein translocase subunit SecY -
  I6J97_RS01905 (I6J97_01905) rpmJ 381751..381867 (+) 117 WP_000868187.1 50S ribosomal protein L36 -
  I6J97_RS01910 (I6J97_01910) rpsM 382014..382370 (+) 357 WP_000090775.1 30S ribosomal protein S13 -
  I6J97_RS01915 (I6J97_01915) rpsK 382387..382776 (+) 390 WP_001029684.1 30S ribosomal protein S11 -
  I6J97_RS01920 (I6J97_01920) rpsD 382810..383430 (+) 621 WP_000135224.1 30S ribosomal protein S4 -
  I6J97_RS01925 (I6J97_01925) rpoA 383456..384445 (+) 990 WP_001162094.1 DNA-directed RNA polymerase subunit alpha -
  I6J97_RS01930 (I6J97_01930) rplQ 384486..384869 (+) 384 WP_001216368.1 50S ribosomal protein L17 -
  I6J97_RS01935 (I6J97_01935) yhdN 384976..385344 (+) 369 WP_000266504.1 DUF1992 domain-containing protein -
  I6J97_RS01940 (I6J97_01940) zntR 385355..385780 (+) 426 WP_000285610.1 Zn(2+)-responsive transcriptional regulator -
  I6J97_RS01945 (I6J97_01945) arfA 385836..386054 (+) 219 WP_000092695.1 alternative ribosome-rescue factor ArfA -
  I6J97_RS01950 (I6J97_01950) mscL 386051..386464 (-) 414 WP_000022450.1 large-conductance mechanosensitive channel protein MscL -
  I6J97_RS01955 (I6J97_01955) trkA 386594..387970 (-) 1377 WP_000691382.1 Trk system potassium transporter TrkA -
  I6J97_RS01960 (I6J97_01960) rsmB 387992..389281 (-) 1290 WP_000744767.1 16S rRNA (cytosine(967)-C(5))-methyltransferase RsmB -
  I6J97_RS01965 (I6J97_01965) fmt 389333..390280 (-) 948 WP_000004421.1 methionyl-tRNA formyltransferase -
  I6J97_RS01970 (I6J97_01970) def 390295..390804 (-) 510 WP_000114984.1 peptide deformylase -
  I6J97_RS01975 (I6J97_01975) dprA 390934..392058 (+) 1125 WP_000228517.1 DNA-protecting protein DprA Machinery gene

Sequence


Protein


Download         Length: 374 a.a.        Molecular weight: 40910.86 Da        Isoelectric Point: 6.0395

>NTDB_id=535310 I6J97_RS01975 WP_000228517.1 390934..392058(+) (dprA) [Escherichia coli strain FDAARGOS_1267]
MVDTEIWLRLMSISSLYGDDMVRIAHWLAKQSHIDAVVLQQTGLTLRQAQRFLSFPRKSIESSLCWLEQPNHHLIPADSE
FYPPQLLATTDYPGALFVEGELHALHSFQLAVVGSRAHSWYGERWGRLFCETLATRGVTITSGLARGIDGVAHKAALQVN
GVSIAVLGNGLNTIHPRRHAPLAASLLEQGGALVSEFPLDVPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCA
LEQGREVFALPGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPDAPENSFYSPDQEDVALPFPELLANVGDE
VTPVDVVAERAGQPVPEVVTQLLELELAGWIAAVPGGYVRLRRACHVRRTNVFV

Nucleotide


Download         Length: 1125 bp        

>NTDB_id=535310 I6J97_RS01975 WP_000228517.1 390934..392058(+) (dprA) [Escherichia coli strain FDAARGOS_1267]
ATGGTCGATACAGAAATTTGGCTGCGTTTAATGAGCATCAGCAGCTTGTATGGCGATGATATGGTCCGTATAGCTCACTG
GCTGGCAAAACAGTCGCATATTGATGCGGTTGTATTGCAGCAAACAGGGCTTACATTGCGGCAGGCACAACGCTTTCTTT
CATTTCCGCGAAAGAGTATCGAAAGCTCACTTTGTTGGTTGGAGCAACCCAACCATCATTTAATCCCTGCGGACAGCGAA
TTTTATCCTCCTCAACTTCTGGCGACGACAGATTACCCCGGCGCACTGTTTGTTGAAGGAGAACTGCACGCGTTGCATTC
ATTTCAGCTTGCCGTAGTGGGGAGTCGGGCGCATTCATGGTATGGCGAGCGATGGGGACGATTATTTTGCGAAACTCTGG
CGACGCGTGGAGTGACAATTACGAGTGGACTGGCGCGTGGAATCGATGGTGTGGCGCATAAAGCGGCCTTACAGGTAAAT
GGCGTTAGCATTGCTGTATTGGGGAATGGACTTAATACCATTCATCCCCGCCGCCATGCTCCACTGGCTGCCAGTCTACT
TGAGCAAGGTGGTGCTCTCGTCTCGGAATTTCCCCTCGATGTTCCACCCCTTGCTTACAATTTCCCACGAAGAAATCGCA
TTATCAGTGGTCTAAGTAAAGGTGTACTGGTGGTGGAAGCGGCTTTGCGCAGTGGTTCGCTGGTGACAGCACGTTGTGCG
CTTGAGCAGGGGCGTGAAGTTTTTGCTTTGCCAGGTCCAATAGGGAATCCGGGAAGCGAAGGGCCTCACTGGTTAATAAA
ACAAGGTGCGATTCTTGTGACGGAACCGGAAGAAATTCTGGAAAACTTGCAATTTGGATTGCACTGGTTGCCAGACGCCC
CTGAAAATTCATTTTATTCACCAGATCAGGAAGACGTGGCATTGCCATTTCCTGAGCTCCTGGCTAACGTAGGAGATGAG
GTAACACCTGTTGACGTCGTCGCTGAACGTGCCGGCCAACCTGTGCCAGAGGTAGTTACTCAACTACTCGAACTGGAGTT
AGCAGGATGGATCGCAGCTGTACCCGGCGGCTATGTCCGATTGAGGAGGGCATGCCATGTTCGACGTACTAATGTATTTG
TTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB E2QFE2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  dprA Vibrio campbellii strain DS40M4

48.78

98.663

0.481

  dprA Vibrio cholerae strain A1552

49.355

82.888

0.409

  dprA Glaesserella parasuis strain SC1401

44.807

90.107

0.404

  dprA Haemophilus influenzae Rd KW20

44.214

90.107

0.398

  dprA Legionella pneumophila strain ERS1305867

44.242

88.235

0.39