Detailed information    

insolico Bioinformatically predicted

Overview


Name   dprA   Type   Machinery gene
Locus tag   ETT50_RS04650 Genome accession   NZ_CP035449
Coordinates   898753..899589 (-) Length   278 a.a.
NCBI ID   WP_032465023.1    Uniprot ID   -
Organism   Streptococcus pyogenes strain emm56     
Function   ssDNA binding; loading RecA onto ssDNA (predicted from homology)   
DNA processing

Genomic Context


Location: 893753..904589
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ETT50_RS04640 (ETT50_04640) trmFO 895036..896382 (-) 1347 WP_002995015.1 methylenetetrahydrofolate--tRNA-(uracil(54)- C(5))-methyltransferase (FADH(2)-oxidizing) TrmFO -
  ETT50_RS04645 (ETT50_04645) topA 896517..898646 (-) 2130 WP_063632497.1 type I DNA topoisomerase -
  ETT50_RS04650 (ETT50_04650) dprA 898753..899589 (-) 837 WP_032465023.1 DNA-processing protein DprA Machinery gene
  ETT50_RS04655 (ETT50_04655) - 899654..900445 (-) 792 WP_023611055.1 ribonuclease HII -
  ETT50_RS04660 (ETT50_04660) ylqF 900435..901283 (-) 849 WP_032465022.1 ribosome biogenesis GTPase YlqF -
  ETT50_RS04665 (ETT50_04665) - 901502..901942 (+) 441 WP_002989774.1 DUF1836 domain-containing protein -
  ETT50_RS04670 (ETT50_04670) - 901939..902589 (+) 651 WP_002984629.1 hemolysin III family protein -
  ETT50_RS04675 (ETT50_04675) - 902706..903617 (+) 912 WP_032465021.1 diacylglycerol kinase family protein -
  ETT50_RS04680 (ETT50_04680) - 903667..904104 (-) 438 WP_032465020.1 hypothetical protein -

Sequence


Protein


Download         Length: 278 a.a.        Molecular weight: 31031.01 Da        Isoelectric Point: 8.8230

>NTDB_id=342192 ETT50_RS04650 WP_032465023.1 898753..899589(-) (dprA) [Streptococcus pyogenes strain emm56]
MNHFELYKLKKAGLTNKNILNILDYQEHQEKSLSLRDMAVVSGCKHPSHFIEAYKQLDIQKLKIEFKQFPSISILDKHYP
MALKEIYNPPVLLFFQGNLDLLEKPKLAIVGFRRSSDTGVKSVRKILKELGNRFVIVSGLARGIDTSAHLACLKNGGQTI
AIIGTGLDRFYPKENRELQTFLGKNHLVLTEYGPGEEALSYHFPERNRIIAGLSRGILVVEAKNRSGSLITCQIGIEEGR
DIFAVPGNILDGKSEGCLQLIKEGATCVTSGMDILSEY

Nucleotide


Download         Length: 837 bp        

>NTDB_id=342192 ETT50_RS04650 WP_032465023.1 898753..899589(-) (dprA) [Streptococcus pyogenes strain emm56]
GTGAATCATTTTGAACTTTATAAGCTAAAGAAAGCTGGACTGACTAATAAAAACATTCTCAATATTCTTGACTACCAAGA
ACACCAGGAAAAATCGTTGTCACTTCGAGATATGGCCGTTGTTTCTGGTTGTAAGCATCCGTCTCACTTTATAGAAGCCT
ATAAGCAGTTAGATATTCAAAAATTAAAAATAGAATTTAAACAATTTCCTAGTATTTCTATCTTAGATAAGCATTACCCA
ATGGCATTGAAAGAAATATACAATCCACCTGTCCTCTTGTTTTTTCAGGGAAATTTAGACCTTCTTGAGAAACCTAAATT
AGCCATTGTCGGCTTCAGACGCTCAAGCGATACCGGAGTAAAGTCTGTCCGTAAAATTCTTAAAGAACTCGGGAACCGTT
TTGTGATTGTTAGCGGACTTGCTCGAGGTATCGACACTAGTGCTCATTTAGCCTGCCTTAAAAATGGGGGACAAACCATC
GCTATTATTGGAACAGGGTTGGATCGCTTTTACCCTAAAGAAAATAGGGAGTTGCAAACTTTCTTAGGGAAAAATCATCT
TGTGCTAACAGAATACGGTCCAGGAGAAGAAGCTTTATCTTATCACTTTCCAGAACGGAATCGTATTATCGCAGGCCTTA
GCCGAGGTATACTTGTCGTTGAAGCAAAAAATCGTTCAGGTTCCTTGATTACTTGTCAAATTGGGATAGAAGAAGGCCGA
GACATTTTTGCTGTCCCAGGAAACATTTTGGACGGGAAATCCGAAGGTTGCCTTCAGTTAATTAAAGAGGGAGCAACATG
CGTCACATCGGGCATGGATATCCTTTCAGAGTACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  dprA Streptococcus mutans UA159

67.266

100

0.673

  dprA/cilB/dalA Streptococcus pneumoniae Rx1

60.432

100

0.604

  dprA/cilB/dalA Streptococcus pneumoniae D39

60.432

100

0.604

  dprA/cilB/dalA Streptococcus pneumoniae R6

60.432

100

0.604

  dprA/cilB/dalA Streptococcus pneumoniae TIGR4

60.432

100

0.604

  dprA/cilB/dalA Streptococcus mitis SK321

60.432

100

0.604

  dprA/cilB/dalA Streptococcus mitis NCTC 12261

60.072

100

0.601

  dprA Lactococcus lactis subsp. cremoris KW2

57.194

100

0.572

  dprA Legionella pneumophila strain ERS1305867

42.437

85.612

0.363


Multiple sequence alignment