Detailed information    

insolico Bioinformatically predicted

Overview


Name   dprA   Type   Machinery gene
Locus tag   ETT49_RS04420 Genome accession   NZ_CP035450
Coordinates   858086..858922 (+) Length   278 a.a.
NCBI ID   WP_002995009.1    Uniprot ID   -
Organism   Streptococcus pyogenes strain emm93.4     
Function   ssDNA binding; loading RecA onto ssDNA (predicted from homology)   
DNA processing

Genomic Context


Location: 853086..863922
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ETT49_RS04390 (ETT49_04385) - 853634..854008 (+) 375 WP_109828759.1 hypothetical protein -
  ETT49_RS04395 (ETT49_04390) - 854058..854969 (-) 912 WP_002984631.1 diacylglycerol kinase family protein -
  ETT49_RS04400 (ETT49_04395) - 855086..855736 (-) 651 WP_002984629.1 hemolysin III family protein -
  ETT49_RS04405 (ETT49_04400) - 855733..856173 (-) 441 WP_002989774.1 DUF1836 domain-containing protein -
  ETT49_RS04410 (ETT49_04405) ylqF 856392..857240 (+) 849 WP_136119026.1 ribosome biogenesis GTPase YlqF -
  ETT49_RS04415 (ETT49_04410) - 857230..858021 (+) 792 WP_136269129.1 ribonuclease HII -
  ETT49_RS04420 (ETT49_04415) dprA 858086..858922 (+) 837 WP_002995009.1 DNA-processing protein DprA Machinery gene
  ETT49_RS04425 (ETT49_04420) topA 859029..861158 (+) 2130 WP_002984618.1 type I DNA topoisomerase -
  ETT49_RS04430 (ETT49_04425) trmFO 861293..862639 (+) 1347 WP_021733693.1 methylenetetrahydrofolate--tRNA-(uracil(54)- C(5))-methyltransferase (FADH(2)-oxidizing) TrmFO -

Sequence


Protein


Download         Length: 278 a.a.        Molecular weight: 30970.91 Da        Isoelectric Point: 8.8230

>NTDB_id=342246 ETT49_RS04420 WP_002995009.1 858086..858922(+) (dprA) [Streptococcus pyogenes strain emm93.4]
MNHFELYKLKKAGLTNKNILNILDYQEHQEKSLSLRDMAVVSGCKHPSHFIEAYKQLDIQKLKIEFKQFPSISILDKHYP
MALKEIYNPPVLLFFQGNLDLLEKPKLAIVGSRRSSDTGVKSVRKILKELGNRFVIVSGLARGIDTSAHLACLKNGGQTI
AIIGTGLDRFYPKENRELQTFLGKNHLVLTEYGPGEEALSYHFPERNRIIAGLSRGILVVEAKNRSGSLITCQIGIEEGR
DIFAVPGNILDGKSEGCLQLIKEGATCVTSGMDILSEY

Nucleotide


Download         Length: 837 bp        

>NTDB_id=342246 ETT49_RS04420 WP_002995009.1 858086..858922(+) (dprA) [Streptococcus pyogenes strain emm93.4]
GTGAATCATTTTGAACTTTATAAGCTAAAGAAAGCTGGACTGACTAATAAAAACATTCTCAATATTCTTGACTACCAAGA
ACACCAGGAAAAATCGTTGTCACTTCGAGATATGGCCGTTGTTTCTGGTTGTAAGCATCCGTCTCACTTTATAGAAGCCT
ATAAGCAGTTAGATATTCAAAAATTAAAAATAGAATTTAAACAATTTCCTAGTATTTCTATCTTAGATAAGCATTACCCA
ATGGCATTGAAAGAAATATACAATCCACCTGTCCTCTTGTTTTTTCAGGGAAATTTAGACCTTCTTGAGAAACCTAAATT
AGCCATTGTCGGCTCCAGACGCTCAAGCGATACCGGAGTAAAGTCTGTCCGTAAAATTCTTAAAGAACTCGGGAATCGTT
TTGTGATTGTTAGCGGACTTGCTCGAGGTATCGACACTAGTGCTCATTTAGCCTGCCTTAAAAATGGGGGACAAACCATC
GCTATTATTGGAACAGGGTTGGATCGCTTTTACCCTAAAGAAAATAGGGAGTTGCAAACTTTCTTAGGGAAAAATCATCT
TGTGCTAACAGAATACGGTCCAGGAGAAGAAGCTTTATCTTATCACTTTCCAGAACGGAATCGTATTATCGCAGGTCTTA
GCCGAGGTATTCTTGTCGTTGAAGCAAAAAATCGTTCAGGTTCCTTGATTACTTGTCAAATTGGGATAGAAGAAGGCCGA
GACATTTTTGCTGTCCCAGGAAACATTTTGGACGGGAAATCCGAAGGTTGCCTTCAGTTAATTAAAGAGGGAGCAACATG
CGTCACATCGGGCATGGATATCCTTTCAGAGTACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  dprA Streptococcus mutans UA159

67.626

100

0.676

  dprA/cilB/dalA Streptococcus pneumoniae Rx1

60.791

100

0.608

  dprA/cilB/dalA Streptococcus pneumoniae D39

60.791

100

0.608

  dprA/cilB/dalA Streptococcus pneumoniae R6

60.791

100

0.608

  dprA/cilB/dalA Streptococcus pneumoniae TIGR4

60.791

100

0.608

  dprA/cilB/dalA Streptococcus mitis SK321

60.791

100

0.608

  dprA/cilB/dalA Streptococcus mitis NCTC 12261

60.432

100

0.604

  dprA Lactococcus lactis subsp. cremoris KW2

57.554

100

0.576

  dprA Legionella pneumophila strain ERS1305867

42.857

85.612

0.367


Multiple sequence alignment