Detailed information    

insolico Bioinformatically predicted

Overview


Name   prx   Type   Regulator
Locus tag   EHF40_RS04700 Genome accession   NZ_CP033907
Coordinates   866860..867048 (+) Length   62 a.a.
NCBI ID   WP_011528571.1    Uniprot ID   A0A660A3N3
Organism   Streptococcus pyogenes strain Duke-Large     
Function   Inhibit ComR activation (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 834825..873640 866860..867048 within 0


Gene organization within MGE regions


Location: 834825..873640
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EHF40_RS04500 (EHF40_04500) - 834825..835460 (+) 636 WP_002990114.1 cystathionine beta-lyase -
  EHF40_RS04505 (EHF40_04505) rnz 835475..836404 (+) 930 WP_009881223.1 ribonuclease Z -
  EHF40_RS04510 (EHF40_04510) - 836404..837168 (+) 765 WP_002984893.1 SDR family oxidoreductase -
  EHF40_RS04515 (EHF40_04515) recJ 837165..839375 (+) 2211 WP_011528535.1 single-stranded-DNA-specific exonuclease RecJ -
  EHF40_RS04520 (EHF40_04520) - 839526..840044 (+) 519 WP_002990109.1 adenine phosphoribosyltransferase -
  EHF40_RS04525 (EHF40_04525) - 840125..840808 (+) 684 WP_011184446.1 DnaD domain-containing protein -
  EHF40_RS04530 (EHF40_04530) nth 840805..841461 (+) 657 WP_002990106.1 endonuclease III -
  EHF40_RS04535 (EHF40_04535) - 841533..842219 (+) 687 WP_002990104.1 tRNA (adenine(22)-N(1))-methyltransferase TrmK -
  EHF40_RS04540 (EHF40_04540) - 842209..842997 (+) 789 WP_021299315.1 Nif3-like dinuclear metal center hexameric protein -
  EHF40_RS04545 (EHF40_04545) - 843037..844143 (+) 1107 WP_011528537.1 FAD-dependent oxidoreductase -
  EHF40_RS04550 (EHF40_04550) rfbA 844201..845070 (+) 870 WP_002992970.1 glucose-1-phosphate thymidylyltransferase RfbA -
  EHF40_RS04555 (EHF40_04555) - 845070..845663 (+) 594 WP_002990099.1 dTDP-4-dehydrorhamnose 3,5-epimerase family protein -
  EHF40_RS04560 (EHF40_04560) rfbB 845907..846947 (+) 1041 WP_002984881.1 dTDP-glucose 4,6-dehydratase -
  EHF40_RS04565 (EHF40_04565) - 847030..847965 (-) 936 WP_060388510.1 site-specific integrase -
  EHF40_RS04570 (EHF40_04570) - 848112..848432 (+) 321 WP_002995960.1 VRR-NUC domain-containing protein -
  EHF40_RS04575 (EHF40_04575) - 848416..848772 (+) 357 WP_011018138.1 hypothetical protein -
  EHF40_RS10160 (EHF40_04580) - 848769..849020 (+) 252 WP_011528549.1 hypothetical protein -
  EHF40_RS04585 (EHF40_04585) - 849029..849238 (+) 210 Protein_835 DUF4355 domain-containing protein -
  EHF40_RS04590 (EHF40_04590) - 849257..850147 (+) 891 WP_011528556.1 hypothetical protein -
  EHF40_RS04595 (EHF40_04595) - 850159..850452 (+) 294 WP_011528557.1 HeH/LEM domain-containing protein -
  EHF40_RS04600 (EHF40_04600) - 850466..850810 (+) 345 WP_060388512.1 hypothetical protein -
  EHF40_RS04605 (EHF40_04605) - 850807..851118 (+) 312 WP_011528559.1 hypothetical protein -
  EHF40_RS04610 (EHF40_04610) - 851115..851510 (+) 396 WP_011528560.1 hypothetical protein -
  EHF40_RS04615 (EHF40_04615) - 851512..851922 (+) 411 WP_011528561.1 DUF5072 family protein -
  EHF40_RS04620 (EHF40_04620) - 851934..852191 (+) 258 WP_121158086.1 phage major tail protein, TP901-1 family -
  EHF40_RS04625 (EHF40_04625) - 852204..852494 (+) 291 WP_060388513.1 hypothetical protein -
  EHF40_RS04630 (EHF40_04630) - 852451..854028 (+) 1578 WP_231494234.1 phage tail protein -
  EHF40_RS04635 (EHF40_04635) - 854029..855513 (+) 1485 WP_011528566.1 distal tail protein Dit -
  EHF40_RS04640 (EHF40_04640) - 855514..858963 (+) 3450 WP_011528567.1 glucosaminidase domain-containing protein -
  EHF40_RS04645 (EHF40_04645) - 858968..860830 (+) 1863 WP_011528568.1 DUF859 family phage minor structural protein -
  EHF40_RS04650 (EHF40_04650) - 860841..861188 (+) 348 WP_009880247.1 DUF1366 domain-containing protein -
  EHF40_RS10125 - 861202..861324 (+) 123 WP_015055953.1 hypothetical protein -
  EHF40_RS04655 (EHF40_04655) - 861338..861661 (+) 324 WP_015055952.1 hypothetical protein -
  EHF40_RS04660 (EHF40_04660) - 861661..861993 (+) 333 WP_011054798.1 phage holin -
  EHF40_RS04665 (EHF40_04665) - 861995..862759 (+) 765 WP_011054797.1 CHAP domain-containing protein -
  EHF40_RS04670 (EHF40_04670) - 862771..863373 (+) 603 WP_011054796.1 hypothetical protein -
  EHF40_RS04675 (EHF40_04675) - 863384..864157 (+) 774 WP_011528569.1 hypothetical protein -
  EHF40_RS04680 (EHF40_04680) - 864167..864388 (+) 222 WP_009880241.1 hypothetical protein -
  EHF40_RS04685 (EHF40_04685) - 864388..865047 (+) 660 WP_011528570.1 hypothetical protein -
  EHF40_RS04690 (EHF40_04690) - 865116..865550 (-) 435 WP_011017966.1 hypothetical protein -
  EHF40_RS04695 (EHF40_04695) sda3 865822..866622 (-) 801 WP_011285611.1 streptodornase Sda3 -
  EHF40_RS04700 (EHF40_04700) prx 866860..867048 (+) 189 WP_011528571.1 hypothetical protein Regulator
  EHF40_RS04705 (EHF40_04705) - 867456..867932 (+) 477 WP_002984880.1 8-oxo-dGTP diphosphatase -
  EHF40_RS04710 (EHF40_04710) - 867990..869171 (+) 1182 WP_002984879.1 AI-2E family transporter -
  EHF40_RS04715 (EHF40_04715) - 869161..870408 (+) 1248 WP_002984878.1 tetratricopeptide repeat protein -
  EHF40_RS04720 (EHF40_04720) fbp54 870467..872119 (-) 1653 WP_021299312.1 Rqc2 family fibronectin-binding protein Fbp54 -
  EHF40_RS04725 (EHF40_04725) trpX 872473..873471 (+) 999 WP_011184452.1 tryptophan ABC transporter substrate-binding protein -

Sequence


Protein


Download         Length: 62 a.a.        Molecular weight: 7224.11 Da        Isoelectric Point: 4.0606

>NTDB_id=326917 EHF40_RS04700 WP_011528571.1 866860..867048(+) (prx) [Streptococcus pyogenes strain Duke-Large]
MLTYDEFKQAIDREYITGDTVMIVRKNGQIFDYVLPHEEVRNGEVVTIERISDVMAELSESE

Nucleotide


Download         Length: 189 bp        

>NTDB_id=326917 EHF40_RS04700 WP_011528571.1 866860..867048(+) (prx) [Streptococcus pyogenes strain Duke-Large]
ATGCTAACATATGACGAGTTTAAGCAAGCAATCGACCGTGAATATATCACAGGAGACACAGTTATGATCGTGCGCAAGAA
CGGACAGATTTTTGATTATGTGTTGCCGCATGAAGAAGTGAGAAATGGGGAAGTTGTGACAATCGAGCGGATATCAGATG
TTATGGCAGAACTTTCTGAGTCTGAATAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A660A3N3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  prx Streptococcus pyogenes MGAS315

76.667

96.774

0.742

  prx Streptococcus pyogenes MGAS315

76.271

95.161

0.726

  prx Streptococcus pyogenes MGAS8232

76.271

95.161

0.726

  prx Streptococcus pyogenes MGAS315

71.667

96.774

0.694

  prx Streptococcus pyogenes MGAS315

90.698

69.355

0.629

  prx Streptococcus pyogenes MGAS315

85.366

66.129

0.565

  prx Streptococcus pyogenes MGAS315

76.19

67.742

0.516


Multiple sequence alignment