Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Regulator
Locus tag   EQH43_RS10750 Genome accession   NZ_CP035237
Coordinates   2106807..2107559 (-) Length   250 a.a.
NCBI ID   WP_000866065.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain TVO_Taiwan19F-14     
Function   activate transcription of early competence genes (predicted from homology)   
Competence regulation

Genomic Context


Location: 2101807..2112559
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH43_RS10720 (EQH43_11265) - 2102186..2104738 (+) 2553 WP_000834680.1 YfhO family protein -
  EQH43_RS11020 (EQH43_11270) - 2104768..2105915 (-) 1148 Protein_2098 YhgE/Pip family protein -
  EQH43_RS10735 (EQH43_11275) - 2106022..2106564 (+) 543 WP_001158266.1 TetR/AcrR family transcriptional regulator -
  EQH43_RS10750 (EQH43_11290) comE 2106807..2107559 (-) 753 WP_000866065.1 competence system response regulator transcription factor ComE Regulator
  EQH43_RS10755 (EQH43_11295) comD/comD1 2107556..2108881 (-) 1326 WP_000362880.1 competence system sensor histidine kinase ComD Regulator
  EQH43_RS10760 (EQH43_11300) comC/comC1 2108902..2109027 (-) 126 WP_000799689.1 competence-stimulating peptide ComC Regulator
  EQH43_RS10770 (EQH43_11310) rlmH 2109309..2109788 (-) 480 WP_000695929.1 23S rRNA (pseudouridine(1915)-N(3))-methyltransferase RlmH -
  EQH43_RS10775 (EQH43_11315) htrA 2109971..2111152 (+) 1182 WP_000681597.1 S1C family serine protease Regulator
  EQH43_RS10780 (EQH43_11320) spo0J 2111210..2111968 (+) 759 WP_000410376.1 ParB/RepB/Spo0J family partition protein Regulator

Sequence


Protein


Download         Length: 250 a.a.        Molecular weight: 29958.31 Da        Isoelectric Point: 6.5073

>NTDB_id=336863 EQH43_RS10750 WP_000866065.1 2106807..2107559(-) (comE) [Streptococcus pneumoniae strain TVO_Taiwan19F-14]
MKVLILEDVIEHQVRLERILDEISKESNIPISYKTTGKVREFEEYIENDEVNQLYFLDIDIHGIEKKGFEVAQLIRHYNP
YAIIVFITSRSEFATLTYKYQVSALDFVDKDINDEMFKKRIEQNIFYTKSMLLENEDVVDYFDYNYKGNDLKIPYHDILY
IETTGVSHKLRIIGKNFAKEFYGTMTDIQEKDKHTQRFYSPHKSFLVNIGNIREIDRKNLEIVFYEDHRCPISRLKIRKL
KDILEKKSQK

Nucleotide


Download         Length: 753 bp        

>NTDB_id=336863 EQH43_RS10750 WP_000866065.1 2106807..2107559(-) (comE) [Streptococcus pneumoniae strain TVO_Taiwan19F-14]
ATGAAAGTTTTAATTTTAGAAGATGTTATTGAACATCAAGTGAGACTAGAGAGAATATTGGATGAAATTTCGAAAGAATC
GAATATTCCAATATCATACAAGACAACGGGAAAAGTCCGTGAATTTGAAGAATACATTGAAAATGATGAAGTAAATCAGC
TTTATTTCCTAGATATCGATATTCATGGAATTGAGAAAAAGGGATTTGAAGTGGCTCAGCTCATTCGTCATTACAATCCT
TACGCTATTATCGTCTTTATCACTAGTCGATCAGAGTTTGCGACTCTAACCTATAAATACCAGGTATCAGCCCTAGATTT
TGTTGATAAGGATATCAATGATGAGATGTTTAAGAAGAGAATTGAGCAAAATATCTTCTACACGAAGAGTATGTTACTTG
AAAATGAAGATGTTGTAGATTATTTCGACTACAATTACAAGGGAAATGATTTAAAAATTCCTTACCATGATATTTTGTAT
ATTGAAACAACAGGGGTCTCTCATAAATTGCGCATTATTGGTAAGAATTTTGCAAAAGAGTTTTATGGTACCATGACAGA
TATTCAGGAAAAGGACAAACATACTCAGCGATTTTATTCTCCTCACAAGTCATTTTTGGTAAATATAGGCAATATCAGAG
AAATTGATCGAAAAAACTTAGAAATTGTTTTCTATGAAGACCATCGTTGTCCTATTTCAAGATTAAAAATTAGAAAATTA
AAAGATATTTTAGAGAAAAAATCTCAAAAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Streptococcus pneumoniae Rx1

100

100

1

  comE Streptococcus pneumoniae D39

100

100

1

  comE Streptococcus pneumoniae R6

100

100

1

  comE Streptococcus pneumoniae TIGR4

100

100

1

  comE Streptococcus mitis SK321

99.2

100

0.992

  comE Streptococcus mitis NCTC 12261

98.4

100

0.984

  comE Streptococcus infantis strain Atu-4

91.2

100

0.912

  comE/comE2 Streptococcus gordonii strain NCTC7865

62.8

100

0.628

  comE/comE1 Streptococcus gordonii str. Challis substr. CH1

62.8

100

0.628

  comE/blpR Streptococcus mutans UA159

41.296

98.8

0.408


Multiple sequence alignment