Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Regulator
Locus tag   L3475_RS10860 Genome accession   NZ_CP091451
Coordinates   2097771..2098523 (-) Length   250 a.a.
NCBI ID   WP_000866065.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain 3238/09     
Function   activate transcription of early competence genes (predicted from homology)   
Competence regulation

Genomic Context


Location: 2092771..2103523
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  L3475_RS10830 (L3475_10750) - 2093150..2095702 (+) 2553 WP_000834680.1 YfhO family protein -
  L3475_RS11030 - 2095732..2096806 (-) 1075 Protein_2102 YhgE/Pip family protein -
  L3475_RS10845 (L3475_10765) - 2096986..2097528 (+) 543 WP_001158266.1 TetR/AcrR family transcriptional regulator -
  L3475_RS10860 (L3475_10780) comE 2097771..2098523 (-) 753 WP_000866065.1 competence system response regulator transcription factor ComE Regulator
  L3475_RS10865 (L3475_10785) comD/comD1 2098520..2099845 (-) 1326 WP_000362880.1 competence system sensor histidine kinase ComD Regulator
  L3475_RS10870 (L3475_10790) comC/comC1 2099866..2099991 (-) 126 WP_000799689.1 competence-stimulating peptide ComC Regulator
  L3475_RS10880 (L3475_10800) rlmH 2100273..2100752 (-) 480 WP_000695929.1 23S rRNA (pseudouridine(1915)-N(3))-methyltransferase RlmH -
  L3475_RS10885 (L3475_10805) htrA 2100935..2102116 (+) 1182 WP_000681597.1 S1C family serine protease Regulator
  L3475_RS10890 (L3475_10810) spo0J 2102174..2102932 (+) 759 WP_000410376.1 ParB/RepB/Spo0J family partition protein Regulator

Sequence


Protein


Download         Length: 250 a.a.        Molecular weight: 29958.31 Da        Isoelectric Point: 6.5073

>NTDB_id=651370 L3475_RS10860 WP_000866065.1 2097771..2098523(-) (comE) [Streptococcus pneumoniae strain 3238/09]
MKVLILEDVIEHQVRLERILDEISKESNIPISYKTTGKVREFEEYIENDEVNQLYFLDIDIHGIEKKGFEVAQLIRHYNP
YAIIVFITSRSEFATLTYKYQVSALDFVDKDINDEMFKKRIEQNIFYTKSMLLENEDVVDYFDYNYKGNDLKIPYHDILY
IETTGVSHKLRIIGKNFAKEFYGTMTDIQEKDKHTQRFYSPHKSFLVNIGNIREIDRKNLEIVFYEDHRCPISRLKIRKL
KDILEKKSQK

Nucleotide


Download         Length: 753 bp        

>NTDB_id=651370 L3475_RS10860 WP_000866065.1 2097771..2098523(-) (comE) [Streptococcus pneumoniae strain 3238/09]
ATGAAAGTTTTAATTTTAGAAGATGTTATTGAACATCAAGTGAGACTAGAGAGAATATTGGATGAAATTTCGAAAGAATC
GAATATTCCAATATCATACAAGACAACGGGAAAAGTCCGTGAATTTGAAGAATACATTGAAAATGATGAAGTAAATCAGC
TTTATTTCCTAGATATCGATATTCATGGAATTGAGAAAAAGGGATTTGAAGTGGCTCAGCTCATTCGTCATTACAATCCT
TACGCTATTATCGTCTTTATCACTAGTCGATCAGAGTTTGCGACTCTAACCTATAAATACCAGGTATCAGCCCTAGATTT
TGTTGATAAGGATATCAATGATGAGATGTTTAAGAAGAGAATTGAGCAAAATATCTTCTACACGAAGAGTATGTTACTTG
AAAATGAAGATGTTGTAGATTATTTCGACTACAATTACAAGGGAAATGATTTAAAAATTCCTTACCATGATATTTTGTAT
ATTGAAACAACAGGGGTCTCTCATAAATTGCGCATTATTGGTAAGAATTTTGCAAAAGAGTTTTATGGTACCATGACAGA
TATTCAGGAAAAGGACAAACATACTCAGCGATTTTATTCTCCTCACAAGTCATTTTTGGTAAATATAGGCAATATCAGAG
AAATTGATCGAAAAAACTTAGAAATTGTTTTCTATGAAGACCATCGTTGTCCTATTTCAAGATTAAAAATTAGAAAATTA
AAAGATATTTTAGAGAAAAAATCTCAAAAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Streptococcus pneumoniae Rx1

100

100

1

  comE Streptococcus pneumoniae D39

100

100

1

  comE Streptococcus pneumoniae R6

100

100

1

  comE Streptococcus pneumoniae TIGR4

100

100

1

  comE Streptococcus mitis SK321

99.2

100

0.992

  comE Streptococcus mitis NCTC 12261

98.4

100

0.984

  comE Streptococcus infantis strain Atu-4

91.2

100

0.912

  comE/comE2 Streptococcus gordonii strain NCTC7865

62.8

100

0.628

  comE/comE1 Streptococcus gordonii str. Challis substr. CH1

62.8

100

0.628

  comE/blpR Streptococcus mutans UA159

41.296

98.8

0.408