Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE/comE2   Type   Regulator
Locus tag   DQM54_RS10770 Genome accession   NZ_LS483375
Coordinates   2149310..2150077 (-) Length   255 a.a.
NCBI ID   WP_008808031.1    Uniprot ID   P75031
Organism   Streptococcus gordonii strain NCTC3165     
Function   activate transcription of early competence genes (predicted from homology)   
Competence regulation

Genomic Context


Location: 2144310..2155077
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQM54_RS10750 (NCTC3165_02140) pth 2147231..2147800 (-) 570 WP_111724267.1 aminoacyl-tRNA hydrolase -
  DQM54_RS10755 (NCTC3165_02141) ychF 2147873..2148988 (-) 1116 WP_008808030.1 redox-regulated ATPase YchF -
  DQM54_RS10770 (NCTC3165_02144) comE/comE2 2149310..2150077 (-) 768 WP_008808031.1 competence system response regulator transcription factor ComE Regulator
  DQM54_RS10775 (NCTC3165_02145) comD/comD1 2150074..2151432 (-) 1359 WP_111724324.1 competence system sensor histidine kinase ComD Regulator
  DQM54_RS10780 (NCTC3165_02146) comC/comC1 2151448..2151612 (-) 165 WP_080998577.1 bacteriocin Regulator
  DQM54_RS10790 (NCTC3165_02148) rlmH 2151898..2152377 (-) 480 WP_008808034.1 23S rRNA (pseudouridine(1915)-N(3))-methyltransferase RlmH -
  DQM54_RS10795 (NCTC3165_02149) htrA 2152571..2153764 (+) 1194 WP_111724268.1 S1C family serine protease Regulator
  DQM54_RS10800 (NCTC3165_02150) spo0J 2153830..2154594 (+) 765 WP_111724269.1 ParB/RepB/Spo0J family partition protein Regulator

Sequence


Protein


Download         Length: 255 a.a.        Molecular weight: 30225.48 Da        Isoelectric Point: 7.0651

>NTDB_id=1139163 DQM54_RS10770 WP_008808031.1 2149310..2150077(-) (comE/comE2) [Streptococcus gordonii strain NCTC3165]
MKVLVLEDTIEHQVRIENVFEEISRELNLEIKAKVTGKIHEFKEYVESDEVNQLYFLDIDIKGEEQKGLEMAQFIRQHNP
YAIIVFVTSHSEFATLTFKYKVSALDFIDKDINDNSFKKRVKDCIVYTRNNLIENTDIVDFFEYSFRGNEIRIPFRDILY
IETTGSPYKLRVVGKNFFKEFYGTIAEIQEQDKELGYFFSPHKSYLSNISNISGYDKKTKDVLYYDGHRSPISRLRVRYL
KEILKAKSKKEENKK

Nucleotide


Download         Length: 768 bp        

>NTDB_id=1139163 DQM54_RS10770 WP_008808031.1 2149310..2150077(-) (comE/comE2) [Streptococcus gordonii strain NCTC3165]
ATGAAAGTATTAGTGCTAGAGGATACTATTGAACATCAAGTGAGAATTGAAAATGTCTTCGAAGAAATTTCTCGCGAACT
AAATCTTGAAATTAAAGCAAAAGTAACAGGAAAAATTCATGAATTTAAAGAATATGTTGAGTCAGACGAGGTGAATCAAC
TCTATTTCTTAGATATAGATATCAAAGGAGAAGAACAAAAAGGTCTAGAAATGGCGCAATTTATCCGCCAACATAATCCT
TATGCTATCATTGTTTTTGTTACGAGTCATTCAGAATTTGCGACCTTAACTTTTAAGTATAAAGTATCAGCGTTAGATTT
TATAGATAAAGACATAAATGATAATTCTTTTAAGAAAAGAGTTAAAGATTGCATCGTTTATACACGAAATAATTTAATTG
AAAATACAGATATTGTTGACTTTTTTGAGTATAGTTTTAGAGGAAACGAAATTCGAATTCCTTTTAGAGATATTCTATAT
ATTGAGACGACAGGAAGCCCTTATAAATTAAGAGTTGTTGGAAAAAATTTCTTTAAGGAATTTTATGGGACAATAGCAGA
AATTCAAGAACAAGATAAAGAACTTGGATATTTTTTCTCACCACATAAGTCATACCTATCTAATATTAGCAACATTAGTG
GATACGACAAAAAGACAAAAGATGTTTTGTATTATGATGGTCATCGTTCACCAATTTCTCGATTACGTGTCAGATACTTG
AAAGAAATACTGAAAGCCAAAAGTAAAAAAGAAGAAAATAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB P75031

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE/comE2 Streptococcus gordonii strain NCTC7865

100

100

1

  comE/comE1 Streptococcus gordonii str. Challis substr. CH1

100

100

1

  comE Streptococcus infantis strain Atu-4

64

98.039

0.627

  comE Streptococcus mitis NCTC 12261

64

98.039

0.627

  comE Streptococcus mitis SK321

63.2

98.039

0.62

  comE Streptococcus pneumoniae Rx1

62.8

98.039

0.616

  comE Streptococcus pneumoniae D39

62.8

98.039

0.616

  comE Streptococcus pneumoniae R6

62.8

98.039

0.616

  comE Streptococcus pneumoniae TIGR4

62.8

98.039

0.616

  comE/blpR Streptococcus mutans UA159

37.551

96.078

0.361


Multiple sequence alignment