Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Regulator
Locus tag   NBW44_RS07635 Genome accession   NZ_OW724079
Coordinates   1504127..1504879 (+) Length   250 a.a.
NCBI ID   WP_250307841.1    Uniprot ID   -
Organism   Streptococcus sp. Marseille-Q3533     
Function   activate transcription of early competence genes (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1489387..1529522 1504127..1504879 within 0


Gene organization within MGE regions


Location: 1489387..1529522
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NBW44_RS07545 - 1489387..1489581 (-) 195 WP_006150549.1 DUF951 family protein -
  NBW44_RS07550 dnaN 1489648..1490784 (-) 1137 WP_250307828.1 DNA polymerase III subunit beta -
  NBW44_RS07555 dnaA 1490943..1492301 (-) 1359 WP_250307993.1 chromosomal replication initiator protein DnaA -
  NBW44_RS07560 spo0J 1492514..1493272 (-) 759 WP_250307829.1 ParB/RepB/Spo0J family partition protein Regulator
  NBW44_RS07565 htrA 1493329..1494513 (-) 1185 WP_250307830.1 trypsin-like peptidase domain-containing protein Regulator
  NBW44_RS07570 rlmH 1494698..1495177 (+) 480 WP_250307831.1 23S rRNA (pseudouridine(1915)-N(3))-methyltransferase RlmH -
  NBW44_RS07580 comD 1495461..1496561 (+) 1101 WP_284346641.1 histidine kinase Regulator
  NBW44_RS07585 - 1496601..1497764 (-) 1164 WP_250307832.1 tyrosine-type recombinase/integrase -
  NBW44_RS07590 - 1497820..1498359 (-) 540 WP_250307833.1 helix-turn-helix transcriptional regulator -
  NBW44_RS07595 - 1498522..1498812 (+) 291 WP_250307834.1 hypothetical protein -
  NBW44_RS07600 - 1498963..1500330 (+) 1368 WP_250307835.1 virulence-associated E family protein -
  NBW44_RS07605 - 1501214..1501630 (-) 417 WP_250307836.1 hypothetical protein -
  NBW44_RS07610 - 1501639..1502394 (-) 756 WP_250307837.1 hypothetical protein -
  NBW44_RS07615 - 1502406..1502678 (-) 273 WP_250307838.1 hypothetical protein -
  NBW44_RS07620 - 1502696..1503157 (-) 462 WP_250307839.1 hypothetical protein -
  NBW44_RS07625 - 1503331..1503657 (+) 327 WP_250307840.1 helix-turn-helix transcriptional regulator -
  NBW44_RS07630 - 1503825..1504130 (+) 306 Protein_1469 GHKL domain-containing protein -
  NBW44_RS07635 comE 1504127..1504879 (+) 753 WP_250307841.1 competence system response regulator transcription factor ComE Regulator
  NBW44_RS07650 - 1505121..1505663 (-) 543 WP_247922302.1 TetR/AcrR family transcriptional regulator -
  NBW44_RS07655 - 1505792..1508446 (+) 2655 WP_250307842.1 YhgE/Pip domain-containing protein -
  NBW44_RS07660 - 1508640..1510262 (-) 1623 WP_250307843.1 ATP-binding cassette domain-containing protein -
  NBW44_RS07665 trpS 1510458..1511483 (+) 1026 WP_250307844.1 tryptophan--tRNA ligase -
  NBW44_RS07670 guaB 1511646..1513124 (+) 1479 WP_250307845.1 IMP dehydrogenase -
  NBW44_RS07675 recF 1513189..1514280 (-) 1092 WP_250307846.1 DNA replication/repair protein RecF -
  NBW44_RS07680 yaaA 1514283..1514642 (-) 360 WP_250307847.1 S4 domain-containing protein YaaA -
  NBW44_RS07685 - 1514805..1516055 (+) 1251 WP_250307848.1 pitrilysin family protein -
  NBW44_RS07690 - 1516052..1517335 (+) 1284 WP_250307849.1 pitrilysin family protein -
  NBW44_RS07695 rodZ 1517375..1518202 (+) 828 WP_250307850.1 cytoskeleton protein RodZ -
  NBW44_RS07700 pgsA 1518214..1518753 (+) 540 WP_075565198.1 CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase -
  NBW44_RS07705 - 1518756..1519583 (+) 828 WP_250307851.1 energy-coupling factor ABC transporter ATP-binding protein -
  NBW44_RS07710 - 1519568..1520407 (+) 840 WP_250307852.1 energy-coupling factor transporter ATPase -
  NBW44_RS07715 - 1520400..1521194 (+) 795 WP_250307853.1 energy-coupling factor transporter transmembrane component T -
  NBW44_RS07720 mreC 1521223..1522044 (+) 822 WP_250307854.1 rod shape-determining protein MreC -
  NBW44_RS07725 mreD 1522041..1522535 (+) 495 WP_250307855.1 rod shape-determining protein MreD -
  NBW44_RS07730 - 1522627..1523832 (+) 1206 WP_250307856.1 peptidoglycan hydrolase PcsB -
  NBW44_RS07735 rpsB 1524063..1524845 (+) 783 WP_250307857.1 30S ribosomal protein S2 -
  NBW44_RS07740 tsf 1524924..1525964 (+) 1041 WP_250307858.1 translation elongation factor Ts -
  NBW44_RS07745 cysK 1526053..1526979 (-) 927 WP_250307859.1 cysteine synthase A -
  NBW44_RS07750 - 1527076..1527519 (-) 444 WP_250307860.1 PH domain-containing protein -
  NBW44_RS07755 - 1527533..1528168 (-) 636 WP_250307861.1 YigZ family protein -
  NBW44_RS07760 comFA/cflA 1528224..1529522 (+) 1299 WP_250307862.1 DEAD/DEAH box helicase Machinery gene

Sequence


Protein


Download         Length: 250 a.a.        Molecular weight: 29577.12 Da        Isoelectric Point: 8.2148

>NTDB_id=1152620 NBW44_RS07635 WP_250307841.1 1504127..1504879(+) (comE) [Streptococcus sp. Marseille-Q3533]
MKVLILEDIIEHQVRLEKTLNEISKEMNIPISYKTTGKVREFMEYVENEEVNQLYFLDIDINGIEKKGFEVAQFIRHRNP
YAIIVFITSRSEFATLTYKYKVSALDFIDKDINDRLFKERVAECIVYTKSTLLENQSVVDYFEYSFKGNDICLPYHDILY
IETTGTSHKLRIIGKNFSKEFYGTITDIQEKDKGSQRFYLAHKSFLVNVGNIKDIDRKTMEVVFYEEHRCPISRLKTKKL
KQILTEKTKK

Nucleotide


Download         Length: 753 bp        

>NTDB_id=1152620 NBW44_RS07635 WP_250307841.1 1504127..1504879(+) (comE) [Streptococcus sp. Marseille-Q3533]
ATGAAAGTATTAATTTTAGAAGACATTATAGAACATCAGGTAAGATTGGAAAAAACTTTGAATGAGATTTCCAAAGAAAT
GAATATTCCTATATCGTATAAAACAACAGGCAAAGTTAGAGAATTTATGGAATATGTTGAAAATGAAGAAGTGAATCAGC
TGTATTTTTTAGATATCGATATTAATGGAATTGAGAAAAAAGGATTTGAAGTAGCCCAGTTTATTCGTCATCGTAATCCT
TATGCGATTATCGTTTTTATAACGAGTCGCTCTGAATTCGCAACACTAACTTATAAATACAAAGTATCTGCTCTAGATTT
TATTGATAAGGACATCAATGATCGCTTATTCAAAGAACGAGTTGCGGAGTGTATTGTGTATACTAAAAGTACATTATTAG
AAAATCAATCCGTAGTTGATTACTTTGAATATAGTTTTAAAGGAAATGATATTTGTCTTCCTTACCATGATATTTTATAT
ATTGAAACAACAGGAACTTCCCATAAACTAAGAATTATTGGTAAGAATTTTTCAAAAGAGTTTTATGGAACAATAACAGA
TATTCAAGAAAAAGATAAGGGATCTCAACGTTTCTATTTAGCTCATAAATCATTTTTAGTGAATGTTGGAAATATTAAAG
ATATAGATCGTAAGACTATGGAAGTTGTTTTTTACGAAGAACATCGTTGTCCTATTTCACGATTAAAAACGAAAAAACTC
AAACAGATTTTAACTGAAAAGACTAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Streptococcus mitis NCTC 12261

80.4

100

0.804

  comE Streptococcus mitis SK321

80

100

0.8

  comE Streptococcus pneumoniae Rx1

79.6

100

0.796

  comE Streptococcus pneumoniae D39

79.6

100

0.796

  comE Streptococcus pneumoniae TIGR4

79.6

100

0.796

  comE Streptococcus pneumoniae R6

79.6

100

0.796

  comE Streptococcus infantis strain Atu-4

79.6

100

0.796

  comE/comE2 Streptococcus gordonii strain NCTC7865

65.2

100

0.652

  comE/comE1 Streptococcus gordonii str. Challis substr. CH1

65.2

100

0.652

  comE/blpR Streptococcus mutans UA159

39.286

100

0.396


Multiple sequence alignment