Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   CP942_RS20070 Genome accession   NZ_CP023666
Coordinates   4018805..4019434 (+) Length   209 a.a.
NCBI ID   WP_096748238.1    Uniprot ID   -
Organism   Bacillus paralicheniformis strain Bac48     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4013805..4024434
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CP942_RS20035 aroE 4014448..4015283 (+) 836 Protein_3901 shikimate dehydrogenase -
  CP942_RS20040 yhbY 4015277..4015567 (+) 291 WP_003183704.1 ribosome assembly RNA-binding protein YhbY -
  CP942_RS20045 - 4015586..4016155 (+) 570 WP_025811125.1 nicotinate-nucleotide adenylyltransferase -
  CP942_RS20050 yqeK 4016145..4016708 (+) 564 WP_026579268.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  CP942_RS20055 rsfS 4016722..4017078 (+) 357 WP_023855256.1 ribosome silencing factor -
  CP942_RS20060 - 4017075..4017830 (+) 756 WP_023855255.1 class I SAM-dependent DNA methyltransferase -
  CP942_RS20065 comER 4017901..4018722 (-) 822 WP_229128976.1 late competence protein ComER -
  CP942_RS20070 comEA 4018805..4019434 (+) 630 WP_096748238.1 helix-hairpin-helix domain-containing protein Machinery gene
  CP942_RS20075 - 4019503..4020072 (+) 570 WP_020452273.1 ComE operon protein 2 -
  CP942_RS20080 - 4020077..4022464 (+) 2388 WP_105980292.1 DNA internalization-related competence protein ComEC/Rec2 -
  CP942_RS20085 - 4022489..4022620 (-) 132 WP_003183686.1 YqzM family protein -
  CP942_RS20090 holA 4022850..4023875 (+) 1026 WP_199792213.1 DNA polymerase III subunit delta -

Sequence


Protein


Download         Length: 209 a.a.        Molecular weight: 22628.43 Da        Isoelectric Point: 4.6492

>NTDB_id=249133 CP942_RS20070 WP_096748238.1 4018805..4019434(+) (comEA) [Bacillus paralicheniformis strain Bac48]
MTDWLKQYKWHAAGGVALVLIISAAFMLLSGKRETSSGFSIPEEASAQTFDKKEEVKREKNAGEEEVIIDLKGAVKNPGV
YQMKEGDRVHDALKKAGGTEKKADQKQINLAAVLRDGMVLYIPFEGEEAAGSLSEADSRADGSSSDIVNINTASSEELQT
IPGIGPSKAEAVVEYREENGMFQTIEDITNVSGIGEKSFERIKSSITVK

Nucleotide


Download         Length: 630 bp        

>NTDB_id=249133 CP942_RS20070 WP_096748238.1 4018805..4019434(+) (comEA) [Bacillus paralicheniformis strain Bac48]
TTGACAGATTGGCTAAAACAATATAAATGGCATGCGGCGGGAGGTGTGGCACTCGTCTTGATCATCAGCGCTGCCTTCAT
GCTGTTAAGCGGAAAGCGCGAAACTTCATCCGGCTTCTCCATTCCTGAAGAGGCTTCTGCACAGACTTTTGACAAGAAAG
AGGAGGTGAAGCGGGAAAAAAACGCGGGGGAAGAGGAGGTTATCATCGATTTGAAAGGCGCTGTGAAAAATCCGGGCGTC
TATCAAATGAAGGAGGGAGACAGGGTGCACGATGCATTGAAAAAAGCAGGCGGCACCGAGAAAAAAGCGGACCAAAAGCA
GATTAATCTGGCAGCCGTTTTGCGGGACGGTATGGTTCTTTACATTCCATTTGAAGGGGAAGAGGCCGCTGGCTCTCTTT
CGGAAGCGGACTCAAGAGCAGACGGCAGCTCAAGTGATATCGTCAATATCAATACGGCTTCCTCTGAGGAGCTTCAGACG
ATTCCCGGGATCGGTCCTTCAAAAGCGGAAGCGGTAGTCGAATACCGCGAGGAGAACGGAATGTTTCAGACGATTGAAGA
CATAACAAACGTTTCGGGAATTGGAGAAAAGTCTTTTGAAAGAATAAAATCTTCCATCACGGTAAAGTAA

Domains


Predicted by InterproScan.

(69-122)

(145-207)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Bacillus subtilis subsp. subtilis str. 168

52.336

100

0.536

  comEA Staphylococcus aureus MW2

36.364

100

0.383

  comEA Staphylococcus aureus N315

35.909

100

0.378

  comEA Lactococcus lactis subsp. cremoris KW2

35.023

100

0.364


Multiple sequence alignment