Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   S101395_RS08785 Genome accession   NZ_CP021920
Coordinates   1663942..1664565 (+) Length   207 a.a.
NCBI ID   WP_373926326.1    Uniprot ID   -
Organism   Bacillus sonorensis strain SRCM101395     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1658942..1669565
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S101395_RS08750 (S101395_01774) aroE 1659583..1660419 (+) 837 WP_006637639.1 shikimate dehydrogenase -
  S101395_RS08755 (S101395_01775) yhbY 1660413..1660703 (+) 291 WP_006637638.1 ribosome assembly RNA-binding protein YhbY -
  S101395_RS08760 (S101395_01776) - 1660722..1661291 (+) 570 WP_006637637.1 nicotinate-nucleotide adenylyltransferase -
  S101395_RS08765 (S101395_01777) yqeK 1661281..1661844 (+) 564 WP_006637636.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  S101395_RS08770 (S101395_01778) rsfS 1661858..1662211 (+) 354 WP_006637635.1 ribosome silencing factor -
  S101395_RS08775 (S101395_01779) - 1662211..1662966 (+) 756 WP_006637634.1 class I SAM-dependent DNA methyltransferase -
  S101395_RS08780 (S101395_01780) comER 1663034..1663855 (-) 822 WP_006637633.1 late competence protein ComER -
  S101395_RS08785 (S101395_01781) comEA 1663942..1664565 (+) 624 WP_373926326.1 helix-hairpin-helix domain-containing protein Machinery gene
  S101395_RS08790 (S101395_01782) - 1664633..1665202 (+) 570 WP_006637631.1 ComE operon protein 2 -
  S101395_RS08795 (S101395_01783) comEC 1665209..1667539 (+) 2331 WP_006637630.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  S101395_RS08800 (S101395_01784) - 1667630..1667761 (-) 132 WP_006637629.1 YqzM family protein -
  S101395_RS08805 (S101395_01785) holA 1667994..1669034 (+) 1041 WP_006637628.1 DNA polymerase III subunit delta -
  S101395_RS08810 (S101395_01786) rpsT 1669082..1669348 (-) 267 WP_006637627.1 30S ribosomal protein S20 -

Sequence


Protein


Download         Length: 207 a.a.        Molecular weight: 21720.63 Da        Isoelectric Point: 4.9695

>NTDB_id=234882 S101395_RS08785 WP_373926326.1 1663942..1664565(+) (comEA) [Bacillus sonorensis strain SRCM101395]
MNSLKRYKWAAAGVFIAALSVISIMVMPKLHHESAGSDSLPADAASVQAADLAEKKESEEPDRIVVDLKGAVKKPGVYEM
KTGERVHQLLKKAGGTVKNAEGKQINLAAVLQDGMVIYIPFEGEETVQAGTGTAAPSSAEGGTETVNINTASPEELQAIP
GVGPSKAEAIAAYREENGPFQGIEDITNVSGIGEKTFEKIKSSISVK

Nucleotide


Download         Length: 624 bp        

>NTDB_id=234882 S101395_RS08785 WP_373926326.1 1663942..1664565(+) (comEA) [Bacillus sonorensis strain SRCM101395]
GTGAATAGTCTGAAACGTTACAAATGGGCTGCGGCGGGAGTCTTTATTGCGGCGCTCTCTGTCATTTCGATCATGGTAAT
GCCAAAACTCCACCATGAATCAGCCGGAAGCGACTCTTTGCCGGCTGATGCCGCATCCGTTCAAGCGGCCGATCTAGCGG
AGAAAAAGGAAAGCGAAGAACCGGACAGAATTGTCGTAGATCTGAAGGGGGCTGTTAAAAAACCGGGCGTTTATGAGATG
AAGACGGGAGAAAGAGTGCACCAATTGCTGAAAAAAGCCGGGGGCACCGTGAAAAACGCAGAGGGAAAACAAATCAATCT
GGCTGCTGTCCTTCAGGACGGCATGGTGATTTATATTCCGTTTGAAGGCGAAGAAACCGTTCAAGCCGGTACAGGAACGG
CAGCACCATCAAGTGCTGAAGGCGGAACGGAAACGGTGAATATCAATACCGCTTCTCCCGAGGAGCTTCAAGCGATCCCC
GGCGTCGGACCTTCAAAAGCGGAGGCGATCGCCGCATACCGTGAAGAGAACGGCCCTTTTCAGGGGATTGAAGACATTAC
AAACGTATCAGGGATTGGCGAAAAAACGTTTGAGAAAATAAAATCGTCAATATCAGTAAAGTAA

Domains


Predicted by InterproScan.

(144-204)

(65-120)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Bacillus subtilis subsp. subtilis str. 168

53.081

100

0.541

  comEA Lactococcus lactis subsp. cremoris KW2

35.514

100

0.367


Multiple sequence alignment