Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   S100141_RS10715 Genome accession   NZ_CP021669
Coordinates   1982594..1983223 (-) Length   209 a.a.
NCBI ID   WP_003183693.1    Uniprot ID   Q8VQ77
Organism   Bacillus licheniformis strain SRCM100141     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1977594..1988223
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S100141_RS10690 (S100141_02160) rpsT 1977808..1978074 (+) 267 WP_003183682.1 30S ribosomal protein S20 -
  S100141_RS10695 (S100141_02161) holA 1978136..1979179 (-) 1044 WP_025804972.1 DNA polymerase III subunit delta -
  S100141_RS10700 (S100141_02162) - 1979409..1979540 (+) 132 WP_003183686.1 YqzM family protein -
  S100141_RS10705 (S100141_02163) - 1979565..1981952 (-) 2388 WP_009327865.1 DNA internalization-related competence protein ComEC/Rec2 -
  S100141_RS10710 (S100141_02164) - 1981956..1982525 (-) 570 WP_003183690.1 ComE operon protein 2 -
  S100141_RS10715 (S100141_02165) comEA 1982594..1983223 (-) 630 WP_003183693.1 helix-hairpin-helix domain-containing protein Machinery gene
  S100141_RS10720 (S100141_02166) comER 1983307..1984128 (+) 822 WP_003183696.1 late competence protein ComER -
  S100141_RS10725 (S100141_02167) - 1984198..1984953 (-) 756 WP_011198132.1 class I SAM-dependent DNA methyltransferase -
  S100141_RS10730 (S100141_02168) rsfS 1984950..1985306 (-) 357 WP_003183699.1 ribosome silencing factor -
  S100141_RS10735 (S100141_02169) yqeK 1985320..1985883 (-) 564 WP_003183701.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  S100141_RS10740 (S100141_02170) - 1985873..1986442 (-) 570 WP_003183702.1 nicotinate-nucleotide adenylyltransferase -
  S100141_RS10745 (S100141_02171) yhbY 1986461..1986751 (-) 291 WP_003183704.1 ribosome assembly RNA-binding protein YhbY -
  S100141_RS10750 (S100141_02172) aroE 1986745..1987581 (-) 837 WP_003183706.1 shikimate dehydrogenase -

Sequence


Protein


Download         Length: 209 a.a.        Molecular weight: 22468.42 Da        Isoelectric Point: 5.1890

>NTDB_id=232367 S100141_RS10715 WP_003183693.1 1982594..1983223(-) (comEA) [Bacillus licheniformis strain SRCM100141]
MTDWLKQYKWHAAGGVALVLIISAAFMLLSGKRETSSGLSIPEEASAQTYDKKEEVKREKSAGKEAVIIDLKGAVKNPGV
YQMKEGDRVHDVLKKAGGTEKKADQKQINLAAVLQDGMVVYIPFEGEEAADSFSKAGSRADGASVDIVNINTASSEELQA
IPGIGPSKAEAVIEYREENGPFHTVEDITNVSGIGEKSFERIKSAITVK

Nucleotide


Download         Length: 630 bp        

>NTDB_id=232367 S100141_RS10715 WP_003183693.1 1982594..1983223(-) (comEA) [Bacillus licheniformis strain SRCM100141]
TTGACAGATTGGCTAAAACAATATAAATGGCATGCTGCGGGAGGCGTTGCACTCGTCTTGATCATCAGCGCTGCCTTCAT
GCTGTTAAGCGGAAAGCGTGAAACTTCATCCGGCCTCTCCATTCCCGAGGAGGCTTCTGCACAGACGTATGACAAGAAGG
AGGAGGTGAAACGGGAAAAAAGCGCGGGAAAAGAGGCGGTTATCATCGATTTGAAAGGCGCTGTGAAAAATCCGGGCGTC
TATCAAATGAAAGAGGGAGACAGGGTCCACGATGTATTGAAAAAAGCAGGCGGCACCGAGAAAAAAGCGGATCAAAAGCA
AATTAACCTTGCAGCCGTTTTGCAGGACGGTATGGTGGTTTACATTCCATTTGAGGGGGAAGAGGCTGCTGATTCTTTCT
CGAAAGCGGGTTCAAGGGCAGATGGCGCTTCCGTCGATATCGTCAATATCAATACGGCTTCCTCTGAGGAGCTTCAGGCG
ATTCCCGGCATCGGCCCTTCAAAAGCGGAAGCGGTTATCGAATACCGCGAGGAGAACGGACCGTTTCACACGGTAGAAGA
CATAACAAACGTTTCGGGAATTGGAGAAAAGTCTTTTGAAAGAATAAAATCTGCAATCACGGTAAAGTAA

Domains


Predicted by InterproScan.

(147-207)

(69-123)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q8VQ77

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Bacillus subtilis subsp. subtilis str. 168

51.402

100

0.526


Multiple sequence alignment