Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   HMPREF9425_RS02235 Genome accession   NZ_GL831112
Coordinates   420630..421325 (+) Length   231 a.a.
NCBI ID   WP_003093743.1    Uniprot ID   A0AAW7QFH2
Organism   Streptococcus vestibularis ATCC 49124     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 415630..426325
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HMPREF9425_RS02210 (HMPREF9425_0462) - 416726..417235 (+) 510 WP_037621639.1 hypothetical protein -
  HMPREF9425_RS10655 - 417447..417566 (+) 120 Protein_446 IS30 family transposase -
  HMPREF9425_RS02215 (HMPREF9425_0463) - 417616..417885 (-) 270 WP_003096419.1 GIY-YIG nuclease family protein -
  HMPREF9425_RS02220 (HMPREF9425_0464) - 417889..418653 (-) 765 WP_003096421.1 tRNA1(Val) (adenine(37)-N6)-methyltransferase -
  HMPREF9425_RS02225 (HMPREF9425_0465) - 418703..419638 (-) 936 WP_003096423.1 polysaccharide deacetylase family protein -
  HMPREF9425_RS02230 (HMPREF9425_0466) - 419768..420523 (+) 756 WP_003096426.1 lysophospholipid acyltransferase family protein -
  HMPREF9425_RS02235 (HMPREF9425_0467) comEA 420630..421325 (+) 696 WP_003093743.1 helix-hairpin-helix domain-containing protein Machinery gene
  HMPREF9425_RS02240 (HMPREF9425_0468) - 421315..423555 (+) 2241 WP_003096429.1 DNA internalization-related competence protein ComEC/Rec2 -
  HMPREF9425_RS02245 (HMPREF9425_0469) - 423641..424903 (+) 1263 WP_003093180.1 UDP-N-acetylglucosamine 1-carboxyvinyltransferase -
  HMPREF9425_RS02250 (HMPREF9425_0470) - 424969..425514 (+) 546 WP_003093731.1 GNAT family N-acetyltransferase -

Sequence


Protein


Download         Length: 231 a.a.        Molecular weight: 24588.56 Da        Isoelectric Point: 4.7187

>NTDB_id=1111734 HMPREF9425_RS02235 WP_003093743.1 420630..421325(+) (comEA) [Streptococcus vestibularis ATCC 49124]
MKEKIIDYVNNNRLFVSIIGVLMMVFCFFLWMTCGAGNSMEAETSYTDVTTLSTSSSKEGSKSLTEVSSQSQTEGSEKVK
SKVTVDVKGAVVKPGVYTLKVSARVTDAIQEAGGMTEDADAKSVNLAASLSDEEVIYVANKDENVSVLDQSDTGQVSNKG
GQAVSKNGKINLNTATSEQLQTISGIGAKRAEDIVAYRESHGGFQSVGDLKNVSGIGDKTLDKIRESIYVA

Nucleotide


Download         Length: 696 bp        

>NTDB_id=1111734 HMPREF9425_RS02235 WP_003093743.1 420630..421325(+) (comEA) [Streptococcus vestibularis ATCC 49124]
GTGAAGGAAAAGATTATAGACTATGTTAACAATAATCGTCTATTTGTGAGTATCATTGGTGTGCTGATGATGGTTTTCTG
CTTCTTTCTGTGGATGACTTGTGGTGCCGGCAACAGCATGGAGGCAGAGACGTCTTATACAGATGTGACAACATTGTCAA
CCTCCTCATCCAAAGAAGGGTCAAAATCACTTACGGAAGTGTCTTCTCAGTCACAAACTGAAGGAAGTGAAAAAGTTAAG
TCAAAAGTAACGGTAGATGTTAAGGGGGCTGTGGTCAAGCCAGGTGTATATACGCTAAAAGTAAGCGCTAGGGTGACAGA
TGCCATTCAGGAAGCTGGAGGAATGACAGAGGATGCAGACGCTAAGAGTGTTAATTTGGCCGCAAGCTTGTCAGATGAAG
AGGTTATTTATGTAGCCAATAAAGACGAGAATGTTTCTGTTCTTGATCAATCAGATACTGGTCAGGTCTCTAACAAAGGA
GGGCAGGCTGTATCTAAGAATGGCAAAATTAACTTAAATACAGCGACCTCAGAGCAGTTACAAACTATTTCTGGAATTGG
TGCCAAGAGGGCAGAGGATATTGTCGCTTATCGTGAAAGTCATGGCGGCTTTCAGTCCGTAGGTGACCTGAAAAATGTTT
CAGGCATTGGTGATAAAACGTTAGATAAAATCAGAGAGTCCATCTATGTGGCTTAA

Domains


Predicted by InterproScan.

(85-138)

(166-228)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Streptococcus thermophilus LMD-9

88.312

100

0.883

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

40.175

99.134

0.398

  comEA/celA/cilE Streptococcus pneumoniae D39

39.056

100

0.394

  comEA/celA/cilE Streptococcus pneumoniae Rx1

39.056

100

0.394

  comEA/celA/cilE Streptococcus pneumoniae R6

39.056

100

0.394

  comEA/celA/cilE Streptococcus mitis SK321

39.301

99.134

0.39

  comEA Latilactobacillus sakei subsp. sakei 23K

35.246

100

0.372

  comEA/celA/cilE Streptococcus mitis NCTC 12261

37.281

98.701

0.368


Multiple sequence alignment