Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/celA/cilE   Type   Machinery gene
Locus tag   EL079_RS02575 Genome accession   NZ_LR134283
Coordinates   514535..515239 (+) Length   234 a.a.
NCBI ID   WP_003031685.1    Uniprot ID   A0A448AGW3
Organism   Streptococcus anginosus strain NCTC10713     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 509535..520239
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL079_RS02550 (NCTC10713_00551) - 509888..511093 (+) 1206 WP_003032823.1 multidrug efflux MFS transporter -
  EL079_RS02555 rpmG 511077..511226 (+) 150 WP_003024746.1 50S ribosomal protein L33 -
  EL079_RS02560 (NCTC10713_00552) secG 511266..511499 (+) 234 WP_003024744.1 preprotein translocase subunit SecG -
  EL079_RS02565 (NCTC10713_00553) rnr 511591..513924 (+) 2334 WP_018543505.1 ribonuclease R -
  EL079_RS02570 (NCTC10713_00554) smpB 513887..514354 (+) 468 WP_003031704.1 SsrA-binding protein SmpB -
  EL079_RS02575 (NCTC10713_00555) comEA/celA/cilE 514535..515239 (+) 705 WP_003031685.1 helix-hairpin-helix domain-containing protein Machinery gene
  EL079_RS02580 (NCTC10713_00556) comEC/celB 515223..517451 (+) 2229 WP_003031687.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  EL079_RS02585 (NCTC10713_00557) - 517534..518100 (+) 567 WP_223349562.1 DUF805 domain-containing protein -
  EL079_RS02590 (NCTC10713_00559) - 518731..519717 (+) 987 WP_003024723.1 Gfo/Idh/MocA family protein -

Sequence


Protein


Download         Length: 234 a.a.        Molecular weight: 25081.12 Da        Isoelectric Point: 4.7769

>NTDB_id=1120009 EL079_RS02575 WP_003031685.1 514535..515239(+) (comEA/celA/cilE) [Streptococcus anginosus strain NCTC10713]
MLETIIEKMKEYKILIGLSLIGLIIAGFFMINGQSSRRSNVAELAQETVTSSEAELEEISTGTKKNSQKEKAEPQTSSSE
ESEFLTVDVKGAVKNPGIYQLKKTSRINDAIQKAGGLMTDADSKSINLAQKLTDEAVVYVATMGENAASVSSNTGQSSTS
GTSEVASQKGNKVNLNTADLSELQTISGIGQKRAQDILDYREANGKFNSVDDLKNVSGVGAKTLEKLKEYVTVD

Nucleotide


Download         Length: 705 bp        

>NTDB_id=1120009 EL079_RS02575 WP_003031685.1 514535..515239(+) (comEA/celA/cilE) [Streptococcus anginosus strain NCTC10713]
ATGTTAGAGACAATCATTGAAAAGATGAAAGAGTATAAAATTTTGATTGGTTTAAGTTTGATTGGTTTGATAATAGCAGG
ATTTTTTATGATAAATGGTCAATCTAGTAGACGATCAAATGTAGCAGAGCTCGCACAGGAAACAGTTACGAGTTCGGAGG
CAGAATTAGAAGAAATTTCAACTGGAACGAAGAAAAACTCACAGAAAGAAAAAGCAGAGCCTCAAACGAGTTCCAGTGAA
GAATCAGAATTTTTAACCGTAGATGTCAAGGGTGCAGTCAAAAATCCAGGTATTTATCAACTGAAAAAGACTAGCCGTAT
CAATGATGCGATTCAAAAAGCAGGTGGTTTAATGACAGACGCTGACAGCAAGTCTATCAATTTGGCGCAGAAACTAACGG
ATGAAGCTGTTGTTTATGTGGCAACTATGGGTGAGAATGCGGCGAGTGTTTCAAGCAATACAGGACAATCTTCAACGTCA
GGAACTAGTGAAGTCGCATCGCAAAAAGGGAATAAGGTCAATCTAAATACAGCTGATTTATCTGAGTTGCAGACCATTTC
TGGTATTGGTCAAAAACGCGCGCAAGATATTTTAGATTATCGGGAGGCAAACGGGAAATTTAATTCCGTTGATGACCTGA
AAAATGTATCAGGAGTAGGTGCTAAAACACTAGAGAAATTGAAAGAATATGTCACAGTGGATTAA

Domains


Predicted by InterproScan.

(87-140)

(169-232)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A448AGW3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/celA/cilE Streptococcus pneumoniae TIGR4

52.361

99.573

0.521

  comEA/celA/cilE Streptococcus pneumoniae Rx1

52.361

99.573

0.521

  comEA/celA/cilE Streptococcus pneumoniae D39

52.361

99.573

0.521

  comEA/celA/cilE Streptococcus pneumoniae R6

52.361

99.573

0.521

  comEA/celA/cilE Streptococcus mitis SK321

51.931

99.573

0.517

  comEA/celA/cilE Streptococcus mitis NCTC 12261

51.073

99.573

0.509

  comEA Streptococcus thermophilus LMD-9

43.404

100

0.436

  comEA Lactococcus lactis subsp. cremoris KW2

41.202

99.573

0.41


Multiple sequence alignment