Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/celA/cilE   Type   Machinery gene
Locus tag   LK450_RS02090 Genome accession   NZ_CP085939
Coordinates   417937..418641 (+) Length   234 a.a.
NCBI ID   WP_003031685.1    Uniprot ID   A0A448AGW3
Organism   Streptococcus anginosus subsp. anginosus strain FDAARGOS_1569     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 412937..423641
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LK450_RS02065 (LK450_02065) - 413290..414495 (+) 1206 WP_003032823.1 multidrug efflux MFS transporter -
  LK450_RS02070 (LK450_02070) rpmG 414479..414628 (+) 150 WP_003024746.1 50S ribosomal protein L33 -
  LK450_RS02075 (LK450_02075) secG 414668..414901 (+) 234 WP_003024744.1 preprotein translocase subunit SecG -
  LK450_RS02080 (LK450_02080) rnr 414993..417326 (+) 2334 WP_018543505.1 ribonuclease R -
  LK450_RS02085 (LK450_02085) smpB 417289..417756 (+) 468 WP_003031704.1 SsrA-binding protein SmpB -
  LK450_RS02090 (LK450_02090) comEA/celA/cilE 417937..418641 (+) 705 WP_003031685.1 helix-hairpin-helix domain-containing protein Machinery gene
  LK450_RS02095 (LK450_02095) comEC/celB 418625..420853 (+) 2229 WP_003031687.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  LK450_RS02100 (LK450_02100) - 420936..421502 (+) 567 WP_223349562.1 DUF805 domain-containing protein -
  LK450_RS02105 (LK450_02105) - 422133..423119 (+) 987 WP_003024723.1 Gfo/Idh/MocA family protein -

Sequence


Protein


Download         Length: 234 a.a.        Molecular weight: 25081.12 Da        Isoelectric Point: 4.7769

>NTDB_id=621383 LK450_RS02090 WP_003031685.1 417937..418641(+) (comEA/celA/cilE) [Streptococcus anginosus subsp. anginosus strain FDAARGOS_1569]
MLETIIEKMKEYKILIGLSLIGLIIAGFFMINGQSSRRSNVAELAQETVTSSEAELEEISTGTKKNSQKEKAEPQTSSSE
ESEFLTVDVKGAVKNPGIYQLKKTSRINDAIQKAGGLMTDADSKSINLAQKLTDEAVVYVATMGENAASVSSNTGQSSTS
GTSEVASQKGNKVNLNTADLSELQTISGIGQKRAQDILDYREANGKFNSVDDLKNVSGVGAKTLEKLKEYVTVD

Nucleotide


Download         Length: 705 bp        

>NTDB_id=621383 LK450_RS02090 WP_003031685.1 417937..418641(+) (comEA/celA/cilE) [Streptococcus anginosus subsp. anginosus strain FDAARGOS_1569]
ATGTTAGAGACAATCATTGAAAAGATGAAAGAGTATAAAATTTTGATTGGTTTAAGTTTGATTGGTTTGATAATAGCAGG
ATTTTTTATGATAAATGGTCAATCTAGTAGACGATCAAATGTAGCAGAGCTCGCACAGGAAACAGTTACGAGTTCGGAGG
CAGAATTAGAAGAAATTTCAACTGGAACGAAGAAAAACTCACAGAAAGAAAAAGCAGAGCCTCAAACGAGTTCCAGTGAA
GAATCAGAATTTTTAACCGTAGATGTCAAGGGTGCAGTCAAAAATCCAGGTATTTATCAACTGAAAAAGACTAGCCGTAT
CAATGATGCGATTCAAAAAGCAGGTGGTTTAATGACAGACGCTGACAGCAAGTCTATCAATTTGGCGCAGAAACTAACGG
ATGAAGCTGTTGTTTATGTGGCAACTATGGGTGAGAATGCGGCGAGTGTTTCAAGCAATACAGGACAATCTTCAACGTCA
GGAACTAGTGAAGTCGCATCGCAAAAAGGGAATAAGGTCAATCTAAATACAGCTGATTTATCTGAGTTGCAGACCATTTC
TGGTATTGGTCAAAAACGCGCGCAAGATATTTTAGATTATCGGGAGGCAAACGGGAAATTTAATTCCGTTGATGACCTGA
AAAATGTATCAGGAGTAGGTGCTAAAACACTAGAGAAATTGAAAGAATATGTCACAGTGGATTAA

Domains


Predicted by InterproScan.

(87-140)

(169-232)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A448AGW3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/celA/cilE Streptococcus pneumoniae TIGR4

52.361

99.573

0.521

  comEA/celA/cilE Streptococcus pneumoniae Rx1

52.361

99.573

0.521

  comEA/celA/cilE Streptococcus pneumoniae D39

52.361

99.573

0.521

  comEA/celA/cilE Streptococcus pneumoniae R6

52.361

99.573

0.521

  comEA/celA/cilE Streptococcus mitis SK321

51.931

99.573

0.517

  comEA/celA/cilE Streptococcus mitis NCTC 12261

51.073

99.573

0.509

  comEA Streptococcus thermophilus LMD-9

43.404

100

0.436

  comEA Lactococcus lactis subsp. cremoris KW2

41.202

99.573

0.41