Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/celA/cilE   Type   Machinery gene
Locus tag   R4708_RS10625 Genome accession   NZ_CP137104
Coordinates   2036007..2036657 (+) Length   216 a.a.
NCBI ID   WP_000387328.1    Uniprot ID   A0A4N9N7Z7
Organism   Streptococcus pneumoniae strain LM     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2031007..2041657
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R4708_RS10595 - 2031490..2031705 (+) 216 WP_001232081.1 YozE family protein -
  R4708_RS10600 - 2031791..2032759 (+) 969 WP_000658198.1 PhoH family protein -
  R4708_RS10605 - 2032952..2033452 (+) 501 WP_000566986.1 GNAT family N-acetyltransferase -
  R4708_RS10610 - 2033455..2033781 (+) 327 Protein_2050 TfoX/Sxy family protein -
  R4708_RS10615 ald 2034082..2035193 (-) 1112 Protein_2051 alanine dehydrogenase -
  R4708_RS10620 - 2035370..2035939 (+) 570 WP_000443775.1 GNAT family N-acetyltransferase -
  R4708_RS10625 comEA/celA/cilE 2036007..2036657 (+) 651 WP_000387328.1 ComEA family DNA-binding protein Machinery gene
  R4708_RS10630 comEC/celB 2036641..2038881 (+) 2241 WP_000942392.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  R4708_RS10635 - 2039023..2039247 (+) 225 WP_000583432.1 hypothetical protein -
  R4708_RS10640 - 2039280..2039867 (+) 588 WP_000939884.1 ATP-binding cassette domain-containing protein -
  R4708_RS10645 - 2039871..2041052 (+) 1182 WP_000655951.1 membrane protein -

Sequence


Protein


Download         Length: 216 a.a.        Molecular weight: 23230.59 Da        Isoelectric Point: 5.0667

>NTDB_id=895446 R4708_RS10625 WP_000387328.1 2036007..2036657(+) (comEA/celA/cilE) [Streptococcus pneumoniae strain LM]
MEAIIEKIKEYKIIVICTGLGLLVGGFFLLKPAPQTPVKETNLQAEVAAVSKDLVSEEEVNKEEKEEPLEQDLITVDVKG
AVKSPGIYDLPVGSRVNDAVQKAGGLTEQADSKSLNLAQKVSDEALVYVPTKGEEAVSQQTGLGTASSISKEKKVNLNKA
SLEELKQVKGLGGKRAQDIIDHREANGKFKSVDELKKVSGIGGKTIEKLKDYVTVD

Nucleotide


Download         Length: 651 bp        

>NTDB_id=895446 R4708_RS10625 WP_000387328.1 2036007..2036657(+) (comEA/celA/cilE) [Streptococcus pneumoniae strain LM]
ATGGAAGCAATTATCGAGAAAATCAAAGAGTATAAAATCATCGTCATCTGTACTGGTCTGGGCTTGCTTGTAGGAGGATT
TTTCCTGCTAAAACCAGCTCCACAAACACCTGTCAAAGAGACGAATTTGCAGGCTGAAGTCGCAGCTGTTTCCAAGGATT
TGGTATCCGAAGAGGAAGTGAACAAGGAAGAAAAGGAAGAACCCCTTGAACAAGATCTAATCACAGTAGATGTCAAAGGT
GCTGTCAAATCGCCAGGGATTTATGACTTGCCTGTAGGTAGTCGAGTCAATGATGCTGTTCAGAAGGCTGGTGGCTTGAC
AGAGCAAGCAGACAGCAAGTCGCTCAATCTAGCTCAGAAAGTTAGTGATGAGGCTCTGGTTTACGTTCCTACTAAGGGAG
AAGAAGCAGTTAGCCAACAGACTGGTTTGGGGACAGCTTCTTCAATAAGCAAGGAAAAGAAGGTCAATCTCAACAAGGCC
AGTCTGGAAGAACTCAAGCAGGTCAAGGGACTGGGAGGAAAACGAGCTCAGGACATTATCGACCATCGTGAGGCAAATGG
CAAGTTCAAGTCAGTAGACGAGCTCAAGAAGGTCTCTGGCATTGGTGGCAAAACAATAGAAAAGCTTAAAGACTATGTTA
CAGTGGATTAA

Domains


Predicted by InterproScan.

(76-126)

(151-214)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4N9N7Z7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/celA/cilE Streptococcus pneumoniae Rx1

99.537

100

0.995

  comEA/celA/cilE Streptococcus pneumoniae D39

99.537

100

0.995

  comEA/celA/cilE Streptococcus pneumoniae R6

99.537

100

0.995

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

96.759

100

0.968

  comEA/celA/cilE Streptococcus mitis NCTC 12261

95.833

100

0.958

  comEA/celA/cilE Streptococcus mitis SK321

89.815

100

0.898

  comEA Lactococcus lactis subsp. cremoris KW2

43.172

100

0.454

  comEA Bacillus subtilis subsp. subtilis str. 168

41.579

87.963

0.366