Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/celA/cilE   Type   Machinery gene
Locus tag   DV119_RS07710 Genome accession   NZ_CP031245
Coordinates   1413376..1414026 (+) Length   216 a.a.
NCBI ID   WP_114880828.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain M16808     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1408376..1419026
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV119_RS07665 - 1408860..1409075 (+) 216 WP_001232082.1 YozE family protein -
  DV119_RS07670 - 1409161..1410129 (+) 969 WP_000658183.1 PhoH family protein -
  DV119_RS07680 - 1410322..1410822 (+) 501 WP_000566982.1 GNAT family N-acetyltransferase -
  DV119_RS07685 - 1410825..1411150 (+) 326 Protein_1407 TfoX/Sxy family protein -
  DV119_RS11905 ald 1411451..1412562 (-) 1112 Protein_1408 alanine dehydrogenase -
  DV119_RS07705 - 1412739..1413308 (+) 570 WP_114880827.1 GNAT family N-acetyltransferase -
  DV119_RS07710 comEA/celA/cilE 1413376..1414026 (+) 651 WP_114880828.1 helix-hairpin-helix domain-containing protein Machinery gene
  DV119_RS07715 comEC/celB 1414010..1416250 (+) 2241 WP_044727637.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  DV119_RS07720 - 1416429..1416617 (+) 189 WP_001809102.1 hypothetical protein -
  DV119_RS07725 - 1416649..1417236 (+) 588 WP_000933542.1 ATP-binding cassette domain-containing protein -
  DV119_RS07730 - 1417240..1418420 (+) 1181 Protein_1414 hypothetical protein -

Sequence


Protein


Download         Length: 216 a.a.        Molecular weight: 23274.65 Da        Isoelectric Point: 6.0746

>NTDB_id=304646 DV119_RS07710 WP_114880828.1 1413376..1414026(+) (comEA/celA/cilE) [Streptococcus pneumoniae strain M16808]
MEAIIEKIKEYKIIVICTGLGLLVGGFFLLKPAPQTPVKETNVQAEVAAVSKDLVSEKEVNKEEKEEPVEQDLITVDVKG
AVKSPGIYDLPVGSRVNDAVQKAGGLTEQADSKSLNLAQKVSDEALVYVPTKGEEAVSQQTGSGTASSISKEKKVNLNKA
SLEELKQVKRLGGKRAQDIIDHREANGKFKSVDELKKVSGIGGKTIEKLKDYVTVD

Nucleotide


Download         Length: 651 bp        

>NTDB_id=304646 DV119_RS07710 WP_114880828.1 1413376..1414026(+) (comEA/celA/cilE) [Streptococcus pneumoniae strain M16808]
ATGGAAGCAATTATCGAGAAAATCAAAGAGTATAAAATCATCGTCATCTGTACTGGTCTGGGCTTGCTTGTAGGCGGATT
TTTCCTGCTAAAGCCAGCTCCACAAACACCTGTAAAGGAAACGAATGTGCAGGCTGAAGTTGCAGCTGTTTCCAAGGATT
TGGTATCCGAAAAGGAAGTGAACAAGGAAGAGAAGGAAGAACCAGTTGAACAAGATCTAATCACAGTAGATGTCAAAGGT
GCTGTCAAATCGCCAGGGATTTATGACTTGCCTGTAGGTAGTCGAGTCAATGATGCTGTTCAGAAGGCTGGTGGCTTGAC
AGAGCAAGCAGACAGCAAGTCGCTCAATCTAGCTCAGAAAGTTAGTGATGAGGCTCTGGTTTACGTTCCTACTAAGGGAG
AAGAAGCAGTTAGTCAACAGACTGGTTCGGGGACAGCTTCTTCAATAAGCAAGGAAAAGAAGGTCAATCTCAACAAGGCC
AGTCTGGAAGAACTCAAGCAGGTCAAGAGACTGGGAGGAAAACGAGCTCAGGACATTATCGACCATCGTGAGGCAAATGG
CAAGTTCAAGTCAGTAGACGAGCTCAAGAAGGTCTCTGGCATTGGTGGCAAAACAATAGAAAAGCTTAAAGACTATGTTA
CAGTGGATTAA

Domains


Predicted by InterproScan.

(151-214)

(76-126)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/celA/cilE Streptococcus pneumoniae Rx1

98.148

100

0.981

  comEA/celA/cilE Streptococcus pneumoniae D39

98.148

100

0.981

  comEA/celA/cilE Streptococcus pneumoniae R6

98.148

100

0.981

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

96.296

100

0.963

  comEA/celA/cilE Streptococcus mitis NCTC 12261

94.907

100

0.949

  comEA/celA/cilE Streptococcus mitis SK321

90.278

100

0.903

  comEA Lactococcus lactis subsp. cremoris KW2

43.172

100

0.454

  comEA Streptococcus thermophilus LMD-9

39.738

100

0.421

  comEA Bacillus subtilis subsp. subtilis str. 168

41.053

87.963

0.361


Multiple sequence alignment