Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGE/cglE   Type   Machinery gene
Locus tag   SPG_1964 Genome accession   CP001015
Coordinates   1861624..1861926 (-) Length   100 a.a.
NCBI ID   ACF56096.1    Uniprot ID   -
Organism   Streptococcus pneumoniae G54     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1856624..1866926
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SPG_1956 rnpA 1857303..1857674 (-) 372 ACF55368.1 ribonuclease P protein component -
  SPG_1957 - 1857691..1857822 (-) 132 ACF54788.1 conserved hypothetical protein -
  SPG_1958 ackA 1857823..1859013 (-) 1191 ACF56573.1 acetate kinase -
  SPG_1959 comYH 1859064..1860017 (-) 954 ACF56004.1 conserved hypothetical protein Machinery gene
  SPG_1960 - 1860078..1860458 (-) 381 ACF56765.1 hypothetical protein -
  SPG_1961 - 1860490..1860672 (-) 183 ACF55149.1 conserved hypothetical protein -
  SPG_1962 comGG/cglG 1860809..1861222 (-) 414 ACF54761.1 conserved hypothetical protein Machinery gene
  SPG_1963 comGF/cglF 1861200..1861661 (-) 462 ACF55334.1 conserved hypothetical protein Machinery gene
  SPG_1964 comGE/cglE 1861624..1861926 (-) 303 ACF56096.1 hypothetical protein Machinery gene
  SPG_1965 comGD/cglD 1861889..1862293 (-) 405 ACF56539.1 competence protein CglD Machinery gene
  SPG_1966 comGC/cglC 1862286..1862612 (-) 327 ACF55098.1 competence protein CglC Machinery gene
  SPG_1967 comGB/cglB 1862614..1863630 (-) 1017 ACF55582.1 competence protein CglB Machinery gene
  SPG_1968 comGA/cglA/cilD 1863578..1864519 (-) 942 ACF55208.1 competence protein CglA Machinery gene
  SPG_1969 - 1864595..1864960 (-) 366 ACF55572.1 conserved hypothetical protein -
  SPG_1970 - 1865111..1866169 (-) 1059 ACF56524.1 alcohol dehydrogenase, zinc-containing -

Sequence


Protein


Download         Length: 100 a.a.        Molecular weight: 11108.08 Da        Isoelectric Point: 9.7273

>NTDB_id=20234 SPG_1964 ACF56096.1 1861624..1861926(-) (comGE/cglE) [Streptococcus pneumoniae G54]
MEKLNALRKQKIRAVILLEAVVALAIFASIATLLLGQIQKNRQEEAKILQKEEVLRVAKMALQTGQNQVSINGVEIQVFS
SEKGLEVYHGSEQLLAIKEP

Nucleotide


Download         Length: 303 bp        

>NTDB_id=20234 SPG_1964 ACF56096.1 1861624..1861926(-) (comGE/cglE) [Streptococcus pneumoniae G54]
ATGGAAAAATTAAACGCATTAAGGAAACAAAAAATTAGGGCAGTGATTTTACTGGAAGCAGTAGTCGCTCTAGCTATCTT
TGCCAGCATTGCGACCCTCCTTTTGGGACAAATTCAAAAAAATAGGCAAGAGGAAGCAAAAATCTTGCAAAAGGAAGAAG
TCTTGAGGGTAGCTAAGATGGCCCTGCAGACGGGGCAAAATCAGGTAAGCATCAACGGAGTTGAGATTCAGGTATTTTCT
AGTGAAAAAGGATTGGAGGTCTACCATGGTTCAGAACAGTTGTTGGCAATCAAAGAGCCATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGE/cglE Streptococcus pneumoniae Rx1

100

100

1

  comGE/cglE Streptococcus pneumoniae D39

100

100

1

  comGE/cglE Streptococcus pneumoniae R6

100

100

1

  comGE/cglE Streptococcus pneumoniae TIGR4

100

100

1

  comGE/cglE Streptococcus mitis NCTC 12261

98

100

0.98

  comGE/cglE Streptococcus mitis SK321

98

100

0.98

  comYE Streptococcus mutans UA140

44.444

90

0.4

  comYE Streptococcus mutans UA159

44.444

90

0.4


Multiple sequence alignment