Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGF/cglF   Type   Machinery gene
Locus tag   SPG_1963 Genome accession   CP001015
Coordinates   1861200..1861661 (-) Length   153 a.a.
NCBI ID   ACF55334.1    Uniprot ID   -
Organism   Streptococcus pneumoniae G54     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1856200..1866661
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SPG_1955 - 1856504..1857328 (-) 825 ACF56295.1 SpoIIIJ family protein -
  SPG_1956 rnpA 1857303..1857674 (-) 372 ACF55368.1 ribonuclease P protein component -
  SPG_1957 - 1857691..1857822 (-) 132 ACF54788.1 conserved hypothetical protein -
  SPG_1958 ackA 1857823..1859013 (-) 1191 ACF56573.1 acetate kinase -
  SPG_1959 comYH 1859064..1860017 (-) 954 ACF56004.1 conserved hypothetical protein Machinery gene
  SPG_1960 - 1860078..1860458 (-) 381 ACF56765.1 hypothetical protein -
  SPG_1961 - 1860490..1860672 (-) 183 ACF55149.1 conserved hypothetical protein -
  SPG_1962 comGG/cglG 1860809..1861222 (-) 414 ACF54761.1 conserved hypothetical protein Machinery gene
  SPG_1963 comGF/cglF 1861200..1861661 (-) 462 ACF55334.1 conserved hypothetical protein Machinery gene
  SPG_1964 comGE/cglE 1861624..1861926 (-) 303 ACF56096.1 hypothetical protein Machinery gene
  SPG_1965 comGD/cglD 1861889..1862293 (-) 405 ACF56539.1 competence protein CglD Machinery gene
  SPG_1966 comGC/cglC 1862286..1862612 (-) 327 ACF55098.1 competence protein CglC Machinery gene
  SPG_1967 comGB/cglB 1862614..1863630 (-) 1017 ACF55582.1 competence protein CglB Machinery gene
  SPG_1968 comGA/cglA/cilD 1863578..1864519 (-) 942 ACF55208.1 competence protein CglA Machinery gene
  SPG_1969 - 1864595..1864960 (-) 366 ACF55572.1 conserved hypothetical protein -
  SPG_1970 - 1865111..1866169 (-) 1059 ACF56524.1 alcohol dehydrogenase, zinc-containing -

Sequence


Protein


Download         Length: 153 a.a.        Molecular weight: 17918.53 Da        Isoelectric Point: 9.3137

>NTDB_id=20233 SPG_1963 ACF55334.1 1861200..1861661(-) (comGF/cglF) [Streptococcus pneumoniae G54]
MVQNSCWQSKSHKVKAFTLLESLLALIVISGGLLLFQTMSQLLISEVRYQQQSEQKEWLLFVDQLEAELDRSQFEKVEGN
RLYMKQDGKDIAIGKSKSDDFRKTNARGRGYQPMVYGLKSAQITEDNQLVHFRFQFQKGLEREFIYRVEKEKS

Nucleotide


Download         Length: 462 bp        

>NTDB_id=20233 SPG_1963 ACF55334.1 1861200..1861661(-) (comGF/cglF) [Streptococcus pneumoniae G54]
ATGGTTCAGAACAGTTGTTGGCAATCAAAGAGCCATAAGGTCAAGGCTTTTACCTTGTTAGAATCCCTACTTGCCCTCAT
TGTCATCAGTGGGGGATTACTCCTTTTTCAAACTATGAGTCAGCTCCTCATTTCAGAAGTTCGCTACCAGCAACAAAGCG
AGCAAAAGGAGTGGCTCTTGTTTGTGGACCAACTTGAGGCAGAATTAGACCGTTCGCAGTTCGAAAAAGTAGAAGGCAAT
CGCCTATACATGAAGCAAGATGGCAAGGACATCGCCATCGGTAAGTCAAAGTCAGACGATTTTCGTAAAACGAATGCTCG
TGGTCGAGGTTATCAGCCTATGGTTTATGGCCTCAAATCAGCTCAGATTACAGAGGACAATCAACTGGTTCATTTTCGTT
TCCAGTTTCAAAAAGGCTTAGAAAGGGAGTTCATCTATCGTGTGGAAAAAGAAAAAAGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGF/cglF Streptococcus pneumoniae Rx1

96.078

100

0.961

  comGF/cglF Streptococcus pneumoniae D39

96.078

100

0.961

  comGF/cglF Streptococcus pneumoniae R6

96.078

100

0.961

  comGF/cglF Streptococcus pneumoniae TIGR4

96.078

100

0.961

  comGF/cglF Streptococcus mitis SK321

92.157

100

0.922

  comGF/cglF Streptococcus mitis NCTC 12261

90.85

100

0.909


Multiple sequence alignment