Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGG/cglG   Type   Machinery gene
Locus tag   SPG_1962 Genome accession   CP001015
Coordinates   1860809..1861222 (-) Length   137 a.a.
NCBI ID   ACF54761.1    Uniprot ID   -
Organism   Streptococcus pneumoniae G54     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1855809..1866222
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SPG_1955 - 1856504..1857328 (-) 825 ACF56295.1 SpoIIIJ family protein -
  SPG_1956 rnpA 1857303..1857674 (-) 372 ACF55368.1 ribonuclease P protein component -
  SPG_1957 - 1857691..1857822 (-) 132 ACF54788.1 conserved hypothetical protein -
  SPG_1958 ackA 1857823..1859013 (-) 1191 ACF56573.1 acetate kinase -
  SPG_1959 comYH 1859064..1860017 (-) 954 ACF56004.1 conserved hypothetical protein Machinery gene
  SPG_1960 - 1860078..1860458 (-) 381 ACF56765.1 hypothetical protein -
  SPG_1961 - 1860490..1860672 (-) 183 ACF55149.1 conserved hypothetical protein -
  SPG_1962 comGG/cglG 1860809..1861222 (-) 414 ACF54761.1 conserved hypothetical protein Machinery gene
  SPG_1963 comGF/cglF 1861200..1861661 (-) 462 ACF55334.1 conserved hypothetical protein Machinery gene
  SPG_1964 comGE/cglE 1861624..1861926 (-) 303 ACF56096.1 hypothetical protein Machinery gene
  SPG_1965 comGD/cglD 1861889..1862293 (-) 405 ACF56539.1 competence protein CglD Machinery gene
  SPG_1966 comGC/cglC 1862286..1862612 (-) 327 ACF55098.1 competence protein CglC Machinery gene
  SPG_1967 comGB/cglB 1862614..1863630 (-) 1017 ACF55582.1 competence protein CglB Machinery gene
  SPG_1968 comGA/cglA/cilD 1863578..1864519 (-) 942 ACF55208.1 competence protein CglA Machinery gene
  SPG_1969 - 1864595..1864960 (-) 366 ACF55572.1 conserved hypothetical protein -
  SPG_1970 - 1865111..1866169 (-) 1059 ACF56524.1 alcohol dehydrogenase, zinc-containing -

Sequence


Protein


Download         Length: 137 a.a.        Molecular weight: 15955.42 Da        Isoelectric Point: 10.2472

>NTDB_id=20232 SPG_1962 ACF54761.1 1860809..1861222(-) (comGG/cglG) [Streptococcus pneumoniae G54]
MWKKKKVKAGVLLYAVTIAAIFSLLLQFYLNRQVAHYQDYALNKEKLVAFAMAKRTKDKVEQESGEQVFNLGQVSYQNKK
TGLVTRVRTDKSQYEFLFPSVKIKEEKRDKKEEVATDSSEKVEKKKSEEKPEKKENS

Nucleotide


Download         Length: 414 bp        

>NTDB_id=20232 SPG_1962 ACF54761.1 1860809..1861222(-) (comGG/cglG) [Streptococcus pneumoniae G54]
GTGTGGAAAAAGAAAAAAGTTAAGGCAGGTGTTCTCCTCTACGCAGTCACCATAGCAGCCATCTTTAGTCTTTTGTTGCA
ATTTTATTTGAACCGACAAGTCGCCCACTATCAAGACTATGCTTTGAATAAAGAAAAATTGGTTGCTTTTGCTATGGCTA
AACGAACCAAAGATAAGGTTGAGCAAGAAAGTGGGGAACAGGTTTTTAATCTAGGTCAGGTAAGCTATCAAAACAAGAAA
ACTGGCTTAGTGACGAGGGTTCGTACGGATAAGAGCCAATATGAGTTTCTGTTTCCTTCAGTCAAAATCAAAGAAGAGAA
AAGAGATAAAAAGGAAGAGGTAGCGACCGATTCAAGCGAAAAAGTGGAGAAGAAAAAATCAGAAGAGAAGCCTGAAAAGA
AAGAGAATTCCTAG

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGG/cglG Streptococcus pneumoniae Rx1

99.27

100

0.993

  comGG/cglG Streptococcus pneumoniae D39

99.27

100

0.993

  comGG/cglG Streptococcus pneumoniae R6

99.27

100

0.993

  comGG/cglG Streptococcus pneumoniae TIGR4

99.27

100

0.993

  comGG/cglG Streptococcus mitis SK321

87.591

100

0.876

  comGG/cglG Streptococcus mitis NCTC 12261

89.256

88.321

0.788

  comYG Streptococcus mutans UA140

38.806

97.81

0.38

  comYG Streptococcus mutans UA159

38.806

97.81

0.38


Multiple sequence alignment