Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGF/cglF   Type   Machinery gene
Locus tag   SPPN_RS10335 Genome accession   NC_015875
Coordinates   1981713..1982174 (-) Length   153 a.a.
NCBI ID   WP_000250511.1    Uniprot ID   -
Organism   Streptococcus pseudopneumoniae IS7493     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1976713..1987174
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SPPN_RS10310 (SPPN_10435) - 1977129..1978028 (+) 900 WP_000771264.1 hypothetical protein -
  SPPN_RS10315 (SPPN_10440) - 1978030..1979232 (+) 1203 WP_000855226.1 ATP-grasp domain-containing protein -
  SPPN_RS10320 (SPPN_10445) - 1979345..1979821 (+) 477 WP_000777684.1 GNAT family N-acetyltransferase -
  SPPN_RS10325 (SPPN_10450) - 1980018..1980860 (+) 843 WP_000840058.1 helix-turn-helix domain-containing protein -
  SPPN_RS10330 (SPPN_10455) comGG/cglG 1981322..1981735 (-) 414 WP_000265635.1 competence type IV pilus minor pilin ComGG Machinery gene
  SPPN_RS10335 (SPPN_10460) comGF/cglF 1981713..1982174 (-) 462 WP_000250511.1 competence type IV pilus minor pilin ComGF Machinery gene
  SPPN_RS10340 (SPPN_10465) comGE/cglE 1982137..1982439 (-) 303 WP_000413380.1 competence type IV pilus minor pilin ComGE Machinery gene
  SPPN_RS10345 (SPPN_10470) comGD/cglD 1982402..1982836 (-) 435 WP_044012294.1 competence type IV pilus minor pilin ComGD Machinery gene
  SPPN_RS10350 (SPPN_10475) comGC/cglC 1982799..1983122 (-) 324 WP_000738647.1 comG operon protein ComGC Machinery gene
  SPPN_RS10355 (SPPN_10480) comGB/cglB 1983124..1984140 (-) 1017 WP_080564700.1 competence type IV pilus assembly protein ComGB Machinery gene
  SPPN_RS10360 (SPPN_10485) comGA/cglA/cilD 1984088..1985029 (-) 942 WP_000249547.1 competence type IV pilus ATPase ComGA Machinery gene
  SPPN_RS10365 (SPPN_10490) - 1985105..1985470 (-) 366 WP_000286407.1 DUF1033 family protein -
  SPPN_RS10370 (SPPN_10495) - 1985621..1986658 (-) 1038 Protein_2082 zinc-dependent alcohol dehydrogenase family protein -

Sequence


Protein


Download         Length: 153 a.a.        Molecular weight: 17868.55 Da        Isoelectric Point: 9.3016

>NTDB_id=42032 SPPN_RS10335 WP_000250511.1 1981713..1982174(-) (comGF/cglF) [Streptococcus pseudopneumoniae IS7493]
MVQNSCWLSKSHKVKAFTLLESLIALIVISGSLLLFQAMSQLLISDVRYQQQSEQKEWLLFVDQLEVELDRSQFEKVEGN
RLYMKQDGKEIAIGKSKSDDFRKTDASGRGYQPMVYGLKSAQITEDNQVVRFRFQFKKGLEREFIYRVEKEKS

Nucleotide


Download         Length: 462 bp        

>NTDB_id=42032 SPPN_RS10335 WP_000250511.1 1981713..1982174(-) (comGF/cglF) [Streptococcus pseudopneumoniae IS7493]
ATGGTTCAGAACAGTTGTTGGCTATCAAAGAGCCATAAGGTCAAGGCTTTTACCTTGTTAGAATCCTTGATTGCCCTCAT
TGTCATCAGTGGAAGCTTACTTCTCTTTCAAGCCATGAGTCAGCTCCTCATTTCAGATGTTCGCTATCAGCAGCAAAGCG
AGCAAAAAGAGTGGCTCTTGTTTGTGGACCAGTTGGAGGTAGAATTAGACCGTTCGCAGTTCGAAAAAGTAGAAGGCAAT
CGCCTCTATATGAAGCAGGATGGCAAGGAAATTGCGATAGGCAAGTCTAAATCAGATGATTTTCGGAAAACCGATGCCAG
TGGACGGGGTTATCAGCCGATGGTTTATGGCCTCAAATCCGCACAGATTACAGAGGACAATCAAGTGGTTCGCTTTCGTT
TCCAGTTCAAAAAAGGCTTAGAAAGGGAGTTCATCTATCGTGTGGAAAAAGAAAAAAGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGF/cglF Streptococcus mitis NCTC 12261

94.771

100

0.948

  comGF/cglF Streptococcus mitis SK321

93.464

100

0.935

  comGF/cglF Streptococcus pneumoniae Rx1

92.157

100

0.922

  comGF/cglF Streptococcus pneumoniae D39

92.157

100

0.922

  comGF/cglF Streptococcus pneumoniae R6

92.157

100

0.922

  comGF/cglF Streptococcus pneumoniae TIGR4

92.157

100

0.922


Multiple sequence alignment