Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGE/cglE   Type   Machinery gene
Locus tag   SPPN_RS10340 Genome accession   NC_015875
Coordinates   1982137..1982439 (-) Length   100 a.a.
NCBI ID   WP_000413380.1    Uniprot ID   -
Organism   Streptococcus pseudopneumoniae IS7493     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1977137..1987439
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SPPN_RS10315 (SPPN_10440) - 1978030..1979232 (+) 1203 WP_000855226.1 ATP-grasp domain-containing protein -
  SPPN_RS10320 (SPPN_10445) - 1979345..1979821 (+) 477 WP_000777684.1 GNAT family N-acetyltransferase -
  SPPN_RS10325 (SPPN_10450) - 1980018..1980860 (+) 843 WP_000840058.1 helix-turn-helix domain-containing protein -
  SPPN_RS10330 (SPPN_10455) comGG/cglG 1981322..1981735 (-) 414 WP_000265635.1 competence type IV pilus minor pilin ComGG Machinery gene
  SPPN_RS10335 (SPPN_10460) comGF/cglF 1981713..1982174 (-) 462 WP_000250511.1 competence type IV pilus minor pilin ComGF Machinery gene
  SPPN_RS10340 (SPPN_10465) comGE/cglE 1982137..1982439 (-) 303 WP_000413380.1 competence type IV pilus minor pilin ComGE Machinery gene
  SPPN_RS10345 (SPPN_10470) comGD/cglD 1982402..1982836 (-) 435 WP_044012294.1 competence type IV pilus minor pilin ComGD Machinery gene
  SPPN_RS10350 (SPPN_10475) comGC/cglC 1982799..1983122 (-) 324 WP_000738647.1 comG operon protein ComGC Machinery gene
  SPPN_RS10355 (SPPN_10480) comGB/cglB 1983124..1984140 (-) 1017 WP_080564700.1 competence type IV pilus assembly protein ComGB Machinery gene
  SPPN_RS10360 (SPPN_10485) comGA/cglA/cilD 1984088..1985029 (-) 942 WP_000249547.1 competence type IV pilus ATPase ComGA Machinery gene
  SPPN_RS10365 (SPPN_10490) - 1985105..1985470 (-) 366 WP_000286407.1 DUF1033 family protein -
  SPPN_RS10370 (SPPN_10495) - 1985621..1986658 (-) 1038 Protein_2082 zinc-dependent alcohol dehydrogenase family protein -
  SPPN_RS10375 (SPPN_10500) - 1986977..1987216 (-) 240 WP_000907298.1 hypothetical protein -

Sequence


Protein


Download         Length: 100 a.a.        Molecular weight: 11135.11 Da        Isoelectric Point: 9.7273

>NTDB_id=42033 SPPN_RS10340 WP_000413380.1 1982137..1982439(-) (comGE/cglE) [Streptococcus pseudopneumoniae IS7493]
MEKLNALRKQKIRAVILLEAVVALAIFASIATLLLGQIQKNRQEEAKILQKEEVLRVAKMALQTGQNQVNINGVEIQVFS
SEKGLEVYHGSEQLLAIKEP

Nucleotide


Download         Length: 303 bp        

>NTDB_id=42033 SPPN_RS10340 WP_000413380.1 1982137..1982439(-) (comGE/cglE) [Streptococcus pseudopneumoniae IS7493]
ATGGAAAAATTAAACGCATTAAGGAAACAAAAAATTAGGGCAGTAATTTTACTGGAAGCAGTAGTCGCTCTAGCTATCTT
TGCCAGCATTGCGACCCTCCTTTTGGGACAAATTCAAAAAAATAGACAAGAAGAAGCAAAAATCTTGCAAAAGGAAGAAG
TCTTGAGGGTAGCTAAGATGGCCTTGCAGACAGGTCAAAATCAGGTAAACATAAATGGAGTTGAGATTCAGGTGTTTTCT
AGTGAAAAGGGATTGGAGGTTTATCATGGTTCAGAACAGTTGTTGGCTATCAAAGAGCCATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGE/cglE Streptococcus mitis NCTC 12261

99

100

0.99

  comGE/cglE Streptococcus pneumoniae Rx1

99

100

0.99

  comGE/cglE Streptococcus pneumoniae D39

99

100

0.99

  comGE/cglE Streptococcus pneumoniae R6

99

100

0.99

  comGE/cglE Streptococcus pneumoniae TIGR4

99

100

0.99

  comGE/cglE Streptococcus mitis SK321

98

100

0.98

  comYE Streptococcus mutans UA140

43.333

90

0.39

  comYE Streptococcus mutans UA159

43.333

90

0.39


Multiple sequence alignment