Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGE/cglE   Type   Machinery gene
Locus tag   SNAG_RS01195 Genome accession   NZ_AP017652
Coordinates   207483..207785 (+) Length   100 a.a.
NCBI ID   WP_172842372.1    Uniprot ID   A0A1E1G8U1
Organism   Streptococcus sp. NPS 308     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 202483..212785
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SNAG_RS01165 (SNAG_0235) nagA 203162..204313 (+) 1152 WP_096405914.1 N-acetylglucosamine-6-phosphate deacetylase -
  SNAG_RS01170 (SNAG_0236) - 204465..204830 (+) 366 WP_096405916.1 DUF1033 family protein -
  SNAG_RS01175 (SNAG_0237) comGA/cglA/cilD 204893..205834 (+) 942 WP_096405917.1 competence type IV pilus ATPase ComGA Machinery gene
  SNAG_RS01180 (SNAG_0238) comGB/cglB 205782..206798 (+) 1017 WP_096405919.1 competence type IV pilus assembly protein ComGB Machinery gene
  SNAG_RS01185 (SNAG_0239) comGC/cglC 206800..207123 (+) 324 WP_001037925.1 competence type IV pilus major pilin ComGC Machinery gene
  SNAG_RS01190 (SNAG_0240) comGD/cglD 207086..207520 (+) 435 WP_172842371.1 competence type IV pilus minor pilin ComGD Machinery gene
  SNAG_RS01195 (SNAG_0241) comGE/cglE 207483..207785 (+) 303 WP_172842372.1 competence type IV pilus minor pilin ComGE Machinery gene
  SNAG_RS01200 (SNAG_0242) comGF/cglF 207748..208209 (+) 462 WP_096405922.1 competence type IV pilus minor pilin ComGF Machinery gene
  SNAG_RS01205 (SNAG_0243) comGG/cglG 208187..208600 (+) 414 WP_096405924.1 competence type IV pilus minor pilin ComGG Machinery gene
  SNAG_RS01210 (SNAG_0244) - 208633..209223 (+) 591 WP_172842373.1 class I SAM-dependent methyltransferase -
  SNAG_RS01215 (SNAG_0245) comYH 209282..210235 (+) 954 WP_096405925.1 class I SAM-dependent methyltransferase Machinery gene
  SNAG_RS01220 (SNAG_0246) - 210285..211475 (+) 1191 WP_000167798.1 acetate kinase -
  SNAG_RS01225 (SNAG_0247) rnpA 211635..212006 (+) 372 WP_000739254.1 ribonuclease P protein component -

Sequence


Protein


Download         Length: 100 a.a.        Molecular weight: 11224.25 Da        Isoelectric Point: 10.0119

>NTDB_id=68322 SNAG_RS01195 WP_172842372.1 207483..207785(+) (comGE/cglE) [Streptococcus sp. NPS 308]
MEKLNALRKQKIRAVILLEAVVALAIFASIATLLLGQIQKNRQEEAKILQKEEVLRVAKMALQTGQNQVKINGVEIQVFS
SEKGLEVYHGSEKLLDLKEQ

Nucleotide


Download         Length: 303 bp        

>NTDB_id=68322 SNAG_RS01195 WP_172842372.1 207483..207785(+) (comGE/cglE) [Streptococcus sp. NPS 308]
ATGGAAAAATTAAACGCATTAAGGAAACAAAAAATTAGGGCAGTGATTTTACTGGAAGCAGTAGTTGCTTTAGCTATCTT
TGCCAGCATTGCGACGCTTCTTTTGGGACAAATTCAGAAAAATAGGCAAGAAGAAGCAAAAATCTTACAAAAGGAAGAAG
TCTTGAGGGTAGCTAAGATGGCACTGCAGACAGGTCAAAATCAGGTAAAGATAAACGGAGTTGAGATTCAGGTGTTTTCT
AGTGAAAAGGGATTGGAGGTCTACCATGGTTCAGAGAAGTTACTCGACCTTAAAGAGCAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1E1G8U1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGE/cglE Streptococcus mitis SK321

95.96

99

0.95

  comGE/cglE Streptococcus pneumoniae Rx1

95.96

99

0.95

  comGE/cglE Streptococcus pneumoniae D39

95.96

99

0.95

  comGE/cglE Streptococcus pneumoniae R6

95.96

99

0.95

  comGE/cglE Streptococcus pneumoniae TIGR4

95.96

99

0.95

  comGE/cglE Streptococcus mitis NCTC 12261

94.949

99

0.94

  comYE Streptococcus mutans UA140

43.478

92

0.4

  comYE Streptococcus mutans UA159

43.478

92

0.4


Multiple sequence alignment