Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA/cglA/cilD   Type   Machinery gene
Locus tag   SNAG_RS01175 Genome accession   NZ_AP017652
Coordinates   204893..205834 (+) Length   313 a.a.
NCBI ID   WP_096405917.1    Uniprot ID   A0A1E1G8V7
Organism   Streptococcus sp. NPS 308     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 199893..210834
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SNAG_RS01155 (SNAG_0233) tgt 199960..201102 (+) 1143 WP_096405911.1 tRNA guanosine(34) transglycosylase Tgt -
  SNAG_RS01160 (SNAG_0234) - 201204..203021 (+) 1818 WP_096405913.1 acyltransferase family protein -
  SNAG_RS01165 (SNAG_0235) nagA 203162..204313 (+) 1152 WP_096405914.1 N-acetylglucosamine-6-phosphate deacetylase -
  SNAG_RS01170 (SNAG_0236) - 204465..204830 (+) 366 WP_096405916.1 DUF1033 family protein -
  SNAG_RS01175 (SNAG_0237) comGA/cglA/cilD 204893..205834 (+) 942 WP_096405917.1 competence type IV pilus ATPase ComGA Machinery gene
  SNAG_RS01180 (SNAG_0238) comGB/cglB 205782..206798 (+) 1017 WP_096405919.1 competence type IV pilus assembly protein ComGB Machinery gene
  SNAG_RS01185 (SNAG_0239) comGC/cglC 206800..207123 (+) 324 WP_001037925.1 competence type IV pilus major pilin ComGC Machinery gene
  SNAG_RS01190 (SNAG_0240) comGD/cglD 207086..207520 (+) 435 WP_172842371.1 competence type IV pilus minor pilin ComGD Machinery gene
  SNAG_RS01195 (SNAG_0241) comGE/cglE 207483..207785 (+) 303 WP_172842372.1 competence type IV pilus minor pilin ComGE Machinery gene
  SNAG_RS01200 (SNAG_0242) comGF/cglF 207748..208209 (+) 462 WP_096405922.1 competence type IV pilus minor pilin ComGF Machinery gene
  SNAG_RS01205 (SNAG_0243) comGG/cglG 208187..208600 (+) 414 WP_096405924.1 competence type IV pilus minor pilin ComGG Machinery gene
  SNAG_RS01210 (SNAG_0244) - 208633..209223 (+) 591 WP_172842373.1 class I SAM-dependent methyltransferase -
  SNAG_RS01215 (SNAG_0245) comYH 209282..210235 (+) 954 WP_096405925.1 class I SAM-dependent methyltransferase Machinery gene

Sequence


Protein


Download         Length: 313 a.a.        Molecular weight: 35561.61 Da        Isoelectric Point: 5.8189

>NTDB_id=68318 SNAG_RS01175 WP_096405917.1 204893..205834(+) (comGA/cglA/cilD) [Streptococcus sp. NPS 308]
MVQEIAQKMIATAKEKGAQDIYFIPKEATYELHMRIGDERCLIDSYDFEVLAAVISHFKFVAGMNVGEKRRSQLGSCDYQ
HGKKVSSLRLSTVGDYRGHESLVIRLLHDEERELRFWFQDLVELGEQYRQRGLYLFAGPVGSGKTTLMHELAKSLFKGQQ
VMSIEDPVEIKQDDMLQLQLNEAIGLTYENLIKLSLRHRPDLLIIGEIRDSETARAVVRASLTGATVFSTIHAKSIRGVY
ERLLELGVTEEELAVVLQGVCYQRLIGGGGILDFANKDYQEHQPTSWNEQIDQLLKDGHITSLQAETEKITYR

Nucleotide


Download         Length: 942 bp        

>NTDB_id=68318 SNAG_RS01175 WP_096405917.1 204893..205834(+) (comGA/cglA/cilD) [Streptococcus sp. NPS 308]
ATGGTACAAGAAATTGCACAGAAAATGATTGCGACTGCTAAGGAAAAAGGAGCTCAGGATATCTATTTCATTCCCAAGGA
AGCTACTTATGAACTTCATATGCGAATTGGTGATGAGAGGTGTCTCATTGACTCCTATGATTTTGAGGTTTTAGCTGCAG
TGATTAGTCATTTCAAGTTTGTGGCGGGTATGAATGTAGGAGAAAAACGCCGTAGTCAGCTGGGCTCTTGCGACTACCAG
CATGGAAAGAAGGTATCTTCTCTGCGCTTGTCAACCGTAGGAGATTATCGGGGACATGAGAGTTTGGTCATTCGTTTGTT
GCACGACGAGGAAAGGGAGTTGCGCTTTTGGTTTCAGGATCTAGTTGAGTTAGGAGAGCAGTACAGGCAACGGGGCCTCT
ATCTCTTTGCAGGTCCAGTCGGCAGTGGTAAGACGACCTTGATGCACGAATTAGCTAAGTCCCTCTTTAAGGGACAGCAG
GTCATGTCTATCGAAGATCCAGTAGAAATCAAGCAGGACGACATGCTCCAGTTGCAGTTGAATGAGGCGATTGGACTGAC
CTATGAAAATCTAATCAAACTGTCTCTCCGACATCGCCCAGACCTCTTGATTATCGGTGAAATTCGGGACAGTGAGACGG
CGCGTGCAGTTGTCAGAGCCAGTTTGACAGGTGCGACGGTCTTTTCAACCATTCATGCCAAAAGTATCCGAGGTGTTTAT
GAGCGCCTTCTGGAGTTAGGTGTGACGGAGGAGGAACTAGCAGTTGTTCTGCAAGGCGTCTGCTACCAGAGATTAATCGG
GGGAGGAGGAATCCTTGACTTTGCAAACAAAGACTATCAAGAACACCAGCCAACTAGCTGGAATGAGCAGATTGACCAGC
TTCTTAAAGATGGACATATCACAAGTCTTCAGGCTGAAACGGAAAAAATTACCTACCGCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1E1G8V7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA/cglA/cilD Streptococcus mitis NCTC 12261

89.103

99.681

0.888

  comGA/cglA/cilD Streptococcus pneumoniae Rx1

88.141

99.681

0.879

  comGA/cglA/cilD Streptococcus pneumoniae D39

88.141

99.681

0.879

  comGA/cglA/cilD Streptococcus pneumoniae R6

88.141

99.681

0.879

  comGA/cglA/cilD Streptococcus pneumoniae TIGR4

88.141

99.681

0.879

  comYA Streptococcus gordonii str. Challis substr. CH1

78.71

99.042

0.78

  comYA Streptococcus mutans UA159

65.916

99.361

0.655

  comYA Streptococcus mutans UA140

65.916

99.361

0.655

  comGA/cglA Streptococcus sobrinus strain NIDR 6715-7

62.581

99.042

0.62

  comGA Lactococcus lactis subsp. cremoris KW2

55.769

99.681

0.556

  comGA Latilactobacillus sakei subsp. sakei 23K

43.295

83.387

0.361


Multiple sequence alignment