Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB/cglB   Type   Machinery gene
Locus tag   SNAG_RS01180 Genome accession   NZ_AP017652
Coordinates   205782..206798 (+) Length   338 a.a.
NCBI ID   WP_096405919.1    Uniprot ID   -
Organism   Streptococcus sp. NPS 308     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 200782..211798
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SNAG_RS01160 (SNAG_0234) - 201204..203021 (+) 1818 WP_096405913.1 acyltransferase family protein -
  SNAG_RS01165 (SNAG_0235) nagA 203162..204313 (+) 1152 WP_096405914.1 N-acetylglucosamine-6-phosphate deacetylase -
  SNAG_RS01170 (SNAG_0236) - 204465..204830 (+) 366 WP_096405916.1 DUF1033 family protein -
  SNAG_RS01175 (SNAG_0237) comGA/cglA/cilD 204893..205834 (+) 942 WP_096405917.1 competence type IV pilus ATPase ComGA Machinery gene
  SNAG_RS01180 (SNAG_0238) comGB/cglB 205782..206798 (+) 1017 WP_096405919.1 competence type IV pilus assembly protein ComGB Machinery gene
  SNAG_RS01185 (SNAG_0239) comGC/cglC 206800..207123 (+) 324 WP_001037925.1 competence type IV pilus major pilin ComGC Machinery gene
  SNAG_RS01190 (SNAG_0240) comGD/cglD 207086..207520 (+) 435 WP_172842371.1 competence type IV pilus minor pilin ComGD Machinery gene
  SNAG_RS01195 (SNAG_0241) comGE/cglE 207483..207785 (+) 303 WP_172842372.1 competence type IV pilus minor pilin ComGE Machinery gene
  SNAG_RS01200 (SNAG_0242) comGF/cglF 207748..208209 (+) 462 WP_096405922.1 competence type IV pilus minor pilin ComGF Machinery gene
  SNAG_RS01205 (SNAG_0243) comGG/cglG 208187..208600 (+) 414 WP_096405924.1 competence type IV pilus minor pilin ComGG Machinery gene
  SNAG_RS01210 (SNAG_0244) - 208633..209223 (+) 591 WP_172842373.1 class I SAM-dependent methyltransferase -
  SNAG_RS01215 (SNAG_0245) comYH 209282..210235 (+) 954 WP_096405925.1 class I SAM-dependent methyltransferase Machinery gene
  SNAG_RS01220 (SNAG_0246) - 210285..211475 (+) 1191 WP_000167798.1 acetate kinase -

Sequence


Protein


Download         Length: 338 a.a.        Molecular weight: 38645.97 Da        Isoelectric Point: 9.5636

>NTDB_id=68319 SNAG_RS01180 WP_096405919.1 205782..206798(+) (comGB/cglB) [Streptococcus sp. NPS 308]
MDISQVFRLKRKKLPTAKQKKIITLFHNLFSSGFHLVEIISFLGRSALLEKDYVAQMHQGLSQGKSFSEMMNSLGFSSAI
VTQLSLAEVHGNLHLSLGKIEEYLDNLAKVKKKLIEVATYPLILLGFLLLIMLGLRNYLLPQLDSSNIATQIIGNLPQIF
LVLVLIFSLILLLALTFYKRSSKMCVFSILARIPFLGIFVQTYLTAYYAREWGNMISQGMELMQIFQIMQEQGSQLFKEI
GQDLAQALQNGREFSQIIATYPFFKKELSLIIEYGEVKSKLGSELEIYAEKTWEAFFTRVNRTMNLVQPLVFIFVALIIV
LLYAAMLMPMYQNMEVNF

Nucleotide


Download         Length: 1017 bp        

>NTDB_id=68319 SNAG_RS01180 WP_096405919.1 205782..206798(+) (comGB/cglB) [Streptococcus sp. NPS 308]
ATGGACATATCACAAGTCTTCAGGCTGAAACGGAAAAAATTACCTACCGCTAAGCAGAAGAAAATTATCACCCTGTTTCA
TAATCTCTTCTCTAGTGGATTTCACTTGGTGGAAATTATTTCTTTCTTGGGCAGAAGTGCGCTGCTGGAAAAGGACTATG
TGGCCCAGATGCACCAAGGCTTGTCTCAGGGAAAATCTTTCTCAGAAATGATGAACAGCTTGGGCTTTTCAAGTGCTATT
GTGACCCAGCTATCCCTAGCTGAAGTTCATGGAAATCTTCATCTGAGTTTGGGCAAGATAGAAGAATATCTGGATAATTT
GGCCAAGGTTAAGAAAAAGTTAATCGAAGTGGCGACTTATCCCTTGATTTTGCTGGGATTTCTCCTGCTAATCATGCTGG
GGTTGAGGAACTATTTGCTCCCTCAACTAGATAGTAGCAATATTGCTACCCAAATCATCGGCAATCTGCCACAAATTTTT
CTGGTACTCGTGCTGATTTTCTCTCTAATTCTACTTTTAGCCCTCACTTTCTATAAAAGAAGTTCCAAAATGTGCGTCTT
TTCTATTTTGGCACGGATTCCTTTCCTTGGAATCTTTGTTCAGACCTATCTGACGGCCTATTACGCGCGTGAATGGGGCA
ATATGATTTCGCAGGGGATGGAGCTGATGCAGATTTTTCAGATTATGCAGGAACAAGGCTCTCAGCTCTTTAAAGAAATC
GGTCAAGATCTGGCTCAAGCCCTGCAAAATGGTCGTGAATTTTCCCAAATCATAGCAACCTATCCCTTCTTTAAAAAGGA
GTTGAGTCTCATCATCGAGTATGGGGAAGTCAAGTCCAAGCTGGGGAGCGAGTTGGAAATCTATGCCGAAAAAACTTGGG
AAGCCTTTTTTACCCGAGTCAATCGCACCATGAACTTAGTACAGCCACTGGTTTTTATTTTTGTGGCCCTGATTATCGTT
TTACTTTATGCGGCAATGCTTATGCCCATGTATCAAAATATGGAGGTAAATTTTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB/cglB Streptococcus mitis SK321

92.604

100

0.926

  comGB/cglB Streptococcus mitis NCTC 12261

91.716

100

0.917

  comGB/cglB Streptococcus pneumoniae Rx1

90.237

100

0.902

  comGB/cglB Streptococcus pneumoniae D39

90.237

100

0.902

  comGB/cglB Streptococcus pneumoniae R6

90.237

100

0.902

  comGB/cglB Streptococcus pneumoniae TIGR4

90.237

100

0.902

  comYB Streptococcus gordonii str. Challis substr. CH1

73.512

99.408

0.731

  comYB Streptococcus mutans UA140

58.228

93.491

0.544

  comYB Streptococcus mutans UA159

58.228

93.491

0.544

  comGB Lactococcus lactis subsp. cremoris KW2

51.497

98.817

0.509


Multiple sequence alignment