Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   STU_RS18150 Genome accession   NC_006448
Coordinates   1658549..1659490 (-) Length   313 a.a.
NCBI ID   WP_011226629.1    Uniprot ID   -
Organism   Streptococcus thermophilus LMG 18311     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1653549..1664490
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  STU_RS18110 (stu1857) - 1653625..1654818 (-) 1194 WP_022096898.1 acetate kinase -
  STU_RS18115 (stu1858) comYH 1654874..1655830 (-) 957 WP_011226622.1 class I SAM-dependent methyltransferase Machinery gene
  STU_RS18120 (stu1859) comGG 1655875..1656192 (-) 318 WP_011226623.1 competence type IV pilus minor pilin ComGG Machinery gene
  STU_RS18125 (stu1860) comGF 1656170..1656607 (-) 438 WP_014608742.1 competence type IV pilus minor pilin ComGF Machinery gene
  STU_RS18130 (stu1861) comGE 1656591..1656884 (-) 294 WP_011226625.1 competence type IV pilus minor pilin ComGE Machinery gene
  STU_RS18135 (stu1862) comGD 1656856..1657284 (-) 429 WP_011226626.1 competence type IV pilus minor pilin ComGD Machinery gene
  STU_RS18140 (stu1863) comGC 1657244..1657570 (-) 327 WP_011226627.1 competence type IV pilus major pilin ComGC Machinery gene
  STU_RS18145 (stu1864) comGB 1657567..1658667 (-) 1101 WP_148148900.1 competence type IV pilus assembly protein ComGB Machinery gene
  STU_RS18150 (stu1865) comGA 1658549..1659490 (-) 942 WP_011226629.1 competence type IV pilus ATPase ComGA Machinery gene
  STU_RS18155 (stu1866) - 1659571..1659933 (-) 363 WP_011226630.1 DUF1033 family protein -
  STU_RS18160 (stu1867) rpoC 1660092..1663730 (-) 3639 WP_011226631.1 DNA-directed RNA polymerase subunit beta' -

Regulatory network


Positive effect      
Negative effect
Regulator Target Regulation
  comX comGA positive effect
  comX ssbA positive effect
  comX comGE positive effect
  comX comGB positive effect
  comX comGD positive effect
  comX comGF positive effect
  comX comGG positive effect
  comX radA positive effect
  comX comEC positive effect
  comX comFC positive effect
  comX recA positive effect
  comX comEA positive effect
  comX comGC positive effect
  comX cinA positive effect
  comX coiA positive effect
  comX comFA positive effect
  comX dprA positive effect
  clpC comX negative effect
  comR comX positive effect
  comR comS positive effect
  comS comX positive effect
  comS comS positive effect
  amiC comS positive effect
  eeP comS positive effect
  amiD comS positive effect
  amiF comS positive effect
  amiA3 comS positive effect
  amiE comS positive effect
  clpP comX negative effect
  mecA comX negative effect

Sequence


Protein


Download         Length: 313 a.a.        Molecular weight: 35422.56 Da        Isoelectric Point: 6.3927

>NTDB_id=336 STU_RS18150 WP_011226629.1 1658549..1659490(-) (comGA) [Streptococcus thermophilus LMG 18311]
MVTEFAKEMIKNADSCGAQDIYVIPRQDNYELYMRVGQERRLIDVYRPDFMASLIGHFKFVAGMMVGEKRRSQLGSCDYD
CGDGHLVSLRLSTVGDYRGLESLVIRILHSERRELVYWNQGIQPIKDALDYRGLYLFAGPVGSGKTTLMHELVQERFKGQ
QVISIEDPVEIKQDNVLQLQVNQAIDMNYDSLIKLSLRHRPDVLIIGEIRDKETARAVIRASLTGVTVLSTIHAKSIAGV
YERLLDLGVDKSELDNALQGIAYMRLIKGGGVIDFARENFQNHSSTNWNQQLEGLVKQGCLTERDIQGEKIKD

Nucleotide


Download         Length: 942 bp        

>NTDB_id=336 STU_RS18150 WP_011226629.1 1658549..1659490(-) (comGA) [Streptococcus thermophilus LMG 18311]
ATGGTAACAGAATTTGCTAAGGAAATGATAAAAAATGCTGATAGTTGTGGGGCTCAAGACATCTATGTCATTCCACGTCA
GGATAATTATGAGCTTTATATGCGAGTCGGGCAAGAGAGGAGATTGATTGATGTATATCGGCCTGATTTCATGGCTAGTC
TTATTGGTCACTTTAAATTTGTGGCGGGAATGATGGTGGGTGAGAAGCGTAGGAGTCAACTGGGTTCATGTGACTATGAT
TGTGGTGACGGTCACTTGGTTTCTCTGCGTTTATCAACTGTTGGAGACTATCGAGGTTTGGAGAGCTTGGTTATCCGTAT
CCTACATTCCGAACGTCGAGAATTAGTGTATTGGAATCAAGGAATCCAGCCTATTAAGGATGCCTTGGATTATAGAGGAC
TGTATCTCTTTGCGGGTCCCGTAGGTTCTGGGAAGACCACACTTATGCATGAGTTGGTTCAGGAGCGTTTTAAGGGGCAG
CAGGTAATTTCTATTGAGGATCCTGTTGAAATCAAACAGGATAATGTCTTGCAACTCCAGGTTAATCAGGCGATTGATAT
GAACTATGATAGTTTGATTAAGCTATCTCTACGTCACCGTCCAGATGTTTTGATTATTGGGGAGATTCGGGATAAGGAGA
CTGCCCGGGCAGTTATTAGAGCTAGTCTAACAGGTGTAACTGTTCTCTCAACTATTCATGCTAAGAGTATTGCTGGCGTT
TACGAACGTCTCTTGGACCTTGGCGTAGATAAGTCTGAGTTGGATAATGCTCTACAAGGCATTGCCTACATGCGTTTGAT
CAAGGGAGGAGGTGTGATTGATTTTGCCAGAGAAAATTTCCAAAACCATTCGTCTACCAACTGGAATCAGCAGTTGGAAG
GTTTGGTTAAACAAGGATGTCTCACTGAGAGGGATATCCAAGGGGAAAAAATTAAAGATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA/cglA/cilD Streptococcus mitis NCTC 12261

65.273

99.361

0.649

  comYA Streptococcus mutans UA159

64.537

100

0.645

  comYA Streptococcus mutans UA140

64.537

100

0.645

  comGA/cglA/cilD Streptococcus pneumoniae D39

63.987

99.361

0.636

  comGA/cglA/cilD Streptococcus pneumoniae Rx1

63.987

99.361

0.636

  comGA/cglA/cilD Streptococcus pneumoniae R6

63.987

99.361

0.636

  comGA/cglA/cilD Streptococcus pneumoniae TIGR4

63.987

99.361

0.636

  comYA Streptococcus gordonii str. Challis substr. CH1

62.939

100

0.629

  comGA/cglA Streptococcus sobrinus strain NIDR 6715-7

61.218

99.681

0.61

  comGA Lactococcus lactis subsp. cremoris KW2

51.125

99.361

0.508

  comGA Latilactobacillus sakei subsp. sakei 23K

38.816

97.125

0.377


Multiple sequence alignment