Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   SGGB_RS00665 Genome accession   NC_017576
Coordinates   102840..103796 (+) Length   318 a.a.
NCBI ID   WP_009853236.1    Uniprot ID   A0A139R3Z2
Organism   Streptococcus gallolyticus subsp. gallolyticus ATCC 43143     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 97840..108796
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SGGB_RS00625 (SGGB_0086) - 98758..99126 (+) 369 WP_009853228.1 DUF1033 family protein -
  SGGB_RS00630 (SGGB_0087) comYA 99197..100138 (+) 942 WP_012961327.1 competence type IV pilus ATPase ComGA Machinery gene
  SGGB_RS00635 (SGGB_0088) comYB 100020..101105 (+) 1086 WP_012961328.1 competence type IV pilus assembly protein ComGB Machinery gene
  SGGB_RS00640 (SGGB_0089) comYC 101105..101398 (+) 294 WP_009853231.1 competence type IV pilus major pilin ComGC Machinery gene
  SGGB_RS00645 (SGGB_0090) comYD 101382..101813 (+) 432 WP_009853232.1 competence type IV pilus minor pilin ComGD Machinery gene
  SGGB_RS00650 (SGGB_0091) comGE 101767..102060 (+) 294 WP_012961329.1 competence type IV pilus minor pilin ComGE -
  SGGB_RS00655 (SGGB_0092) comYF 102014..102481 (+) 468 WP_014619875.1 competence type IV pilus minor pilin ComGF Machinery gene
  SGGB_RS00660 (SGGB_0093) comYG 102453..102767 (+) 315 WP_009853235.1 competence type IV pilus minor pilin ComGG Machinery gene
  SGGB_RS00665 (SGGB_0094) comYH 102840..103796 (+) 957 WP_009853236.1 class I SAM-dependent methyltransferase Machinery gene
  SGGB_RS00670 (SGGB_0095) - 103849..105048 (+) 1200 WP_009853237.1 acetate kinase -
  SGGB_RS00675 (SGGB_0096) - 105279..105479 (+) 201 WP_012961332.1 helix-turn-helix transcriptional regulator -
  SGGB_RS00680 (SGGB_0097) - 105489..105698 (+) 210 WP_009853239.1 hypothetical protein -
  SGGB_RS00685 (SGGB_0098) - 105711..106163 (+) 453 WP_009853240.1 hypothetical protein -
  SGGB_RS00690 (SGGB_0099) - 106175..106813 (+) 639 WP_009853241.1 CPBP family intramembrane glutamic endopeptidase -
  SGGB_RS00695 (SGGB_0100) proC 106873..107643 (-) 771 WP_009853242.1 pyrroline-5-carboxylate reductase -
  SGGB_RS00700 (SGGB_0101) pepA 107704..108771 (-) 1068 WP_009853243.1 glutamyl aminopeptidase -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 36027.96 Da        Isoelectric Point: 4.4127

>NTDB_id=48892 SGGB_RS00665 WP_009853236.1 102840..103796(+) (comYH) [Streptococcus gallolyticus subsp. gallolyticus ATCC 43143]
MNFEKIETAYELILENIQLIENELKTHIYDALIEQNSFYLGAEGASEEVAANNEKLRQLALTKEEWRRAFQFIFIKAGQT
EQLQANHQFTPDAIGFILLFLIENLTDSDKIDLLEIGSGTGNLAQTLLNNSSKELNYLGIEVDDLLIDLSASIAEVMDSD
AQFIQEDAVRPQILKESDVIISDLPVGFYPNDDIAKRYKVASSDEHTYAHHLLMEQSLKYLKKDGIAVFLAPVSLLTSKQ
SDLLKQWLKDYADIIAVITLPESIFGNAANAKSIFVLKKQAAHTPETFVYPLSDLQSREALTDFIRKFQKWKVDNMNF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=48892 SGGB_RS00665 WP_009853236.1 102840..103796(+) (comYH) [Streptococcus gallolyticus subsp. gallolyticus ATCC 43143]
ATGAATTTTGAAAAAATTGAAACAGCCTATGAGCTGATTTTAGAAAATATCCAATTAATTGAAAATGAGTTAAAAACTCA
TATTTATGATGCGCTTATTGAACAGAATTCTTTTTACTTGGGGGCTGAAGGTGCCAGTGAAGAAGTTGCTGCCAACAATG
AGAAACTGCGTCAGCTTGCATTGACCAAAGAAGAGTGGCGTCGAGCTTTCCAATTTATCTTTATCAAAGCTGGTCAAACA
GAGCAGCTGCAAGCCAATCATCAATTTACACCAGATGCTATTGGTTTTATTTTGCTGTTCTTGATTGAAAATCTGACAGA
TTCAGATAAAATTGATCTTTTAGAAATTGGTAGTGGGACAGGAAACCTTGCTCAAACATTGTTAAACAATTCGTCTAAAG
AATTAAATTATCTTGGGATTGAAGTTGACGATTTATTGATTGATTTATCAGCAAGTATTGCAGAAGTGATGGATTCTGAT
GCTCAGTTTATTCAAGAAGATGCTGTGCGTCCACAAATTCTGAAAGAAAGTGATGTGATTATTAGTGATTTGCCAGTTGG
TTTTTATCCTAATGATGACATTGCCAAACGTTATAAAGTGGCAAGTTCTGATGAGCATACCTATGCCCACCATTTGTTAA
TGGAACAATCGTTAAAATATCTCAAAAAAGATGGTATTGCAGTCTTTTTGGCGCCCGTCAGTCTTTTGACAAGTAAGCAA
AGTGATTTATTGAAACAATGGTTGAAAGATTACGCGGATATTATCGCCGTGATTACCTTGCCAGAATCTATTTTTGGTAA
TGCAGCGAATGCAAAATCAATTTTTGTTTTGAAAAAACAGGCTGCGCATACGCCAGAAACCTTTGTTTATCCACTTTCTG
ACTTACAAAGTCGTGAAGCTCTGACTGATTTCATTAGAAAATTTCAAAAATGGAAAGTTGATAATATGAATTTTTAA

Domains


Predicted by InterproScan.

(71-296)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A139R3Z2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

68.987

99.371

0.686

  comYH Streptococcus mutans UA159

68.671

99.371

0.682


Multiple sequence alignment