Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   GALLO_RS00665 Genome accession   NC_013798
Coordinates   104576..105532 (+) Length   318 a.a.
NCBI ID   WP_009853236.1    Uniprot ID   A0A139R3Z2
Organism   Streptococcus gallolyticus UCN34     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 99576..110532
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GALLO_RS00625 (GALLO_0086) - 100494..100862 (+) 369 WP_009853228.1 DUF1033 family protein -
  GALLO_RS00630 (GALLO_0087) comYA 100933..101874 (+) 942 WP_012961327.1 competence type IV pilus ATPase ComGA Machinery gene
  GALLO_RS00635 (GALLO_0088) comYB 101756..102841 (+) 1086 WP_012961328.1 competence type IV pilus assembly protein ComGB Machinery gene
  GALLO_RS00640 (GALLO_0089) comYC 102841..103134 (+) 294 WP_009853231.1 competence type IV pilus major pilin ComGC Machinery gene
  GALLO_RS00645 (GALLO_0090) comYD 103118..103549 (+) 432 WP_009853232.1 competence type IV pilus minor pilin ComGD Machinery gene
  GALLO_RS00650 (GALLO_0091) comGE 103503..103796 (+) 294 WP_012961329.1 competence type IV pilus minor pilin ComGE -
  GALLO_RS00655 (GALLO_0092) comYF 103750..104217 (+) 468 WP_012961330.1 competence type IV pilus minor pilin ComGF Machinery gene
  GALLO_RS00660 (GALLO_0093) comYG 104189..104503 (+) 315 WP_009853235.1 competence type IV pilus minor pilin ComGG Machinery gene
  GALLO_RS00665 (GALLO_0094) comYH 104576..105532 (+) 957 WP_009853236.1 class I SAM-dependent methyltransferase Machinery gene
  GALLO_RS00670 (GALLO_0095) - 105585..106784 (+) 1200 WP_012961331.1 acetate kinase -
  GALLO_RS00675 (GALLO_0096) - 106946..107146 (+) 201 WP_012961332.1 helix-turn-helix transcriptional regulator -
  GALLO_RS00680 (GALLO_0097) - 107156..107365 (+) 210 WP_009853239.1 hypothetical protein -
  GALLO_RS00685 (GALLO_0098) - 107378..107830 (+) 453 WP_009853240.1 hypothetical protein -
  GALLO_RS00690 (GALLO_0099) - 107842..108480 (+) 639 WP_009853241.1 CPBP family intramembrane glutamic endopeptidase -
  GALLO_RS00695 (GALLO_0100) proC 108540..109310 (-) 771 WP_009853242.1 pyrroline-5-carboxylate reductase -
  GALLO_RS00700 (GALLO_0101) pepA 109371..110438 (-) 1068 WP_009853243.1 glutamyl aminopeptidase -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 36027.96 Da        Isoelectric Point: 4.4127

>NTDB_id=36171 GALLO_RS00665 WP_009853236.1 104576..105532(+) (comYH) [Streptococcus gallolyticus UCN34]
MNFEKIETAYELILENIQLIENELKTHIYDALIEQNSFYLGAEGASEEVAANNEKLRQLALTKEEWRRAFQFIFIKAGQT
EQLQANHQFTPDAIGFILLFLIENLTDSDKIDLLEIGSGTGNLAQTLLNNSSKELNYLGIEVDDLLIDLSASIAEVMDSD
AQFIQEDAVRPQILKESDVIISDLPVGFYPNDDIAKRYKVASSDEHTYAHHLLMEQSLKYLKKDGIAVFLAPVSLLTSKQ
SDLLKQWLKDYADIIAVITLPESIFGNAANAKSIFVLKKQAAHTPETFVYPLSDLQSREALTDFIRKFQKWKVDNMNF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=36171 GALLO_RS00665 WP_009853236.1 104576..105532(+) (comYH) [Streptococcus gallolyticus UCN34]
ATGAATTTTGAAAAAATTGAAACAGCCTATGAGCTGATTTTAGAAAATATCCAATTAATTGAAAATGAGTTAAAAACTCA
TATTTATGATGCGCTTATTGAACAGAATTCTTTTTACTTGGGGGCTGAAGGTGCCAGTGAAGAAGTTGCTGCCAACAATG
AGAAACTGCGTCAGCTTGCATTGACCAAAGAAGAGTGGCGTCGAGCTTTCCAATTTATCTTTATCAAAGCTGGTCAAACA
GAGCAGCTGCAAGCCAATCATCAATTTACACCAGATGCTATTGGTTTTATTTTGCTGTTCTTGATTGAAAATCTGACAGA
TTCAGATAAAATTGATCTTTTAGAAATTGGTAGTGGGACAGGAAACCTTGCTCAAACATTGTTAAACAATTCGTCTAAAG
AATTAAATTATCTTGGGATTGAAGTTGACGATTTATTGATTGATTTATCAGCAAGTATTGCAGAAGTGATGGATTCTGAT
GCTCAGTTTATTCAAGAAGATGCTGTGCGTCCACAAATTCTGAAAGAAAGTGATGTGATTATTAGTGATTTGCCAGTTGG
TTTTTATCCTAATGATGACATTGCCAAACGTTATAAAGTGGCAAGTTCTGATGAGCATACCTATGCCCACCATTTGTTAA
TGGAACAATCGTTAAAATATCTCAAAAAAGATGGTATTGCAGTCTTTTTGGCGCCCGTCAGTCTTTTGACAAGTAAGCAA
AGTGATTTATTGAAACAATGGTTGAAAGATTACGCGGATATTATCGCCGTGATTACCTTGCCAGAATCTATTTTTGGTAA
TGCAGCGAATGCAAAATCAATTTTTGTTTTGAAAAAACAGGCTGCGCATACGCCAGAAACCTTTGTTTATCCACTTTCTG
ACTTACAAAGTCGTGAAGCTCTGACTGATTTCATTAGAAAATTTCAAAAATGGAAAGTTGATAATATGAATTTTTAA

Domains


Predicted by InterproScan.

(71-296)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A139R3Z2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

68.987

99.371

0.686

  comYH Streptococcus mutans UA159

68.671

99.371

0.682


Multiple sequence alignment