Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   VOI49_RS00785 Genome accession   NZ_CP142018
Coordinates   122280..123233 (+) Length   317 a.a.
NCBI ID   WP_115246884.1    Uniprot ID   A0A380JYU6
Organism   Streptococcus dysgalactiae strain lu24     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 117280..128233
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  VOI49_RS00745 - 118106..118471 (+) 366 WP_003049248.1 DUF1033 family protein -
  VOI49_RS00750 comYA 118552..119493 (+) 942 WP_003049251.1 competence type IV pilus ATPase ComGA Machinery gene
  VOI49_RS00755 comYB 119429..120460 (+) 1032 WP_226313561.1 competence type IV pilus assembly protein ComGB Machinery gene
  VOI49_RS00760 comYC 120462..120788 (+) 327 WP_115246891.1 competence type IV pilus major pilin ComGC Machinery gene
  VOI49_RS00765 comGD 120763..121191 (+) 429 WP_324739066.1 competence type IV pilus minor pilin ComGD -
  VOI49_RS00770 comGE 121148..121444 (+) 297 WP_003061797.1 competence type IV pilus minor pilin ComGE -
  VOI49_RS00775 comGF 121485..121865 (+) 381 WP_324739576.1 competence type IV pilus minor pilin ComGF -
  VOI49_RS00780 comGG 121843..122217 (+) 375 WP_003049269.1 competence type IV pilus minor pilin ComGG -
  VOI49_RS00785 comYH 122280..123233 (+) 954 WP_115246884.1 class I SAM-dependent methyltransferase Machinery gene
  VOI49_RS00790 - 123292..124488 (+) 1197 WP_003049273.1 acetate kinase -
  VOI49_RS00795 - 124798..125004 (+) 207 WP_003058508.1 helix-turn-helix transcriptional regulator -
  VOI49_RS00800 - 125160..125765 (+) 606 WP_226313560.1 CPBP family intramembrane glutamic endopeptidase -
  VOI49_RS00805 - 125783..126220 (+) 438 WP_003049278.1 hypothetical protein -
  VOI49_RS00810 proC 126359..127135 (-) 777 WP_115246875.1 pyrroline-5-carboxylate reductase -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 36018.92 Da        Isoelectric Point: 4.7910

>NTDB_id=920996 VOI49_RS00785 WP_115246884.1 122280..123233(+) (comYH) [Streptococcus dysgalactiae strain lu24]
MNFEKIEEAYQLLLENSQLIENDLKTHIYDAIVEQNSFYLGAEGASPQVAQNINKLKALHLTKEEWRRAYQFIFIKAAQT
EQLQANHQFTPDAIGFILLYLLEQLSDKDSLEVLEIGSGTGNLAQTLLNNTSKKLDYVGIELDDLLIDLSASIAEVMDSS
ARFIQEDAVRPQLLKESDVVISDLPVGYYPNDDIAKRYKVASSEEHTYAHHLLMEQSLKYLKKDGFAIFLAPVNLLTSSQ
SQLLKNWLTGYAQVVALITLPDSIFGHPSNAKSIIVLQKQTDHPTETFVYPIRDLKLAENIHDFMQNFKKWKQDNVN

Nucleotide


Download         Length: 954 bp        

>NTDB_id=920996 VOI49_RS00785 WP_115246884.1 122280..123233(+) (comYH) [Streptococcus dysgalactiae strain lu24]
ATGAATTTTGAAAAAATTGAAGAAGCCTATCAGTTACTTTTAGAGAATAGTCAACTGATTGAAAACGATTTAAAAACACA
TATTTATGATGCAATCGTTGAGCAAAACTCTTTTTATTTAGGTGCTGAGGGAGCTAGTCCTCAAGTTGCCCAGAACATTA
ACAAACTGAAAGCCTTACACCTGACTAAAGAAGAATGGCGTAGAGCTTACCAGTTTATTTTTATTAAGGCAGCTCAGACT
GAGCAGCTCCAAGCAAACCATCAGTTCACGCCAGATGCTATTGGTTTCATTCTGCTGTATCTTTTGGAACAATTGAGTGA
CAAAGATAGCCTAGAAGTCCTTGAAATTGGCAGTGGTACGGGAAATCTAGCACAAACTCTCCTCAATAATACGAGTAAGA
AACTTGATTATGTAGGGATTGAACTTGACGATCTCTTAATTGATTTATCAGCAAGTATTGCTGAAGTAATGGACTCTTCA
GCTCGTTTTATTCAAGAAGATGCGGTAAGACCGCAATTGTTAAAAGAAAGCGACGTTGTTATCAGTGACTTGCCAGTGGG
TTATTATCCTAATGATGACATTGCTAAGCGCTATAAGGTAGCTAGTTCTGAGGAACACACCTATGCTCATCATTTACTCA
TGGAACAATCTTTAAAGTATTTGAAAAAAGATGGTTTTGCCATTTTCTTGGCACCTGTCAATTTGTTAACAAGCTCTCAG
AGCCAATTGTTGAAAAATTGGTTAACTGGCTATGCCCAGGTAGTGGCCTTGATTACTCTACCAGATTCTATATTTGGTCA
TCCCTCAAATGCTAAGTCCATTATTGTCTTGCAAAAGCAGACAGACCATCCGACGGAAACCTTTGTTTATCCTATTCGAG
ACTTGAAACTTGCAGAGAATATTCATGACTTTATGCAAAATTTCAAAAAATGGAAACAGGATAATGTCAATTAA

Domains


Predicted by InterproScan.

(68-290)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A380JYU6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

66.456

99.685

0.662

  comYH Streptococcus mutans UA159

66.139

99.685

0.659