Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   GPA00_RS03665 Genome accession   NZ_CP046629
Coordinates   709695..710651 (+) Length   318 a.a.
NCBI ID   WP_024344497.1    Uniprot ID   A0A239RCI2
Organism   Streptococcus equinus strain CNU G6     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 704695..715651
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GPA00_RS03625 (GPA00_03625) - 705589..705957 (+) 369 WP_094140982.1 DUF1033 family protein -
  GPA00_RS03630 (GPA00_03630) comYA 706026..706967 (+) 942 WP_027968018.1 competence type IV pilus ATPase ComGA Machinery gene
  GPA00_RS03635 (GPA00_03635) comYB 706891..707934 (+) 1044 WP_167697815.1 competence type IV pilus assembly protein ComGB Machinery gene
  GPA00_RS03640 (GPA00_03640) comYC 707934..708236 (+) 303 WP_021141433.1 competence type IV pilus major pilin ComGC Machinery gene
  GPA00_RS03645 (GPA00_03645) comGD 708211..708651 (+) 441 WP_157327520.1 competence type IV pilus minor pilin ComGD -
  GPA00_RS03650 (GPA00_03650) comYE 708605..708898 (+) 294 WP_074629067.1 competence type IV pilus minor pilin ComGE Machinery gene
  GPA00_RS03655 (GPA00_03655) comYF 708882..709319 (+) 438 WP_157327522.1 competence type IV pilus minor pilin ComGF Machinery gene
  GPA00_RS03660 (GPA00_03660) comGG 709348..709641 (+) 294 WP_268894141.1 competence type IV pilus minor pilin ComGG -
  GPA00_RS03665 (GPA00_03665) comYH 709695..710651 (+) 957 WP_024344497.1 class I SAM-dependent methyltransferase Machinery gene
  GPA00_RS03670 (GPA00_03670) - 710705..711904 (+) 1200 WP_045798327.1 acetate kinase -
  GPA00_RS03675 (GPA00_03675) - 712062..712259 (+) 198 WP_027968026.1 helix-turn-helix transcriptional regulator -
  GPA00_RS03680 (GPA00_03680) - 712315..712767 (+) 453 WP_074483978.1 ABC transporter permease -
  GPA00_RS03685 (GPA00_03685) - 712779..713414 (+) 636 WP_141348477.1 CPBP family intramembrane glutamic endopeptidase -
  GPA00_RS03690 (GPA00_03690) proC 713466..714236 (-) 771 WP_157327526.1 pyrroline-5-carboxylate reductase -
  GPA00_RS03695 (GPA00_03695) pepA 714298..715365 (-) 1068 WP_074534071.1 glutamyl aminopeptidase -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 35783.86 Da        Isoelectric Point: 4.4780

>NTDB_id=406296 GPA00_RS03665 WP_024344497.1 709695..710651(+) (comYH) [Streptococcus equinus strain CNU G6]
MNFENIETAYGLILENIQLIENELKTHIYDALIEQNSFYLGAEGASEVVAANNEKLRQLNLTKEEWRRAFQFIFIKAAQT
EALQANHQFTPDAIGFILMFLIENLTASKELDVLEIGSGTGNLAQTLLNNSSKDLNYLGIEVDDLLIDLSASIAEVMDSK
AQFVQEDAVRPQILKESDVIISDLPVGFYPNDEIAKRYKVASSEGHTYAHHLLMEQSLKYLKKDGIAVFLAPVSLLTSKQ
SDLLKAWLKDYADVIAVITLPESIFGNAANAKSIFVLKKQAEHTPETFVYPLADLQSREVLTDFIDKFKKWNVENMIF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=406296 GPA00_RS03665 WP_024344497.1 709695..710651(+) (comYH) [Streptococcus equinus strain CNU G6]
ATGAATTTTGAAAATATCGAAACAGCCTATGGGTTGATTCTTGAAAATATACAATTAATCGAAAATGAGTTGAAAACACA
CATTTACGATGCACTTATTGAACAAAACTCTTTTTATCTTGGTGCTGAAGGTGCCAGTGAAGTTGTAGCTGCAAATAATG
AGAAACTACGCCAACTTAACTTAACTAAGGAAGAATGGCGCCGTGCTTTTCAGTTTATCTTTATTAAAGCTGCGCAAACA
GAAGCTCTTCAGGCAAATCACCAATTTACACCTGATGCTATTGGCTTCATTTTAATGTTCCTCATTGAGAATTTGACAGC
TTCTAAGGAACTTGATGTTTTGGAAATCGGTAGCGGAACAGGTAACCTTGCTCAAACGTTGTTGAACAACTCATCTAAAG
ACCTAAACTATCTAGGTATTGAAGTTGATGATTTGTTGATTGACTTGTCAGCAAGTATCGCTGAAGTTATGGATTCTAAA
GCTCAATTCGTTCAAGAAGATGCTGTACGCCCACAGATTCTTAAGGAAAGTGATGTCATCATCAGTGACCTTCCAGTCGG
ATTCTATCCAAATGATGAAATTGCAAAACGTTACAAAGTAGCTAGCAGTGAAGGGCACACTTATGCGCATCATTTGTTGA
TGGAACAATCTCTTAAATATCTCAAAAAAGACGGGATTGCTGTCTTTTTAGCACCAGTTAGTCTTTTGACAAGTAAGCAA
AGTGACCTGTTGAAAGCATGGTTGAAGGATTATGCTGATGTGATTGCTGTAATTACTTTGCCAGAATCTATCTTTGGAAA
TGCAGCCAACGCGAAATCAATTTTTGTTTTGAAAAAACAAGCTGAACATACTCCAGAAACCTTTGTTTATCCACTTGCTG
ACTTGCAAAGTCGAGAAGTGTTGACAGACTTCATTGATAAATTTAAAAAATGGAATGTTGAAAATATGATTTTTTAA

Domains


Predicted by InterproScan.

(69-291)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A239RCI2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

68.987

99.371

0.686

  comYH Streptococcus mutans UA159

68.671

99.371

0.682


Multiple sequence alignment