Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   Sp14A_RS00775 Genome accession   NZ_CP022601
Coordinates   126020..126973 (+) Length   317 a.a.
NCBI ID   WP_115129542.1    Uniprot ID   A0A345VHB0
Organism   Streptococcus pluranimalium strain 14A0014     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 121020..131973
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  Sp14A_RS00735 (Sp14A_01490) - 121873..122253 (+) 381 WP_205407082.1 DUF1033 family protein -
  Sp14A_RS00740 (Sp14A_01500) comYA 122316..123269 (+) 954 WP_115129537.1 competence type IV pilus ATPase ComGA Machinery gene
  Sp14A_RS00745 (Sp14A_01510) comYB 123193..124221 (+) 1029 WP_115129538.1 competence type IV pilus assembly protein ComGB Machinery gene
  Sp14A_RS00750 (Sp14A_01520) comYC 124222..124515 (+) 294 WP_115129539.1 competence type IV pilus major pilin ComGC Machinery gene
  Sp14A_RS00755 (Sp14A_01530) comGD 124496..124912 (+) 417 WP_104967107.1 competence type IV pilus minor pilin ComGD -
  Sp14A_RS00760 comGE 124884..125183 (+) 300 WP_115129540.1 competence type IV pilus minor pilin ComGE -
  Sp14A_RS00765 (Sp14A_01540) comYF 125161..125598 (+) 438 WP_115129541.1 competence type IV pilus minor pilin ComGF Machinery gene
  Sp14A_RS00770 (Sp14A_01550) comGG 125582..125977 (+) 396 WP_205407083.1 competence type IV pilus minor pilin ComGG -
  Sp14A_RS00775 (Sp14A_01560) comYH 126020..126973 (+) 954 WP_115129542.1 class I SAM-dependent methyltransferase Machinery gene
  Sp14A_RS00780 (Sp14A_01570) - 127034..128224 (+) 1191 WP_115129543.1 acetate kinase -
  Sp14A_RS00785 (Sp14A_01580) - 128609..128806 (+) 198 WP_115129544.1 helix-turn-helix transcriptional regulator -
  Sp14A_RS00790 (Sp14A_01590) - 128819..129271 (+) 453 WP_115129545.1 hypothetical protein -
  Sp14A_RS00795 (Sp14A_01600) - 129305..129976 (+) 672 WP_115129546.1 type II CAAX endopeptidase family protein -
  Sp14A_RS00800 (Sp14A_01610) proC 130021..130791 (-) 771 WP_115129547.1 pyrroline-5-carboxylate reductase -
  Sp14A_RS00805 (Sp14A_01620) pepA 130830..131894 (-) 1065 WP_115129548.1 glutamyl aminopeptidase -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35691.53 Da        Isoelectric Point: 4.3566

>NTDB_id=241031 Sp14A_RS00775 WP_115129542.1 126020..126973(+) (comYH) [Streptococcus pluranimalium strain 14A0014]
MNFEKIESAYQLILENTQLIENDIKTNIYDALIEQNAFYLGAENAPTKVAENNQALKDLQLSKEEWRRAYQFVFIKASQT
EKLQANHQFTPDAIGFLVLYLLENLTSQEHLDVIEIGSGTGNLAQTLLNNSNRDLDYLGLELDDLLIDLSASIAEVMGAN
VSFVQEDAVRPQILKESDVIISDLPVGYYPNDAIAKRYQVAATDEHTYAHHLLMEQSLKYLKKDGLAILLAPTNLLTSGQ
SDLLKKWLAGYADILAVITLPETLFGNPANAKSLFVLQKQAKTKPETFVYHLSDIQNPEILQDFMENLKIWKDDNAI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=241031 Sp14A_RS00775 WP_115129542.1 126020..126973(+) (comYH) [Streptococcus pluranimalium strain 14A0014]
ATGAATTTTGAAAAAATCGAATCAGCCTATCAGTTGATTTTAGAAAATACCCAGCTTATTGAAAATGACATTAAGACCAA
TATTTACGATGCTTTGATTGAGCAAAATGCCTTTTATTTGGGTGCTGAAAATGCTCCTACAAAAGTGGCTGAAAATAATC
AAGCCTTGAAAGATTTGCAACTGAGTAAAGAAGAATGGCGTCGTGCCTATCAGTTCGTTTTTATCAAGGCTAGTCAGACG
GAGAAACTTCAAGCCAATCATCAGTTTACACCAGATGCGATTGGCTTCTTGGTACTTTATCTTTTGGAAAATTTGACCAG
TCAAGAGCATTTGGATGTGATTGAAATCGGTAGTGGTACTGGTAATTTGGCGCAAACCCTTCTCAATAATAGTAATCGTG
ATTTAGACTATCTCGGTTTGGAGTTGGATGATTTATTGATTGATTTATCAGCTTCGATCGCTGAAGTGATGGGAGCAAAT
GTTAGCTTCGTTCAAGAAGATGCAGTGCGTCCACAGATTTTGAAAGAGAGTGACGTCATCATCAGTGATTTACCAGTTGG
TTATTATCCGAATGACGCTATCGCTAAACGTTATCAAGTAGCTGCTACTGATGAGCATACTTATGCGCATCATCTCTTGA
TGGAGCAATCCTTGAAGTATTTGAAAAAAGATGGTTTAGCGATTCTTTTGGCACCGACTAATCTCTTAACTAGTGGGCAA
AGTGATCTCTTGAAAAAATGGTTGGCAGGTTATGCAGATATCTTGGCAGTGATTACCTTACCAGAAACTTTATTTGGTAA
TCCGGCTAATGCTAAATCTCTGTTTGTCTTGCAAAAGCAGGCTAAAACTAAGCCGGAGACTTTTGTTTATCATTTGTCAG
ATATTCAAAATCCGGAAATTTTACAGGATTTTATGGAAAATTTGAAAATCTGGAAAGATGATAATGCCATTTAG

Domains


Predicted by InterproScan.

(68-294)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A345VHB0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

67.089

99.685

0.669

  comYH Streptococcus mutans UA159

66.772

99.685

0.666


Multiple sequence alignment