Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   SPSF3K_RS01970 Genome accession   NZ_CP025420
Coordinates   410526..411482 (+) Length   318 a.a.
NCBI ID   WP_003103712.1    Uniprot ID   A0A0E2UAX9
Organism   Streptococcus parauberis strain SPOF3K     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 405526..416482
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SPSF3K_RS01930 (SPSF3K_00399) - 406202..406576 (+) 375 WP_003108753.1 DUF1033 family protein -
  SPSF3K_RS01935 (SPSF3K_00400) comYA 406630..407571 (+) 942 WP_003108751.1 competence type IV pilus ATPase ComGA Machinery gene
  SPSF3K_RS01940 (SPSF3K_00401) comYB 407483..408538 (+) 1056 WP_152414240.1 competence type IV pilus assembly protein ComGB Machinery gene
  SPSF3K_RS01945 (SPSF3K_00402) comYC 408535..408861 (+) 327 WP_004347181.1 competence type IV pilus major pilin ComGC Machinery gene
  SPSF3K_RS01950 (SPSF3K_00403) comGD 408851..409258 (+) 408 WP_243619590.1 competence type IV pilus minor pilin ComGD -
  SPSF3K_RS01955 (SPSF3K_00404) comGE 409230..409529 (+) 300 WP_003108643.1 competence type IV pilus minor pilin ComGE -
  SPSF3K_RS01960 (SPSF3K_00405) comGF 409507..409944 (+) 438 WP_003102915.1 competence type IV pilus minor pilin ComGF -
  SPSF3K_RS01965 (SPSF3K_00406) comGG 409922..410419 (+) 498 WP_003108640.1 competence type IV pilus minor pilin ComGG -
  SPSF3K_RS01970 (SPSF3K_00407) comYH 410526..411482 (+) 957 WP_003103712.1 class I SAM-dependent methyltransferase Machinery gene
  SPSF3K_RS01975 (SPSF3K_00408) - 411538..412731 (+) 1194 WP_003108638.1 acetate kinase -
  SPSF3K_RS01980 (SPSF3K_00409) - 412842..413054 (+) 213 WP_003108636.1 helix-turn-helix transcriptional regulator -
  SPSF3K_RS01985 (SPSF3K_00410) - 413047..413499 (+) 453 WP_003108634.1 hypothetical protein -
  SPSF3K_RS01990 (SPSF3K_00411) - 413508..414161 (+) 654 WP_003108632.1 CPBP family intramembrane glutamic endopeptidase -
  SPSF3K_RS01995 (SPSF3K_00412) proC 414178..414948 (-) 771 WP_003104671.1 pyrroline-5-carboxylate reductase -
  SPSF3K_RS02000 (SPSF3K_00413) pepA 414960..416027 (-) 1068 WP_003108630.1 glutamyl aminopeptidase -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 36032.20 Da        Isoelectric Point: 4.9473

>NTDB_id=261155 SPSF3K_RS01970 WP_003103712.1 410526..411482(+) (comYH) [Streptococcus parauberis strain SPOF3K]
MNFEKIEKAFELILENSQLIETDLHTHIYDAIIEQNSYLLGAKGANKQVEGNIQALKELHLTKEEWRRAFQFVFIKASQT
ERLQANHQFTPDSLGFILLYMMETLVSNHSFDLLEIGSGTGNLAQTILNNTSKSIDYLGIELDDLLIDLSASISEIMDSS
AKFLQEDAVRPQILKVSDVIISDLPVGYYPNDDIASRYKVASKEEHTYAHHLLMEQSLKYLKEGGYAIFLAPTNILTSPQ
SDLLKQWLKNYAQVMAVVTLPETMFGNPNNAKSIFVLGKTKNQAVETFVFPITNIQSTELIQDFMNKFKNWKLANVIS

Nucleotide


Download         Length: 957 bp        

>NTDB_id=261155 SPSF3K_RS01970 WP_003103712.1 410526..411482(+) (comYH) [Streptococcus parauberis strain SPOF3K]
ATGAATTTTGAAAAAATAGAAAAAGCCTTTGAGCTTATTTTAGAGAATAGTCAGCTTATTGAAACGGACTTACATACACA
TATTTATGATGCAATTATTGAACAAAATTCCTACTTATTGGGTGCAAAAGGGGCAAACAAACAAGTTGAAGGAAATATTC
AAGCATTAAAAGAATTACACTTAACAAAAGAAGAATGGCGACGTGCCTTTCAATTTGTATTTATTAAGGCCTCACAGACC
GAGAGATTGCAAGCCAATCATCAATTTACTCCAGACAGTCTTGGTTTTATCTTGCTCTATATGATGGAGACTTTAGTATC
TAATCATTCATTTGATTTATTAGAAATTGGTAGTGGAACTGGTAATTTGGCTCAAACTATCTTAAACAATACGAGTAAGT
CTATTGACTATCTTGGTATTGAGCTGGATGACTTGCTTATCGATTTATCAGCAAGTATTTCTGAAATAATGGATTCTTCT
GCAAAATTTTTACAGGAAGATGCTGTGAGACCACAAATTCTTAAAGTAAGTGATGTTATCATTAGTGACTTACCAGTTGG
TTATTACCCAAATGACGATATAGCAAGTCGTTATAAAGTAGCTAGTAAGGAAGAGCATACTTATGCTCACCATTTATTAA
TGGAACAGTCATTGAAGTATCTAAAAGAAGGTGGTTATGCTATATTTTTAGCTCCAACTAATATTTTAACTAGTCCACAA
AGCGATCTGTTGAAGCAATGGTTAAAAAACTATGCCCAGGTTATGGCAGTTGTTACTTTACCTGAAACGATGTTTGGCAA
TCCAAATAATGCCAAGTCGATTTTTGTCTTAGGAAAAACGAAAAATCAAGCGGTTGAAACATTTGTCTTTCCAATTACTA
ACATTCAATCCACAGAGCTTATTCAAGATTTCATGAACAAGTTTAAAAATTGGAAACTTGCTAATGTCATTTCATAG

Domains


Predicted by InterproScan.

(78-310)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0E2UAX9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

63.608

99.371

0.632

  comYH Streptococcus mutans UA159

63.291

99.371

0.629


Multiple sequence alignment