Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   HMPREF0833_RS08330 Genome accession   NC_015678
Coordinates   1785975..1786928 (+) Length   317 a.a.
NCBI ID   WP_013904490.1    Uniprot ID   F8DGZ7
Organism   Streptococcus parasanguinis ATCC 15912     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1780975..1791928
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HMPREF0833_RS08290 (HMPREF0833_11654) - 1781734..1782126 (+) 393 WP_037617571.1 DUF1033 family protein -
  HMPREF0833_RS08295 (HMPREF0833_11655) comYA 1782212..1783153 (+) 942 WP_013904485.1 competence type IV pilus ATPase ComGA Machinery gene
  HMPREF0833_RS08300 (HMPREF0833_11656) comYB 1783086..1784117 (+) 1032 WP_115276785.1 competence type IV pilus assembly protein ComGB Machinery gene
  HMPREF0833_RS08305 (HMPREF0833_11657) comYC 1784114..1784431 (+) 318 WP_003019171.1 competence type IV pilus major pilin ComGC Machinery gene
  HMPREF0833_RS08310 (HMPREF0833_11658) comYD 1784421..1784825 (+) 405 WP_041818518.1 competence type IV pilus minor pilin ComGD Machinery gene
  HMPREF0833_RS08315 (HMPREF0833_11659) comGE 1784791..1785078 (+) 288 WP_013904488.1 competence type IV pilus minor pilin ComGE -
  HMPREF0833_RS08320 (HMPREF0833_11660) comGF/cglF 1785068..1785520 (+) 453 WP_003019060.1 competence type IV pilus minor pilin ComGF Machinery gene
  HMPREF0833_RS08325 (HMPREF0833_11661) comGG 1785477..1785944 (+) 468 WP_255309063.1 competence type IV pilus minor pilin ComGG -
  HMPREF0833_RS08330 (HMPREF0833_11662) comYH 1785975..1786928 (+) 954 WP_013904490.1 class I SAM-dependent methyltransferase Machinery gene
  HMPREF0833_RS08335 (HMPREF0833_11663) - 1786980..1788173 (+) 1194 WP_013904491.1 acetate kinase -
  HMPREF0833_RS08340 (HMPREF0833_11664) - 1788246..1788977 (+) 732 WP_013904492.1 CPBP family intramembrane glutamic endopeptidase -
  HMPREF0833_RS08345 (HMPREF0833_11665) folP 1789139..1790089 (+) 951 WP_013904493.1 dihydropteroate synthase -
  HMPREF0833_RS08350 (HMPREF0833_11666) - 1790090..1791400 (+) 1311 WP_013904494.1 folylpolyglutamate synthase/dihydrofolate synthase family protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35964.79 Da        Isoelectric Point: 4.2014

>NTDB_id=41403 HMPREF0833_RS08330 WP_013904490.1 1785975..1786928(+) (comYH) [Streptococcus parasanguinis ATCC 15912]
MNFEKIEQAYTYLLENTQSIQNELSTNFYDALIEQNVMYLEGKTDLDIVKNNSKKLKELGLSKEEWRRAYQFLFMKAAQT
EPLQANHQFTPDAIGFIITFLIDQLAKSDQLDVLEVGSGTGNLAETIVNNSRLTIDYLGLEVDDLLIDLSASIADVMESS
VVFAQGDAVRPQVLKESDLIVSDLPIGYYPDDAIAQRYQVVSSEGHTYAHHLMMEQALKYLKPQGVAIFLAPNNLLTSPQ
SDLLKAWLTDKAQILAMLTLPESLFSNPAYAKTIFVLRKQEEESVQPFVYPFTDLQDQDQVVHFMESFQNWLKDSEI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=41403 HMPREF0833_RS08330 WP_013904490.1 1785975..1786928(+) (comYH) [Streptococcus parasanguinis ATCC 15912]
ATGAATTTCGAAAAAATTGAACAAGCCTATACCTATCTATTAGAAAACACTCAAAGTATTCAAAATGAATTGTCGACCAA
CTTTTATGATGCCTTGATTGAACAGAATGTCATGTATTTGGAGGGCAAGACAGACCTAGACATTGTTAAAAACAATAGCA
AAAAATTAAAAGAACTAGGTTTAAGTAAGGAAGAATGGCGCAGAGCCTACCAATTCCTTTTTATGAAAGCTGCTCAGACA
GAACCTTTACAAGCGAATCACCAGTTCACACCAGATGCGATTGGTTTTATCATTACATTTTTGATCGATCAGTTGGCTAA
AAGCGACCAACTGGATGTCTTAGAAGTGGGAAGTGGAACCGGAAATCTCGCTGAGACTATTGTCAACAATAGCCGTCTCA
CGATTGATTACTTGGGATTGGAAGTAGATGATCTCTTGATTGACCTATCTGCTAGTATCGCAGATGTGATGGAGTCTAGC
GTTGTCTTTGCACAAGGCGACGCGGTGCGTCCACAAGTTTTAAAAGAAAGTGACTTGATCGTTAGCGACTTACCGATTGG
CTATTATCCAGATGATGCGATTGCACAGCGCTATCAGGTAGTGAGCTCCGAAGGCCATACCTATGCCCATCACCTCATGA
TGGAACAGGCTTTGAAATATCTGAAACCTCAAGGAGTTGCCATCTTTTTAGCTCCAAATAACCTCTTGACAAGCCCTCAG
AGTGATCTGTTGAAAGCTTGGCTAACAGACAAAGCCCAAATCCTTGCCATGTTGACCTTGCCAGAATCTCTTTTTTCAAA
TCCAGCCTATGCTAAGACGATTTTCGTCCTACGAAAACAAGAAGAAGAGTCTGTTCAGCCCTTTGTCTATCCTTTTACCG
ATCTCCAGGATCAAGATCAGGTGGTTCACTTCATGGAAAGTTTCCAAAACTGGTTAAAGGATAGTGAAATTTGA

Domains


Predicted by InterproScan.

(68-286)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB F8DGZ7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

61.905

99.369

0.615

  comYH Streptococcus mutans UA140

61.905

99.369

0.615


Multiple sequence alignment