Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   I2437_RS08940 Genome accession   NZ_CP065061
Coordinates   1906557..1907510 (-) Length   317 a.a.
NCBI ID   WP_043024962.1    Uniprot ID   A0A7Z8ZY98
Organism   Streptococcus equi subsp. zooepidemicus strain SEZ33     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1901557..1912510
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I2437_RS08925 (IEMOCGPF_01782) - 1901771..1903813 (-) 2043 WP_165628051.1 BglG family transcription antiterminator -
  I2437_RS08930 (IEMOCGPF_01783) - 1904345..1904935 (+) 591 WP_165628050.1 hypothetical protein -
  I2437_RS08935 (IEMOCGPF_01784) - 1905298..1906497 (-) 1200 WP_012678817.1 acetate kinase -
  I2437_RS08940 (IEMOCGPF_01785) comYH 1906557..1907510 (-) 954 WP_043024962.1 class I SAM-dependent methyltransferase Machinery gene
  I2437_RS08945 (IEMOCGPF_01786) comGG 1907572..1907934 (-) 363 WP_012677238.1 competence type IV pilus minor pilin ComGG -
  I2437_RS08950 (IEMOCGPF_01787) comYF 1907912..1908346 (-) 435 WP_043983927.1 competence type IV pilus minor pilin ComGF Machinery gene
  I2437_RS08955 (IEMOCGPF_01788) comGE 1908333..1908623 (-) 291 WP_014622143.1 competence type IV pilus minor pilin ComGE -
  I2437_RS08960 (IEMOCGPF_01789) comGD 1908580..1909005 (-) 426 WP_012514783.1 competence type IV pilus minor pilin ComGD -
  I2437_RS08965 (IEMOCGPF_01790) comGC/cglC 1908983..1909306 (-) 324 WP_012514782.1 competence type IV pilus major pilin ComGC Machinery gene
  I2437_RS08970 (IEMOCGPF_01791) comYB 1909307..1910341 (-) 1035 WP_228274584.1 competence type IV pilus assembly protein ComGB Machinery gene
  I2437_RS08975 (IEMOCGPF_01792) comGA/cglA 1910274..1911212 (-) 939 WP_165628048.1 competence type IV pilus ATPase ComGA Machinery gene
  I2437_RS08980 (IEMOCGPF_01793) - 1911369..1912037 (-) 669 WP_165628047.1 ATP-binding cassette domain-containing protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35708.69 Da        Isoelectric Point: 4.6973

>NTDB_id=506626 I2437_RS08940 WP_043024962.1 1906557..1907510(-) (comYH) [Streptococcus equi subsp. zooepidemicus strain SEZ33]
MNFEKIEQAYELILENSQLIENDLKTHIYDAIVEQNSFYLGAQGASPQVAKNIETLKALQLTKEEWRQAYQFVLIKAGKT
EPLQANHQFTPDAIGFIMLYILETLSSQESLDVLEIGSGTGNLAQTILNHSHKSIDYLGIELDDLLIDLSASIAEIMGSS
AQFIQEDAVRPQLLKESDMIISDLPVGFYPNDDIASRYQVASSDEHTYAHHLLIEQALKYLKKDGFAIFLAPVNLLSSPQ
SHLLKQWLKGYAQVAALITLPEAVFGNPANAKSIIVLCKQSNRFAETFVYPIRDLKSVDNVRDFMENFKNWKRDNVI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=506626 I2437_RS08940 WP_043024962.1 1906557..1907510(-) (comYH) [Streptococcus equi subsp. zooepidemicus strain SEZ33]
ATGAATTTTGAAAAGATTGAACAAGCCTATGAGCTTATATTAGAAAATAGCCAGCTGATTGAAAATGATTTAAAAACGCA
TATCTATGATGCTATTGTTGAGCAGAATTCTTTTTATCTGGGAGCCCAAGGAGCAAGCCCTCAGGTTGCTAAAAATATTG
AGACGTTGAAGGCCTTGCAGCTAACCAAGGAGGAGTGGCGTCAGGCTTACCAGTTTGTTTTGATCAAGGCTGGGAAAACA
GAGCCATTACAGGCCAACCACCAATTTACCCCAGACGCGATCGGTTTTATCATGCTTTATATTTTGGAGACCTTGAGTTC
ACAAGAGTCACTTGATGTGCTTGAGATTGGCAGTGGAACAGGTAATTTAGCTCAAACTATTTTAAACCACTCACATAAGA
GCATTGATTATTTAGGCATTGAGCTTGATGATTTATTAATTGATCTATCAGCTAGTATTGCTGAAATCATGGGGTCGTCA
GCCCAGTTTATTCAAGAGGATGCTGTCAGACCTCAGCTGTTAAAGGAAAGTGATATGATCATTAGTGATTTGCCAGTTGG
CTTTTATCCTAACGATGATATTGCCAGCAGGTATCAGGTGGCTAGCTCAGATGAGCATACCTATGCCCATCATTTACTGA
TAGAGCAGGCCTTAAAGTACCTGAAAAAAGATGGCTTTGCTATTTTCTTAGCACCGGTAAATCTTTTGAGCAGCCCACAA
AGTCACCTCTTAAAACAATGGCTGAAGGGCTATGCTCAGGTTGCGGCCTTGATTACTTTGCCTGAGGCAGTGTTTGGAAA
TCCAGCTAATGCAAAATCGATTATTGTTCTTTGTAAGCAATCGAATCGCTTTGCAGAAACCTTTGTTTACCCCATTAGGG
ATTTGAAATCTGTTGATAATGTTCGTGATTTTATGGAAAACTTCAAAAATTGGAAACGGGATAATGTTATTTAA

Domains


Predicted by InterproScan.

(68-294)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7Z8ZY98

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

63.291

99.685

0.631

  comYH Streptococcus mutans UA140

63.291

99.685

0.631