Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   N597_RS00270 Genome accession   NC_022582
Coordinates   50510..51463 (+) Length   317 a.a.
NCBI ID   WP_023022079.1    Uniprot ID   -
Organism   Streptococcus ilei     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 45510..56463
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  N597_RS00230 (N597_00240) - 46275..46652 (+) 378 WP_006595139.1 DUF1033 family protein -
  N597_RS00235 (N597_00245) comYA 46758..47699 (+) 942 WP_006597118.1 competence type IV pilus ATPase ComGA Machinery gene
  N597_RS00240 (N597_00250) comYB 47641..48663 (+) 1023 WP_023022075.1 competence type IV pilus assembly protein ComGB Machinery gene
  N597_RS00245 (N597_00255) comYC 48660..48977 (+) 318 WP_023022076.1 competence type IV pilus major pilin ComGC Machinery gene
  N597_RS00250 (N597_00260) comGD/cglD 48946..49371 (+) 426 WP_042507426.1 competence type IV pilus minor pilin ComGD Machinery gene
  N597_RS00255 (N597_00265) comGE 49337..49630 (+) 294 WP_006595134.1 competence type IV pilus minor pilin ComGE -
  N597_RS00260 (N597_00270) comGF/cglF 49614..50066 (+) 453 WP_040803180.1 competence type IV pilus minor pilin ComGF Machinery gene
  N597_RS00265 (N597_00275) comGG 50032..50478 (+) 447 WP_006595132.1 competence type IV pilus minor pilin ComGG -
  N597_RS00270 (N597_00280) comYH 50510..51463 (+) 954 WP_023022079.1 class I SAM-dependent methyltransferase Machinery gene
  N597_RS00275 (N597_00285) - 51512..52708 (+) 1197 WP_023022080.1 acetate kinase -
  N597_RS00280 (N597_00290) - 52860..53534 (+) 675 WP_224781595.1 CPBP family intramembrane glutamic endopeptidase -
  N597_RS00285 (N597_00295) folP 53694..54650 (+) 957 WP_023022082.1 dihydropteroate synthase -
  N597_RS00290 (N597_00300) - 54667..55971 (+) 1305 WP_006595127.1 folylpolyglutamate synthase/dihydrofolate synthase family protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 36075.86 Da        Isoelectric Point: 4.2657

>NTDB_id=62878 N597_RS00270 WP_023022079.1 50510..51463(+) (comYH) [Streptococcus ilei]
MNFEKIEKAYGYLLENTQTIQNDLQTNFYDALVEQNAIYLDGQTELTLVKENNQRLKDLNLNKEEWRRSFQYLLMKAAQT
EPLQANHQFTPDGIGFLLVFLVDQLASSDQVDVLEMGSGTGNLAQTLMNNCQRSLDYLGLEIDDLLIDLAASMAEVMKAD
VNFAQGDAIRPQVLKESDVIISDLPVGYYPDDAIASRYQVASPQGHTYAHHLLIEQSLKYLKPGGVAIFLAPNDLLTSEQ
SPLLKQWMQDHAQVLAMVTLPENLFRSANLAKTIFVFRKQEEAVVQPFVYPLTDLQDQEDLMKFRESFQNWNKESEI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=62878 N597_RS00270 WP_023022079.1 50510..51463(+) (comYH) [Streptococcus ilei]
ATGAATTTCGAAAAAATTGAGAAAGCCTACGGCTACCTATTAGAGAATACCCAAACTATCCAAAATGATTTGCAGACCAA
CTTTTATGATGCTCTAGTTGAGCAGAATGCCATCTATCTGGATGGGCAAACAGAGTTGACTCTAGTGAAGGAAAACAACC
AGCGCCTGAAGGACTTGAACTTAAACAAGGAAGAGTGGCGTCGTTCCTTCCAGTATCTTTTGATGAAGGCTGCCCAAACA
GAGCCCCTACAGGCCAATCACCAATTTACGCCAGATGGGATTGGATTTCTTCTGGTCTTCCTAGTGGATCAGTTGGCGAG
TTCCGATCAAGTAGATGTGCTAGAAATGGGGAGTGGAACAGGGAACTTGGCCCAAACCTTGATGAACAACTGTCAGCGTT
CCTTAGATTATTTGGGCTTGGAAATTGATGATCTCTTGATTGACCTTGCGGCTAGTATGGCAGAAGTGATGAAGGCGGAT
GTGAATTTTGCCCAAGGTGATGCCATTCGTCCACAAGTTTTGAAAGAGAGCGATGTGATTATCAGCGATTTACCTGTCGG
TTATTATCCAGATGATGCCATTGCGAGTCGTTACCAGGTCGCTTCCCCTCAGGGCCACACCTATGCCCATCATTTATTGA
TTGAACAATCGCTAAAATACTTAAAACCAGGTGGCGTCGCTATTTTTCTAGCTCCGAATGATCTCTTGACGAGCGAGCAG
AGTCCTCTGCTGAAACAATGGATGCAGGATCATGCTCAGGTCTTGGCCATGGTGACCTTGCCAGAGAACCTCTTTCGATC
AGCCAATCTAGCAAAAACCATCTTTGTTTTCCGCAAGCAAGAAGAAGCGGTGGTCCAACCATTTGTCTATCCTTTGACTG
ATTTGCAAGACCAGGAAGACCTCATGAAATTCCGTGAAAGTTTTCAAAACTGGAATAAAGAAAGTGAAATTTAA

Domains


Predicted by InterproScan.

(70-290)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

58.73

99.369

0.584

  comYH Streptococcus mutans UA159

58.413

99.369

0.58


Multiple sequence alignment