Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   LPB404_RS01935 Genome accession   NZ_CP079821
Coordinates   376336..377289 (+) Length   317 a.a.
NCBI ID   WP_120701351.1    Uniprot ID   -
Organism   Streptococcus rubneri strain LPB0404     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 371336..382289
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LPB404_RS01895 (LPB404_01895) - 372102..372479 (+) 378 WP_219074901.1 DUF1033 family protein -
  LPB404_RS01900 (LPB404_01900) comYA 372584..373543 (+) 960 WP_219074902.1 competence type IV pilus ATPase ComGA Machinery gene
  LPB404_RS01905 (LPB404_01905) comYB 373467..374489 (+) 1023 WP_257573088.1 competence type IV pilus assembly protein ComGB Machinery gene
  LPB404_RS01910 (LPB404_01910) comGC/cglC 374486..374803 (+) 318 WP_219074903.1 competence type IV pilus major pilin ComGC Machinery gene
  LPB404_RS01915 (LPB404_01915) comGD/cglD 374793..375197 (+) 405 WP_219075204.1 competence type IV pilus minor pilin ComGD Machinery gene
  LPB404_RS01920 (LPB404_01920) comGE 375163..375456 (+) 294 WP_219074904.1 competence type IV pilus minor pilin ComGE -
  LPB404_RS01925 (LPB404_01925) comGF/cglF 375440..375892 (+) 453 WP_219075203.1 competence type IV pilus minor pilin ComGF Machinery gene
  LPB404_RS01930 (LPB404_01930) comGG 375921..376304 (+) 384 WP_374030902.1 competence type IV pilus minor pilin ComGG -
  LPB404_RS01935 (LPB404_01935) comYH 376336..377289 (+) 954 WP_120701351.1 class I SAM-dependent methyltransferase Machinery gene
  LPB404_RS01940 (LPB404_01940) - 377338..378534 (+) 1197 WP_219074906.1 acetate kinase -
  LPB404_RS01945 (LPB404_01945) - 378687..379361 (+) 675 WP_244917016.1 CPBP family intramembrane glutamic endopeptidase -
  LPB404_RS01950 (LPB404_01950) folP 379520..380476 (+) 957 WP_257573047.1 dihydropteroate synthase -
  LPB404_RS01955 (LPB404_01955) - 380493..381797 (+) 1305 WP_219074908.1 folylpolyglutamate synthase/dihydrofolate synthase family protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 36022.95 Da        Isoelectric Point: 4.4354

>NTDB_id=589733 LPB404_RS01935 WP_120701351.1 376336..377289(+) (comYH) [Streptococcus rubneri strain LPB0404]
MNFEKIEKAYGYLLENTQTIQNDLQTNFYDALVEQNAIYLDGQTELTLVKENNQRLKDLNLNKEEWRRSFQYLLMKAAQT
EPLQANHQFTPDGIGFLLVFLVDQLASSDKVDVLEIGSGTGNLAQTLMNNCQRSLDYLGLEIDDLLIDLAASMAEVMKAD
VKFAQGDAVRPQVLKESDVIISDLPVGYYPDDAIASRYQVASPQGHTYAHHLLIEQSLKYLKPGGIAIFLAPNDLLTSEQ
SPLLKKWMQDHAQVLAMVTLPENLFRSANLAKTIFVLRKQEEAVVQPFVYPLTDLQNQEDVMKFRESFQNWNKESEI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=589733 LPB404_RS01935 WP_120701351.1 376336..377289(+) (comYH) [Streptococcus rubneri strain LPB0404]
ATGAATTTCGAAAAAATTGAGAAAGCCTATGGCTATCTATTAGAAAATACCCAAACCATCCAAAATGATTTGCAGACCAA
CTTTTATGATGCTCTAGTGGAGCAGAATGCAATCTATCTGGATGGGCAGACTGAGTTGACTCTAGTGAAGGAAAACAACC
AGCGACTGAAGGACTTGAATTTGAACAAAGAAGAGTGGCGTCGCTCCTTCCAATATCTTTTGATGAAGGCGGCCCAAACA
GAGCCCCTACAAGCCAATCACCAATTTACCCCAGATGGGATTGGATTTCTTCTGGTCTTTCTAGTGGATCAGTTAGCTAG
TTCTGATAAAGTAGATGTGCTAGAAATTGGAAGCGGAACAGGAAACTTAGCCCAAACCTTAATGAACAACTGCCAGCGCT
CGCTAGATTATTTGGGCTTGGAAATCGATGATCTGTTGATTGATCTTGCGGCCAGTATGGCAGAAGTGATGAAGGCGGAT
GTGAAGTTCGCCCAAGGTGATGCGGTTCGTCCACAGGTTTTGAAAGAGAGTGATGTGATTATCAGTGATTTACCTGTCGG
TTATTATCCAGATGATGCCATTGCAAGTCGTTATCAGGTCGCTTCCCCTCAGGGCCACACCTATGCCCATCATTTATTAA
TCGAACAATCGTTAAAATACTTAAAGCCGGGTGGTATCGCTATTTTTCTAGCTCCGAATGATCTGTTGACGAGCGAGCAG
AGCCCTCTCTTGAAAAAATGGATGCAGGATCATGCTCAGGTCTTGGCCATGGTTACCTTGCCAGAGAACCTCTTTCGATC
AGCTAATCTAGCAAAAACCATCTTTGTTTTGCGCAAGCAAGAAGAAGCAGTGGTCCAACCATTTGTCTATCCCTTAACCG
ATTTGCAAAATCAGGAAGACGTCATGAAATTCCGTGAAAGTTTTCAAAACTGGAATAAAGAAAGTGAAATTTAA

Domains


Predicted by InterproScan.

(70-290)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

59.048

99.369

0.587

  comYH Streptococcus mutans UA159

58.73

99.369

0.584