Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   D7D53_RS08305 Genome accession   NZ_CP032621
Coordinates   1645880..1646833 (-) Length   317 a.a.
NCBI ID   WP_120770670.1    Uniprot ID   A0A387AZ60
Organism   Streptococcus gwangjuensis strain KCOM 1679 (=ChDC B345)     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1640880..1651833
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D7D53_RS08280 (D7D53_08280) - 1641639..1642262 (-) 624 Protein_1546 HI_0552 family protein -
  D7D53_RS08285 (D7D53_08285) jag 1642313..1643302 (-) 990 WP_049492474.1 RNA-binding cell elongation regulator Jag/EloR -
  D7D53_RS08290 (D7D53_08290) - 1643320..1644144 (-) 825 WP_218961675.1 membrane protein insertase YidC -
  D7D53_RS08295 (D7D53_08295) rnpA 1644119..1644490 (-) 372 WP_000739243.1 ribonuclease P protein component -
  D7D53_RS08300 (D7D53_08300) - 1644639..1645829 (-) 1191 WP_120770669.1 acetate kinase -
  D7D53_RS08305 (D7D53_08305) comYH 1645880..1646833 (-) 954 WP_120770670.1 class I SAM-dependent methyltransferase Machinery gene
  D7D53_RS08310 (D7D53_08310) - 1646894..1647481 (-) 588 WP_162927893.1 class I SAM-dependent methyltransferase -
  D7D53_RS08315 (D7D53_08315) comGG/cglG 1647514..1647915 (-) 402 WP_120770671.1 competence type IV pilus minor pilin ComGG Machinery gene
  D7D53_RS08320 (D7D53_08320) comGF/cglF 1647893..1648354 (-) 462 WP_000250521.1 competence type IV pilus minor pilin ComGF Machinery gene
  D7D53_RS08325 (D7D53_08325) comGE/cglE 1648317..1648619 (-) 303 WP_281268606.1 competence type IV pilus minor pilin ComGE Machinery gene
  D7D53_RS08330 (D7D53_08330) comGD/cglD 1648582..1648986 (-) 405 WP_245941793.1 competence type IV pilus minor pilin ComGD Machinery gene
  D7D53_RS08335 (D7D53_08335) comGC/cglC 1648979..1649305 (-) 327 WP_045606658.1 competence type IV pilus major pilin ComGC Machinery gene
  D7D53_RS08340 (D7D53_08340) comGB/cglB 1649307..1650323 (-) 1017 WP_120770673.1 competence type IV pilus assembly protein ComGB Machinery gene
  D7D53_RS08345 (D7D53_08345) comGA/cglA/cilD 1650271..1651212 (-) 942 WP_120770674.1 competence type IV pilus ATPase ComGA Machinery gene
  D7D53_RS08350 (D7D53_08350) - 1651288..1651653 (-) 366 WP_120770675.1 DUF1033 family protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35727.84 Da        Isoelectric Point: 4.2215

>NTDB_id=317213 D7D53_RS08305 WP_120770670.1 1645880..1646833(-) (comYH) [Streptococcus gwangjuensis strain KCOM 1679 (=ChDC B345)]
MDFEKIEQAYTYLLENVQVIQSDLATNFYDALVEQNSIYLDGETELEQVKENNQALKRLALRKEEWLKTYQFLLMKAGQT
EPLQANHQFTPDAIALLLVFIVEELFKEEEITILEMGSGMGILGATFLTSLDKKVDYLGMEVDDLLIDLAASMADVIGLQ
AGFVQGDAVRPQMLKESDVVISDLPVGYYPDDDVASRHQVASSQEHTYAHHLLIEQGLKYLKSDGYAIFLAPSDLLTSPQ
SDLLKGWLKEEASLVTMISLPENLFANAKQSKTIFILQKKNEIAVEPFVYPLASLQDASVLMKFKENFQKWTQGTEI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=317213 D7D53_RS08305 WP_120770670.1 1645880..1646833(-) (comYH) [Streptococcus gwangjuensis strain KCOM 1679 (=ChDC B345)]
ATGGATTTTGAAAAAATTGAACAAGCTTATACGTATTTACTAGAGAATGTCCAAGTCATCCAAAGTGATTTGGCGACCAA
CTTTTATGACGCCCTGGTGGAGCAAAACAGCATTTATCTGGATGGTGAAACTGAGTTAGAGCAGGTCAAGGAGAACAATC
AGGCCCTTAAACGTTTAGCGCTTCGCAAAGAAGAATGGCTCAAGACCTACCAGTTTCTCTTGATGAAGGCTGGGCAGACA
GAGCCCCTACAGGCCAATCACCAGTTTACGCCAGATGCCATTGCTTTACTTTTGGTGTTTATTGTGGAAGAGTTGTTTAA
AGAGGAGGAAATTACTATCCTCGAAATGGGTTCCGGGATGGGGATTCTGGGCGCTACTTTCTTGACCTCGCTTGATAAAA
AGGTGGATTATTTGGGAATGGAAGTGGATGATTTGCTGATTGATTTGGCAGCAAGCATGGCAGATGTGATTGGTTTGCAG
GCTGGCTTTGTCCAAGGAGATGCCGTTCGTCCACAAATGCTCAAAGAAAGCGATGTGGTCATCAGCGACTTGCCTGTCGG
CTATTATCCTGATGATGACGTTGCGTCGCGCCATCAAGTTGCTTCTAGTCAAGAACATACTTACGCCCATCACTTGCTCA
TAGAACAAGGACTTAAGTACCTCAAGTCAGATGGATACGCTATTTTTCTCGCTCCGAGTGATTTGTTGACCAGTCCTCAG
AGTGATTTGTTGAAAGGGTGGTTGAAAGAGGAAGCGAGTCTGGTTACCATGATCAGTCTGCCTGAAAATCTCTTTGCTAA
TGCTAAACAATCTAAGACTATTTTTATCCTACAGAAGAAAAATGAGATAGCAGTAGAACCTTTTGTTTATCCACTTGCTA
GCTTGCAAGATGCAAGTGTTTTAATGAAATTTAAAGAAAATTTTCAAAAATGGACTCAAGGTACTGAAATATAA

Domains


Predicted by InterproScan.

(69-282)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A387AZ60

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

54.313

98.738

0.536

  comYH Streptococcus mutans UA159

53.994

98.738

0.533


Multiple sequence alignment