Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   SM121_RS01065 Genome accession   NZ_CP139418
Coordinates   222592..223545 (-) Length   317 a.a.
NCBI ID   WP_155127115.1    Uniprot ID   A0A6A8VAE9
Organism   Streptococcus dentalis strain S1     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 217592..228545
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SM121_RS01045 (SM121_01045) - 218119..219429 (-) 1311 WP_320910979.1 folylpolyglutamate synthase/dihydrofolate synthase family protein -
  SM121_RS01050 (SM121_01050) folP 219430..220380 (-) 951 WP_320910980.1 dihydropteroate synthase -
  SM121_RS01055 (SM121_01055) - 220543..221274 (-) 732 WP_320910981.1 type II CAAX endopeptidase family protein -
  SM121_RS01060 (SM121_01060) - 221347..222540 (-) 1194 WP_003004202.1 acetate kinase -
  SM121_RS01065 (SM121_01065) comYH 222592..223545 (-) 954 WP_155127115.1 class I SAM-dependent methyltransferase Machinery gene
  SM121_RS01070 (SM121_01070) comGG 223576..224013 (-) 438 WP_320910982.1 competence type IV pilus minor pilin ComGG -
  SM121_RS01075 (SM121_01075) comGF/cglF 224000..224452 (-) 453 WP_151190960.1 competence type IV pilus minor pilin ComGF Machinery gene
  SM121_RS01080 (SM121_01080) comGE 224442..224729 (-) 288 WP_003004353.1 competence type IV pilus minor pilin ComGE -
  SM121_RS01085 (SM121_01085) comYD 224695..225099 (-) 405 WP_003004513.1 competence type IV pilus minor pilin ComGD Machinery gene
  SM121_RS01090 (SM121_01090) comYC 225089..225406 (-) 318 WP_003013558.1 competence type IV pilus major pilin ComGC Machinery gene
  SM121_RS01095 (SM121_01095) comYB 225403..226434 (-) 1032 WP_320911301.1 competence type IV pilus assembly protein ComGB Machinery gene
  SM121_RS01100 (SM121_01100) comYA 226367..227308 (-) 942 WP_155127112.1 competence type IV pilus ATPase ComGA Machinery gene
  SM121_RS01105 (SM121_01105) - 227396..227785 (-) 390 WP_037615753.1 DUF1033 family protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35957.80 Da        Isoelectric Point: 4.2369

>NTDB_id=909467 SM121_RS01065 WP_155127115.1 222592..223545(-) (comYH) [Streptococcus dentalis strain S1]
MNFEKIEQAYTYLLENTQSIQNELSTNFYDALIEQNVMYLDGKTDLDLVKNNSKKLKELGLSKEEWRRAYQFLFMKAAQT
EPLQANHQFTPDAIGFIITFLIDQLAKSDHLDVLEVGSGTGNLAETIVNNSRLTIDYLGLEVDDLLIDLSASIADVMESS
VVFAQGDAVRPQVLKESDLIVSDLPIGYYPDDAIAQRYQVASSEGHTYAHHLMMEQALKYLKPQGVAIFLAPNNLLTSPQ
SDLLKAWLTDKAQLLAMLTLPESLFLNPAYAKTIFVLRKQEEESVQPFVYPFTDLQDQDQVVHFMESFQNWLKDSEI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=909467 SM121_RS01065 WP_155127115.1 222592..223545(-) (comYH) [Streptococcus dentalis strain S1]
ATGAATTTCGAAAAAATTGAACAAGCTTATACCTATCTATTAGAAAACACTCAAAGTATTCAAAATGAATTGTCGACCAA
CTTTTATGACGCCTTGATTGAACAAAATGTCATGTATTTGGATGGCAAGACGGATCTAGACCTTGTTAAAAACAATAGCA
AAAAATTAAAAGAACTAGGTTTAAGTAAGGAAGAATGGCGCAGAGCCTACCAATTCCTTTTTATGAAAGCTGCTCAGACA
GAACCTTTACAAGCGAATCACCAGTTCACACCAGATGCGATTGGTTTTATCATTACATTTTTGATCGATCAGTTGGCTAA
AAGCGACCATTTGGATGTCTTAGAAGTGGGAAGTGGAACCGGAAATCTCGCTGAGACCATTGTCAACAATAGCCGCCTCA
CGATTGATTACTTAGGATTGGAAGTGGATGATCTTTTGATTGACCTATCTGCTAGTATCGCAGATGTGATGGAATCCAGT
GTTGTCTTTGCACAAGGCGACGCGGTGCGTCCACAAGTGTTGAAAGAAAGTGACTTGATCGTTAGCGACTTACCGATTGG
CTATTATCCAGATGATGCGATTGCACAGCGCTATCAGGTAGCGAGCTCCGAAGGCCATACCTATGCCCATCACCTTATGA
TGGAACAGGCTCTGAAATATCTGAAACCTCAAGGAGTTGCCATCTTTTTAGCTCCAAATAACCTCTTGACGAGCCCTCAG
AGTGACCTTTTAAAAGCTTGGCTAACAGACAAAGCTCAACTCCTTGCCATGCTGACCTTGCCAGAATCTCTTTTTTTAAA
TCCAGCCTATGCTAAGACGATTTTCGTCCTACGAAAACAAGAAGAAGAGTCTGTTCAGCCCTTTGTCTATCCGTTTACCG
ATCTCCAGGATCAAGACCAGGTGGTTCACTTTATGGAAAGTTTCCAAAACTGGTTAAAGGATAGTGAAATTTGA

Domains


Predicted by InterproScan.

(68-286)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6A8VAE9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

61.587

99.369

0.612

  comYH Streptococcus mutans UA140

61.587

99.369

0.612