Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   H1W98_RS09630 Genome accession   NZ_LR822026
Coordinates   1874206..1875162 (-) Length   318 a.a.
NCBI ID   WP_179972341.1    Uniprot ID   A0A7U7CBK4
Organism   Streptococcus thermophilus isolate STH_CIRM_967     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1869206..1880162
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H1W98_RS09605 (STHERMO_2196) pepA 1869417..1870484 (+) 1068 WP_180482293.1 glutamyl aminopeptidase -
  H1W98_RS09610 (STHERMO_2197) proC 1870500..1871270 (+) 771 WP_179972337.1 pyrroline-5-carboxylate reductase -
  H1W98_RS09615 (STHERMO_2198) - 1871300..1871962 (-) 663 WP_179972338.1 CPBP family intramembrane glutamic endopeptidase -
  H1W98_RS11415 (STHERMO_2200) - 1872059..1872500 (-) 442 Protein_1850 CAAX protease -
  H1W98_RS09620 (STHERMO_2201) - 1872512..1872709 (-) 198 WP_179972339.1 helix-turn-helix transcriptional regulator -
  H1W98_RS09625 (STHERMO_2202) - 1872957..1874150 (-) 1194 WP_179972340.1 acetate kinase -
  H1W98_RS09630 (STHERMO_2203) comYH 1874206..1875162 (-) 957 WP_179972341.1 class I SAM-dependent methyltransferase Machinery gene
  H1W98_RS09635 (STHERMO_2204) comGG 1875213..1875524 (-) 312 WP_179972342.1 competence type IV pilus minor pilin ComGG -
  H1W98_RS09640 (STHERMO_2205) comYF 1875502..1875939 (-) 438 WP_179972343.1 competence type IV pilus minor pilin ComGF Machinery gene
  H1W98_RS09645 (STHERMO_2206) comGE 1875923..1876216 (-) 294 WP_179972344.1 competence type IV pilus minor pilin ComGE -
  H1W98_RS09650 (STHERMO_2207) comGD 1876188..1876616 (-) 429 WP_269473123.1 competence type IV pilus minor pilin ComGD -
  H1W98_RS09655 (STHERMO_2208) comYC 1876576..1876902 (-) 327 WP_179972346.1 competence type IV pilus major pilin ComGC Machinery gene
  H1W98_RS09660 (STHERMO_2209) comYB 1876899..1877999 (-) 1101 WP_179973355.1 competence type IV pilus assembly protein ComGB Machinery gene
  H1W98_RS09665 (STHERMO_2210) comGA/cglA/cilD 1877881..1878822 (-) 942 WP_179972347.1 competence type IV pilus ATPase ComGA Machinery gene
  H1W98_RS09670 (STHERMO_2211) - 1878903..1879265 (-) 363 WP_180482294.1 DUF1033 family protein -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 35987.11 Da        Isoelectric Point: 4.6574

>NTDB_id=1131305 H1W98_RS09630 WP_179972341.1 1874206..1875162(-) (comYH) [Streptococcus thermophilus isolate STH_CIRM_967]
MNFEAIETAFELLLENVQTIENDLGTHAYDALIEQNSYYLGAEVANELIIKNNEKLRALNLSKEEWRRAFQFLFIKLGQL
EALQANHQFTPDAIGFIILYLLEGLTQEKQLDILEIGSGTGNLAETLLNNTQRTLNYMGMEVDDLLIDLSASIAEVVNSV
AVYIQEDAVRPHILKESNVIISDLPIGYYPNDEIASRFKVAATGEHTYAHHLLMEQSLKYLKKDGIAIFLAPTNLLTSPQ
SDLLKKWLSGYADIIAVITLPEAAFSNKHNMKSIFVLKKQTKNAPETFVYPLSDLQNPRVLKDFTENFQKWKSDNSIF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=1131305 H1W98_RS09630 WP_179972341.1 1874206..1875162(-) (comYH) [Streptococcus thermophilus isolate STH_CIRM_967]
ATGAATTTTGAAGCAATTGAGACAGCTTTTGAGCTGTTGTTAGAAAATGTCCAAACTATTGAAAATGATCTTGGAACCCA
TGCTTACGATGCACTTATTGAGCAAAATTCCTATTATTTGGGGGCTGAGGTTGCTAATGAGCTCATCATCAAAAACAACG
AGAAATTACGGGCGCTTAATCTAAGTAAAGAGGAGTGGCGTCGTGCTTTTCAGTTTTTGTTTATCAAACTAGGGCAATTG
GAAGCTTTACAAGCCAATCACCAATTTACACCAGATGCTATCGGATTTATCATTCTGTACTTGCTCGAAGGTTTGACCCA
GGAAAAACAATTGGATATCTTGGAGATTGGTTCGGGAACAGGGAACTTGGCTGAAACTCTTCTAAATAATACTCAGAGAA
CCCTTAATTATATGGGGATGGAAGTTGATGATCTTCTTATCGATTTGTCAGCTAGTATTGCTGAGGTGGTAAATTCAGTA
GCGGTTTATATCCAAGAGGATGCTGTTCGACCACATATTCTCAAAGAGAGTAACGTTATTATCAGCGATTTACCTATAGG
TTACTACCCTAATGATGAGATTGCGAGTCGTTTCAAGGTGGCAGCAACCGGCGAACACACTTATGCCCATCATCTTCTTA
TGGAGCAATCGCTTAAGTATTTGAAGAAAGATGGTATTGCTATTTTTTTGGCACCAACCAATCTTTTGACAAGCCCTCAA
AGTGATCTGCTTAAGAAGTGGTTATCAGGATATGCTGATATTATTGCTGTTATTACTCTTCCAGAAGCAGCTTTTAGCAA
TAAACATAACATGAAGTCTATCTTTGTGCTAAAAAAACAAACTAAAAATGCTCCTGAGACCTTCGTTTACCCACTTAGCG
ATTTGCAAAATCCAAGGGTCCTCAAGGATTTTACAGAGAATTTCCAAAAATGGAAATCAGATAATTCCATTTTCTAG

Domains


Predicted by InterproScan.

(69-293)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7U7CBK4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

68.889

99.057

0.682

  comYH Streptococcus mutans UA140

68.889

99.057

0.682


Multiple sequence alignment