Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   H0514_RS00865 Genome accession   NZ_LR822020
Coordinates   150842..151798 (+) Length   318 a.a.
NCBI ID   WP_179972341.1    Uniprot ID   A0A7U7CBK4
Organism   Streptococcus thermophilus isolate STH_CIRM_956     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 145842..156798
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H0514_RS00825 (STHERMO_0166) - 146739..147101 (+) 363 WP_179972348.1 DUF1033 family protein -
  H0514_RS00830 (STHERMO_0167) comGA/cglA/cilD 147182..148123 (+) 942 WP_179972347.1 competence type IV pilus ATPase ComGA Machinery gene
  H0514_RS00835 (STHERMO_0168) comYB 148005..149105 (+) 1101 WP_179973355.1 competence type IV pilus assembly protein ComGB Machinery gene
  H0514_RS00840 (STHERMO_0169) comYC 149102..149428 (+) 327 WP_179972346.1 competence type IV pilus major pilin ComGC Machinery gene
  H0514_RS00845 (STHERMO_0170) comGD 149388..149816 (+) 429 WP_269473123.1 competence type IV pilus minor pilin ComGD -
  H0514_RS00850 (STHERMO_0171) comGE 149788..150081 (+) 294 WP_179972344.1 competence type IV pilus minor pilin ComGE -
  H0514_RS00855 (STHERMO_0172) comYF 150065..150502 (+) 438 WP_179972343.1 competence type IV pilus minor pilin ComGF Machinery gene
  H0514_RS00860 (STHERMO_0173) comGG 150480..150791 (+) 312 WP_179972342.1 competence type IV pilus minor pilin ComGG -
  H0514_RS00865 (STHERMO_0174) comYH 150842..151798 (+) 957 WP_179972341.1 class I SAM-dependent methyltransferase Machinery gene
  H0514_RS00870 (STHERMO_0175) - 151854..153047 (+) 1194 WP_179972340.1 acetate kinase -
  H0514_RS00875 (STHERMO_0176) - 153295..153492 (+) 198 WP_179972339.1 helix-turn-helix transcriptional regulator -
  H0514_RS11190 (STHERMO_0177) - 153504..153945 (+) 442 Protein_129 CAAX protease -
  H0514_RS00880 (STHERMO_0178) - 154042..154704 (+) 663 WP_179972338.1 CPBP family intramembrane glutamic endopeptidase -
  H0514_RS00885 (STHERMO_0179) proC 154734..155504 (-) 771 WP_179972337.1 pyrroline-5-carboxylate reductase -
  H0514_RS00890 (STHERMO_0180) pepA 155520..156587 (-) 1068 WP_179972336.1 glutamyl aminopeptidase -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 35987.11 Da        Isoelectric Point: 4.6574

>NTDB_id=1131068 H0514_RS00865 WP_179972341.1 150842..151798(+) (comYH) [Streptococcus thermophilus isolate STH_CIRM_956]
MNFEAIETAFELLLENVQTIENDLGTHAYDALIEQNSYYLGAEVANELIIKNNEKLRALNLSKEEWRRAFQFLFIKLGQL
EALQANHQFTPDAIGFIILYLLEGLTQEKQLDILEIGSGTGNLAETLLNNTQRTLNYMGMEVDDLLIDLSASIAEVVNSV
AVYIQEDAVRPHILKESNVIISDLPIGYYPNDEIASRFKVAATGEHTYAHHLLMEQSLKYLKKDGIAIFLAPTNLLTSPQ
SDLLKKWLSGYADIIAVITLPEAAFSNKHNMKSIFVLKKQTKNAPETFVYPLSDLQNPRVLKDFTENFQKWKSDNSIF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=1131068 H0514_RS00865 WP_179972341.1 150842..151798(+) (comYH) [Streptococcus thermophilus isolate STH_CIRM_956]
ATGAATTTTGAAGCAATTGAGACAGCTTTTGAGCTGTTGTTAGAAAATGTCCAAACTATTGAAAATGATCTTGGAACCCA
TGCTTACGATGCACTTATTGAGCAAAATTCCTATTATTTGGGGGCTGAGGTTGCTAATGAGCTCATCATCAAAAACAACG
AGAAATTACGGGCGCTTAATCTAAGTAAAGAGGAGTGGCGTCGTGCTTTTCAGTTTTTGTTTATCAAACTAGGGCAATTG
GAAGCTTTACAAGCCAATCACCAATTTACACCAGATGCTATCGGATTTATCATTCTGTACTTGCTCGAAGGTTTGACCCA
GGAAAAACAATTGGATATCTTGGAGATTGGTTCGGGAACAGGGAACTTGGCTGAAACTCTTCTAAATAATACTCAGAGAA
CCCTTAATTATATGGGGATGGAAGTTGATGATCTTCTTATCGATTTGTCAGCTAGTATTGCTGAGGTGGTAAATTCAGTA
GCGGTTTATATCCAAGAGGATGCTGTTCGACCACATATTCTCAAAGAGAGTAACGTTATTATCAGCGATTTACCTATAGG
TTACTACCCTAATGATGAGATTGCGAGTCGTTTCAAGGTGGCAGCAACCGGCGAACACACTTATGCCCATCATCTTCTTA
TGGAGCAATCGCTTAAGTATTTGAAGAAAGATGGTATTGCTATTTTTTTGGCACCAACCAATCTTTTGACAAGCCCTCAA
AGTGATCTGCTTAAGAAGTGGTTATCAGGATATGCTGATATTATTGCTGTTATTACTCTTCCAGAAGCAGCTTTTAGCAA
TAAACATAACATGAAGTCTATCTTTGTGCTAAAAAAACAAACTAAAAATGCTCCTGAGACCTTCGTTTACCCACTTAGCG
ATTTGCAAAATCCAAGGGTCCTCAAGGATTTTACAGAGAATTTCCAAAAATGGAAATCAGATAATTCCATTTTCTAG

Domains


Predicted by InterproScan.

(69-293)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7U7CBK4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

68.889

99.057

0.682

  comYH Streptococcus mutans UA140

68.889

99.057

0.682


Multiple sequence alignment