Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   E3C75_RS01055 Genome accession   NZ_CP038020
Coordinates   205828..206784 (-) Length   318 a.a.
NCBI ID   WP_023909974.1    Uniprot ID   -
Organism   Streptococcus thermophilus strain ATCC 19258     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 200828..211784
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E3C75_RS01025 proC 201383..202153 (+) 771 WP_011226616.1 pyrroline-5-carboxylate reductase -
  E3C75_RS01030 - 202183..202845 (-) 663 WP_096811680.1 type II CAAX endopeptidase family protein -
  E3C75_RS01035 - 202948..203391 (-) 444 WP_011681680.1 hypothetical protein -
  E3C75_RS11415 - 203465..203890 (-) 426 WP_111679733.1 CPBP family intramembrane glutamic endopeptidase -
  E3C75_RS01045 - 204133..204330 (-) 198 WP_014727723.1 helix-turn-helix transcriptional regulator -
  E3C75_RS01050 - 204579..205772 (-) 1194 WP_084826168.1 acetate kinase -
  E3C75_RS01055 comYH 205828..206784 (-) 957 WP_023909974.1 class I SAM-dependent methyltransferase Machinery gene
  E3C75_RS01060 comGG 206829..207146 (-) 318 WP_023909975.1 competence type IV pilus minor pilin ComGG -
  E3C75_RS01065 comYF 207124..207561 (-) 438 WP_084825661.1 competence type IV pilus minor pilin ComGF Machinery gene
  E3C75_RS01070 comGE 207545..207838 (-) 294 WP_100262274.1 competence type IV pilus minor pilin ComGE -
  E3C75_RS01075 comYD 207810..208238 (-) 429 WP_171815055.1 competence type IV pilus minor pilin ComGD Machinery gene
  E3C75_RS01080 comYC 208198..208524 (-) 327 WP_023909980.1 competence type IV pilus major pilin ComGC Machinery gene
  E3C75_RS01085 comYB 208521..209621 (-) 1101 WP_133264040.1 competence type IV pilus assembly protein ComGB Machinery gene
  E3C75_RS01090 comYA 209503..210444 (-) 942 WP_014622005.1 competence type IV pilus ATPase ComGA Machinery gene
  E3C75_RS01095 - 210525..210887 (-) 363 WP_111679734.1 DUF1033 family protein -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 35916.03 Da        Isoelectric Point: 4.6046

>NTDB_id=351378 E3C75_RS01055 WP_023909974.1 205828..206784(-) (comYH) [Streptococcus thermophilus strain ATCC 19258]
MNFEAIETAFELLLENVQTIENDLGTHAYDALIEQNSYYLGAEVANELIIKNNEKIRALNLSKEEWRRAFQFLFIKLGQL
EALQANHQFTPDAIGFIILYLLEGLTQEKQLDILEIGSGTGNLAETLLNNSQKTLNYMGMEVDDLLIDLSASIAEVVNSV
AVYIQEDAVRPHILKESNVIISDLPIGYYPNDEIASRFKVAATGEHTYAHHLLMEQSLKYLKKDGIAIFLAPTNLLTSPQ
SDLLKKWLSGYADIIAVITLPEAAFGNKHNMKSIFVLKKQTKDAPETFVYPLSDLQNPRVLKDFTENFQKWKSDNSIF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=351378 E3C75_RS01055 WP_023909974.1 205828..206784(-) (comYH) [Streptococcus thermophilus strain ATCC 19258]
ATGAATTTTGAAGCAATTGAGACAGCTTTTGAGCTGTTGTTAGAAAATGTCCAAACTATTGAAAATGATCTTGGAACCCA
TGCTTACGATGCACTTATTGAGCAAAATTCCTATTATTTGGGGGCTGAGGTTGCTAATGAGCTCATCATCAAAAACAACG
AGAAAATACGGGCGCTTAATCTAAGTAAAGAGGAGTGGCGTCGTGCTTTTCAGTTTTTGTTTATCAAACTAGGGCAATTG
GAAGCTTTACAAGCCAATCACCAATTTACACCAGATGCTATCGGATTTATCATTCTGTACTTGCTCGAAGGTTTGACCCA
GGAAAAACAATTAGATATCTTGGAGATTGGTTCGGGAACAGGAAACTTGGCTGAAACTCTTCTAAATAATAGTCAGAAAA
CCCTTAATTATATGGGGATGGAAGTTGATGATCTTCTTATCGATTTGTCAGCTAGTATTGCTGAGGTGGTGAATTCAGTA
GCGGTTTATATCCAAGAGGATGCTGTTCGACCACATATTCTCAAAGAGAGCAACGTTATTATCAGCGATTTACCTATAGG
TTACTACCCTAATGATGAGATTGCGAGTCGTTTCAAGGTGGCAGCAACTGGCGAACACACTTATGCCCATCATCTTCTTA
TGGAGCAATCGCTTAAGTATTTGAAGAAAGACGGTATTGCTATTTTTTTGGCACCAACCAATCTTTTGACAAGCCCTCAA
AGTGATCTGCTTAAGAAGTGGTTATCAGGATATGCTGATATTATTGCTGTTATTACTCTTCCAGAAGCAGCTTTTGGCAA
TAAACATAACATGAAGTCTATCTTTGTGCTAAAAAAACAAACTAAAGATGCTCCTGAGACCTTCGTTTACCCACTTAGCG
ATTTGCAAAATCCAAGGGTCCTCAAGGATTTTACAGAGAATTTCCAAAAATGGAAATCAGATAATTCCATTTTCTAG

Domains


Predicted by InterproScan.

(69-292)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

69.206

99.057

0.686

  comYH Streptococcus mutans UA140

69.206

99.057

0.686


Multiple sequence alignment