Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   DK182_RS00935 Genome accession   NZ_CP029491
Coordinates   150961..151914 (+) Length   317 a.a.
NCBI ID   WP_019773539.1    Uniprot ID   -
Organism   Streptococcus sobrinus strain 10919     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 145961..156914
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DK182_RS00885 (DK181_00885) comGA/cglA 146382..147329 (+) 948 WP_019777327.1 competence type IV pilus ATPase ComGA Machinery gene
  DK182_RS00890 (DK181_00890) comYB 147253..148296 (+) 1044 WP_275425199.1 competence type IV pilus assembly protein ComGB Machinery gene
  DK182_RS00895 (DK181_00895) comYC 148293..148619 (+) 327 WP_019777325.1 competence type IV pilus major pilin ComGC Machinery gene
  DK182_RS00900 (DK181_00900) comYD 148573..149007 (+) 435 WP_254051091.1 competence type IV pilus minor pilin ComGD Machinery gene
  DK182_RS00905 (DK181_00905) comYE 148979..149272 (+) 294 WP_002959510.1 competence type IV pilus minor pilin ComGE Machinery gene
  DK182_RS00910 (DK181_00910) comYF 149256..149693 (+) 438 WP_002959513.1 competence type IV pilus minor pilin ComGF Machinery gene
  DK182_RS00915 (DK181_00915) comGG 149671..150132 (+) 462 WP_002959515.1 competence type IV pilus minor pilin ComGG -
  DK182_RS00925 (DK181_00925) - 150334..150600 (+) 267 WP_002959516.1 type II toxin-antitoxin system RelE/ParE family toxin -
  DK182_RS00930 (DK181_00930) - 150587..150862 (+) 276 WP_019777324.1 helix-turn-helix transcriptional regulator -
  DK182_RS00935 (DK181_00935) comYH 150961..151914 (+) 954 WP_019773539.1 class I SAM-dependent methyltransferase Machinery gene
  DK182_RS00940 (DK181_00940) - 151977..153173 (+) 1197 WP_109833408.1 acetate kinase -
  DK182_RS00945 (DK181_00945) - 153651..154556 (+) 906 WP_109833723.1 LysR family transcriptional regulator -
  DK182_RS00950 (DK181_00950) - 154651..154917 (+) 267 WP_002959526.1 ACT domain-containing protein -
  DK182_RS00955 (DK181_00955) - 154929..156266 (+) 1338 WP_002959528.1 PFL family protein -
  DK182_RS00960 (DK181_00960) - 156506..156889 (-) 384 Protein_147 hypothetical protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35550.30 Da        Isoelectric Point: 4.1336

>NTDB_id=293203 DK182_RS00935 WP_019773539.1 150961..151914(+) (comYH) [Streptococcus sobrinus strain 10919]
MNFETIEKAYELLLENVQLLQNDLKTNSYDALIEQNAIYLDGKTENKTVLANDQALCDLNLSKEEWRRAYQFLFIKLAQS
EPLQANHQFTPDSIGFVLLFLLENLTKEESLDLLEIGSGTGNLAQTLLNNSSKGLNYLGLEVDDLLIDLSASIADVVGSD
ASFVQEDAVRPSLLKESDIIVSDLPVGYYPDDAIASRYQVAAQDDHTYAHHLLMEQSLKYLKANGFAIFLAPVNLLTSPQ
SDLLKAWLKGYADIVAVITLPEELFGNPANAKSIFVLQKQTVQAPETFVYPLSDLQDRDKLLDFMENFKKWSAEYIL

Nucleotide


Download         Length: 954 bp        

>NTDB_id=293203 DK182_RS00935 WP_019773539.1 150961..151914(+) (comYH) [Streptococcus sobrinus strain 10919]
ATGAATTTTGAAACGATTGAAAAAGCTTATGAGCTGCTATTAGAGAATGTTCAGCTCTTGCAAAATGACTTGAAGACCAA
TAGTTACGATGCCTTGATTGAGCAAAATGCTATCTATCTGGATGGCAAAACTGAAAATAAGACGGTTTTGGCCAATGACC
AGGCCTTGTGTGACTTGAATTTGAGCAAGGAAGAGTGGCGGAGGGCCTACCAGTTTCTCTTTATTAAGCTGGCCCAGTCT
GAACCCTTGCAGGCCAACCACCAGTTTACGCCGGATAGTATTGGTTTTGTCCTGCTTTTCTTGCTGGAAAATCTGACCAA
GGAAGAAAGTCTGGACCTTCTGGAGATTGGCTCTGGAACGGGCAATCTAGCTCAAACCCTGCTCAATAACAGCAGTAAAG
GTCTGAATTATCTGGGGCTAGAAGTGGATGATTTGCTGATTGATTTGTCAGCCAGCATCGCTGATGTTGTGGGATCTGAC
GCTAGTTTCGTGCAGGAGGATGCGGTTCGTCCTTCCCTGCTCAAGGAGAGCGACATTATTGTCAGCGATCTTCCCGTTGG
CTATTATCCCGATGATGCTATTGCCAGTCGCTACCAGGTGGCTGCCCAGGATGACCACACCTATGCCCATCACCTGCTTA
TGGAGCAGTCGCTCAAGTACTTGAAGGCCAATGGTTTTGCCATTTTCTTGGCTCCAGTCAATCTTCTGACCAGTCCCCAG
AGTGATCTGCTCAAGGCTTGGCTCAAGGGTTATGCTGACATCGTGGCCGTCATTACCCTACCTGAGGAACTTTTTGGCAA
TCCGGCCAATGCCAAGTCTATCTTTGTCCTGCAGAAGCAGACTGTTCAGGCGCCCGAGACCTTCGTCTATCCGCTGAGTG
ATTTGCAAGACCGAGATAAGCTCTTGGATTTTATGGAAAATTTCAAGAAATGGTCAGCTGAATATATTCTTTGA

Domains


Predicted by InterproScan.

(68-295)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

64.984

100

0.65

  comYH Streptococcus mutans UA140

64.669

100

0.647


Multiple sequence alignment