Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   GE023_RS00860 Genome accession   NZ_CP053789
Coordinates   136772..137725 (+) Length   317 a.a.
NCBI ID   WP_003046195.1    Uniprot ID   A0AAE4Q9T5
Organism   Streptococcus canis strain HL_98_2     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 131772..142725
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GE023_RS00820 (GE023_000820) - 132598..132963 (+) 366 WP_125073632.1 DUF1033 family protein -
  GE023_RS00825 (GE023_000825) comYA 133050..133991 (+) 942 WP_003046182.1 competence type IV pilus ATPase ComGA Machinery gene
  GE023_RS00830 (GE023_000830) comYB 133924..134958 (+) 1035 WP_223824829.1 competence type IV pilus assembly protein ComGB Machinery gene
  GE023_RS00835 (GE023_000835) comYC 134961..135287 (+) 327 WP_159305269.1 competence type IV pilus major pilin ComGC Machinery gene
  GE023_RS00840 (GE023_000840) comGD 135262..135690 (+) 429 WP_236494085.1 competence type IV pilus minor pilin ComGD -
  GE023_RS00845 (GE023_000845) comGE 135668..135943 (+) 276 WP_003046190.1 competence type IV pilus minor pilin ComGE -
  GE023_RS00850 (GE023_000850) comGF 135924..136364 (+) 441 WP_003046192.1 competence type IV pilus minor pilin ComGF -
  GE023_RS00855 (GE023_000855) comGG 136342..136722 (+) 381 WP_003046193.1 competence type IV pilus minor pilin ComGG -
  GE023_RS00860 (GE023_000860) comYH 136772..137725 (+) 954 WP_003046195.1 class I SAM-dependent methyltransferase Machinery gene
  GE023_RS00865 (GE023_000865) - 137784..138980 (+) 1197 WP_093998734.1 acetate kinase -
  GE023_RS00870 (GE023_000870) - 139300..139506 (+) 207 WP_003046200.1 helix-turn-helix transcriptional regulator -
  GE023_RS00875 (GE023_000875) - 139507..139647 (+) 141 Protein_126 CPBP family glutamic-type intramembrane protease -
  GE023_RS10810 - 139772..139927 (+) 156 WP_236493542.1 hypothetical protein -
  GE023_RS00885 (GE023_000885) proC 140233..141009 (-) 777 WP_159305271.1 pyrroline-5-carboxylate reductase -
  GE023_RS00890 (GE023_000890) pepA 141057..142124 (-) 1068 WP_003046205.1 glutamyl aminopeptidase -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35915.02 Da        Isoelectric Point: 4.7866

>NTDB_id=447141 GE023_RS00860 WP_003046195.1 136772..137725(+) (comYH) [Streptococcus canis strain HL_98_2]
MNFEKIEEAYQLLLENSQLIENDLKTHIYDAIVEQNSFYLGAEGASPQVAQNIDKLKALCLTKEEWRRAYQFIFIKAAQT
EQLQANHQFTPDTIGFILLYLLEQLSDKDSLEVLEIGSGTGNLAQTLLNNTSKSLDYVGIELDDLLIDLSASIAEVMDSS
AHFIQEDAVRPQLLKESDVVISDLPVGYYPNDAIAKRYKVASSDEHTYAHHLLMEQSLKYLKKDGFAIFLAPVNLLTSPQ
SQLLKHWLKGYAQVVALITLPDTIFGHPSNAKSIIVLQKQTDRPMETFVYPIRDLKLAENIHALMDNFKKWKLDNVN

Nucleotide


Download         Length: 954 bp        

>NTDB_id=447141 GE023_RS00860 WP_003046195.1 136772..137725(+) (comYH) [Streptococcus canis strain HL_98_2]
ATGAATTTTGAAAAAATTGAAGAAGCTTATCAGTTGCTTTTAGAGAATAGCCAACTGATTGAAAATGACTTAAAAACCCA
TATTTATGATGCCATTGTTGAACAAAATTCCTTTTATTTAGGGGCAGAGGGAGCCAGTCCTCAGGTGGCTCAAAACATTG
ATAAACTGAAAGCCTTGTGCCTGACAAAAGAAGAATGGCGCAGAGCCTACCAGTTTATTTTTATTAAGGCAGCTCAGACT
GAACAACTGCAAGCCAACCATCAGTTCACACCAGATACTATTGGTTTTATTCTTCTCTATCTGTTGGAACAATTGAGCGA
TAAAGATAGCTTAGAGGTGCTTGAAATTGGGAGTGGAACAGGGAACCTAGCTCAAACCCTTCTCAACAACACGAGCAAGT
CTCTTGATTATGTAGGAATTGAACTTGATGATCTCTTGATTGATTTGTCAGCCAGCATTGCAGAGGTGATGGATTCTTCA
GCTCATTTTATTCAAGAAGATGCGGTAAGACCTCAATTATTGAAAGAAAGTGATGTTGTCATCAGTGACTTACCAGTTGG
CTATTATCCTAACGATGCTATTGCCAAACGTTACAAGGTGGCTAGTTCAGATGAGCATACGTATGCTCACCATTTATTAA
TGGAACAGTCTCTAAAATACTTGAAAAAAGATGGCTTTGCTATTTTTCTGGCACCAGTCAATTTATTGACGAGCCCTCAG
AGCCAGTTATTGAAACATTGGTTAAAAGGATATGCTCAGGTGGTGGCTCTAATTACTCTACCAGACACCATTTTTGGTCA
TCCATCCAATGCCAAGTCTATTATTGTCTTGCAAAAACAAACAGACCGCCCAATGGAAACCTTTGTTTATCCAATTCGGG
ACTTGAAACTTGCAGAGAATATTCATGCTCTTATGGATAATTTCAAAAAGTGGAAACTGGATAATGTCAATTAA

Domains


Predicted by InterproScan.

(68-290)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

66.772

99.685

0.666

  comYH Streptococcus mutans UA159

66.456

99.685

0.662