Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   SMA_0099 Genome accession   HE613569
Coordinates   105311..106267 (+) Length   318 a.a.
NCBI ID   CCF01390.1    Uniprot ID   -
Organism   Streptococcus macedonicus ACA-DC 198     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 100311..111267
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SMA_0091 - 101229..101558 (+) 330 CCF01382.1 DNA binding protein -
  SMA_0092 comYA 101668..102609 (+) 942 CCF01383.1 Late competence protein ComGA, access of DNA to ComEA Machinery gene
  SMA_0093 comYB 102491..103576 (+) 1086 CCF01384.1 Late competence protein ComGB, access of DNA to ComEA Machinery gene
  SMA_0094 comYC 103576..103869 (+) 294 CCF01385.1 Late competence protein ComGC, access of DNA to ComEA Machinery gene
  SMA_0095 comYD 103853..104284 (+) 432 CCF01386.1 Late competence protein ComGD, access of DNA to ComEA Machinery gene
  SMA_0096 comGE 104289..104531 (+) 243 CCF01387.1 Late competence protein ComGE -
  SMA_0097 comYF 104485..104952 (+) 468 CCF01388.1 Late competence protein ComGF, access of DNA to ComEA Machinery gene
  SMA_0098 comYG 104924..105229 (+) 306 CCF01389.1 Late competence protein ComGG Machinery gene
  SMA_0099 comYH 105311..106267 (+) 957 CCF01390.1 Adenine-specific methyltransferase Machinery gene
  SMA_0100 ackA 106320..107519 (+) 1200 CCF01391.1 Acetate kinase -
  SMA_0101 - 107681..107881 (+) 201 CCF01392.1 Transcriptional regulator, Cro/CI family -
  SMA_0102 - 107891..108100 (+) 210 CCF01393.1 Hypothetical protein -
  SMA_0103 - 108113..108565 (+) 453 CCF01394.1 Hypothetical protein -
  SMA_0104 - 108577..109212 (+) 636 CCF01395.1 Membrane-bound protease, CAAX family -
  SMA_0105 comR 109391..110290 (+) 900 CCF01396.1 Transcriptional regulator Regulator
  - comS 110367..110411 (+) 45 - - Regulator

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 36058.98 Da        Isoelectric Point: 4.3705

>NTDB_id=20327 SMA_0099 CCF01390.1 105311..106267(+) (comYH) [Streptococcus macedonicus ACA-DC 198]
MNFEKIETAYELILENIQLIENELKTHIYDALIEQNSFYLGAEGASEEVAANNEKLRQLALTKEEWRRAFQFIFIKAGQT
EQLQANHQFTPDAIGFILLFLIENLTDSDKIDLLEIGSGTGNLAQTLLNNSSKELNYLGIEVDDLLIDLSASIAEVLDSD
AQFIQEDAVRPQILKESDVIISDLPVGFYPNDDIAKRYKVANSDEHTYAHHLLMEQSLKYLKKDGIAVFLAPVSLLTSKQ
SDLLKQWLKDYADIIAVITLPESIFGNAANAKSIFVLKKQAAYTPETFVYPLSDLQSREALTDFIRKFQKWKIDNINF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=20327 SMA_0099 CCF01390.1 105311..106267(+) (comYH) [Streptococcus macedonicus ACA-DC 198]
ATGAATTTTGAAAAAATTGAAACAGCCTATGAGCTGATTTTAGAAAATATCCAATTAATTGAAAATGAGTTAAAAACTCA
TATTTATGATGCGCTTATTGAACAGAATTCTTTTTACTTGGGGGCTGAAGGTGCCAGTGAAGAAGTTGCTGCCAACAATG
AAAAACTACGTCAGCTTGCATTGACCAAAGAAGAGTGGCGTCGAGCTTTCCAATTTATCTTTATTAAAGCTGGTCAAACA
GAGCAGCTGCAAGCCAATCATCAATTTACACCAGATGCTATTGGTTTTATTTTGCTGTTCTTGATTGAAAATCTGACAGA
TTCAGATAAAATTGATCTTTTAGAAATTGGTAGTGGGACAGGAAACCTTGCTCAAACATTGTTAAATAATTCGTCTAAAG
AATTAAATTATCTTGGTATTGAAGTTGACGATTTATTGATTGATTTATCAGCAAGTATTGCAGAAGTGTTGGATTCTGAT
GCTCAGTTTATTCAAGAAGATGCTGTGCGTCCACAAATTCTGAAAGAAAGTGATGTGATTATTAGTGATTTGCCAGTTGG
TTTTTATCCTAATGATGACATTGCCAAACGTTATAAAGTGGCAAATTCTGATGAGCATACTTATGCCCATCATTTATTGA
TGGAACAATCGTTAAAATATCTCAAAAAAGATGGTATTGCAGTCTTTTTGGCGCCCGTCAGTCTTTTGACAAGTAAGCAA
AGTGATTTATTGAAACAATGGTTGAAAGATTACGCGGATATTATCGCCGTGATTACCTTGCCAGAATCTATTTTTGGTAA
TGCAGCGAATGCAAAATCAATTTTTGTTTTGAAAAAACAGGCTGCGTATACGCCAGAAACCTTTGTTTATCCACTTTCTG
ACTTACAAAGTCGTGAAGCTCTGACTGATTTCATTAGAAAATTTCAAAAATGGAAAATTGATAATATTAATTTTTAA

Domains


Predicted by InterproScan.

(71-298)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

69.304

99.371

0.689

  comYH Streptococcus mutans UA159

68.987

99.371

0.686


Multiple sequence alignment