Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   ATM98_RS07975 Genome accession   NZ_CP013651
Coordinates   1638254..1639207 (+) Length   317 a.a.
NCBI ID   WP_061564649.1    Uniprot ID   -
Organism   Streptococcus sp. A12     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1633254..1644207
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ATM98_RS07935 (ATM98_07935) - 1634019..1634396 (+) 378 WP_061564644.1 DUF1033 family protein -
  ATM98_RS07940 (ATM98_07940) comYA 1634502..1635443 (+) 942 WP_048691344.1 competence type IV pilus ATPase ComGA Machinery gene
  ATM98_RS07945 (ATM98_07945) comYB 1635385..1636407 (+) 1023 WP_061564645.1 competence type IV pilus assembly protein ComGB Machinery gene
  ATM98_RS07950 (ATM98_07950) comYC 1636404..1636721 (+) 318 WP_023022076.1 competence type IV pilus major pilin ComGC Machinery gene
  ATM98_RS07955 (ATM98_07955) comGD/cglD 1636690..1637115 (+) 426 WP_061564646.1 competence type IV pilus minor pilin ComGD Machinery gene
  ATM98_RS07960 (ATM98_07960) comGE 1637081..1637374 (+) 294 WP_061564647.1 competence type IV pilus minor pilin ComGE -
  ATM98_RS07965 (ATM98_07965) comGF/cglF 1637358..1637810 (+) 453 WP_061564817.1 competence type IV pilus minor pilin ComGF Machinery gene
  ATM98_RS07970 (ATM98_07970) comGG 1637776..1638222 (+) 447 WP_061564648.1 competence type IV pilus minor pilin ComGG -
  ATM98_RS07975 (ATM98_07975) comYH 1638254..1639207 (+) 954 WP_061564649.1 class I SAM-dependent methyltransferase Machinery gene
  ATM98_RS07980 (ATM98_07980) - 1639256..1640452 (+) 1197 WP_048789482.1 acetate kinase -
  ATM98_RS07985 (ATM98_07985) - 1640604..1641278 (+) 675 WP_223363686.1 type II CAAX endopeptidase family protein -
  ATM98_RS07990 (ATM98_07990) folP 1641438..1642394 (+) 957 WP_061564651.1 dihydropteroate synthase -
  ATM98_RS07995 (ATM98_07995) - 1642411..1643715 (+) 1305 WP_061564652.1 folylpolyglutamate synthase/dihydrofolate synthase family protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 36018.76 Da        Isoelectric Point: 4.2243

>NTDB_id=163427 ATM98_RS07975 WP_061564649.1 1638254..1639207(+) (comYH) [Streptococcus sp. A12]
MNFEKIEKAYGYLLENTQTIQNDLQTNFYDALVEQNAIYLDGQTELTLVKENNQRLRDLNLNKEEWRRSFQYLLMKAAQT
EPLQANHQFTPDGIGFLLVFLVDQLASSDQVDVLEMGSGTGNLAQTLMNNCQRSLDYLGLEIDDLLIDLAASMAEVMKAD
VNFAQGDAVRPQVLKESDVIVSDLPVGYYPDDAIASRYQVASPQGNTYAHHLLIEQSLKYLKPGGVAIFLAPNDLLTSEQ
SPLLKQWMQDHAQVLAMVTLPENLFRSANLAKTIFVLRKQEEAVVQPFVYPLTDLQDQEDLMKFRESFQNWNKESEI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=163427 ATM98_RS07975 WP_061564649.1 1638254..1639207(+) (comYH) [Streptococcus sp. A12]
ATGAATTTCGAAAAAATTGAGAAAGCCTACGGCTACCTATTAGAAAATACCCAAACTATCCAAAATGATTTGCAGACCAA
CTTTTATGATGCTCTAGTGGAGCAGAATGCCATCTATCTGGATGGGCAAACAGAGTTGACTCTAGTGAAGGAAAACAATC
AGCGGCTGAGGGACTTGAACTTGAACAAGGAAGAATGGCGTCGCTCCTTCCAGTATCTTTTGATGAAGGCTGCTCAAACA
GAGCCTCTACAAGCCAATCACCAATTTACGCCAGATGGGATTGGATTTCTTCTGGTCTTTCTAGTGGATCAGTTGGCTAG
TTCCGATCAAGTAGATGTGCTAGAAATGGGAAGTGGAACGGGGAACTTGGCCCAAACCTTGATGAACAACTGTCAGCGCT
CCTTAGATTATTTGGGCTTGGAAATCGATGATCTCTTGATTGACCTTGCGGCCAGTATGGCAGAAGTGATGAAGGCGGAT
GTGAATTTTGCCCAAGGTGATGCCGTTCGGCCACAGGTTTTGAAGGAGAGCGATGTGATTGTCAGCGATTTACCTGTCGG
TTATTATCCAGATGATGCCATTGCGAGTCGTTACCAGGTCGCTTCCCCTCAGGGCAACACCTATGCCCATCATTTATTGA
TCGAACAATCGCTAAAATACTTAAAACCAGGTGGTGTCGCTATTTTTCTAGCTCCGAATGATCTTTTGACGAGCGAGCAG
AGTCCTCTGTTGAAACAATGGATGCAGGATCATGCTCAGGTCTTGGCCATGGTGACCTTGCCAGAGAACCTCTTTCGATC
AGCCAATCTAGCAAAAACCATCTTTGTTTTACGCAAGCAAGAAGAAGCGGTGGTCCAACCATTTGTCTATCCCTTGACCG
ATTTGCAAGATCAGGAAGACCTTATGAAATTCCGTGAAAGTTTTCAAAACTGGAATAAAGAAAGTGAAATTTAA

Domains


Predicted by InterproScan.

(70-290)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

59.048

99.369

0.587

  comYH Streptococcus mutans UA159

58.73

99.369

0.584


Multiple sequence alignment