Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   SPG_1959 Genome accession   CP001015
Coordinates   1859064..1860017 (-) Length   317 a.a.
NCBI ID   ACF56004.1    Uniprot ID   -
Organism   Streptococcus pneumoniae G54     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1854064..1865017
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SPG_1953 - 1854825..1855448 (-) 624 ACF55028.1 conserved hypothetical protein -
  SPG_1954 - 1855499..1856485 (-) 987 ACF56836.1 hypothetical protein -
  SPG_1955 - 1856504..1857328 (-) 825 ACF56295.1 SpoIIIJ family protein -
  SPG_1956 rnpA 1857303..1857674 (-) 372 ACF55368.1 ribonuclease P protein component -
  SPG_1957 - 1857691..1857822 (-) 132 ACF54788.1 conserved hypothetical protein -
  SPG_1958 ackA 1857823..1859013 (-) 1191 ACF56573.1 acetate kinase -
  SPG_1959 comYH 1859064..1860017 (-) 954 ACF56004.1 conserved hypothetical protein Machinery gene
  SPG_1960 - 1860078..1860458 (-) 381 ACF56765.1 hypothetical protein -
  SPG_1961 - 1860490..1860672 (-) 183 ACF55149.1 conserved hypothetical protein -
  SPG_1962 comGG/cglG 1860809..1861222 (-) 414 ACF54761.1 conserved hypothetical protein Machinery gene
  SPG_1963 comGF/cglF 1861200..1861661 (-) 462 ACF55334.1 conserved hypothetical protein Machinery gene
  SPG_1964 comGE/cglE 1861624..1861926 (-) 303 ACF56096.1 hypothetical protein Machinery gene
  SPG_1965 comGD/cglD 1861889..1862293 (-) 405 ACF56539.1 competence protein CglD Machinery gene
  SPG_1966 comGC/cglC 1862286..1862612 (-) 327 ACF55098.1 competence protein CglC Machinery gene
  SPG_1967 comGB/cglB 1862614..1863630 (-) 1017 ACF55582.1 competence protein CglB Machinery gene
  SPG_1968 comGA/cglA/cilD 1863578..1864519 (-) 942 ACF55208.1 competence protein CglA Machinery gene
  SPG_1969 - 1864595..1864960 (-) 366 ACF55572.1 conserved hypothetical protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35717.95 Da        Isoelectric Point: 4.2995

>NTDB_id=20231 SPG_1959 ACF56004.1 1859064..1860017(-) (comYH) [Streptococcus pneumoniae G54]
MDFEKIEQAYTYLLENVQVIQSDLATNFYDALVEQNSIYLDGETELEQVKDNNQTLKRLALRKEEWLKTYQFLLMKAGQT
EPLQANHQFTPDAIALLLVFIVEELFKEEEITILEMGSGMGILGATFLTSLTKKVDYLGMEVDDLLIDLAASMADVIGLQ
AGFVQGDAVRPQMLKESDVVISDLPVGYYPDDAVASRHQVASSQEHTYTHHLLMEQGLKYLKSDGYAIFLAPSDLLTSPQ
SNLLKVWLKEEASLVAMISLPENLFANAKQSKTIFILQKKSEIAVEPFVYPLASLQDASVLMKFKENFQKWTQGTEI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=20231 SPG_1959 ACF56004.1 1859064..1860017(-) (comYH) [Streptococcus pneumoniae G54]
ATGGATTTTGAAAAAATTGAACAAGCTTATACCTATTTACTAGAGAATGTCCAAGTCATCCAAAGTGATTTGGCGACCAA
CTTTTATGACGCCTTGGTGGAGCAAAACAGCATCTATCTGGATGGTGAGACTGAGCTAGAACAGGTCAAAGACAACAATC
AGACCCTTAAGCGTTTAGCACTACGCAAAGAAGAATGGCTCAAGACCTACCAGTTTCTCTTGATGAAGGCTGGGCAAACA
GAACCCTTGCAGGCCAATCACCAGTTTACACCGGATGCTATTGCTTTACTTTTGGTGTTTATTGTGGAAGAGCTGTTTAA
AGAGGAGGAAATTACTATCCTCGAAATGGGTTCTGGGATGGGAATTTTGGGCGCTACTTTCTTGACCTCGCTTACTAAAA
AGGTGGATTATTTGGGAATGGAAGTGGATGATTTGCTGATTGATTTGGCAGCTAGCATGGCAGATGTAATTGGTTTGCAG
GCTGGCTTTGTCCAAGGAGATGCCGTTCGTCCACAAATGCTCAAAGAAAGTGATGTGGTCATCAGCGACTTGCCTGTTGG
CTATTATCCTGATGATGCCGTTGCGTCGCGCCATCAAGTTGCTTCTAGTCAAGAACATACTTACACCCATCACTTGCTCA
TGGAACAAGGACTTAAGTACCTCAAGTCAGACGGATACGCTATTTTTCTAGCTCCGAGTGATTTGTTGACCAGTCCTCAA
AGTAATTTGTTGAAAGTCTGGTTGAAAGAGGAGGCAAGTCTGGTTGCTATGATTAGTTTACCTGAAAATCTCTTTGCTAA
TGCTAAACAATCTAAGACTATTTTTATCCTACAGAAGAAAAGTGAAATAGCAGTAGAGCCTTTTGTTTATCCACTTGCTA
GCTTGCAAGATGCAAGTGTTTTAATGAAATTTAAAGAAAATTTTCAAAAATGGACTCAAGGTACTGAAATATAA

Domains


Predicted by InterproScan.

(69-282)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

54.633

98.738

0.539

  comYH Streptococcus mutans UA159

54.313

98.738

0.536


Multiple sequence alignment