Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   AAAF57_RS00830 Genome accession   NZ_CP150847
Coordinates   157330..158286 (+) Length   318 a.a.
NCBI ID   WP_227071975.1    Uniprot ID   -
Organism   Streptococcus salivarius strain KSS5     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 152330..163286
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AAAF57_RS00790 (AAAF57_00790) - 153228..153590 (+) 363 WP_014635121.1 DUF1033 family protein -
  AAAF57_RS00795 (AAAF57_00795) comGA/cglA/cilD 153670..154611 (+) 942 WP_002887018.1 competence type IV pilus ATPase ComGA Machinery gene
  AAAF57_RS00800 (AAAF57_00800) comYB 154493..155593 (+) 1101 WP_148263031.1 competence type IV pilus assembly protein ComGB Machinery gene
  AAAF57_RS00805 (AAAF57_00805) comYC 155602..155916 (+) 315 WP_037611125.1 competence type IV pilus major pilin ComGC Machinery gene
  AAAF57_RS00810 (AAAF57_00810) comYD 155876..156304 (+) 429 WP_270334813.1 competence type IV pilus minor pilin ComGD Machinery gene
  AAAF57_RS00815 (AAAF57_00815) comGE 156336..156566 (+) 231 WP_021144685.1 competence type IV pilus minor pilin ComGE -
  AAAF57_RS00820 (AAAF57_00820) comYF 156553..156990 (+) 438 WP_227071977.1 competence type IV pilus minor pilin ComGF Machinery gene
  AAAF57_RS00825 (AAAF57_00825) comGG 156968..157285 (+) 318 WP_227071976.1 competence type IV pilus minor pilin ComGG -
  AAAF57_RS00830 (AAAF57_00830) comYH 157330..158286 (+) 957 WP_227071975.1 class I SAM-dependent methyltransferase Machinery gene
  AAAF57_RS00835 (AAAF57_00835) - 158343..159536 (+) 1194 WP_002885831.1 acetate kinase -
  AAAF57_RS00840 (AAAF57_00840) - 159789..159986 (+) 198 WP_002885716.1 helix-turn-helix transcriptional regulator -
  AAAF57_RS00845 (AAAF57_00845) - 159998..160594 (+) 597 WP_410536720.1 CPBP family intramembrane glutamic endopeptidase -
  AAAF57_RS00850 (AAAF57_00850) - 160734..161180 (+) 447 WP_073686090.1 CAAX protease -
  AAAF57_RS00855 (AAAF57_00855) - 161293..161955 (+) 663 WP_002885821.1 CPBP family intramembrane glutamic endopeptidase -
  AAAF57_RS00860 (AAAF57_00860) proC 161982..162752 (-) 771 WP_073686089.1 pyrroline-5-carboxylate reductase -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 35772.70 Da        Isoelectric Point: 4.4586

>NTDB_id=972940 AAAF57_RS00830 WP_227071975.1 157330..158286(+) (comYH) [Streptococcus salivarius strain KSS5]
MNFEAIETAFELLLENVQTIENDLGTHAYDALIEQNSYYLGAEVANEVIIKNNEKLRALNLSKEEWRRAFQFLFIKLGQL
EALQANHQFTPDAIGFIILYLLEGLTKDDQLDVLEIGSGTGNLAETLLNNSQKNLNYMGMEVDDLLIDLSASIADVVNSS
AVYIQEDAVRPHILKESDVIISDLPVGYYPNDEIASRFKVAATGEHTYAHHLLMEQSLKYLKKDGIAILLAPTNLLTSPQ
SDLLKKWLSGYADIIAVITLPEAAFGNKHNMKSIFVLKKQTENAPETFVYPLSDLQNPKVLKDFTENFQKWKSDNSIF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=972940 AAAF57_RS00830 WP_227071975.1 157330..158286(+) (comYH) [Streptococcus salivarius strain KSS5]
ATGAATTTTGAAGCAATTGAGACAGCCTTTGAGCTGTTGTTAGAAAATGTCCAAACCATTGAAAATGATCTTGGAACCCA
TGCTTACGATGCGCTTATTGAGCAAAATTCCTATTATTTGGGGGCTGAGGTAGCTAATGAAGTCATCATCAAAAACAACG
AGAAATTACGTGCCCTTAATCTAAGCAAAGAGGAGTGGCGTCGTGCTTTTCAGTTCTTGTTTATCAAGCTAGGGCAATTG
GAAGCCTTACAAGCCAATCACCAGTTTACACCTGATGCTATTGGATTTATCATTCTTTACCTACTTGAAGGTTTGACCAA
GGACGACCAATTAGATGTTTTGGAGATTGGTTCGGGGACAGGAAACTTGGCTGAAACTCTTCTAAATAATAGTCAGAAAA
ACCTTAATTATATGGGAATGGAAGTTGACGATCTCCTTATCGATTTGTCAGCTAGTATTGCTGATGTGGTGAATTCAAGT
GCAGTTTATATCCAAGAAGATGCTGTTCGACCACATATTCTCAAAGAGAGTGATGTTATTATTAGTGACCTACCTGTTGG
TTACTACCCTAATGATGAAATTGCGAGTCGTTTCAAGGTTGCAGCAACTGGTGAACACACCTATGCTCATCACCTTCTCA
TGGAGCAATCACTCAAGTACTTGAAGAAAGATGGTATTGCTATTCTTTTGGCACCAACTAATCTTTTGACAAGTCCACAA
AGTGATTTGCTTAAGAAATGGCTATCAGGATATGCTGATATTATTGCAGTTATCACTCTTCCAGAAGCAGCTTTTGGCAA
TAAACATAACATGAAGTCTATCTTTGTGCTTAAAAAACAAACTGAAAATGCTCCTGAGACCTTTGTTTATCCACTTAGTG
ATTTGCAAAACCCAAAGGTCCTCAAAGATTTTACAGAGAATTTCCAAAAATGGAAATCAGATAATTCCATTTTCTAG

Domains


Predicted by InterproScan.

(69-296)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

70.159

99.057

0.695

  comYH Streptococcus mutans UA140

70.159

99.057

0.695


Multiple sequence alignment