Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   NSQ79_RS09530 Genome accession   NZ_CP150200
Coordinates   2042045..2043001 (-) Length   318 a.a.
NCBI ID   WP_138261573.1    Uniprot ID   A0AAW7PI86
Organism   Streptococcus sp. FSL W7-1342     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2037045..2048001
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NSQ79_RS09500 (NSQ79_09500) proC 2037583..2038353 (+) 771 WP_002885846.1 pyrroline-5-carboxylate reductase -
  NSQ79_RS09505 (NSQ79_09505) - 2038380..2039042 (-) 663 WP_111686674.1 type II CAAX endopeptidase family protein -
  NSQ79_RS09510 (NSQ79_09510) - 2039155..2039601 (-) 447 WP_013991259.1 hypothetical protein -
  NSQ79_RS09515 (NSQ79_09515) - 2039679..2040335 (-) 657 WP_037601483.1 type II CAAX endopeptidase family protein -
  NSQ79_RS09520 (NSQ79_09520) - 2040347..2040544 (-) 198 WP_002885716.1 helix-turn-helix transcriptional regulator -
  NSQ79_RS09525 (NSQ79_09525) - 2040795..2041988 (-) 1194 WP_138261572.1 acetate kinase -
  NSQ79_RS09530 (NSQ79_09530) comYH 2042045..2043001 (-) 957 WP_138261573.1 class I SAM-dependent methyltransferase Machinery gene
  NSQ79_RS09535 (NSQ79_09535) comGG 2043046..2043363 (-) 318 WP_021144683.1 competence type IV pilus minor pilin ComGG -
  NSQ79_RS09540 (NSQ79_09540) comYF 2043341..2043778 (-) 438 WP_045769383.1 competence type IV pilus minor pilin ComGF Machinery gene
  NSQ79_RS09545 (NSQ79_09545) comGE 2043765..2043995 (-) 231 WP_002887015.1 competence type IV pilus minor pilin ComGE -
  NSQ79_RS09550 (NSQ79_09550) comYD 2044027..2044455 (-) 429 WP_014632541.1 competence type IV pilus minor pilin ComGD Machinery gene
  NSQ79_RS09555 (NSQ79_09555) comYC 2044415..2044729 (-) 315 WP_037611125.1 competence type IV pilus major pilin ComGC Machinery gene
  NSQ79_RS09560 (NSQ79_09560) comYB 2044738..2045838 (-) 1101 WP_138261574.1 competence type IV pilus assembly protein ComGB Machinery gene
  NSQ79_RS09565 (NSQ79_09565) comGA/cglA/cilD 2045720..2046661 (-) 942 WP_002887018.1 competence type IV pilus ATPase ComGA Machinery gene
  NSQ79_RS09570 (NSQ79_09570) - 2046741..2047103 (-) 363 WP_049529260.1 DUF1033 family protein -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 35779.69 Da        Isoelectric Point: 4.3808

>NTDB_id=966127 NSQ79_RS09530 WP_138261573.1 2042045..2043001(-) (comYH) [Streptococcus sp. FSL W7-1342]
MNFEAIETAFELLLENVQTIENDLGTHAYDALIEQNSYYLGAEVANEVIIKNNEKLRALNLSKEEWRRAFQFLFIKLGQL
EALQANHQFTPDAIGFIILYLLEGLTKDDQLDVLEIGSGTGNLAETLLNNSQKTLNYMGMEVDDLLIDLSASIADVVNSS
AVYIQEDAVRPHILKESDVIISDLPVGYYPNDEIASRFKVAATGEHTYAHHLLMEQSLKYLKKDGIAILLAPTNLLTSPQ
SDLLKKWLSGYADIIAVITLPEAAFGNKRNMKSIFVLKKQTENAPETFVYPLSDLQNPEVLKDFTENFQKWKSDNSIF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=966127 NSQ79_RS09530 WP_138261573.1 2042045..2043001(-) (comYH) [Streptococcus sp. FSL W7-1342]
ATGAATTTTGAAGCAATTGAGACAGCCTTTGAGCTGTTGTTAGAAAATGTCCAAACCATTGAAAATGATCTTGGAACCCA
TGCTTACGATGCGCTTATTGAGCAAAATTCCTATTATTTGGGGGCTGAGGTAGCTAATGAAGTCATCATCAAAAACAACG
AGAAATTACGTGCCCTTAATCTAAGCAAAGAGGAGTGGCGTCGCGCTTTTCAGTTCTTGTTTATCAAGCTAGGGCAATTG
GAAGCCTTACAAGCCAATCACCAATTTACACCTGATGCTATTGGATTTATCATTCTTTACCTACTTGAAGGGTTGACCAA
GGACGACCAATTAGATGTTTTGGAGATTGGTTCGGGGACAGGAAACTTGGCTGAGACTCTTCTAAATAATAGTCAGAAAA
CCCTTAATTATATGGGAATGGAAGTTGACGATCTCCTTATCGATTTGTCAGCTAGTATTGCTGATGTGGTGAATTCAAGT
GCAGTTTACATCCAAGAAGATGCTGTTCGACCACACATTCTCAAAGAGAGTGATGTTATTATTAGTGACCTACCTGTTGG
TTACTACCCTAATGATGAGATTGCGAGTCGTTTCAAGGTTGCAGCAACTGGTGAACACACCTATGCCCATCACCTTCTCA
TGGAGCAATCACTCAAGTACTTGAAGAAAGATGGTATTGCTATTCTTTTGGCACCAACTAATCTTTTAACAAGCCCACAA
AGTGATTTGCTTAAGAAATGGCTGTCAGGCTATGCTGATATTATTGCAGTTATCACTCTTCCAGAAGCAGCTTTTGGCAA
TAAACGTAACATGAAGTCTATCTTTGTGCTTAAAAAACAAACTGAAAATGCTCCTGAGACCTTTGTTTATCCACTTAGCG
ATTTGCAAAACCCAGAGGTCCTCAAAGATTTTACAGAGAATTTTCAAAAATGGAAATCAGATAATTCCATTTTCTAG

Domains


Predicted by InterproScan.

(69-295)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

70.476

99.057

0.698

  comYH Streptococcus mutans UA140

70.476

99.057

0.698


Multiple sequence alignment