Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   QLH56_RS04205 Genome accession   NZ_CP125107
Coordinates   901542..902498 (-) Length   318 a.a.
NCBI ID   WP_003098815.1    Uniprot ID   -
Organism   Streptococcus iniae strain DFSM220524     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 896542..907498
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QLH56_RS04175 (QLH56_04175) pepA 896937..898004 (+) 1068 Protein_809 glutamyl aminopeptidase -
  QLH56_RS04180 (QLH56_04180) proC 898028..898798 (+) 771 WP_003098824.1 pyrroline-5-carboxylate reductase -
  QLH56_RS04185 (QLH56_04185) - 898832..899488 (-) 657 WP_003098822.1 type II CAAX endopeptidase family protein -
  QLH56_RS04190 (QLH56_04190) - 899498..899950 (-) 453 WP_003098820.1 hypothetical protein -
  QLH56_RS04195 (QLH56_04195) - 899963..900166 (-) 204 WP_003098819.1 helix-turn-helix transcriptional regulator -
  QLH56_RS04200 (QLH56_04200) - 900289..901485 (-) 1197 WP_003098816.1 acetate kinase -
  QLH56_RS04205 (QLH56_04205) comYH 901542..902498 (-) 957 WP_003098815.1 class I SAM-dependent methyltransferase Machinery gene
  QLH56_RS04210 (QLH56_04210) comGG 902560..902913 (-) 354 WP_003098812.1 competence type IV pilus minor pilin ComGG -
  QLH56_RS04215 (QLH56_04215) comGF 902903..903283 (-) 381 WP_003098811.1 competence type IV pilus minor pilin ComGF -
  QLH56_RS04220 (QLH56_04220) comGE 903312..903569 (-) 258 WP_003098809.1 competence type IV pilus minor pilin ComGE -
  QLH56_RS04225 (QLH56_04225) comGD 903577..904005 (-) 429 WP_003098807.1 competence type IV pilus minor pilin ComGD -
  QLH56_RS04230 (QLH56_04230) comGC/cglC 903965..904291 (-) 327 WP_003098806.1 competence type IV pilus major pilin ComGC Machinery gene
  QLH56_RS04235 (QLH56_04235) comYB 904292..905308 (-) 1017 WP_003098804.1 competence type IV pilus assembly protein ComGB Machinery gene
  QLH56_RS04240 (QLH56_04240) comYA 905259..906200 (-) 942 WP_003098801.1 competence type IV pilus ATPase ComGA Machinery gene
  QLH56_RS04245 (QLH56_04245) - 906261..906626 (-) 366 WP_003098799.1 DUF1033 family protein -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 35968.86 Da        Isoelectric Point: 4.5568

>NTDB_id=829953 QLH56_RS04205 WP_003098815.1 901542..902498(-) (comYH) [Streptococcus iniae strain DFSM220524]
MNFENIEKAFELILENSQLIENELKTHIYDALIEQNSFYLGAEGASEQVAKNNEVLRSLNLSKEEWRRAFQFIFIKVGQT
EKLQANHQFTPDSLGFLILFLIETLTQEDSLDILEIGSGTGNLAQTLLNNSGKQLDYLGIEVDDLLIDLSASIAEIMNST
ARFVQEDAVRPQILKESHIIISDLPVGYYPNDEIASRYQVASPDGHTYAHHLLMEQALKYLKQDGFAIFLAPASLLQSQQ
SHLLKEWLKGYAQLSAVITLPETFFGDPSVAKSLIVLQKQSDKKGETFVYPLTDLQSADKVRLFMENFKKWKADNVFS

Nucleotide


Download         Length: 957 bp        

>NTDB_id=829953 QLH56_RS04205 WP_003098815.1 901542..902498(-) (comYH) [Streptococcus iniae strain DFSM220524]
ATGAATTTTGAAAATATTGAGAAAGCTTTTGAGCTTATTTTAGAAAATAGTCAACTTATTGAAAATGAGTTGAAAACGCA
CATTTATGATGCCCTTATTGAGCAGAATTCTTTTTATTTGGGAGCTGAAGGTGCTAGTGAGCAGGTCGCTAAAAACAATG
AAGTTTTAAGGTCACTCAATTTAAGTAAAGAAGAGTGGCGTCGCGCTTTTCAGTTTATTTTTATCAAGGTTGGTCAAACA
GAAAAACTGCAGGCGAATCACCAATTTACACCAGATAGTCTAGGTTTCCTTATTTTATTTTTAATTGAAACCTTGACACA
AGAAGACTCTCTTGACATTTTAGAGATTGGCAGTGGGACAGGTAATCTAGCACAAACACTTTTGAATAATAGTGGTAAAC
AATTAGATTACTTAGGTATTGAAGTGGACGATTTACTCATTGATTTATCAGCTAGTATAGCTGAGATAATGAATTCGACA
GCACGTTTTGTTCAAGAAGATGCCGTTAGACCACAAATTTTGAAAGAAAGTCATATTATTATCAGTGATTTACCAGTAGG
TTATTATCCTAATGATGAGATTGCTAGTCGTTATCAGGTTGCAAGTCCAGATGGTCATACCTATGCCCATCACTTATTGA
TGGAACAGGCTTTGAAATATTTAAAACAAGATGGCTTTGCAATTTTCTTAGCACCAGCCAGTCTTTTGCAAAGTCAACAA
AGCCATCTCCTTAAAGAGTGGTTAAAAGGTTATGCTCAGTTATCTGCAGTGATTACCTTGCCAGAAACATTCTTTGGAGA
TCCATCCGTGGCTAAATCACTGATTGTTCTGCAAAAGCAGAGTGACAAAAAAGGAGAAACCTTTGTTTATCCATTGACTG
ATTTACAGTCTGCAGATAAGGTTCGTCTCTTCATGGAGAACTTCAAAAAATGGAAAGCTGATAATGTCTTTTCATGA

Domains


Predicted by InterproScan.

(69-298)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

67.192

99.686

0.67

  comYH Streptococcus mutans UA140

66.877

99.686

0.667