Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   EL020_RS03715 Genome accession   NZ_LR134203
Coordinates   707165..708121 (+) Length   318 a.a.
NCBI ID   WP_020915994.1    Uniprot ID   A0AB33AJ02
Organism   Streptococcus lutetiensis strain NCTC11436     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 702165..713121
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL020_RS03675 (NCTC11436_00764) - 703065..703433 (+) 369 WP_126402407.1 DUF1033 family protein -
  EL020_RS03680 (NCTC11436_00765) comYA 703505..704446 (+) 942 WP_020915987.1 competence type IV pilus ATPase ComGA Machinery gene
  EL020_RS03685 (NCTC11436_00766) comYB 704370..705413 (+) 1044 WP_231997012.1 competence type IV pilus assembly protein ComGB Machinery gene
  EL020_RS03690 (NCTC11436_00767) comYC 705413..705706 (+) 294 WP_020915989.1 competence type IV pilus major pilin ComGC Machinery gene
  EL020_RS03695 (NCTC11436_00768) comYD 705690..706121 (+) 432 WP_020915990.1 competence type IV pilus minor pilin ComGD Machinery gene
  EL020_RS03700 (NCTC11436_00769) comYE 706075..706368 (+) 294 WP_020915991.1 competence type IV pilus minor pilin ComGE Machinery gene
  EL020_RS03705 (NCTC11436_00770) comYF 706352..706789 (+) 438 WP_043894883.1 competence type IV pilus minor pilin ComGF Machinery gene
  EL020_RS03710 (NCTC11436_00771) comGG 706761..707111 (+) 351 WP_020915993.1 competence type IV pilus minor pilin ComGG -
  EL020_RS03715 (NCTC11436_00772) comYH 707165..708121 (+) 957 WP_020915994.1 class I SAM-dependent methyltransferase Machinery gene
  EL020_RS03720 (NCTC11436_00773) - 708175..709374 (+) 1200 WP_126402408.1 acetate kinase -
  EL020_RS03725 (NCTC11436_00774) - 709534..709731 (+) 198 WP_020915996.1 helix-turn-helix transcriptional regulator -
  EL020_RS03730 (NCTC11436_00775) - 709721..709948 (+) 228 WP_231997018.1 hypothetical protein -
  EL020_RS03735 (NCTC11436_00776) - 709949..710404 (+) 456 WP_231853254.1 ABC transporter permease -
  EL020_RS03740 (NCTC11436_00777) - 710419..711054 (+) 636 WP_058833293.1 CPBP family intramembrane glutamic endopeptidase -
  EL020_RS03745 (NCTC11436_00778) proC 711091..711861 (-) 771 WP_126402409.1 pyrroline-5-carboxylate reductase -
  EL020_RS03750 (NCTC11436_00779) pepA 711923..712990 (-) 1068 WP_020916001.1 glutamyl aminopeptidase -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 35938.07 Da        Isoelectric Point: 4.4591

>NTDB_id=1118957 EL020_RS03715 WP_020915994.1 707165..708121(+) (comYH) [Streptococcus lutetiensis strain NCTC11436]
MNFEKIETAYGLILENIQLIENELKTHIYDALIEQNSFYLGAEGANETVAANNEKLRQLDLTKEEWRRAFQFIFIKAAQT
EALQANHQFTPDAIGFILMFLIENLTASKELDVLEIGSGTGNLAQTLLNNSSKDLNYLGIEVDDLLIDLSASIAEVMDSK
AQFIQEDAVRPQILKESDVIISDLPVGFYPNDEIAKRYKVASSEEHTYAHHLLMEQSLKYLKKDGIAVFLAPVSLLTSKQ
SDLLKAWLKDYADVIAVITLPEPIFGNAANAKSIFVLKKQAEHTPETFVYPLADLQSREVLTDFIEKFKKWNVENMIF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=1118957 EL020_RS03715 WP_020915994.1 707165..708121(+) (comYH) [Streptococcus lutetiensis strain NCTC11436]
ATGAATTTTGAAAAAATCGAAACAGCTTACGGGCTGATTCTAGAAAATATACAGTTAATCGAAAATGAGTTGAAAACACA
CATTTACGATGCTCTTATTGAACAAAACTCTTTTTATCTTGGTGCTGAAGGTGCCAATGAGACAGTAGCTGCTAATAATG
AAAAACTACGCCAACTTGATTTAACTAAAGAAGAATGGCGTCGTGCTTTTCAGTTTATTTTTATTAAAGCTGCTCAAACA
GAAGCACTTCAGGCAAATCACCAATTTACACCTGATGCAATTGGCTTTATCTTAATGTTCCTTATTGAGAATTTGACAGC
TTCAAAAGAACTTGATGTTTTAGAAATTGGTAGCGGAACAGGTAACCTTGCACAAACCTTGCTCAATAATTCATCTAAAG
ACTTGAATTACCTTGGAATTGAAGTTGATGATTTGTTGATTGACTTGTCAGCAAGTATTGCCGAAGTGATGGATTCTAAA
GCTCAGTTCATTCAAGAAGATGCTGTGCGCCCACAGATTCTTAAGGAAAGTGACGTCATCATCAGTGACCTTCCAGTCGG
CTTCTATCCCAATGATGAAATTGCAAAACGCTACAAAGTAGCAAGCAGTGAAGAACACACTTATGCGCATCATTTGTTGA
TGGAACAATCTCTCAAATATCTCAAAAAAGATGGTATTGCTGTCTTTTTAGCACCGGTTAGTCTTTTGACAAGTAAACAA
AGTGATTTGTTAAAGGCATGGTTGAAGGATTACGCTGATGTTATTGCGGTGATTACTTTACCAGAACCTATTTTCGGCAA
TGCGGCCAATGCTAAGTCAATTTTTGTCTTGAAGAAACAGGCTGAACATACTCCAGAAACATTTGTTTACCCGCTTGCTG
ACTTGCAAAGTCGCGAAGTTTTAACAGATTTTATTGAGAAATTTAAAAAATGGAATGTTGAAAATATGATTTTTTAA

Domains


Predicted by InterproScan.

(69-292)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

68.354

99.371

0.679

  comYH Streptococcus mutans UA159

68.038

99.371

0.676


Multiple sequence alignment