Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYA   Type   Machinery gene
Locus tag   ABZ559_RS11745 Genome accession   NZ_CP160400
Coordinates   2356170..2357111 (-) Length   313 a.a.
NCBI ID   WP_018376801.1    Uniprot ID   -
Organism   Streptococcus sp. ZY19097     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2351170..2362111
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ABZ559_RS11705 (ABZ559_11705) - 2351210..2352403 (-) 1194 WP_367006522.1 acetate kinase -
  ABZ559_RS11710 (ABZ559_11710) comYH 2352455..2353411 (-) 957 WP_018376808.1 class I SAM-dependent methyltransferase Machinery gene
  ABZ559_RS11715 (ABZ559_11715) comYG 2353465..2353821 (-) 357 WP_367006523.1 competence type IV pilus minor pilin ComGG Machinery gene
  ABZ559_RS11720 (ABZ559_11720) comYF 2353799..2354260 (-) 462 WP_367006524.1 competence type IV pilus minor pilin ComGF Machinery gene
  ABZ559_RS11725 (ABZ559_11725) comGE 2354220..2354510 (-) 291 WP_367006526.1 competence type IV pilus minor pilin ComGE -
  ABZ559_RS11730 (ABZ559_11730) comYD 2354467..2354916 (-) 450 WP_277290537.1 competence type IV pilus minor pilin ComGD Machinery gene
  ABZ559_RS11735 (ABZ559_11735) comGC/cglC 2354870..2355202 (-) 333 WP_018376803.1 competence type IV pilus major pilin ComGC Machinery gene
  ABZ559_RS11740 (ABZ559_11740) comYB 2355203..2356246 (-) 1044 WP_367006528.1 competence type IV pilus assembly protein ComGB Machinery gene
  ABZ559_RS11745 (ABZ559_11745) comYA 2356170..2357111 (-) 942 WP_018376801.1 competence type IV pilus ATPase ComGA Machinery gene
  ABZ559_RS11750 (ABZ559_11750) - 2357206..2357571 (-) 366 WP_273415187.1 DUF1033 family protein -
  ABZ559_RS11755 (ABZ559_11755) - 2357705..2358556 (-) 852 WP_367006530.1 ABC transporter ATP-binding protein -
  ABZ559_RS11760 (ABZ559_11760) - 2358559..2359239 (-) 681 WP_367006532.1 hypothetical protein -
  ABZ559_RS11765 (ABZ559_11765) - 2359254..2359760 (-) 507 WP_367006534.1 hypothetical protein -
  ABZ559_RS11770 (ABZ559_11770) - 2359750..2360391 (-) 642 WP_367006536.1 hypothetical protein -
  ABZ559_RS11775 (ABZ559_11775) - 2360384..2361031 (-) 648 WP_367006538.1 ATP-binding cassette domain-containing protein -
  ABZ559_RS11780 (ABZ559_11780) - 2361120..2361251 (-) 132 WP_321058657.1 aureocin A53 family class IId bacteriocin -

Sequence


Protein


Download         Length: 313 a.a.        Molecular weight: 35378.53 Da        Isoelectric Point: 5.8773

>NTDB_id=1020961 ABZ559_RS11745 WP_018376801.1 2356170..2357111(-) (comYA) [Streptococcus sp. ZY19097]
MVQDMAKKIISEAVALNAQDIYMIPLSENYELYMRIGDERRFINDYDTEQMMSLISHFKFVSGMNVGEKRRSQLGSCDYP
FSDEEEISLRLSSVGDYRGRESLVIRLLYSGRHDLKYWFNGMHNIMEAIGGRGLYLFSGPVGSGKTTLMYQLIAEKFPDK
QIITIEDPVEIKQDNMLQLQLNDSIGMTYDNLIKLSLRHRPDILIIGEIRDSETARAVIRASLTGAMVFSTIHAKSIQGV
YARLLELGVSREELDNSLRLIAYQRLIGGGGVIDFACKDFQNHIADKWNGQIESLAHDGHISAAQAKIEKITA

Nucleotide


Download         Length: 942 bp        

>NTDB_id=1020961 ABZ559_RS11745 WP_018376801.1 2356170..2357111(-) (comYA) [Streptococcus sp. ZY19097]
ATGGTTCAAGACATGGCAAAAAAGATTATCAGTGAGGCTGTGGCACTCAATGCCCAGGATATTTATATGATACCTCTATC
TGAGAATTATGAGCTTTATATGAGGATTGGAGATGAACGCCGCTTTATCAACGATTATGACACAGAGCAAATGATGAGTT
TGATTAGTCATTTTAAGTTTGTATCTGGGATGAATGTCGGAGAGAAACGACGTAGTCAACTGGGATCCTGTGATTACCCA
TTTTCAGATGAAGAGGAGATTTCGCTCAGACTTTCCAGTGTGGGAGATTATCGTGGGCGTGAGAGTTTGGTCATTCGCTT
GCTCTACTCGGGTCGTCATGATTTGAAATATTGGTTTAATGGCATGCACAATATTATGGAAGCGATTGGTGGTCGAGGAC
TTTACCTTTTTTCGGGTCCAGTTGGGAGTGGTAAAACCACTCTCATGTACCAGCTGATTGCGGAGAAATTTCCAGACAAG
CAAATTATTACCATTGAAGATCCAGTTGAGATTAAGCAGGATAATATGTTGCAACTACAATTAAACGATAGCATTGGTAT
GACTTATGACAATCTGATAAAGCTATCTCTGCGACACCGTCCTGATATTCTTATCATCGGGGAAATCCGAGATAGCGAAA
CAGCACGGGCGGTCATTCGAGCCAGTCTGACGGGAGCTATGGTTTTCTCAACCATTCACGCTAAGAGTATTCAAGGAGTT
TATGCCCGTCTGCTAGAGCTTGGTGTCAGCCGGGAAGAGCTGGACAATAGCTTGCGCTTAATTGCCTACCAGCGTTTGAT
TGGGGGAGGAGGTGTGATTGACTTTGCATGTAAAGACTTCCAAAACCATATCGCAGACAAGTGGAATGGGCAAATTGAAA
GCCTTGCTCATGACGGACATATCAGTGCTGCGCAGGCAAAAATCGAAAAAATTACCGCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYA Streptococcus mutans UA159

75.241

99.361

0.748

  comYA Streptococcus mutans UA140

74.92

99.361

0.744

  comGA/cglA Streptococcus sobrinus strain NIDR 6715-7

67.097

99.042

0.665

  comYA Streptococcus gordonii str. Challis substr. CH1

66.881

99.361

0.665

  comGA/cglA/cilD Streptococcus mitis NCTC 12261

65.705

99.681

0.655

  comGA/cglA/cilD Streptococcus pneumoniae Rx1

65.385

99.681

0.652

  comGA/cglA/cilD Streptococcus pneumoniae D39

65.385

99.681

0.652

  comGA/cglA/cilD Streptococcus pneumoniae R6

65.385

99.681

0.652

  comGA/cglA/cilD Streptococcus pneumoniae TIGR4

65.385

99.681

0.652

  comGA Lactococcus lactis subsp. cremoris KW2

51.923

99.681

0.518


Multiple sequence alignment