Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYB   Type   Machinery gene
Locus tag   SGPB_RS00610 Genome accession   NC_015600
Coordinates   95466..96509 (+) Length   347 a.a.
NCBI ID   WP_003062919.1    Uniprot ID   E0PB48
Organism   Streptococcus pasteurianus ATCC 43144     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 90466..101509
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SGPB_RS00600 (SGPB_0085) - 94168..94530 (+) 363 WP_003062916.1 DUF1033 family protein -
  SGPB_RS00605 (SGPB_0086) comYA 94601..95542 (+) 942 WP_003062917.1 competence type IV pilus ATPase ComGA Machinery gene
  SGPB_RS00610 (SGPB_0087) comYB 95466..96509 (+) 1044 WP_003062919.1 competence type IV pilus assembly protein ComGB Machinery gene
  SGPB_RS00615 (SGPB_0088) comGC 96509..96803 (+) 295 Protein_87 competence type IV pilus major pilin ComGC -
  SGPB_RS00620 (SGPB_0089) comYD 96787..97218 (+) 432 WP_003062922.1 competence type IV pilus minor pilin ComGD Machinery gene
  SGPB_RS00625 (SGPB_0090) comGE 97172..97465 (+) 294 WP_080543606.1 competence type IV pilus minor pilin ComGE -
  SGPB_RS00630 (SGPB_0091) comYF 97449..97886 (+) 438 WP_041974065.1 competence type IV pilus minor pilin ComGF Machinery gene
  SGPB_RS00635 (SGPB_0092) comYG 97858..98190 (+) 333 WP_003062925.1 competence type IV pilus minor pilin ComGG Machinery gene
  SGPB_RS00640 (SGPB_0093) comYH 98245..99216 (+) 972 WP_003062926.1 class I SAM-dependent methyltransferase Machinery gene
  SGPB_RS00645 (SGPB_0094) - 99254..100453 (+) 1200 WP_003062927.1 acetate kinase -
  SGPB_RS00650 (SGPB_0095) - 100614..100814 (+) 201 WP_013851401.1 helix-turn-helix transcriptional regulator -
  SGPB_RS00655 (SGPB_0096) - 100824..101054 (+) 231 WP_013851402.1 hypothetical protein -
  SGPB_RS00660 (SGPB_0097) - 101045..101497 (+) 453 WP_003062930.1 hypothetical protein -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39817.38 Da        Isoelectric Point: 9.8728

>NTDB_id=41049 SGPB_RS00610 WP_003062919.1 95466..96509(+) (comYB) [Streptococcus pasteurianus ATCC 43144]
MQKLKVLLKTDISQLNKQKSKKLPFKKQRKVIQLFNNLFKSGFNLTEIVFFLRRSQLLSEVYVERMQESLLNGASLAAMM
VDLGFSDNIVTQIALADVHGNSQKSLLKIESYLSSMTVVRKKLIEVATYPLILFLFLILIMLGLKNYLLPQLESQNVATQ
IIAHFPTIFLLSIFSIGVLLICTTFYARRLSQIDLYSRISRIPLVGNYVRLYLTAYYAREWGNLIGQGIELMAIVGIMQK
QKSLLFQEIGKDMEEALLSGQAFHQKVLDYPFFLRELSLMIEYGEIKSKLGRELDIYAEETWQSFFGKLTQATQLIQPLV
FVFVALIIVLIYVAMLLPMYQNMGGNF

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=41049 SGPB_RS00610 WP_003062919.1 95466..96509(+) (comYB) [Streptococcus pasteurianus ATCC 43144]
ATGCAAAAATTGAAAGTCTTGTTAAAGACGGACATATCACAGCTGAACAAGCAAAAATCGAAAAAATTGCCATTTAAAAA
ACAGCGCAAGGTTATCCAACTCTTTAATAATCTTTTTAAAAGTGGGTTTAATTTAACAGAAATTGTGTTTTTTCTCCGAA
GAAGTCAATTGTTGTCAGAGGTTTATGTTGAGAGGATGCAAGAAAGTTTGTTAAATGGTGCTAGTCTAGCAGCAATGATG
GTAGATTTAGGGTTTTCGGACAATATTGTCACACAAATTGCTTTGGCTGATGTTCATGGAAACAGTCAGAAAAGTCTGCT
AAAAATTGAGTCTTACCTTTCTAGCATGACTGTCGTCAGAAAAAAGTTAATTGAAGTTGCAACGTATCCATTGATTTTGT
TCTTGTTTCTTATTTTGATTATGCTGGGGTTGAAGAATTATTTACTGCCGCAGCTGGAAAGCCAAAATGTGGCAACGCAG
ATTATTGCGCATTTTCCAACGATTTTTTTGTTAAGTATTTTCTCGATTGGAGTGTTGCTTATTTGTACGACATTTTATGC
TAGGCGTTTATCGCAGATTGATTTATATAGTCGAATAAGCCGGATTCCACTTGTGGGAAACTATGTTAGGTTATATTTGA
CAGCTTACTATGCGCGTGAATGGGGGAATTTGATTGGGCAAGGTATTGAATTAATGGCAATCGTGGGAATCATGCAAAAG
CAAAAGTCGCTCTTATTTCAAGAGATTGGAAAGGATATGGAAGAAGCGCTGCTTTCAGGGCAAGCTTTTCATCAAAAAGT
TCTGGATTATCCATTCTTTTTGCGAGAATTGAGCTTGATGATTGAATATGGTGAGATCAAATCAAAGCTTGGGCGTGAGT
TGGACATTTATGCTGAGGAAACATGGCAGAGCTTTTTTGGCAAATTGACTCAAGCAACACAGCTCATTCAACCACTTGTT
TTTGTCTTTGTAGCTTTGATTATTGTGTTAATTTATGTGGCAATGCTGTTGCCAATGTATCAAAATATGGGAGGAAATTT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB E0PB48

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYB Streptococcus mutans UA140

66.456

91.066

0.605

  comYB Streptococcus mutans UA159

66.139

91.066

0.602

  comGB/cglB Streptococcus mitis NCTC 12261

59.587

97.695

0.582

  comYB Streptococcus gordonii str. Challis substr. CH1

59.524

96.83

0.576

  comGB/cglB Streptococcus mitis SK321

56.677

97.118

0.55

  comGB/cglB Streptococcus pneumoniae Rx1

55.786

97.118

0.542

  comGB/cglB Streptococcus pneumoniae D39

55.786

97.118

0.542

  comGB/cglB Streptococcus pneumoniae R6

55.786

97.118

0.542

  comGB/cglB Streptococcus pneumoniae TIGR4

55.786

97.118

0.542

  comGB Lactococcus lactis subsp. cremoris KW2

50

98.559

0.493


Multiple sequence alignment