Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYB   Type   Machinery gene
Locus tag   SM121_RS01095 Genome accession   NZ_CP139418
Coordinates   225403..226434 (-) Length   343 a.a.
NCBI ID   WP_320911301.1    Uniprot ID   -
Organism   Streptococcus dentalis strain S1     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 220403..231434
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SM121_RS01055 (SM121_01055) - 220543..221274 (-) 732 WP_320910981.1 type II CAAX endopeptidase family protein -
  SM121_RS01060 (SM121_01060) - 221347..222540 (-) 1194 WP_003004202.1 acetate kinase -
  SM121_RS01065 (SM121_01065) comYH 222592..223545 (-) 954 WP_155127115.1 class I SAM-dependent methyltransferase Machinery gene
  SM121_RS01070 (SM121_01070) comGG 223576..224013 (-) 438 WP_320910982.1 competence type IV pilus minor pilin ComGG -
  SM121_RS01075 (SM121_01075) comGF/cglF 224000..224452 (-) 453 WP_151190960.1 competence type IV pilus minor pilin ComGF Machinery gene
  SM121_RS01080 (SM121_01080) comGE 224442..224729 (-) 288 WP_003004353.1 competence type IV pilus minor pilin ComGE -
  SM121_RS01085 (SM121_01085) comYD 224695..225099 (-) 405 WP_003004513.1 competence type IV pilus minor pilin ComGD Machinery gene
  SM121_RS01090 (SM121_01090) comYC 225089..225406 (-) 318 WP_003013558.1 competence type IV pilus major pilin ComGC Machinery gene
  SM121_RS01095 (SM121_01095) comYB 225403..226434 (-) 1032 WP_320911301.1 competence type IV pilus assembly protein ComGB Machinery gene
  SM121_RS01100 (SM121_01100) comYA 226367..227308 (-) 942 WP_155127112.1 competence type IV pilus ATPase ComGA Machinery gene
  SM121_RS01105 (SM121_01105) - 227396..227785 (-) 390 WP_037615753.1 DUF1033 family protein -
  SM121_RS01110 (SM121_01110) - 228007..229110 (-) 1104 WP_320910983.1 glycosyl hydrolase family 8 -
  SM121_RS01115 (SM121_01115) - 229119..230429 (-) 1311 WP_003004511.1 glycosyltransferase -
  SM121_RS01120 (SM121_01120) - 230422..230562 (-) 141 WP_003004308.1 hypothetical protein -
  SM121_RS01125 (SM121_01125) - 230572..231360 (-) 789 WP_003004116.1 hypothetical protein -

Sequence


Protein


Download         Length: 343 a.a.        Molecular weight: 39330.14 Da        Isoelectric Point: 9.3919

>NTDB_id=909471 SM121_RS01095 WP_320911301.1 225403..226434(-) (comYB) [Streptococcus dentalis strain S1]
MSWLNRDISSWLRRKPKKLSTAKQKQIIELFLNLYSSGFHLSEIVDFLDRSHLVESRLVSQMREDLSRGRSFSEMMAGIG
FSDAVTTQLSLAELHGNLSLSLEKISAYLENMRKVKKKLIEVSTYPLILLGFLVLIMLGLRNYLLPQMDAQNIGTQLISS
FPQLFLALGAGLVTFFLLGFLYYRKSGKINVFRTLSHLPFGKGMIQAYLTAYYAREWGNLIGQGLELSQIFSMMQDQKSQ
LFQEIGRDLALSLDRGQSFSETVGGYPFFKEELPLMIEYGEVKSKLGNELEIYAEKTWEDFFRRVHKAMNVIQPLVFIFV
ALVIVLLYAAMLLPIYQNMEVHL

Nucleotide


Download         Length: 1032 bp        

>NTDB_id=909471 SM121_RS01095 WP_320911301.1 225403..226434(-) (comYB) [Streptococcus dentalis strain S1]
ATCTCTTGGCTCAATCGGGATATATCCAGCTGGCTCAGGCGCAAGCCGAAAAAATTATCTACCGCTAAACAAAAGCAAAT
CATTGAATTGTTTTTGAATCTTTATTCGAGTGGTTTTCATCTGTCTGAGATCGTCGATTTTCTGGATCGCTCTCACCTAG
TGGAGAGTCGTTTGGTTTCCCAGATGCGAGAGGACCTTTCTCGGGGGCGGAGTTTTTCAGAGATGATGGCGGGGATCGGT
TTTTCAGATGCAGTGACGACACAGCTGTCTCTCGCTGAGCTCCATGGCAATCTTTCATTGAGCTTGGAGAAAATCAGTGC
TTACTTAGAAAACATGCGCAAGGTCAAGAAAAAGCTGATTGAGGTGAGCACCTATCCTCTCATCTTGCTTGGATTTTTAG
TTCTGATTATGCTAGGCTTGCGTAATTATTTGCTCCCCCAAATGGATGCTCAAAATATTGGGACGCAATTGATCAGTTCC
TTCCCCCAACTCTTTTTGGCCTTAGGAGCGGGACTGGTGACCTTCTTCTTACTCGGCTTTCTCTATTATCGAAAGTCAGG
CAAGATCAACGTTTTTAGAACCTTGTCTCATCTGCCTTTCGGAAAAGGCATGATTCAAGCTTATTTGACAGCCTATTATG
CTAGAGAATGGGGCAATCTGATTGGGCAAGGATTGGAGTTGTCTCAGATTTTTTCCATGATGCAGGACCAAAAATCCCAG
CTTTTTCAAGAAATTGGAAGGGACTTAGCTCTTTCTTTAGACCGTGGCCAGTCTTTTTCAGAGACGGTCGGGGGGTATCC
TTTTTTCAAAGAAGAATTGCCCCTTATGATTGAATATGGTGAAGTCAAATCAAAGCTCGGAAATGAACTAGAGATCTACG
CTGAAAAAACATGGGAAGATTTCTTTCGTCGGGTTCACAAGGCCATGAATGTGATACAACCCTTGGTGTTTATCTTTGTG
GCTCTTGTGATTGTGTTACTCTATGCAGCCATGTTGCTGCCGATTTATCAAAATATGGAGGTTCATTTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYB Streptococcus gordonii str. Challis substr. CH1

65.015

100

0.65

  comGB/cglB Streptococcus mitis SK321

66.071

97.959

0.647

  comGB/cglB Streptococcus mitis NCTC 12261

65.774

97.959

0.644

  comGB/cglB Streptococcus pneumoniae TIGR4

65.476

97.959

0.641

  comGB/cglB Streptococcus pneumoniae R6

65.476

97.959

0.641

  comGB/cglB Streptococcus pneumoniae Rx1

65.476

97.959

0.641

  comGB/cglB Streptococcus pneumoniae D39

65.476

97.959

0.641

  comYB Streptococcus mutans UA159

57.88

100

0.589

  comYB Streptococcus mutans UA140

57.88

100

0.589

  comGB Lactococcus lactis subsp. cremoris KW2

48.961

98.251

0.481