Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYB   Type   Machinery gene
Locus tag   SIR_RS18230 Genome accession   NC_022246
Coordinates   1760243..1761265 (-) Length   340 a.a.
NCBI ID   WP_021003258.1    Uniprot ID   -
Organism   Streptococcus intermedius B196     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1755243..1766265
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SIR_RS18195 (SIR_1661) - 1756258..1757457 (-) 1200 WP_021003254.1 acetate kinase -
  SIR_RS18200 (SIR_1662) comYH 1757504..1758457 (-) 954 WP_021003255.1 class I SAM-dependent methyltransferase Machinery gene
  SIR_RS18205 (SIR_1663) comGG 1758548..1758874 (-) 327 WP_021003256.1 competence type IV pilus minor pilin ComGG -
  SIR_RS18210 (SIR_1664) comGF/cglF 1758855..1759292 (-) 438 WP_003073990.1 competence type IV pilus minor pilin ComGF Machinery gene
  SIR_RS18215 (SIR_1665) comGE/cglE 1759276..1759569 (-) 294 WP_003073987.1 competence type IV pilus minor pilin ComGE Machinery gene
  SIR_RS18220 (SIR_1666) comYD 1759541..1759969 (-) 429 WP_021003257.1 competence type IV pilus minor pilin ComGD Machinery gene
  SIR_RS18225 (SIR_1667) comYC 1759929..1760246 (-) 318 WP_003075998.1 competence type IV pilus major pilin ComGC Machinery gene
  SIR_RS18230 (SIR_1668) comYB 1760243..1761265 (-) 1023 WP_021003258.1 competence type IV pilus assembly protein ComGB Machinery gene
  SIR_RS18235 (SIR_1669) comYA 1761207..1762148 (-) 942 WP_003073980.1 competence type IV pilus ATPase ComGA Machinery gene
  SIR_RS18240 (SIR_1670) - 1762217..1762582 (-) 366 WP_021003259.1 DUF1033 family protein -
  SIR_RS18245 (SIR_1671) glnA 1762804..1764150 (-) 1347 WP_021003260.1 type I glutamate--ammonia ligase -
  SIR_RS18250 (SIR_1672) - 1764189..1764548 (-) 360 WP_003073972.1 MerR family transcriptional regulator -
  SIR_RS18255 (SIR_1673) - 1764625..1765155 (-) 531 WP_009569519.1 FUSC family protein -
  SIR_RS18260 (SIR_1674) - 1765330..1766010 (-) 681 WP_003075761.1 COG3942 and LysM peptidoglycan-binding domain-containing protein -

Sequence


Protein


Download         Length: 340 a.a.        Molecular weight: 39205.35 Da        Isoelectric Point: 9.7483

>NTDB_id=61916 SIR_RS18230 WP_021003258.1 1760243..1761265(-) (comYB) [Streptococcus intermedius B196]
MQQDISILNRQKRKKLSTVKQKKVVELLNNFFSSGFHLAEIIDFLKRSALLEKAYVEKMKEGLATGKPFSDIMASLGFSD
NVVTQLSLAELHGNLSLSLSKIEEYLENISKVKKKLIEVATYPFILFIFLVFIMLGLRNYLLPQLESQNIATQLISRLPQ
IFLGSVGVVFALFAIGFYWFRKSSQIRVFSLLSRLPFWGTFVQTYLTAYYAREWGNMIGQGLELSQIFQMMQGQRSAMFQ
EIGKDLEVALQNGQEFSQVVKHYPFFKKELGLMIEYGEVKSKLGCELEVYAQKTWEVFFKRVHQAMNIIQPLVFIFVALM
IVLLYAAMLLPIYQNMEVQL

Nucleotide


Download         Length: 1023 bp        

>NTDB_id=61916 SIR_RS18230 WP_021003258.1 1760243..1761265(-) (comYB) [Streptococcus intermedius B196]
ATGCAGCAGGACATATCAATCTTGAACAGGCAAAAGCGGAAAAAATTATCCACAGTTAAGCAGAAAAAAGTCGTAGAGTT
ACTAAATAATTTTTTTTCTAGTGGTTTTCACTTGGCAGAAATTATTGATTTTTTGAAACGGAGTGCTTTGTTAGAAAAAG
CCTATGTGGAAAAGATGAAAGAGGGTTTGGCAACTGGAAAGCCTTTTTCAGACATCATGGCAAGTTTAGGTTTTTCGGAT
AATGTAGTAACACAGCTTTCATTGGCAGAATTGCATGGAAATTTATCGCTCAGTTTGAGTAAAATTGAAGAGTATTTGGA
AAATATCTCAAAAGTCAAGAAAAAGTTAATCGAAGTGGCGACTTATCCATTTATTTTATTCATTTTTCTAGTGTTTATTA
TGTTAGGATTGCGCAATTATTTGTTACCGCAGTTGGAGAGTCAGAATATTGCAACACAACTGATTAGTCGTTTACCACAA
ATTTTTCTGGGTTCAGTAGGAGTTGTTTTTGCTTTGTTTGCGATCGGTTTTTATTGGTTTAGAAAAAGCTCACAAATAAG
AGTATTTAGTTTGTTGTCTCGCCTTCCTTTTTGGGGCACTTTTGTTCAAACCTATTTGACAGCTTATTATGCGAGAGAAT
GGGGGAATATGATTGGTCAAGGCTTAGAACTCAGTCAGATTTTCCAGATGATGCAAGGACAGCGCTCTGCTATGTTTCAA
GAAATTGGAAAAGATTTAGAAGTAGCACTGCAAAATGGTCAGGAATTTTCACAAGTAGTTAAACATTATCCATTTTTTAA
AAAAGAATTGGGCTTGATGATCGAATATGGGGAAGTGAAATCCAAATTGGGGTGCGAATTAGAAGTTTATGCTCAAAAAA
CGTGGGAGGTGTTTTTTAAACGTGTTCATCAAGCGATGAATATCATTCAGCCATTAGTATTTATCTTTGTGGCGTTAATG
ATTGTATTGTTGTATGCAGCCATGTTGCTGCCAATTTATCAAAATATGGAGGTTCAATTGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYB Streptococcus gordonii str. Challis substr. CH1

72.941

100

0.729

  comGB/cglB Streptococcus mitis SK321

69.254

98.529

0.682

  comGB/cglB Streptococcus mitis NCTC 12261

69.254

98.529

0.682

  comGB/cglB Streptococcus pneumoniae Rx1

68.358

98.529

0.674

  comGB/cglB Streptococcus pneumoniae D39

68.358

98.529

0.674

  comGB/cglB Streptococcus pneumoniae R6

68.358

98.529

0.674

  comGB/cglB Streptococcus pneumoniae TIGR4

68.358

98.529

0.674

  comYB Streptococcus mutans UA140

60.234

100

0.606

  comYB Streptococcus mutans UA159

60.234

100

0.606

  comGB Lactococcus lactis subsp. cremoris KW2

51.929

99.118

0.515


Multiple sequence alignment