Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYC   Type   Machinery gene
Locus tag   AB4X21_RS01145 Genome accession   NZ_CP163380
Coordinates   203321..203638 (+) Length   105 a.a.
NCBI ID   WP_369088025.1    Uniprot ID   A0AB39LBR4
Organism   Streptococcus sp. CP1998     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 198321..208638
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AB4X21_RS01125 (AB4X21_01125) - 199618..200721 (+) 1104 WP_369088023.1 glycosyl hydrolase family 8 -
  AB4X21_RS01130 (AB4X21_01130) - 200943..201332 (+) 390 WP_037607163.1 DUF1033 family protein -
  AB4X21_RS01135 (AB4X21_01135) comYA 201419..202360 (+) 942 WP_369088024.1 competence type IV pilus ATPase ComGA Machinery gene
  AB4X21_RS01140 (AB4X21_01140) comGB/cglB 202293..203324 (+) 1032 WP_369088354.1 competence type IV pilus assembly protein ComGB Machinery gene
  AB4X21_RS01145 (AB4X21_01145) comYC 203321..203638 (+) 318 WP_369088025.1 competence type IV pilus major pilin ComGC Machinery gene
  AB4X21_RS01150 (AB4X21_01150) comYD 203628..204032 (+) 405 WP_003004513.1 competence type IV pilus minor pilin ComGD Machinery gene
  AB4X21_RS01155 (AB4X21_01155) comGE 203998..204285 (+) 288 WP_369088026.1 competence type IV pilus minor pilin ComGE -
  AB4X21_RS01160 (AB4X21_01160) comGF/cglF 204275..204712 (+) 438 WP_369088027.1 competence type IV pilus minor pilin ComGF Machinery gene
  AB4X21_RS01165 (AB4X21_01165) comGG 204714..205151 (+) 438 WP_369088028.1 competence type IV pilus minor pilin ComGG -
  AB4X21_RS01170 (AB4X21_01170) comYH 205182..206135 (+) 954 WP_369088029.1 class I SAM-dependent methyltransferase Machinery gene
  AB4X21_RS01175 (AB4X21_01175) - 206187..207380 (+) 1194 WP_003004202.1 acetate kinase -
  AB4X21_RS01180 (AB4X21_01180) - 207453..208184 (+) 732 WP_369088030.1 CPBP family intramembrane glutamic endopeptidase -

Sequence


Protein


Download         Length: 105 a.a.        Molecular weight: 11633.55 Da        Isoelectric Point: 9.6953

>NTDB_id=1029399 AB4X21_RS01145 WP_369088025.1 203321..203638(+) (comYC) [Streptococcus sp. CP1998]
MKKLKTYKVKAFTLIEMLVVLLIISVLLLLFVPNLTKQKDSVKETGNAAVVKVVESQAELYELNHTNDQATLAKLIADGN
ITNKQAESYRAYHAKNSGETRAVAD

Nucleotide


Download         Length: 318 bp        

>NTDB_id=1029399 AB4X21_RS01145 WP_369088025.1 203321..203638(+) (comYC) [Streptococcus sp. CP1998]
ATGAAAAAATTAAAAACCTATAAGGTTAAAGCCTTTACACTTATTGAAATGTTGGTGGTCTTATTGATCATCAGTGTGCT
CTTATTGCTCTTTGTGCCGAATTTGACCAAGCAAAAAGACTCCGTGAAAGAGACAGGAAATGCAGCGGTTGTGAAGGTGG
TCGAAAGCCAGGCTGAATTGTATGAGCTCAATCATACCAATGATCAAGCCACTCTAGCAAAACTGATTGCGGATGGAAAT
ATTACCAACAAACAAGCAGAATCCTACCGTGCCTATCATGCAAAAAATAGTGGAGAAACTCGTGCGGTTGCAGATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYC Streptococcus gordonii str. Challis substr. CH1

75.238

100

0.752

  comYC Streptococcus mutans UA159

72.381

100

0.724

  comYC Streptococcus mutans UA140

72.381

100

0.724

  comGC/cglC Streptococcus mitis NCTC 12261

67.273

100

0.705

  comGC/cglC Streptococcus mitis SK321

66.972

100

0.695

  comGC/cglC Streptococcus pneumoniae R6

66.055

100

0.686

  comGC/cglC Streptococcus pneumoniae Rx1

66.055

100

0.686

  comGC/cglC Streptococcus pneumoniae D39

66.055

100

0.686

  comGC/cglC Streptococcus pneumoniae TIGR4

65.138

100

0.676

  comGC Lactococcus lactis subsp. cremoris KW2

58.654

99.048

0.581

  comYC Streptococcus suis isolate S10

67.416

84.762

0.571

  comGC Staphylococcus aureus MW2

45.918

93.333

0.429

  comGC Staphylococcus aureus N315

45.918

93.333

0.429


Multiple sequence alignment