Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYC   Type   Machinery gene
Locus tag   DQ228_RS09170 Genome accession   NZ_CP030250
Coordinates   1779511..1779837 (-) Length   108 a.a.
NCBI ID   WP_002946126.1    Uniprot ID   A0A0E2Q1B3
Organism   Streptococcus thermophilus strain CS20     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1774511..1784837
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQ228_RS09125 - 1774776..1775114 (-) 339 WP_224103197.1 CPBP family intramembrane glutamic endopeptidase -
  DQ228_RS09130 - 1775158..1775433 (-) 276 WP_011681681.1 hypothetical protein -
  DQ228_RS09135 - 1775445..1775643 (-) 199 Protein_1785 helix-turn-helix transcriptional regulator -
  DQ228_RS09140 - 1775892..1777085 (-) 1194 WP_014608740.1 acetate kinase -
  DQ228_RS09145 comYH 1777141..1778097 (-) 957 WP_011226622.1 class I SAM-dependent methyltransferase Machinery gene
  DQ228_RS09150 comGG 1778142..1778459 (-) 318 WP_011681685.1 competence type IV pilus minor pilin ComGG -
  DQ228_RS09155 comYF 1778437..1778874 (-) 438 WP_011681686.1 competence type IV pilus minor pilin ComGF Machinery gene
  DQ228_RS09160 comGE 1778858..1779151 (-) 294 WP_011226625.1 competence type IV pilus minor pilin ComGE -
  DQ228_RS09165 comYD 1779123..1779551 (-) 429 WP_011226626.1 competence type IV pilus minor pilin ComGD Machinery gene
  DQ228_RS09170 comYC 1779511..1779837 (-) 327 WP_002946126.1 competence type IV pilus major pilin ComGC Machinery gene
  DQ228_RS09175 comYB 1779834..1780934 (-) 1101 WP_120764773.1 competence type IV pilus assembly protein ComGB Machinery gene
  DQ228_RS09180 comGA/cglA/cilD 1780816..1781757 (-) 942 WP_022096896.1 competence type IV pilus ATPase ComGA Machinery gene
  DQ228_RS09185 - 1781838..1782200 (-) 363 WP_014622006.1 DUF1033 family protein -

Sequence


Protein


Download         Length: 108 a.a.        Molecular weight: 11914.00 Da        Isoelectric Point: 9.9219

>NTDB_id=300452 DQ228_RS09170 WP_002946126.1 1779511..1779837(-) (comYC) [Streptococcus thermophilus strain CS20]
MKLMLKKLNAVKLRAFTLIEMLVVLLIISILLLLFVPNLSKQKDSVKETGNAAVVKVVDSQAELYEMKNNKTASLAALVS
EGQITQKQADSYNDYYAKHGGESRSVAN

Nucleotide


Download         Length: 327 bp        

>NTDB_id=300452 DQ228_RS09170 WP_002946126.1 1779511..1779837(-) (comYC) [Streptococcus thermophilus strain CS20]
ATGAAATTGATGTTAAAAAAATTGAATGCCGTTAAATTACGAGCCTTTACTCTGATTGAAATGCTAGTCGTACTTCTCAT
TATCAGTATTCTCCTTTTGCTCTTTGTTCCTAACTTAAGCAAGCAGAAGGATTCCGTTAAGGAAACTGGAAATGCGGCTG
TAGTCAAGGTCGTGGATTCTCAGGCAGAACTTTATGAAATGAAGAATAACAAGACAGCTAGCTTGGCAGCTCTTGTTTCA
GAAGGTCAAATTACGCAAAAACAGGCAGATTCATACAATGATTACTATGCGAAACATGGTGGTGAAAGCCGCTCAGTGGC
CAATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0E2Q1B3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYC Streptococcus mutans UA140

69.231

96.296

0.667

  comYC Streptococcus mutans UA159

69.231

96.296

0.667

  comYC Streptococcus gordonii str. Challis substr. CH1

63.81

97.222

0.62

  comGC/cglC Streptococcus mitis SK321

62.264

98.148

0.611

  comGC/cglC Streptococcus mitis NCTC 12261

58.333

100

0.583

  comGC/cglC Streptococcus pneumoniae D39

58.491

98.148

0.574

  comGC/cglC Streptococcus pneumoniae R6

58.491

98.148

0.574

  comGC/cglC Streptococcus pneumoniae Rx1

58.491

98.148

0.574

  comGC/cglC Streptococcus pneumoniae TIGR4

58.491

98.148

0.574

  comGC Lactococcus lactis subsp. cremoris KW2

56.604

98.148

0.556

  comYC Streptococcus suis isolate S10

60.227

81.481

0.491

  comGC Staphylococcus aureus MW2

50

72.222

0.361

  comGC Staphylococcus aureus N315

50

72.222

0.361


Multiple sequence alignment