Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC   Type   Machinery gene
Locus tag   GC56T3_RS05655 Genome accession   NC_014206
Coordinates   1108926..1109222 (+) Length   98 a.a.
NCBI ID   WP_012820551.1    Uniprot ID   -
Organism   Geobacillus sp. C56-T3     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1103926..1114222
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GC56T3_RS05630 (GC56T3_1049) - 1104117..1105250 (+) 1134 WP_012820545.1 class I SAM-dependent methyltransferase -
  GC56T3_RS05635 (GC56T3_1050) - 1105334..1105576 (-) 243 WP_008879854.1 DUF2626 family protein -
  GC56T3_RS05640 (GC56T3_1051) - 1105734..1106432 (-) 699 WP_013144741.1 metalloregulator ArsR/SmtB family transcription factor -
  GC56T3_RS05645 (GC56T3_1052) comGA 1106625..1107698 (+) 1074 WP_013144742.1 competence type IV pilus ATPase ComGA Machinery gene
  GC56T3_RS05650 (GC56T3_1053) comGB 1107695..1108723 (+) 1029 WP_013144743.1 competence type IV pilus assembly protein ComGB -
  GC56T3_RS05655 (GC56T3_1054) comGC 1108926..1109222 (+) 297 WP_012820551.1 competence type IV pilus major pilin ComGC Machinery gene
  GC56T3_RS05660 (GC56T3_1055) comGD 1109212..1109649 (+) 438 WP_013144744.1 competence type IV pilus minor pilin ComGD -
  GC56T3_RS05665 (GC56T3_1056) - 1109633..1109959 (+) 327 WP_013144745.1 competence protein ComG -
  GC56T3_RS05670 (GC56T3_1057) comGF 1109938..1110384 (+) 447 WP_013144746.1 competence type IV pilus minor pilin ComGF -
  GC56T3_RS05675 (GC56T3_1058) comGG 1110459..1110848 (+) 390 WP_013144747.1 competence type IV pilus minor pilin ComGG -
  GC56T3_RS05680 (GC56T3_1059) - 1110894..1111073 (+) 180 WP_011231910.1 YqzE family protein -
  GC56T3_RS05685 (GC56T3_1060) - 1111176..1111970 (-) 795 WP_013144748.1 YqhG family protein -
  GC56T3_RS05690 (GC56T3_1061) - 1111957..1113621 (-) 1665 WP_013144749.1 DEAD/DEAH box helicase -

Sequence


Protein


Download         Length: 98 a.a.        Molecular weight: 10769.62 Da        Isoelectric Point: 7.1323

>NTDB_id=37346 GC56T3_RS05655 WP_012820551.1 1108926..1109222(+) (comGC) [Geobacillus sp. C56-T3]
MNQKGFTLIEMLIVMMVISVLLLIAIPNMTKHNSMINSKGCEAFLNTVQAQVKAYEMEHNKIPTVEELLAGRYIKSDKCP
NGRAIQISANGDVSESGS

Nucleotide


Download         Length: 297 bp        

>NTDB_id=37346 GC56T3_RS05655 WP_012820551.1 1108926..1109222(+) (comGC) [Geobacillus sp. C56-T3]
ATGAATCAAAAGGGATTCACATTGATCGAAATGTTGATCGTTATGATGGTGATTTCTGTCCTGTTGCTGATTGCCATTCC
GAATATGACAAAGCACAACAGCATGATCAATTCGAAAGGATGCGAGGCGTTTTTGAACACCGTGCAAGCGCAGGTGAAAG
CGTATGAGATGGAGCATAACAAAATTCCAACGGTGGAGGAATTGCTCGCTGGCCGCTATATCAAGTCGGACAAATGTCCG
AACGGTCGTGCGATCCAAATCAGCGCGAACGGCGATGTGAGTGAAAGTGGCTCGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC Bacillus subtilis subsp. subtilis str. 168

52.747

92.857

0.49

  comGC Staphylococcus aureus MW2

43.011

94.898

0.408

  comGC Staphylococcus aureus N315

43.011

94.898

0.408


Multiple sequence alignment