Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   GC56T3_RS05645 Genome accession   NC_014206
Coordinates   1106625..1107698 (+) Length   357 a.a.
NCBI ID   WP_013144742.1    Uniprot ID   -
Organism   Geobacillus sp. C56-T3     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1101625..1112698
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GC56T3_RS18410 (GC56T3_1047) - 1102819..1102995 (-) 177 WP_011231921.1 DUF2759 domain-containing protein -
  GC56T3_RS05625 (GC56T3_1048) - 1103164..1103796 (+) 633 WP_012820544.1 MBL fold metallo-hydrolase -
  GC56T3_RS05630 (GC56T3_1049) - 1104117..1105250 (+) 1134 WP_012820545.1 class I SAM-dependent methyltransferase -
  GC56T3_RS05635 (GC56T3_1050) - 1105334..1105576 (-) 243 WP_008879854.1 DUF2626 family protein -
  GC56T3_RS05640 (GC56T3_1051) - 1105734..1106432 (-) 699 WP_013144741.1 metalloregulator ArsR/SmtB family transcription factor -
  GC56T3_RS05645 (GC56T3_1052) comGA 1106625..1107698 (+) 1074 WP_013144742.1 competence type IV pilus ATPase ComGA Machinery gene
  GC56T3_RS05650 (GC56T3_1053) comGB 1107695..1108723 (+) 1029 WP_013144743.1 competence type IV pilus assembly protein ComGB -
  GC56T3_RS05655 (GC56T3_1054) comGC 1108926..1109222 (+) 297 WP_012820551.1 competence type IV pilus major pilin ComGC Machinery gene
  GC56T3_RS05660 (GC56T3_1055) comGD 1109212..1109649 (+) 438 WP_013144744.1 competence type IV pilus minor pilin ComGD -
  GC56T3_RS05665 (GC56T3_1056) - 1109633..1109959 (+) 327 WP_013144745.1 competence protein ComG -
  GC56T3_RS05670 (GC56T3_1057) comGF 1109938..1110384 (+) 447 WP_013144746.1 competence type IV pilus minor pilin ComGF -
  GC56T3_RS05675 (GC56T3_1058) comGG 1110459..1110848 (+) 390 WP_013144747.1 competence type IV pilus minor pilin ComGG -
  GC56T3_RS05680 (GC56T3_1059) - 1110894..1111073 (+) 180 WP_011231910.1 YqzE family protein -
  GC56T3_RS05685 (GC56T3_1060) - 1111176..1111970 (-) 795 WP_013144748.1 YqhG family protein -

Sequence


Protein


Download         Length: 357 a.a.        Molecular weight: 39650.13 Da        Isoelectric Point: 8.1836

>NTDB_id=37345 GC56T3_RS05645 WP_013144742.1 1106625..1107698(+) (comGA) [Geobacillus sp. C56-T3]
MLNEIEQTANRLLAEAVQRRASDLHLVPRRRDAAVRLRLDGMLVDVGALPKETAERLIAHFKFLAGMDIGERRRPQSGAM
EVAEFGETVYLRLSTLPTLYDESLVIRLLPQRLPLPLRELSLFFHSTARLFSFMQHPQGLVLLTGPTGSGKTTTLYTLLD
LCQAERQRNIITLEDPIEKQNERLLQVQINEKAGITYAAGLKAALRHDPDVLMVGEIRDHDTAAIAIRSALSGHLVVSTM
HAADAVGAVYRLHEFGIPLGDLAETLLAVSAQRLVELCCPLCGDDCHPSCGRLGRRRRTAVYELLCGSALEDVIHFLSNG
RGKPKRSYMTLARLIRKGIVLGYLPVRMLELVGGEER

Nucleotide


Download         Length: 1074 bp        

>NTDB_id=37345 GC56T3_RS05645 WP_013144742.1 1106625..1107698(+) (comGA) [Geobacillus sp. C56-T3]
GTGCTGAACGAGATCGAACAAACGGCAAACCGCCTTCTCGCCGAAGCGGTGCAGCGCCGCGCCTCGGATCTCCACCTCGT
TCCGCGCCGCCGTGATGCCGCCGTTCGTCTTCGGCTCGATGGCATGCTTGTCGACGTTGGCGCTCTTCCGAAAGAAACGG
CGGAACGTCTCATCGCCCACTTTAAATTTTTAGCCGGCATGGACATCGGGGAACGACGCCGTCCGCAAAGCGGCGCCATG
GAAGTGGCGGAGTTTGGGGAAACGGTGTATTTGCGTTTATCGACATTGCCAACGCTCTACGACGAAAGTCTTGTCATTCG
ACTTCTGCCGCAGCGCCTCCCGTTGCCGCTTCGCGAACTATCCTTATTCTTCCACTCCACCGCACGATTATTTTCCTTCA
TGCAGCATCCGCAAGGACTCGTGCTGTTGACCGGTCCGACGGGGTCGGGAAAAACGACGACCCTGTACACCCTTCTTGAT
CTTTGCCAAGCGGAAAGGCAACGCAACATCATCACGCTTGAAGATCCAATCGAAAAACAGAACGAACGACTGTTGCAAGT
GCAAATCAATGAAAAAGCAGGCATCACATACGCCGCTGGGTTAAAAGCGGCGCTGCGCCATGACCCGGATGTGTTGATGG
TCGGCGAGATCCGCGATCATGACACGGCGGCCATTGCCATCCGTTCAGCGTTGAGCGGCCATTTGGTCGTCTCGACGATG
CACGCCGCCGATGCAGTCGGCGCCGTCTACCGGCTGCATGAATTCGGCATTCCGCTCGGCGATTTGGCTGAGACGCTGCT
TGCCGTCTCTGCCCAGCGGCTTGTCGAGCTTTGCTGCCCGTTGTGCGGCGACGATTGCCATCCATCGTGCGGCCGTCTTG
GGCGCCGCCGGCGCACGGCTGTTTATGAGTTGCTTTGCGGTTCAGCCCTGGAGGACGTGATTCATTTTCTCTCCAATGGG
AGAGGGAAGCCCAAGCGCTCTTATATGACGCTCGCTCGCTTGATCCGCAAGGGGATTGTGCTTGGCTACCTGCCCGTCCG
CATGCTTGAGCTCGTCGGAGGGGAAGAAAGATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

50.291

96.359

0.485


Multiple sequence alignment