Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   QPL78_RS12730 Genome accession   NZ_CP126530
Coordinates   2479127..2480164 (-) Length   345 a.a.
NCBI ID   WP_284559286.1    Uniprot ID   -
Organism   Bacillus halotolerans strain Tehuacan_S4     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2474127..2485164
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QPL78_RS12680 (QPL78_12680) tasA 2474220..2475005 (-) 786 WP_101864524.1 biofilm matrix protein TasA -
  QPL78_RS12685 (QPL78_12685) - 2475070..2475654 (-) 585 WP_284559282.1 signal peptidase I -
  QPL78_RS12690 (QPL78_12690) tapA 2475626..2476387 (-) 762 WP_284559283.1 amyloid fiber anchoring/assembly protein TapA -
  QPL78_RS12695 (QPL78_12695) - 2476664..2476987 (+) 324 WP_024122040.1 YqzG/YhdC family protein -
  QPL78_RS12700 (QPL78_12700) - 2477030..2477209 (-) 180 WP_003236949.1 YqzE family protein -
  QPL78_RS12705 (QPL78_12705) comGG 2477281..2477655 (-) 375 WP_106020094.1 competence type IV pilus minor pilin ComGG Machinery gene
  QPL78_RS12710 (QPL78_12710) comGF 2477656..2478039 (-) 384 WP_284559284.1 competence type IV pilus minor pilin ComGF Machinery gene
  QPL78_RS12715 (QPL78_12715) comGE 2478065..2478412 (-) 348 WP_284559285.1 competence type IV pilus minor pilin ComGE Machinery gene
  QPL78_RS12720 (QPL78_12720) comGD 2478396..2478827 (-) 432 WP_255003480.1 competence type IV pilus minor pilin ComGD Machinery gene
  QPL78_RS12725 (QPL78_12725) comGC 2478817..2479113 (-) 297 WP_010334925.1 competence type IV pilus major pilin ComGC Machinery gene
  QPL78_RS12730 (QPL78_12730) comGB 2479127..2480164 (-) 1038 WP_284559286.1 competence type IV pilus assembly protein ComGB Machinery gene
  QPL78_RS12735 (QPL78_12735) comGA 2480151..2481221 (-) 1071 WP_095713275.1 competence protein ComGA Machinery gene
  QPL78_RS12740 (QPL78_12740) - 2481543..2481953 (-) 411 WP_185848827.1 CBS domain-containing protein -
  QPL78_RS12745 (QPL78_12745) - 2482016..2482969 (-) 954 WP_284559287.1 magnesium transporter CorA family protein -
  QPL78_RS12750 (QPL78_12750) - 2483113..2484442 (+) 1330 Protein_2460 hemolysin family protein -

Sequence


Protein


Download         Length: 345 a.a.        Molecular weight: 39994.26 Da        Isoelectric Point: 9.6928

>NTDB_id=838494 QPL78_RS12730 WP_284559286.1 2479127..2480164(-) (comGB) [Bacillus halotolerans strain Tehuacan_S4]
MKQIRKVWPLMDQAYLLKRLGEMTAGGYSLLDGLRVMQLQMNKRQLADLANAIRRLREGESFYNVLKSLSFHKEAVGICY
FAETHGELPASMIQSGELLERKASQAEQIKRVLRYPLFLIFTVVVMLYMLQNIIIPQFSGIYQSMNVETSRSTDIIFAFF
QHLDAVFILMAILAAGIGLYYWFVFKKKSPDHQMLICVSIPLFGKLVRLFNSYFFSLQLSSLLASGLSIYESLKAFRKQT
FLPFYRHEAEQMIERLKAGETIEAALCKRPFYENDFSKVISHGQLSGRLDRELFTYSQFILQRLEDKSQKWTGILQPIIY
GFVAGMILIVYLSMLLPMYQMMNQM

Nucleotide


Download         Length: 1038 bp        

>NTDB_id=838494 QPL78_RS12730 WP_284559286.1 2479127..2480164(-) (comGB) [Bacillus halotolerans strain Tehuacan_S4]
ATGAAACAGATTAGAAAAGTTTGGCCGTTAATGGATCAAGCCTATTTACTGAAAAGGCTGGGTGAAATGACAGCGGGCGG
ATACAGTCTTTTAGACGGATTGCGCGTGATGCAGCTGCAAATGAATAAAAGGCAGCTGGCGGACTTGGCTAACGCAATAA
GACGGTTGAGAGAAGGGGAATCGTTTTACAATGTATTAAAGAGCTTGTCATTTCATAAGGAAGCCGTCGGTATTTGTTAT
TTTGCTGAAACACATGGGGAATTGCCGGCTTCCATGATTCAGAGCGGAGAGCTGCTGGAACGAAAAGCAAGCCAGGCAGA
ACAGATAAAAAGGGTGTTGCGATATCCTCTTTTCCTCATCTTTACCGTTGTTGTCATGCTTTATATGCTGCAAAACATCA
TCATTCCTCAGTTTTCCGGTATTTATCAATCGATGAATGTAGAAACGTCACGCTCAACAGACATCATTTTTGCATTTTTT
CAGCACCTTGATGCTGTGTTCATATTAATGGCCATTTTGGCTGCAGGAATAGGTCTTTATTATTGGTTTGTGTTCAAGAA
AAAATCCCCTGACCATCAAATGCTCATTTGCGTAAGCATTCCTTTGTTTGGAAAGCTGGTCAGGCTGTTTAACAGCTACT
TTTTTTCTTTACAGCTGAGCAGTCTGCTTGCATCAGGTCTTTCTATTTATGAAAGTTTGAAAGCATTTAGGAAGCAAACG
TTTCTCCCTTTTTACCGCCATGAAGCTGAACAAATGATTGAACGGTTAAAAGCCGGTGAAACCATTGAAGCTGCTTTATG
CAAACGGCCTTTTTACGAAAATGATTTTTCAAAAGTGATATCTCATGGCCAGCTAAGCGGCCGCTTGGATCGGGAACTCT
TCACATACAGCCAGTTTATATTGCAGCGGCTTGAGGACAAGTCACAAAAATGGACAGGCATTCTTCAGCCCATTATTTAC
GGATTTGTCGCAGGGATGATTTTAATTGTTTATCTATCGATGCTCCTGCCTATGTATCAGATGATGAATCAAATGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Bacillus subtilis subsp. subtilis str. 168

81.424

93.623

0.762