Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   NLK52_RS12175 Genome accession   NZ_CP100436
Coordinates   2378689..2379726 (-) Length   345 a.a.
NCBI ID   WP_015483430.1    Uniprot ID   -
Organism   Bacillus subtilis strain MEC_B298     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2373689..2384726
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NLK52_RS12125 (NLK52_12125) tasA 2373788..2374573 (-) 786 WP_003230183.1 biofilm matrix protein TasA -
  NLK52_RS12130 (NLK52_12130) sipW 2374637..2375209 (-) 573 WP_080030740.1 signal peptidase I SipW -
  NLK52_RS12135 (NLK52_12135) tapA 2375193..2375954 (-) 762 WP_015384085.1 amyloid fiber anchoring/assembly protein TapA -
  NLK52_RS12140 (NLK52_12140) yqzG 2376224..2376550 (+) 327 WP_015384086.1 YqzG/YhdC family protein -
  NLK52_RS12145 (NLK52_12145) spoIITA 2376592..2376771 (-) 180 WP_003230176.1 YqzE family protein -
  NLK52_RS12150 (NLK52_12150) comGG 2376843..2377217 (-) 375 WP_014480253.1 ComG operon protein ComGG Machinery gene
  NLK52_RS12155 (NLK52_12155) comGF 2377218..2377601 (-) 384 WP_015384087.1 ComG operon protein ComGF Machinery gene
  NLK52_RS12160 (NLK52_12160) comGE 2377627..2377974 (-) 348 WP_015384088.1 ComG operon protein 5 Machinery gene
  NLK52_RS12165 (NLK52_12165) comGD 2377958..2378389 (-) 432 WP_015384089.1 comG operon protein ComGD Machinery gene
  NLK52_RS12170 (NLK52_12170) comGC 2378379..2378675 (-) 297 WP_014477332.1 comG operon protein ComGC Machinery gene
  NLK52_RS12175 (NLK52_12175) comGB 2378689..2379726 (-) 1038 WP_015483430.1 comG operon protein ComGB Machinery gene
  NLK52_RS12180 (NLK52_12180) comGA 2379713..2380783 (-) 1071 WP_015483431.1 competence protein ComGA Machinery gene
  NLK52_RS12185 (NLK52_12185) corA 2381193..2382146 (-) 954 WP_015483432.1 magnesium transporter CorA -
  NLK52_RS12190 (NLK52_12190) yqhB 2382289..2383617 (+) 1329 WP_003230150.1 hemolysin family protein -
  NLK52_RS12195 (NLK52_12195) rsbRD 2383670..2384506 (-) 837 WP_015384093.1 RsbT co-antagonist protein RsbRD -

Sequence


Protein


Download         Length: 345 a.a.        Molecular weight: 39738.15 Da        Isoelectric Point: 9.6257

>NTDB_id=705438 NLK52_RS12175 WP_015483430.1 2378689..2379726(-) (comGB) [Bacillus subtilis strain MEC_B298]
MKKIRKVWSLKDQARLLKRLGEMTAGGYTLLDGLRLMELQMNKRQAADLTDTVTCLREGAPFYQVLKSLSFHKEAVGICY
FAETHGELPASMIQSGELLERKIAQSDQLKRVLRYPLFLIFTVAVMFYMLQSIIIPQFSGIYQSMNMETSRSTDMLFAFF
QHIDLVLILLALFTAGIGIYYWLVFKKKSPARQMLICVRIPLIGKLVKLFNSYFFSLQLSSLLKSGLSIYDSLNAFKHQT
FLPFYRCEAEQLIERLKAGESIEAAICGSPFYEVDFSKVISHGQLSGRLDRELFTYSQFILQRLEHKAQKWTGILQPMIY
GFVAAMILLVYLSMLVPMYQMMNQM

Nucleotide


Download         Length: 1038 bp        

>NTDB_id=705438 NLK52_RS12175 WP_015483430.1 2378689..2379726(-) (comGB) [Bacillus subtilis strain MEC_B298]
ATGAAAAAGATTAGAAAAGTCTGGTCGTTAAAGGATCAAGCCAGGTTATTAAAGCGGCTTGGTGAAATGACTGCGGGCGG
ATATACGCTTCTGGATGGATTACGCCTGATGGAACTTCAGATGAATAAGAGGCAGGCTGCTGACTTGACTGATACGGTCA
CTTGTTTGAGGGAAGGGGCTCCGTTTTATCAAGTACTAAAGAGTTTGTCATTTCATAAGGAAGCCGTAGGTATTTGTTAT
TTTGCTGAAACACATGGTGAACTGCCCGCTTCCATGATCCAGAGCGGAGAGCTGCTGGAACGAAAAATTGCACAGTCAGA
CCAGCTGAAAAGAGTGCTGCGCTATCCGCTTTTCCTCATTTTTACGGTCGCTGTCATGTTTTATATGTTACAGTCCATCA
TCATTCCTCAGTTTTCCGGTATCTATCAATCGATGAATATGGAAACCTCACGTTCAACCGATATGCTTTTTGCTTTTTTT
CAGCATATTGATCTTGTGCTCATTTTGTTGGCTCTTTTTACAGCAGGTATCGGGATTTATTATTGGCTTGTGTTTAAGAA
AAAATCACCTGCCCGGCAAATGCTGATTTGTGTCAGGATTCCTTTGATTGGAAAGCTTGTCAAGCTGTTTAACAGCTACT
TTTTTTCTTTGCAGCTAAGCAGCCTTTTAAAATCGGGCCTCTCAATTTATGACAGCCTTAATGCATTTAAACATCAAACG
TTTCTCCCTTTCTACCGCTGCGAGGCTGAACAATTGATTGAACGGCTAAAAGCCGGTGAGTCAATTGAAGCCGCTATTTG
TGGAAGCCCTTTTTATGAAGTTGATTTTTCAAAAGTCATATCTCACGGCCAGCTGAGCGGCCGATTGGATCGAGAGCTTT
TCACATACAGTCAGTTCATATTGCAGCGGCTGGAACACAAAGCGCAAAAATGGACAGGCATCCTTCAGCCGATGATTTAT
GGATTTGTTGCAGCGATGATCTTACTTGTGTATTTATCTATGCTTGTGCCTATGTATCAGATGATGAATCAAATGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Bacillus subtilis subsp. subtilis str. 168

96.904

93.623

0.907