Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   WHL51_RS12365 Genome accession   NZ_CP148102
Coordinates   2421126..2422163 (-) Length   345 a.a.
NCBI ID   WP_003226315.1    Uniprot ID   -
Organism   Bacillus spizizenii str. W23     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2416126..2427163
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  WHL51_RS12315 tasA 2416226..2417011 (-) 786 WP_003226340.1 biofilm matrix protein TasA -
  WHL51_RS12320 sipW 2417075..2417647 (-) 573 WP_003226338.1 signal peptidase I SipW -
  WHL51_RS12325 tapA 2417631..2418392 (-) 762 WP_003226335.1 amyloid fiber anchoring/assembly protein TapA -
  WHL51_RS12330 - 2418661..2418987 (+) 327 WP_079996407.1 YqzG/YhdC family protein -
  WHL51_RS12335 - 2419029..2419208 (-) 180 WP_003226330.1 YqzE family protein -
  WHL51_RS12340 comGG 2419280..2419654 (-) 375 WP_003226328.1 competence type IV pilus minor pilin ComGG Machinery gene
  WHL51_RS12345 comGF 2419655..2420038 (-) 384 WP_032727308.1 competence type IV pilus minor pilin ComGF Machinery gene
  WHL51_RS12350 comGE 2420064..2420411 (-) 348 WP_003226323.1 competence type IV pilus minor pilin ComGE Machinery gene
  WHL51_RS12355 comGD 2420395..2420826 (-) 432 WP_003226321.1 competence type IV pilus minor pilin ComGD Machinery gene
  WHL51_RS12360 comGC 2420816..2421112 (-) 297 WP_003226319.1 comG operon protein ComGC Machinery gene
  WHL51_RS12365 comGB 2421126..2422163 (-) 1038 WP_003226315.1 competence type IV pilus assembly protein ComGB Machinery gene
  WHL51_RS12370 comGA 2422150..2423220 (-) 1071 WP_003226312.1 competence protein ComGA Machinery gene
  WHL51_RS12375 - 2423433..2423843 (-) 411 WP_032727412.1 CBS domain-containing protein -
  WHL51_RS12380 - 2423906..2424859 (-) 954 WP_003226308.1 magnesium transporter CorA family protein -
  WHL51_RS12385 - 2425003..2426331 (+) 1329 WP_044428668.1 hemolysin family protein -

Sequence


Protein


Download         Length: 345 a.a.        Molecular weight: 39999.27 Da        Isoelectric Point: 9.3768

>NTDB_id=950438 WHL51_RS12365 WP_003226315.1 2421126..2422163(-) (comGB) [Bacillus spizizenii str. W23]
MKQIRKTWKLKDQAQLLKRLGEMTEGGYTLLDGIRLMELQMNKRQVADLADAITRLREGAPFYQVLKSLSFHKEAVGICY
FAEIHGELPASMIQSGELLERKIAQADQLKRVLRYPLFLIFTVAVMFYMLQAVIIPQFSGIYQSMNMETTRSTDMLFAFF
QHIDLVIILLALFAAGIGLYYWLVFKKKSPARQMLIWVRIPLVGNLVRLFNSYFFSLQLSSLLQSGLSIYDSLNAFKHQT
FLPFYRCEAEQVIERLKAGESIETAICESPYYETDFSKVISHGQLSGRLDRELFTYSQFILQRLEHKAQKWTGILQPMIY
GFVAAMILIVYLSMLLPMYQMMNQM

Nucleotide


Download         Length: 1038 bp        

>NTDB_id=950438 WHL51_RS12365 WP_003226315.1 2421126..2422163(-) (comGB) [Bacillus spizizenii str. W23]
ATGAAACAGATTAGAAAAACCTGGAAGTTAAAGGATCAAGCCCAGTTATTGAAGCGGCTCGGTGAAATGACGGAGGGCGG
ATATACGCTTCTGGATGGAATACGGCTGATGGAGCTTCAGATGAATAAGAGGCAGGTGGCTGACTTGGCTGACGCGATCA
CCCGTTTGAGGGAAGGGGCTCCGTTTTATCAAGTATTAAAGAGTTTGTCATTTCATAAGGAAGCCGTAGGTATTTGTTAT
TTTGCCGAGATACATGGTGAACTGCCTGCCTCCATGATCCAAAGCGGAGAGCTGCTGGAACGAAAAATTGCACAGGCAGA
CCAGCTGAAAAGGGTGCTGCGATATCCGCTTTTCCTCATCTTCACAGTCGCTGTCATGTTTTATATGTTACAGGCCGTCA
TCATTCCTCAGTTTTCCGGTATTTATCAATCGATGAATATGGAAACCACACGTTCAACTGATATGCTTTTTGCTTTTTTT
CAGCATATTGACCTTGTAATCATATTATTGGCTCTTTTTGCAGCAGGTATCGGGCTTTATTATTGGCTTGTGTTTAAGAA
AAAATCACCTGCCCGGCAAATGCTTATTTGGGTCAGAATTCCTTTGGTTGGAAATCTTGTCAGGCTGTTTAACAGCTACT
TTTTTTCTTTGCAGCTAAGCAGCCTTTTACAATCAGGTCTCTCAATATATGACAGCCTTAACGCATTTAAACATCAAACG
TTTCTCCCTTTTTACCGCTGTGAGGCTGAACAAGTGATTGAACGGCTAAAAGCTGGTGAATCAATTGAAACTGCTATTTG
TGAAAGCCCTTATTATGAAACTGATTTTTCAAAAGTCATATCTCACGGACAGTTGAGCGGCCGCCTTGATCGGGAGCTTT
TCACATACAGTCAGTTCATATTGCAGCGGCTGGAACACAAGGCGCAAAAATGGACAGGCATCCTTCAACCTATGATTTAT
GGATTTGTTGCAGCGATGATCTTAATTGTGTATTTATCTATGCTTTTGCCTATGTATCAGATGATGAATCAAATGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Bacillus subtilis subsp. subtilis str. 168

91.641

93.623

0.858