Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   BSO20_RS00215 Genome accession   NZ_CP018200
Coordinates   34811..35848 (+) Length   345 a.a.
NCBI ID   WP_032874014.1    Uniprot ID   -
Organism   Bacillus amyloliquefaciens strain WS-8     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 29811..40848
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BSO20_RS00195 (BSO20_00195) - 30292..31122 (+) 831 WP_032874008.1 STAS domain-containing protein -
  BSO20_RS00200 (BSO20_00200) - 31160..32461 (-) 1302 WP_032874010.1 hemolysin family protein -
  BSO20_RS00205 (BSO20_00205) - 32607..33557 (+) 951 WP_032874012.1 magnesium transporter CorA family protein -
  BSO20_RS00210 (BSO20_00210) comGA 33754..34824 (+) 1071 WP_003153083.1 competence type IV pilus ATPase ComGA Machinery gene
  BSO20_RS00215 (BSO20_00215) comGB 34811..35848 (+) 1038 WP_032874014.1 competence type IV pilus assembly protein ComGB Machinery gene
  BSO20_RS00220 (BSO20_00220) comGC 35895..36161 (+) 267 WP_042635730.1 competence type IV pilus major pilin ComGC Machinery gene
  BSO20_RS00225 (BSO20_00225) comGD 36151..36588 (+) 438 WP_007612572.1 competence type IV pilus minor pilin ComGD Machinery gene
  BSO20_RS00230 (BSO20_00230) comGE 36572..36886 (+) 315 WP_032874016.1 competence type IV pilus minor pilin ComGE Machinery gene
  BSO20_RS00235 (BSO20_00235) comGF 36795..37295 (+) 501 WP_223204419.1 competence type IV pilus minor pilin ComGF -
  BSO20_RS00240 (BSO20_00240) comGG 37296..37673 (+) 378 WP_032874019.1 competence type IV pilus minor pilin ComGG Machinery gene
  BSO20_RS00245 (BSO20_00245) - 37730..37909 (+) 180 WP_022552966.1 YqzE family protein -
  BSO20_RS00250 (BSO20_00250) - 37950..38279 (-) 330 WP_032874021.1 DUF3889 domain-containing protein -
  BSO20_RS00255 (BSO20_00255) tapA 38538..39209 (+) 672 WP_032874023.1 amyloid fiber anchoring/assembly protein TapA -
  BSO20_RS00260 (BSO20_00260) sipW 39181..39765 (+) 585 WP_032874025.1 signal peptidase I SipW -
  BSO20_RS00265 (BSO20_00265) tasA 39830..40615 (+) 786 WP_032874027.1 biofilm matrix protein TasA -

Sequence


Protein


Download         Length: 345 a.a.        Molecular weight: 39525.83 Da        Isoelectric Point: 9.5935

>NTDB_id=207095 BSO20_RS00215 WP_032874014.1 34811..35848(+) (comGB) [Bacillus amyloliquefaciens strain WS-8]
MKRIKRTWPLKDQAAFLKRLGEMTEKGYSLIEGLRLLKLQLHKRQLAELTDGIRRLREGDAFYQVLEALSFHKEAVSICY
FAEKHGELPGAMKQSGDLLQRKLIQTNQIKKMLRYPMFLISSVCVMFYILQSLIIPQFSGIYQSMNMNTSGATAFIFAFF
RHFHEACALALSAAFCLFLYVWFLCKKKSPQDKMLIVVKIPLLGKAAVLFNSYFFSLQLSSLLKSGLSIYDSLTAFKEQS
FLPFYREEAEMLITRLKAGETIESALSGHPCYEKDLAAAVSHGQANGLLHRELYTYSQFMMERLEQNAAKYTGILQPVIY
GVVAGMILIVYMSMLMPMYQMMNQM

Nucleotide


Download         Length: 1038 bp        

>NTDB_id=207095 BSO20_RS00215 WP_032874014.1 34811..35848(+) (comGB) [Bacillus amyloliquefaciens strain WS-8]
ATGAAGCGGATTAAACGAACCTGGCCGTTGAAGGATCAAGCGGCTTTTTTGAAAAGGCTCGGTGAAATGACGGAAAAAGG
ATACAGTCTGATTGAAGGGCTAAGACTGCTGAAACTCCAGCTGCATAAACGTCAGCTTGCAGAGCTGACGGACGGAATCC
GCCGGCTGAGGGAAGGTGATGCGTTTTACCAAGTGCTTGAGGCACTGTCTTTTCACAAAGAAGCCGTCTCTATCTGCTAT
TTCGCTGAGAAACACGGGGAATTGCCGGGAGCGATGAAACAAAGCGGCGACTTGTTACAGAGAAAGCTCATACAGACGAA
TCAAATCAAAAAAATGCTCCGGTATCCGATGTTTTTAATATCTTCCGTCTGTGTCATGTTTTATATCCTGCAAAGTTTAA
TTATTCCGCAGTTTTCTGGGATTTATCAATCAATGAATATGAATACATCCGGTGCGACTGCTTTCATCTTTGCGTTTTTT
CGGCATTTTCATGAAGCCTGCGCACTGGCGCTGTCCGCCGCTTTCTGCTTGTTTTTGTACGTTTGGTTCTTATGTAAGAA
AAAATCCCCGCAAGATAAAATGCTGATTGTTGTGAAAATACCACTATTAGGAAAAGCGGCCGTTCTTTTTAACAGCTACT
TTTTCTCTTTACAGCTCAGCAGTCTTCTGAAATCCGGCCTTTCTATCTATGACAGCTTAACAGCTTTTAAAGAACAATCT
TTTCTGCCTTTTTATCGTGAAGAAGCAGAGATGCTCATTACACGTCTGAAAGCCGGAGAAACCATAGAATCCGCTCTTTC
CGGGCATCCGTGTTATGAAAAAGATCTTGCGGCTGCAGTCAGCCACGGACAGGCAAACGGCCTGCTTCACCGCGAACTTT
ATACGTACAGCCAGTTTATGATGGAGCGTCTTGAACAAAACGCGGCGAAGTATACCGGCATTCTCCAGCCTGTCATTTAC
GGTGTTGTGGCCGGTATGATTTTAATTGTCTACATGTCAATGCTGATGCCGATGTACCAGATGATGAACCAAATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Bacillus subtilis subsp. subtilis str. 168

65.635

93.623

0.614


Multiple sequence alignment