Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   U471_RS11460 Genome accession   NC_022653
Coordinates   2428269..2429306 (-) Length   345 a.a.
NCBI ID   WP_012117984.1    Uniprot ID   -
Organism   Bacillus amyloliquefaciens CC178     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2423269..2434306
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  U471_RS11410 (U471_23730) tasA 2423503..2424288 (-) 786 WP_007408329.1 biofilm matrix protein TasA -
  U471_RS11415 (U471_23740) sipW 2424353..2424937 (-) 585 WP_012117977.1 signal peptidase I SipW -
  U471_RS11420 (U471_23750) tapA 2424909..2425580 (-) 672 WP_012117978.1 amyloid fiber anchoring/assembly protein TapA -
  U471_RS11425 (U471_23780) - 2425839..2426168 (+) 330 WP_012117979.1 DUF3889 domain-containing protein -
  U471_RS11430 (U471_23790) - 2426208..2426387 (-) 180 WP_003153093.1 YqzE family protein -
  U471_RS11435 (U471_23800) comGG 2426444..2426821 (-) 378 WP_012117980.1 competence type IV pilus minor pilin ComGG Machinery gene
  U471_RS11440 (U471_23810) comGF 2426822..2427322 (-) 501 WP_012117981.1 competence type IV pilus minor pilin ComGF -
  U471_RS11445 (U471_23820) comGE 2427231..2427545 (-) 315 WP_012117982.1 competence type IV pilus minor pilin ComGE -
  U471_RS11450 (U471_23830) comGD 2427529..2427966 (-) 438 WP_012117983.1 competence type IV pilus minor pilin ComGD Machinery gene
  U471_RS11455 (U471_23840) comGC 2427956..2428222 (-) 267 WP_042635730.1 competence type IV pilus major pilin ComGC Machinery gene
  U471_RS11460 (U471_23850) comGB 2428269..2429306 (-) 1038 WP_012117984.1 competence type IV pilus assembly protein ComGB Machinery gene
  U471_RS11465 (U471_23860) comGA 2429293..2430363 (-) 1071 WP_012117985.1 competence type IV pilus ATPase ComGA Machinery gene
  U471_RS11470 (U471_23870) - 2430560..2431510 (-) 951 WP_007408319.1 magnesium transporter CorA family protein -
  U471_RS11475 (U471_23880) - 2431656..2432957 (+) 1302 WP_012117986.1 hemolysin family protein -
  U471_RS11480 (U471_23890) - 2432995..2433825 (-) 831 WP_012117987.1 STAS domain-containing protein -

Sequence


Protein


Download         Length: 345 a.a.        Molecular weight: 39527.77 Da        Isoelectric Point: 9.4012

>NTDB_id=63174 U471_RS11460 WP_012117984.1 2428269..2429306(-) (comGB) [Bacillus amyloliquefaciens CC178]
MKRINRTWPLKDQAAFLKRLGEMTEKGYSLIEGLRLLQLQLHKRQLAELTDGIRRLRQGDAFYQVLEALSFHKEAVSICY
FAEKHGELPGAMKQSGELLQRKLMQTDQIKKMLRYPMFLISSVCVMFYILQSLIIPQFAGIYQSMNMNTSGATAFIFAFF
RHFHEACALALSAAFCLFLYVWFLCKKKSPQDKMLIVVKIPLLGKAAVLFNSYFFSLQLSSLLKSGLSIYDSLTAFKEQS
FLPFYREEAEMLITRLKAGETIESALSGHPCYEKDLAAAVSHGQANGLLHRELYTYSQFMMERLEQNAAKYTGILQPVIY
GVVAGMILIVYMSMLMPMYQMMNQM

Nucleotide


Download         Length: 1038 bp        

>NTDB_id=63174 U471_RS11460 WP_012117984.1 2428269..2429306(-) (comGB) [Bacillus amyloliquefaciens CC178]
ATGAAGCGGATTAACCGAACCTGGCCCTTGAAGGATCAAGCAGCTTTTTTGAAAAGGCTCGGTGAAATGACGGAAAAAGG
ATACAGTCTGATTGAAGGGCTCAGACTGCTGCAACTCCAGCTGCATAAACGTCAGCTTGCAGAGCTGACGGACGGAATCC
GCCGGCTGAGGCAAGGTGATGCGTTTTACCAAGTGCTTGAGGCACTGTCTTTTCACAAAGAAGCCGTCTCTATCTGCTAT
TTCGCTGAGAAACACGGGGAATTGCCGGGGGCGATGAAACAAAGCGGCGAATTATTACAGAGAAAGCTCATGCAGACGGA
TCAAATCAAAAAAATGCTCCGGTATCCGATGTTTTTAATTTCTTCCGTCTGTGTCATGTTTTATATCCTGCAAAGTTTAA
TTATTCCGCAGTTTGCTGGGATTTATCAATCAATGAATATGAATACTTCCGGTGCGACTGCTTTCATCTTTGCGTTTTTT
CGGCATTTTCATGAAGCCTGCGCACTGGCGCTGTCCGCCGCTTTCTGCTTGTTTTTGTACGTTTGGTTCTTATGTAAAAA
AAAATCCCCGCAAGATAAAATGCTGATTGTTGTGAAAATACCACTATTAGGAAAAGCGGCCGTTCTTTTTAACAGCTACT
TTTTCTCTTTACAGCTCAGCAGTCTTCTGAAATCCGGCCTATCTATCTATGACAGCTTAACAGCTTTTAAAGAACAATCT
TTTCTGCCTTTTTATCGTGAAGAAGCAGAGATGCTCATTACACGTCTGAAAGCCGGAGAAACCATAGAATCCGCTCTTTC
CGGGCATCCGTGTTATGAAAAAGACCTTGCGGCTGCAGTCAGCCACGGACAGGCAAACGGCCTGCTTCACCGCGAACTTT
ATACGTACAGCCAGTTTATGATGGAGCGTCTTGAACAAAACGCGGCGAAGTATACCGGCATTCTCCAGCCTGTCATTTAC
GGTGTTGTGGCCGGTATGATTTTAATTGTCTACATGTCAATGCTGATGCCGATGTACCAGATGATGAACCAAATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Bacillus subtilis subsp. subtilis str. 168

65.635

93.623

0.614


Multiple sequence alignment