Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   EQZ20_RS14730 Genome accession   NZ_CP035232
Coordinates   2857594..2858631 (-) Length   345 a.a.
NCBI ID   WP_046129973.1    Uniprot ID   -
Organism   Bacillus glycinifermentans strain SRCM103574     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2852594..2863631
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQZ20_RS14680 (EQZ20_14680) - 2852634..2853428 (-) 795 WP_046129964.1 TasA family protein -
  EQZ20_RS14685 (EQZ20_14685) - 2853497..2854078 (-) 582 WP_046129965.1 signal peptidase I -
  EQZ20_RS14690 (EQZ20_14690) tapA 2854075..2854800 (-) 726 WP_046129966.1 amyloid fiber anchoring/assembly protein TapA -
  EQZ20_RS14695 (EQZ20_14695) - 2855063..2855386 (+) 324 WP_046129967.1 DUF3889 domain-containing protein -
  EQZ20_RS14700 (EQZ20_14700) - 2855471..2855653 (-) 183 WP_046129968.1 YqzE family protein -
  EQZ20_RS14705 (EQZ20_14705) comGG 2855735..2856100 (-) 366 WP_046129969.1 competence type IV pilus minor pilin ComGG -
  EQZ20_RS14710 (EQZ20_14710) comGF 2856112..2856597 (-) 486 WP_082094061.1 competence type IV pilus minor pilin ComGF -
  EQZ20_RS14715 (EQZ20_14715) comGE 2856512..2856859 (-) 348 WP_046129970.1 competence type IV pilus minor pilin ComGE -
  EQZ20_RS14720 (EQZ20_14720) comGD 2856843..2857286 (-) 444 WP_046129971.1 competence type IV pilus minor pilin ComGD -
  EQZ20_RS14725 (EQZ20_14725) comGC 2857286..2857579 (-) 294 WP_046129972.1 competence type IV pilus major pilin ComGC Machinery gene
  EQZ20_RS14730 (EQZ20_14730) comGB 2857594..2858631 (-) 1038 WP_046129973.1 competence type IV pilus assembly protein ComGB Machinery gene
  EQZ20_RS14735 (EQZ20_14735) comGA 2858618..2859685 (-) 1068 WP_046129974.1 competence type IV pilus ATPase ComGA Machinery gene
  EQZ20_RS14740 (EQZ20_14740) - 2859842..2860681 (-) 840 WP_046129975.1 STAS domain-containing protein -
  EQZ20_RS14750 (EQZ20_14750) - 2860894..2861274 (-) 381 WP_046129976.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  EQZ20_RS14755 (EQZ20_14755) - 2861478..2861723 (+) 246 WP_046129977.1 DUF2626 domain-containing protein -
  EQZ20_RS14760 (EQZ20_14760) - 2861772..2862410 (-) 639 WP_046129978.1 MBL fold metallo-hydrolase -
  EQZ20_RS14765 (EQZ20_14765) - 2862551..2862724 (+) 174 WP_046129979.1 DUF2759 domain-containing protein -
  EQZ20_RS14770 (EQZ20_14770) - 2862795..2863109 (-) 315 WP_046129980.1 MTH1187 family thiamine-binding protein -

Sequence


Protein


Download         Length: 345 a.a.        Molecular weight: 40075.47 Da        Isoelectric Point: 9.6632

>NTDB_id=336534 EQZ20_RS14730 WP_046129973.1 2857594..2858631(-) (comGB) [Bacillus glycinifermentans strain SRCM103574]
MRLNNIRWPLRKQAELFEKLGEMMMNGYTLLDALNMLELQFDNKQKADISFGRKKLAEGYPVFQVLNMISFHRDAVSIVY
FAEQHGNLPFAFKQSGELLSRKVDQSEKLKKAAKYPSFLVATVCLIAYIMKSAIIPQFSAIYDSMNIETSFLTSFIFLFF
DSLPLFFMCILIIAVVLLTYYVSIFRKKAPEEKMALLVKIPLAGRILKLFNSYFLSLQLSNLLTSGLSVYDSLKAFESQP
FLPFFHKEAKRMIERLKQGEALEQMLDRHPFYEKDLSKAVAHGQLNGHLHRELYSYSQFLIDRLEKKAEKWTSMLQPLIY
GFTAAMILILYLSMLLPMYQMMNQL

Nucleotide


Download         Length: 1038 bp        

>NTDB_id=336534 EQZ20_RS14730 WP_046129973.1 2857594..2858631(-) (comGB) [Bacillus glycinifermentans strain SRCM103574]
ATGAGACTGAATAACATAAGATGGCCCCTTCGGAAGCAGGCGGAGCTGTTTGAGAAGCTCGGCGAAATGATGATGAACGG
GTATACGCTTCTCGACGCTTTGAATATGCTTGAGCTGCAATTCGACAATAAGCAAAAAGCGGACATTTCATTCGGAAGAA
AAAAGCTGGCAGAAGGTTATCCCGTTTTTCAGGTTTTGAATATGATTTCGTTTCATAGGGACGCCGTCAGCATTGTTTAT
TTTGCCGAGCAGCACGGAAATTTGCCATTCGCTTTTAAGCAAAGCGGCGAATTGCTCAGCCGCAAAGTCGACCAGTCCGA
AAAATTGAAGAAGGCTGCAAAATATCCGTCATTCCTGGTTGCAACGGTCTGTCTCATCGCCTATATTATGAAATCGGCGA
TTATTCCGCAGTTTTCCGCGATCTATGACTCCATGAATATCGAAACATCTTTTCTGACCTCCTTCATCTTTTTATTTTTT
GACAGCCTCCCGCTGTTTTTTATGTGCATTCTCATCATTGCTGTGGTTTTGTTAACGTATTATGTTTCGATTTTCCGAAA
AAAAGCCCCGGAAGAAAAAATGGCCCTCCTCGTTAAAATCCCGCTGGCAGGCAGAATCCTCAAGTTATTCAACAGCTATT
TTTTATCGCTTCAGCTAAGCAATCTTTTAACATCGGGATTGTCTGTATATGACAGCTTAAAAGCGTTTGAGAGCCAGCCT
TTTTTGCCGTTTTTTCACAAGGAAGCAAAGCGCATGATCGAAAGGCTGAAACAGGGGGAAGCGCTGGAACAGATGCTGGA
CAGGCATCCGTTTTATGAAAAGGATCTCTCCAAAGCGGTGGCTCACGGTCAATTGAACGGTCATCTCCACAGGGAGCTAT
ATTCATACAGCCAATTTTTGATCGACCGGCTTGAAAAGAAAGCGGAAAAGTGGACAAGCATGCTTCAGCCTCTGATTTAC
GGATTTACCGCAGCCATGATTTTAATCCTCTATTTGTCGATGCTTTTGCCAATGTATCAAATGATGAATCAGTTATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Bacillus subtilis subsp. subtilis str. 168

54.489

93.623

0.51


Multiple sequence alignment