Detailed information
Overview
| Name | comGA | Type | Machinery gene |
| Locus tag | WF295_RS13630 | Genome accession | NZ_CP150885 |
| Coordinates | 2724492..2725556 (-) | Length | 354 a.a. |
| NCBI ID | WP_020452178.1 | Uniprot ID | - |
| Organism | Bacillus paralicheniformis strain Baplich1 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2719492..2730556
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| WF295_RS13585 (WF295_13585) | tapA | 2719986..2720714 (-) | 729 | WP_020452169.1 | amyloid fiber anchoring/assembly protein TapA | - |
| WF295_RS13590 (WF295_13590) | - | 2720992..2721312 (+) | 321 | WP_020452170.1 | YqzG/YhdC family protein | - |
| WF295_RS13595 (WF295_13595) | - | 2721342..2721524 (-) | 183 | WP_020452171.1 | YqzE family protein | - |
| WF295_RS13600 (WF295_13600) | comGG | 2721613..2721978 (-) | 366 | WP_020452172.1 | competence type IV pilus minor pilin ComGG | - |
| WF295_RS13605 (WF295_13605) | comGF | 2721990..2722478 (-) | 489 | WP_236613657.1 | competence type IV pilus minor pilin ComGF | - |
| WF295_RS13610 (WF295_13610) | comGE | 2722387..2722734 (-) | 348 | WP_020452174.1 | competence type IV pilus minor pilin ComGE | - |
| WF295_RS13615 (WF295_13615) | comGD | 2722718..2723155 (-) | 438 | WP_020452175.1 | competence type IV pilus minor pilin ComGD | - |
| WF295_RS13620 (WF295_13620) | comGC | 2723161..2723454 (-) | 294 | WP_020452176.1 | competence type IV pilus major pilin ComGC | Machinery gene |
| WF295_RS13625 (WF295_13625) | comGB | 2723468..2724502 (-) | 1035 | WP_041817253.1 | competence type IV pilus assembly protein ComGB | Machinery gene |
| WF295_RS13630 (WF295_13630) | comGA | 2724492..2725556 (-) | 1065 | WP_020452178.1 | competence type IV pilus ATPase ComGA | Machinery gene |
| WF295_RS13635 (WF295_13635) | - | 2725714..2726553 (-) | 840 | WP_020452179.1 | STAS domain-containing protein | - |
| WF295_RS13645 (WF295_13645) | - | 2726795..2727172 (-) | 378 | WP_003183482.1 | Spx/MgsR family RNA polymerase-binding regulatory protein | - |
| WF295_RS13650 (WF295_13650) | - | 2727383..2727628 (+) | 246 | WP_020452180.1 | DUF2626 domain-containing protein | - |
| WF295_RS13655 (WF295_13655) | - | 2727664..2728302 (-) | 639 | WP_020452181.1 | MBL fold metallo-hydrolase | - |
| WF295_RS13660 (WF295_13660) | - | 2728458..2728631 (+) | 174 | WP_003183486.1 | DUF2759 domain-containing protein | - |
| WF295_RS13665 (WF295_13665) | - | 2728692..2729006 (-) | 315 | WP_020452182.1 | MTH1187 family thiamine-binding protein | - |
| WF295_RS13670 (WF295_13670) | - | 2729022..2730131 (-) | 1110 | WP_020452183.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 354 a.a. Molecular weight: 39806.27 Da Isoelectric Point: 9.1063
>NTDB_id=973422 WF295_RS13630 WP_020452178.1 2724492..2725556(-) (comGA) [Bacillus paralicheniformis strain Baplich1]
MYTIESLSGRLIDEAYRMKASDIHIVPGEKEAVVRFRIDDELFQKDRLTRNECSRLISHFKFLSSMDIGERRLPQSGALT
LYINRQPVHLRMSTLPTIHDESLVIRLLPKVSSKPLTKLSLFPSATFKLLSFLKHSHGLILFTGPTGSGKTTTLYSLIEY
AKRHFKRNIITLEDPVESRSENILQVQVNEKAGMTYSAGLKAVLRHDPDMIILGEIRDAETAQIAVRAALTGHLVLSSMH
AKNAKGAIYRLLEFGVDMTEIEQTLVAISAQRLVGLVCPFCGDNCSLYCRLSRPVRRASIFELLYGKSLNLCIEEAKGRC
GDIKTETLKTLIQKGIALGYLPSNTYERWIGHED
MYTIESLSGRLIDEAYRMKASDIHIVPGEKEAVVRFRIDDELFQKDRLTRNECSRLISHFKFLSSMDIGERRLPQSGALT
LYINRQPVHLRMSTLPTIHDESLVIRLLPKVSSKPLTKLSLFPSATFKLLSFLKHSHGLILFTGPTGSGKTTTLYSLIEY
AKRHFKRNIITLEDPVESRSENILQVQVNEKAGMTYSAGLKAVLRHDPDMIILGEIRDAETAQIAVRAALTGHLVLSSMH
AKNAKGAIYRLLEFGVDMTEIEQTLVAISAQRLVGLVCPFCGDNCSLYCRLSRPVRRASIFELLYGKSLNLCIEEAKGRC
GDIKTETLKTLIQKGIALGYLPSNTYERWIGHED
Nucleotide
Download Length: 1065 bp
>NTDB_id=973422 WF295_RS13630 WP_020452178.1 2724492..2725556(-) (comGA) [Bacillus paralicheniformis strain Baplich1]
TTGTACACGATTGAATCATTAAGCGGAAGATTAATTGACGAGGCATATAGAATGAAGGCGTCAGACATTCATATTGTTCC
CGGTGAAAAAGAGGCGGTGGTCCGCTTCAGGATTGATGATGAACTTTTTCAAAAAGACAGATTAACGAGAAATGAGTGCT
CAAGGCTCATTTCTCACTTTAAATTCCTTTCTTCGATGGATATTGGGGAAAGACGGCTGCCGCAAAGCGGAGCTTTGACT
CTTTATATCAACCGTCAGCCCGTTCATTTAAGAATGTCCACTCTGCCCACCATCCATGATGAAAGCTTGGTCATCAGGCT
TCTTCCCAAGGTGAGCAGCAAGCCGCTCACAAAGCTCTCATTGTTTCCGAGTGCAACATTCAAGCTGCTGTCTTTTCTGA
AGCATTCCCACGGATTAATTCTGTTCACCGGTCCGACAGGCTCGGGGAAAACGACAACACTCTATTCTTTAATCGAATAC
GCTAAGCGGCATTTTAAGCGAAATATCATTACCTTGGAAGACCCTGTTGAATCAAGAAGCGAAAACATTTTGCAAGTGCA
AGTAAACGAAAAAGCTGGCATGACTTATTCTGCGGGGTTGAAGGCGGTGCTTCGCCACGATCCGGACATGATCATCCTCG
GGGAAATTCGCGATGCCGAAACAGCCCAAATCGCAGTCAGGGCAGCTTTGACAGGACACCTTGTTTTATCGAGCATGCAT
GCGAAAAATGCCAAAGGCGCAATATACAGGCTGCTCGAGTTCGGCGTTGATATGACAGAAATTGAACAAACACTGGTCGC
AATCAGCGCCCAGCGCCTCGTCGGTCTCGTCTGCCCGTTTTGCGGCGACAACTGCTCTTTATATTGCAGATTGTCGAGGC
CTGTGAGAAGAGCAAGCATCTTCGAGCTTTTGTACGGAAAAAGCCTCAATCTCTGCATCGAAGAAGCAAAAGGCAGGTGC
GGTGACATAAAAACAGAAACATTGAAAACGCTGATCCAGAAGGGAATAGCACTCGGATATCTGCCGTCGAATACTTATGA
ACGCTGGATAGGCCATGAAGATTAA
TTGTACACGATTGAATCATTAAGCGGAAGATTAATTGACGAGGCATATAGAATGAAGGCGTCAGACATTCATATTGTTCC
CGGTGAAAAAGAGGCGGTGGTCCGCTTCAGGATTGATGATGAACTTTTTCAAAAAGACAGATTAACGAGAAATGAGTGCT
CAAGGCTCATTTCTCACTTTAAATTCCTTTCTTCGATGGATATTGGGGAAAGACGGCTGCCGCAAAGCGGAGCTTTGACT
CTTTATATCAACCGTCAGCCCGTTCATTTAAGAATGTCCACTCTGCCCACCATCCATGATGAAAGCTTGGTCATCAGGCT
TCTTCCCAAGGTGAGCAGCAAGCCGCTCACAAAGCTCTCATTGTTTCCGAGTGCAACATTCAAGCTGCTGTCTTTTCTGA
AGCATTCCCACGGATTAATTCTGTTCACCGGTCCGACAGGCTCGGGGAAAACGACAACACTCTATTCTTTAATCGAATAC
GCTAAGCGGCATTTTAAGCGAAATATCATTACCTTGGAAGACCCTGTTGAATCAAGAAGCGAAAACATTTTGCAAGTGCA
AGTAAACGAAAAAGCTGGCATGACTTATTCTGCGGGGTTGAAGGCGGTGCTTCGCCACGATCCGGACATGATCATCCTCG
GGGAAATTCGCGATGCCGAAACAGCCCAAATCGCAGTCAGGGCAGCTTTGACAGGACACCTTGTTTTATCGAGCATGCAT
GCGAAAAATGCCAAAGGCGCAATATACAGGCTGCTCGAGTTCGGCGTTGATATGACAGAAATTGAACAAACACTGGTCGC
AATCAGCGCCCAGCGCCTCGTCGGTCTCGTCTGCCCGTTTTGCGGCGACAACTGCTCTTTATATTGCAGATTGTCGAGGC
CTGTGAGAAGAGCAAGCATCTTCGAGCTTTTGTACGGAAAAAGCCTCAATCTCTGCATCGAAGAAGCAAAAGGCAGGTGC
GGTGACATAAAAACAGAAACATTGAAAACGCTGATCCAGAAGGGAATAGCACTCGGATATCTGCCGTCGAATACTTATGA
ACGCTGGATAGGCCATGAAGATTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comGA | Bacillus subtilis subsp. subtilis str. 168 |
67.514 |
100 |
0.675 |
| pilB | Vibrio cholerae strain A1552 |
38.671 |
93.503 |
0.362 |