Detailed information
Overview
| Name | comGA | Type | Machinery gene |
| Locus tag | EQZ20_RS14735 | Genome accession | NZ_CP035232 |
| Coordinates | 2858618..2859685 (-) | Length | 355 a.a. |
| NCBI ID | WP_046129974.1 | Uniprot ID | - |
| Organism | Bacillus glycinifermentans strain SRCM103574 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2853618..2864685
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EQZ20_RS14690 (EQZ20_14690) | tapA | 2854075..2854800 (-) | 726 | WP_046129966.1 | amyloid fiber anchoring/assembly protein TapA | - |
| EQZ20_RS14695 (EQZ20_14695) | - | 2855063..2855386 (+) | 324 | WP_046129967.1 | DUF3889 domain-containing protein | - |
| EQZ20_RS14700 (EQZ20_14700) | - | 2855471..2855653 (-) | 183 | WP_046129968.1 | YqzE family protein | - |
| EQZ20_RS14705 (EQZ20_14705) | comGG | 2855735..2856100 (-) | 366 | WP_046129969.1 | competence type IV pilus minor pilin ComGG | - |
| EQZ20_RS14710 (EQZ20_14710) | comGF | 2856112..2856597 (-) | 486 | WP_082094061.1 | competence type IV pilus minor pilin ComGF | - |
| EQZ20_RS14715 (EQZ20_14715) | comGE | 2856512..2856859 (-) | 348 | WP_046129970.1 | competence type IV pilus minor pilin ComGE | - |
| EQZ20_RS14720 (EQZ20_14720) | comGD | 2856843..2857286 (-) | 444 | WP_046129971.1 | competence type IV pilus minor pilin ComGD | - |
| EQZ20_RS14725 (EQZ20_14725) | comGC | 2857286..2857579 (-) | 294 | WP_046129972.1 | competence type IV pilus major pilin ComGC | Machinery gene |
| EQZ20_RS14730 (EQZ20_14730) | comGB | 2857594..2858631 (-) | 1038 | WP_046129973.1 | competence type IV pilus assembly protein ComGB | Machinery gene |
| EQZ20_RS14735 (EQZ20_14735) | comGA | 2858618..2859685 (-) | 1068 | WP_046129974.1 | competence type IV pilus ATPase ComGA | Machinery gene |
| EQZ20_RS14740 (EQZ20_14740) | - | 2859842..2860681 (-) | 840 | WP_046129975.1 | STAS domain-containing protein | - |
| EQZ20_RS14750 (EQZ20_14750) | - | 2860894..2861274 (-) | 381 | WP_046129976.1 | Spx/MgsR family RNA polymerase-binding regulatory protein | - |
| EQZ20_RS14755 (EQZ20_14755) | - | 2861478..2861723 (+) | 246 | WP_046129977.1 | DUF2626 domain-containing protein | - |
| EQZ20_RS14760 (EQZ20_14760) | - | 2861772..2862410 (-) | 639 | WP_046129978.1 | MBL fold metallo-hydrolase | - |
| EQZ20_RS14765 (EQZ20_14765) | - | 2862551..2862724 (+) | 174 | WP_046129979.1 | DUF2759 domain-containing protein | - |
| EQZ20_RS14770 (EQZ20_14770) | - | 2862795..2863109 (-) | 315 | WP_046129980.1 | MTH1187 family thiamine-binding protein | - |
| EQZ20_RS14775 (EQZ20_14775) | - | 2863126..2864229 (-) | 1104 | WP_046129981.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 355 a.a. Molecular weight: 39803.27 Da Isoelectric Point: 9.2825
>NTDB_id=336535 EQZ20_RS14735 WP_046129974.1 2858618..2859685(-) (comGA) [Bacillus glycinifermentans strain SRCM103574]
MYAIESLSGKLIEEACAMRASDIHIVPGEKEAVIRFRIDDELFQKGRLTRMECSRLISHFKFLSSMDIGERRQPQSGALT
IKVNNQPVHLRMSTLPTIYDESLVIRVLPQASAPPLRSLSLFPNATAKLLSFLKHSHGLLIFTGPTGSGKTTTLYSLIEY
AKQHFNRNIITLEDPVESRSEHVLQVQVNEKAGMTYSAGLKAVLRHDPDMIILGEIRDAETAKIAVRAALTGHLVLSSMH
AKNAKGAIYRLLEFGIHMTEIEQTLVAISAQRLVNLVCPLCGERCSLYCRMSGNGRRVSVFELLYGKSLNLCIKEAKGAY
VNSRFETLRKLIRKGIALGYLPEETYNRWVHHETE
MYAIESLSGKLIEEACAMRASDIHIVPGEKEAVIRFRIDDELFQKGRLTRMECSRLISHFKFLSSMDIGERRQPQSGALT
IKVNNQPVHLRMSTLPTIYDESLVIRVLPQASAPPLRSLSLFPNATAKLLSFLKHSHGLLIFTGPTGSGKTTTLYSLIEY
AKQHFNRNIITLEDPVESRSEHVLQVQVNEKAGMTYSAGLKAVLRHDPDMIILGEIRDAETAKIAVRAALTGHLVLSSMH
AKNAKGAIYRLLEFGIHMTEIEQTLVAISAQRLVNLVCPLCGERCSLYCRMSGNGRRVSVFELLYGKSLNLCIKEAKGAY
VNSRFETLRKLIRKGIALGYLPEETYNRWVHHETE
Nucleotide
Download Length: 1068 bp
>NTDB_id=336535 EQZ20_RS14735 WP_046129974.1 2858618..2859685(-) (comGA) [Bacillus glycinifermentans strain SRCM103574]
GTGTACGCGATTGAATCATTAAGCGGGAAATTGATCGAAGAGGCGTGTGCGATGAGAGCTTCTGATATTCATATCGTTCC
GGGGGAAAAAGAGGCGGTTATCCGCTTTAGAATTGATGATGAACTATTTCAAAAGGGCAGACTGACGAGAATGGAGTGCT
CAAGGCTCATTTCTCATTTTAAATTTCTTTCTTCAATGGATATCGGGGAGCGAAGGCAGCCGCAAAGCGGTGCTTTAACC
ATTAAAGTGAACAATCAGCCCGTTCACTTGAGAATGTCGACTTTGCCTACCATATACGACGAAAGCCTGGTTATTCGCGT
ATTGCCGCAGGCAAGCGCCCCGCCGCTCAGAAGCCTGTCTTTGTTTCCAAACGCAACGGCAAAGCTGCTGTCTTTTCTGA
AACATTCCCACGGTCTGCTGATCTTCACAGGTCCAACCGGTTCGGGAAAAACGACGACCCTGTACTCGCTGATCGAGTAT
GCCAAACAGCATTTCAACCGCAATATTATCACGCTGGAGGACCCGGTTGAATCCAGAAGCGAGCATGTTCTTCAAGTACA
GGTGAATGAAAAAGCGGGCATGACGTATTCAGCAGGTTTAAAGGCTGTTCTCCGTCATGATCCCGACATGATCATCCTTG
GAGAAATCCGCGATGCCGAAACAGCCAAAATCGCCGTCAGAGCTGCGCTGACGGGACATCTTGTATTATCTAGCATGCAT
GCGAAAAACGCAAAAGGAGCTATATACCGGCTGCTTGAGTTTGGCATTCATATGACCGAAATTGAGCAGACGCTTGTCGC
TATAAGCGCACAGCGTCTCGTCAACCTTGTTTGTCCGTTATGCGGGGAGCGGTGTTCTTTGTATTGCAGGATGTCGGGGA
ACGGAAGAAGAGTGAGCGTTTTTGAGCTTTTGTACGGGAAGAGTCTGAACCTGTGCATAAAAGAGGCAAAAGGCGCATAC
GTGAACAGCCGTTTTGAAACGCTGAGAAAATTGATTCGAAAAGGGATAGCGCTCGGCTATCTCCCGGAGGAAACTTATAA
CAGATGGGTGCATCATGAGACTGAATAA
GTGTACGCGATTGAATCATTAAGCGGGAAATTGATCGAAGAGGCGTGTGCGATGAGAGCTTCTGATATTCATATCGTTCC
GGGGGAAAAAGAGGCGGTTATCCGCTTTAGAATTGATGATGAACTATTTCAAAAGGGCAGACTGACGAGAATGGAGTGCT
CAAGGCTCATTTCTCATTTTAAATTTCTTTCTTCAATGGATATCGGGGAGCGAAGGCAGCCGCAAAGCGGTGCTTTAACC
ATTAAAGTGAACAATCAGCCCGTTCACTTGAGAATGTCGACTTTGCCTACCATATACGACGAAAGCCTGGTTATTCGCGT
ATTGCCGCAGGCAAGCGCCCCGCCGCTCAGAAGCCTGTCTTTGTTTCCAAACGCAACGGCAAAGCTGCTGTCTTTTCTGA
AACATTCCCACGGTCTGCTGATCTTCACAGGTCCAACCGGTTCGGGAAAAACGACGACCCTGTACTCGCTGATCGAGTAT
GCCAAACAGCATTTCAACCGCAATATTATCACGCTGGAGGACCCGGTTGAATCCAGAAGCGAGCATGTTCTTCAAGTACA
GGTGAATGAAAAAGCGGGCATGACGTATTCAGCAGGTTTAAAGGCTGTTCTCCGTCATGATCCCGACATGATCATCCTTG
GAGAAATCCGCGATGCCGAAACAGCCAAAATCGCCGTCAGAGCTGCGCTGACGGGACATCTTGTATTATCTAGCATGCAT
GCGAAAAACGCAAAAGGAGCTATATACCGGCTGCTTGAGTTTGGCATTCATATGACCGAAATTGAGCAGACGCTTGTCGC
TATAAGCGCACAGCGTCTCGTCAACCTTGTTTGTCCGTTATGCGGGGAGCGGTGTTCTTTGTATTGCAGGATGTCGGGGA
ACGGAAGAAGAGTGAGCGTTTTTGAGCTTTTGTACGGGAAGAGTCTGAACCTGTGCATAAAAGAGGCAAAAGGCGCATAC
GTGAACAGCCGTTTTGAAACGCTGAGAAAATTGATTCGAAAAGGGATAGCGCTCGGCTATCTCCCGGAGGAAACTTATAA
CAGATGGGTGCATCATGAGACTGAATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comGA | Bacillus subtilis subsp. subtilis str. 168 |
69.944 |
100 |
0.701 |
| pilB | Glaesserella parasuis strain SC1401 |
39.437 |
100 |
0.394 |