Detailed information
Overview
| Name | comGA | Type | Machinery gene |
| Locus tag | QN340_RS16640 | Genome accession | NZ_CP126496 |
| Coordinates | 3362975..3364039 (-) | Length | 354 a.a. |
| NCBI ID | WP_020452178.1 | Uniprot ID | - |
| Organism | Bacillus paralicheniformis strain DY4 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3357975..3369039
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| QN340_RS16595 (QN340_16595) | tapA | 3358469..3359197 (-) | 729 | WP_020452169.1 | amyloid fiber anchoring/assembly protein TapA | - |
| QN340_RS16600 (QN340_16600) | - | 3359475..3359795 (+) | 321 | WP_020452170.1 | YqzG/YhdC family protein | - |
| QN340_RS16605 (QN340_16605) | - | 3359825..3360007 (-) | 183 | WP_020452171.1 | YqzE family protein | - |
| QN340_RS16610 (QN340_16610) | comGG | 3360096..3360461 (-) | 366 | WP_020452172.1 | competence type IV pilus minor pilin ComGG | - |
| QN340_RS16615 (QN340_16615) | comGF | 3360473..3360955 (-) | 483 | WP_228119627.1 | competence type IV pilus minor pilin ComGF | - |
| QN340_RS16620 (QN340_16620) | comGE | 3360870..3361217 (-) | 348 | WP_020452174.1 | competence type IV pilus minor pilin ComGE | - |
| QN340_RS16625 (QN340_16625) | comGD | 3361201..3361638 (-) | 438 | WP_020452175.1 | competence type IV pilus minor pilin ComGD | - |
| QN340_RS16630 (QN340_16630) | comGC | 3361644..3361937 (-) | 294 | WP_020452176.1 | competence type IV pilus major pilin ComGC | Machinery gene |
| QN340_RS16635 (QN340_16635) | comGB | 3361951..3362985 (-) | 1035 | WP_041817253.1 | competence type IV pilus assembly protein ComGB | Machinery gene |
| QN340_RS16640 (QN340_16640) | comGA | 3362975..3364039 (-) | 1065 | WP_020452178.1 | competence type IV pilus ATPase ComGA | Machinery gene |
| QN340_RS16645 (QN340_16645) | - | 3364197..3365036 (-) | 840 | WP_020452179.1 | STAS domain-containing protein | - |
| QN340_RS16655 (QN340_16655) | - | 3365278..3365655 (-) | 378 | WP_003183482.1 | Spx/MgsR family RNA polymerase-binding regulatory protein | - |
| QN340_RS16660 (QN340_16660) | - | 3365866..3366111 (+) | 246 | WP_020452180.1 | DUF2626 domain-containing protein | - |
| QN340_RS16665 (QN340_16665) | - | 3366147..3366785 (-) | 639 | WP_020452181.1 | MBL fold metallo-hydrolase | - |
| QN340_RS16670 (QN340_16670) | - | 3366941..3367114 (+) | 174 | WP_003183486.1 | DUF2759 domain-containing protein | - |
| QN340_RS16675 (QN340_16675) | - | 3367175..3367489 (-) | 315 | WP_020452182.1 | MTH1187 family thiamine-binding protein | - |
| QN340_RS16680 (QN340_16680) | - | 3367505..3368614 (-) | 1110 | WP_020452183.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 354 a.a. Molecular weight: 39806.27 Da Isoelectric Point: 9.1063
>NTDB_id=838171 QN340_RS16640 WP_020452178.1 3362975..3364039(-) (comGA) [Bacillus paralicheniformis strain DY4]
MYTIESLSGRLIDEAYRMKASDIHIVPGEKEAVVRFRIDDELFQKDRLTRNECSRLISHFKFLSSMDIGERRLPQSGALT
LYINRQPVHLRMSTLPTIHDESLVIRLLPKVSSKPLTKLSLFPSATFKLLSFLKHSHGLILFTGPTGSGKTTTLYSLIEY
AKRHFKRNIITLEDPVESRSENILQVQVNEKAGMTYSAGLKAVLRHDPDMIILGEIRDAETAQIAVRAALTGHLVLSSMH
AKNAKGAIYRLLEFGVDMTEIEQTLVAISAQRLVGLVCPFCGDNCSLYCRLSRPVRRASIFELLYGKSLNLCIEEAKGRC
GDIKTETLKTLIQKGIALGYLPSNTYERWIGHED
MYTIESLSGRLIDEAYRMKASDIHIVPGEKEAVVRFRIDDELFQKDRLTRNECSRLISHFKFLSSMDIGERRLPQSGALT
LYINRQPVHLRMSTLPTIHDESLVIRLLPKVSSKPLTKLSLFPSATFKLLSFLKHSHGLILFTGPTGSGKTTTLYSLIEY
AKRHFKRNIITLEDPVESRSENILQVQVNEKAGMTYSAGLKAVLRHDPDMIILGEIRDAETAQIAVRAALTGHLVLSSMH
AKNAKGAIYRLLEFGVDMTEIEQTLVAISAQRLVGLVCPFCGDNCSLYCRLSRPVRRASIFELLYGKSLNLCIEEAKGRC
GDIKTETLKTLIQKGIALGYLPSNTYERWIGHED
Nucleotide
Download Length: 1065 bp
>NTDB_id=838171 QN340_RS16640 WP_020452178.1 3362975..3364039(-) (comGA) [Bacillus paralicheniformis strain DY4]
TTGTACACGATTGAATCATTAAGCGGAAGATTAATTGACGAGGCATATAGAATGAAGGCGTCAGACATTCATATTGTTCC
CGGTGAAAAAGAGGCGGTGGTCCGCTTCAGGATTGATGATGAACTTTTTCAAAAAGACAGATTAACGAGAAATGAGTGCT
CAAGGCTCATTTCTCACTTTAAATTCCTTTCTTCGATGGATATTGGGGAAAGACGGCTGCCGCAAAGCGGAGCTTTGACT
CTTTATATCAACCGTCAGCCCGTTCATTTAAGAATGTCCACTCTGCCCACCATCCATGATGAAAGCTTGGTCATCAGGCT
TCTTCCCAAGGTGAGCAGCAAGCCGCTCACAAAGCTCTCATTGTTTCCGAGTGCAACATTCAAGCTGCTGTCTTTTCTGA
AGCATTCCCACGGATTAATTCTGTTCACCGGTCCGACAGGCTCGGGGAAAACGACAACACTCTATTCTTTAATCGAATAC
GCTAAGCGGCATTTTAAGCGAAATATCATTACCTTGGAAGACCCTGTTGAATCAAGAAGCGAAAACATTTTGCAAGTGCA
AGTAAACGAAAAAGCTGGCATGACTTATTCTGCGGGGTTGAAGGCGGTGCTTCGCCACGATCCGGACATGATCATCCTCG
GGGAAATTCGCGATGCCGAAACAGCCCAAATCGCAGTCAGGGCAGCTTTGACAGGACACCTTGTTTTATCGAGCATGCAT
GCGAAAAATGCCAAAGGCGCAATATACAGGCTGCTCGAGTTCGGCGTTGATATGACAGAAATTGAACAAACACTGGTCGC
AATCAGCGCCCAGCGCCTCGTCGGTCTCGTCTGCCCGTTTTGCGGCGACAACTGCTCTTTATATTGCAGATTGTCGAGGC
CTGTGAGAAGAGCAAGCATCTTCGAGCTTTTGTACGGAAAAAGCCTCAATCTCTGCATCGAAGAAGCAAAAGGCAGGTGC
GGTGACATAAAAACAGAAACATTGAAAACGCTGATCCAGAAGGGAATAGCACTCGGATATCTGCCGTCGAATACTTATGA
ACGCTGGATAGGCCATGAAGATTAA
TTGTACACGATTGAATCATTAAGCGGAAGATTAATTGACGAGGCATATAGAATGAAGGCGTCAGACATTCATATTGTTCC
CGGTGAAAAAGAGGCGGTGGTCCGCTTCAGGATTGATGATGAACTTTTTCAAAAAGACAGATTAACGAGAAATGAGTGCT
CAAGGCTCATTTCTCACTTTAAATTCCTTTCTTCGATGGATATTGGGGAAAGACGGCTGCCGCAAAGCGGAGCTTTGACT
CTTTATATCAACCGTCAGCCCGTTCATTTAAGAATGTCCACTCTGCCCACCATCCATGATGAAAGCTTGGTCATCAGGCT
TCTTCCCAAGGTGAGCAGCAAGCCGCTCACAAAGCTCTCATTGTTTCCGAGTGCAACATTCAAGCTGCTGTCTTTTCTGA
AGCATTCCCACGGATTAATTCTGTTCACCGGTCCGACAGGCTCGGGGAAAACGACAACACTCTATTCTTTAATCGAATAC
GCTAAGCGGCATTTTAAGCGAAATATCATTACCTTGGAAGACCCTGTTGAATCAAGAAGCGAAAACATTTTGCAAGTGCA
AGTAAACGAAAAAGCTGGCATGACTTATTCTGCGGGGTTGAAGGCGGTGCTTCGCCACGATCCGGACATGATCATCCTCG
GGGAAATTCGCGATGCCGAAACAGCCCAAATCGCAGTCAGGGCAGCTTTGACAGGACACCTTGTTTTATCGAGCATGCAT
GCGAAAAATGCCAAAGGCGCAATATACAGGCTGCTCGAGTTCGGCGTTGATATGACAGAAATTGAACAAACACTGGTCGC
AATCAGCGCCCAGCGCCTCGTCGGTCTCGTCTGCCCGTTTTGCGGCGACAACTGCTCTTTATATTGCAGATTGTCGAGGC
CTGTGAGAAGAGCAAGCATCTTCGAGCTTTTGTACGGAAAAAGCCTCAATCTCTGCATCGAAGAAGCAAAAGGCAGGTGC
GGTGACATAAAAACAGAAACATTGAAAACGCTGATCCAGAAGGGAATAGCACTCGGATATCTGCCGTCGAATACTTATGA
ACGCTGGATAGGCCATGAAGATTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comGA | Bacillus subtilis subsp. subtilis str. 168 |
67.514 |
100 |
0.675 |
| pilB | Vibrio cholerae strain A1552 |
38.671 |
93.503 |
0.362 |