Detailed information
Overview
| Name | comGA | Type | Machinery gene |
| Locus tag | P3X35_RS13160 | Genome accession | NZ_CP120920 |
| Coordinates | 2641600..2642664 (-) | Length | 354 a.a. |
| NCBI ID | WP_020452178.1 | Uniprot ID | - |
| Organism | Bacillus paralicheniformis strain SYN-191 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2636600..2647664
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| P3X35_RS13115 (P3X35_13115) | tapA | 2637094..2637822 (-) | 729 | WP_020452169.1 | amyloid fiber anchoring/assembly protein TapA | - |
| P3X35_RS13120 (P3X35_13120) | - | 2638100..2638420 (+) | 321 | WP_020452170.1 | YqzG/YhdC family protein | - |
| P3X35_RS13125 (P3X35_13125) | - | 2638450..2638632 (-) | 183 | WP_020452171.1 | YqzE family protein | - |
| P3X35_RS13130 (P3X35_13130) | comGG | 2638721..2639086 (-) | 366 | WP_020452172.1 | competence type IV pilus minor pilin ComGG | - |
| P3X35_RS13135 (P3X35_13135) | comGF | 2639098..2639586 (-) | 489 | WP_236613657.1 | competence type IV pilus minor pilin ComGF | - |
| P3X35_RS13140 (P3X35_13140) | comGE | 2639495..2639842 (-) | 348 | WP_020452174.1 | competence type IV pilus minor pilin ComGE | - |
| P3X35_RS13145 (P3X35_13145) | comGD | 2639826..2640263 (-) | 438 | WP_020452175.1 | competence type IV pilus minor pilin ComGD | - |
| P3X35_RS13150 (P3X35_13150) | comGC | 2640269..2640562 (-) | 294 | WP_020452176.1 | competence type IV pilus major pilin ComGC | Machinery gene |
| P3X35_RS13155 (P3X35_13155) | comGB | 2640576..2641610 (-) | 1035 | WP_277868482.1 | competence type IV pilus assembly protein ComGB | Machinery gene |
| P3X35_RS13160 (P3X35_13160) | comGA | 2641600..2642664 (-) | 1065 | WP_020452178.1 | competence type IV pilus ATPase ComGA | Machinery gene |
| P3X35_RS13165 (P3X35_13165) | - | 2642822..2643661 (-) | 840 | WP_020452179.1 | STAS domain-containing protein | - |
| P3X35_RS13175 (P3X35_13175) | - | 2643903..2644280 (-) | 378 | WP_003183482.1 | Spx/MgsR family RNA polymerase-binding regulatory protein | - |
| P3X35_RS13180 (P3X35_13180) | - | 2644491..2644736 (+) | 246 | WP_020452180.1 | DUF2626 domain-containing protein | - |
| P3X35_RS13185 (P3X35_13185) | - | 2644772..2645410 (-) | 639 | WP_020452181.1 | MBL fold metallo-hydrolase | - |
| P3X35_RS13190 (P3X35_13190) | - | 2645566..2645739 (+) | 174 | WP_003183486.1 | DUF2759 domain-containing protein | - |
| P3X35_RS13195 (P3X35_13195) | - | 2645800..2646114 (-) | 315 | WP_020452182.1 | MTH1187 family thiamine-binding protein | - |
| P3X35_RS13200 (P3X35_13200) | - | 2646130..2647239 (-) | 1110 | WP_020452183.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 354 a.a. Molecular weight: 39806.27 Da Isoelectric Point: 9.1063
>NTDB_id=809285 P3X35_RS13160 WP_020452178.1 2641600..2642664(-) (comGA) [Bacillus paralicheniformis strain SYN-191]
MYTIESLSGRLIDEAYRMKASDIHIVPGEKEAVVRFRIDDELFQKDRLTRNECSRLISHFKFLSSMDIGERRLPQSGALT
LYINRQPVHLRMSTLPTIHDESLVIRLLPKVSSKPLTKLSLFPSATFKLLSFLKHSHGLILFTGPTGSGKTTTLYSLIEY
AKRHFKRNIITLEDPVESRSENILQVQVNEKAGMTYSAGLKAVLRHDPDMIILGEIRDAETAQIAVRAALTGHLVLSSMH
AKNAKGAIYRLLEFGVDMTEIEQTLVAISAQRLVGLVCPFCGDNCSLYCRLSRPVRRASIFELLYGKSLNLCIEEAKGRC
GDIKTETLKTLIQKGIALGYLPSNTYERWIGHED
MYTIESLSGRLIDEAYRMKASDIHIVPGEKEAVVRFRIDDELFQKDRLTRNECSRLISHFKFLSSMDIGERRLPQSGALT
LYINRQPVHLRMSTLPTIHDESLVIRLLPKVSSKPLTKLSLFPSATFKLLSFLKHSHGLILFTGPTGSGKTTTLYSLIEY
AKRHFKRNIITLEDPVESRSENILQVQVNEKAGMTYSAGLKAVLRHDPDMIILGEIRDAETAQIAVRAALTGHLVLSSMH
AKNAKGAIYRLLEFGVDMTEIEQTLVAISAQRLVGLVCPFCGDNCSLYCRLSRPVRRASIFELLYGKSLNLCIEEAKGRC
GDIKTETLKTLIQKGIALGYLPSNTYERWIGHED
Nucleotide
Download Length: 1065 bp
>NTDB_id=809285 P3X35_RS13160 WP_020452178.1 2641600..2642664(-) (comGA) [Bacillus paralicheniformis strain SYN-191]
TTGTACACGATTGAATCATTAAGCGGAAGATTAATTGACGAGGCATATAGAATGAAGGCGTCAGACATTCATATTGTTCC
CGGTGAAAAAGAGGCGGTGGTCCGCTTCAGGATTGATGATGAACTTTTTCAAAAAGACAGATTAACGAGAAATGAGTGCT
CAAGGCTCATTTCTCACTTTAAATTCCTTTCTTCGATGGATATTGGGGAAAGACGGCTGCCGCAAAGCGGAGCTTTGACT
CTTTATATCAACCGTCAGCCCGTTCATTTAAGAATGTCCACTCTGCCCACCATCCATGATGAAAGCTTGGTCATCAGGCT
TCTTCCCAAGGTGAGCAGCAAGCCGCTCACAAAGCTCTCATTGTTTCCGAGTGCAACATTCAAGCTGCTGTCTTTTCTGA
AGCATTCCCACGGATTAATTCTGTTCACCGGTCCGACAGGCTCGGGGAAAACGACAACACTCTATTCTTTAATCGAATAC
GCTAAGCGGCATTTTAAGCGAAATATCATTACCTTGGAAGACCCTGTTGAATCAAGAAGCGAAAACATTTTGCAAGTGCA
AGTAAACGAAAAAGCTGGCATGACTTATTCTGCGGGGTTGAAGGCGGTGCTTCGCCACGATCCGGACATGATCATCCTCG
GGGAAATTCGCGATGCCGAAACAGCCCAAATCGCAGTCAGGGCAGCTTTGACAGGACACCTTGTTTTATCGAGCATGCAT
GCGAAAAATGCCAAAGGCGCAATATACAGGCTGCTCGAGTTCGGCGTTGATATGACAGAAATTGAACAAACACTGGTCGC
AATCAGCGCCCAGCGCCTCGTCGGTCTCGTCTGCCCGTTTTGCGGCGACAACTGCTCTTTATATTGCAGATTGTCGAGGC
CTGTGAGAAGAGCAAGCATCTTCGAGCTTTTGTACGGAAAAAGCCTCAATCTCTGCATCGAAGAAGCAAAAGGCAGGTGC
GGTGACATAAAAACAGAAACATTGAAAACGCTGATCCAGAAGGGAATAGCACTCGGATATCTGCCGTCGAATACTTATGA
ACGCTGGATAGGCCATGAAGATTAA
TTGTACACGATTGAATCATTAAGCGGAAGATTAATTGACGAGGCATATAGAATGAAGGCGTCAGACATTCATATTGTTCC
CGGTGAAAAAGAGGCGGTGGTCCGCTTCAGGATTGATGATGAACTTTTTCAAAAAGACAGATTAACGAGAAATGAGTGCT
CAAGGCTCATTTCTCACTTTAAATTCCTTTCTTCGATGGATATTGGGGAAAGACGGCTGCCGCAAAGCGGAGCTTTGACT
CTTTATATCAACCGTCAGCCCGTTCATTTAAGAATGTCCACTCTGCCCACCATCCATGATGAAAGCTTGGTCATCAGGCT
TCTTCCCAAGGTGAGCAGCAAGCCGCTCACAAAGCTCTCATTGTTTCCGAGTGCAACATTCAAGCTGCTGTCTTTTCTGA
AGCATTCCCACGGATTAATTCTGTTCACCGGTCCGACAGGCTCGGGGAAAACGACAACACTCTATTCTTTAATCGAATAC
GCTAAGCGGCATTTTAAGCGAAATATCATTACCTTGGAAGACCCTGTTGAATCAAGAAGCGAAAACATTTTGCAAGTGCA
AGTAAACGAAAAAGCTGGCATGACTTATTCTGCGGGGTTGAAGGCGGTGCTTCGCCACGATCCGGACATGATCATCCTCG
GGGAAATTCGCGATGCCGAAACAGCCCAAATCGCAGTCAGGGCAGCTTTGACAGGACACCTTGTTTTATCGAGCATGCAT
GCGAAAAATGCCAAAGGCGCAATATACAGGCTGCTCGAGTTCGGCGTTGATATGACAGAAATTGAACAAACACTGGTCGC
AATCAGCGCCCAGCGCCTCGTCGGTCTCGTCTGCCCGTTTTGCGGCGACAACTGCTCTTTATATTGCAGATTGTCGAGGC
CTGTGAGAAGAGCAAGCATCTTCGAGCTTTTGTACGGAAAAAGCCTCAATCTCTGCATCGAAGAAGCAAAAGGCAGGTGC
GGTGACATAAAAACAGAAACATTGAAAACGCTGATCCAGAAGGGAATAGCACTCGGATATCTGCCGTCGAATACTTATGA
ACGCTGGATAGGCCATGAAGATTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comGA | Bacillus subtilis subsp. subtilis str. 168 |
67.514 |
100 |
0.675 |
| pilB | Vibrio cholerae strain A1552 |
38.671 |
93.503 |
0.362 |