Detailed information
Overview
| Name | comGB | Type | Machinery gene |
| Locus tag | R5H20_RS15015 | Genome accession | NZ_CP137345 |
| Coordinates | 2954163..2955200 (-) | Length | 345 a.a. |
| NCBI ID | WP_006637539.1 | Uniprot ID | - |
| Organism | Bacillus sp. KICET-3 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2949163..2960200
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| R5H20_RS14965 (R5H20_14965) | tasA | 2949173..2949967 (-) | 795 | WP_006637529.1 | biofilm matrix protein TasA | - |
| R5H20_RS14970 (R5H20_14970) | sipW | 2950037..2950618 (-) | 582 | WP_006637530.1 | signal peptidase I SipW | - |
| R5H20_RS14975 (R5H20_14975) | tapA | 2950615..2951361 (-) | 747 | WP_006637531.1 | amyloid fiber anchoring/assembly protein TapA | - |
| R5H20_RS14980 (R5H20_14980) | - | 2951624..2951944 (+) | 321 | WP_006637532.1 | DUF3889 domain-containing protein | - |
| R5H20_RS14985 (R5H20_14985) | - | 2952038..2952223 (-) | 186 | WP_006637533.1 | YqzE family protein | - |
| R5H20_RS14990 (R5H20_14990) | comGG | 2952303..2952626 (-) | 324 | WP_224254418.1 | competence type IV pilus minor pilin ComGG | - |
| R5H20_RS14995 (R5H20_14995) | comGF | 2952681..2953172 (-) | 492 | WP_224254419.1 | competence type IV pilus minor pilin ComGF | - |
| R5H20_RS15000 (R5H20_15000) | comGE | 2953081..2953428 (-) | 348 | WP_006637536.1 | competence type IV pilus minor pilin ComGE | - |
| R5H20_RS15005 (R5H20_15005) | comGD | 2953412..2953855 (-) | 444 | WP_006637537.1 | competence type IV pilus minor pilin ComGD | - |
| R5H20_RS15010 (R5H20_15010) | comGC | 2953855..2954148 (-) | 294 | WP_006637538.1 | competence type IV pilus major pilin ComGC | Machinery gene |
| R5H20_RS15015 (R5H20_15015) | comGB | 2954163..2955200 (-) | 1038 | WP_006637539.1 | competence type IV pilus assembly protein ComGB | Machinery gene |
| R5H20_RS15020 (R5H20_15020) | comGA | 2955187..2956254 (-) | 1068 | WP_006637540.1 | competence type IV pilus ATPase ComGA | Machinery gene |
| R5H20_RS15025 (R5H20_15025) | - | 2956393..2957250 (-) | 858 | WP_006637541.1 | STAS domain-containing protein | - |
| R5H20_RS15035 (R5H20_15035) | - | 2957469..2957849 (-) | 381 | WP_006637542.1 | Spx/MgsR family RNA polymerase-binding regulatory protein | - |
| R5H20_RS15040 (R5H20_15040) | - | 2958062..2958307 (+) | 246 | WP_006637543.1 | DUF2626 domain-containing protein | - |
| R5H20_RS15045 (R5H20_15045) | - | 2958357..2958995 (-) | 639 | WP_006637544.1 | MBL fold metallo-hydrolase | - |
| R5H20_RS15050 (R5H20_15050) | - | 2959136..2959309 (+) | 174 | WP_006637545.1 | DUF2759 domain-containing protein | - |
| R5H20_RS15055 (R5H20_15055) | - | 2959367..2959681 (-) | 315 | WP_006637546.1 | MTH1187 family thiamine-binding protein | - |
Sequence
Protein
Download Length: 345 a.a. Molecular weight: 40024.46 Da Isoelectric Point: 9.6575
>NTDB_id=897593 R5H20_RS15015 WP_006637539.1 2954163..2955200(-) (comGB) [Bacillus sp. KICET-3]
MKPIKNRWPVGEQAEFLEKLGEMMMNGYTLLDALSMLELQLKRQQKTDIAFGRRKLAEGYPVFQVLNMISFHKAAVSIVY
FAERHGNVPFAFMQSGELLRRKIEQAEKIKKAAHYPAFLILTVCLIVYMMKAAIVPQFSAIYDSMNIETPFLTSFIFLFF
ESFSLLFLCILAAAAVFWAYYLYAFRQKPPEDKMALLIRIPLAGRILKLFNSYFLSLQLSNLLTSGLSIYDSLKAFESQP
FLPFFQKEAKRLIERLKQGEAIEHMLNGHPFYEKDLSKVVAHGQLNGQLHRELYSYSQFLIDRFEKKAEKWTGLLQPLIY
GFTAAMILILYLSMLLPMYQMMNQL
MKPIKNRWPVGEQAEFLEKLGEMMMNGYTLLDALSMLELQLKRQQKTDIAFGRRKLAEGYPVFQVLNMISFHKAAVSIVY
FAERHGNVPFAFMQSGELLRRKIEQAEKIKKAAHYPAFLILTVCLIVYMMKAAIVPQFSAIYDSMNIETPFLTSFIFLFF
ESFSLLFLCILAAAAVFWAYYLYAFRQKPPEDKMALLIRIPLAGRILKLFNSYFLSLQLSNLLTSGLSIYDSLKAFESQP
FLPFFQKEAKRLIERLKQGEAIEHMLNGHPFYEKDLSKVVAHGQLNGQLHRELYSYSQFLIDRFEKKAEKWTGLLQPLIY
GFTAAMILILYLSMLLPMYQMMNQL
Nucleotide
Download Length: 1038 bp
>NTDB_id=897593 R5H20_RS15015 WP_006637539.1 2954163..2955200(-) (comGB) [Bacillus sp. KICET-3]
ATGAAGCCGATTAAGAACAGATGGCCTGTCGGGGAACAGGCAGAGTTCCTTGAAAAGCTTGGCGAGATGATGATGAACGG
TTATACGCTTCTTGATGCATTAAGCATGCTGGAACTGCAATTAAAGCGGCAGCAAAAAACGGATATTGCATTCGGGAGGA
GAAAGCTTGCGGAAGGGTATCCTGTTTTTCAAGTTTTAAATATGATTTCATTTCATAAAGCTGCCGTCAGCATCGTTTAT
TTCGCCGAACGTCACGGTAATGTGCCATTTGCTTTTATGCAGAGCGGCGAATTGCTCCGCCGTAAAATCGAACAGGCCGA
AAAAATCAAAAAAGCCGCACATTATCCGGCATTTTTGATTTTGACGGTTTGCCTCATTGTCTATATGATGAAAGCCGCCA
TTGTTCCGCAGTTTTCCGCGATCTATGACTCGATGAACATAGAAACGCCCTTTCTGACATCCTTCATCTTTTTATTTTTT
GAAAGTTTCTCCTTGTTGTTTCTGTGCATACTAGCCGCCGCTGCTGTTTTTTGGGCGTATTATTTGTACGCTTTCCGCCA
AAAGCCCCCTGAAGACAAAATGGCTCTTCTCATCAGAATTCCGCTGGCAGGCAGAATCCTCAAATTGTTTAACAGCTACT
TTTTATCACTTCAGCTGAGCAATCTTCTTACATCCGGTTTGTCTATATATGACAGTTTAAAAGCGTTTGAAAGCCAGCCC
TTTTTGCCGTTTTTCCAAAAGGAAGCTAAACGGCTGATCGAGAGGCTGAAACAGGGGGAAGCGATAGAACATATGTTAAA
CGGACACCCGTTTTATGAAAAGGACCTATCAAAGGTGGTGGCTCACGGCCAATTAAACGGCCAGCTTCACAGAGAGCTTT
ATTCATACAGCCAATTTCTGATCGACCGGTTTGAAAAGAAAGCGGAAAAGTGGACAGGCCTGCTGCAGCCGCTGATTTAC
GGTTTTACCGCGGCCATGATTTTAATACTCTATCTGTCCATGCTGTTGCCAATGTATCAAATGATGAATCAGTTATAA
ATGAAGCCGATTAAGAACAGATGGCCTGTCGGGGAACAGGCAGAGTTCCTTGAAAAGCTTGGCGAGATGATGATGAACGG
TTATACGCTTCTTGATGCATTAAGCATGCTGGAACTGCAATTAAAGCGGCAGCAAAAAACGGATATTGCATTCGGGAGGA
GAAAGCTTGCGGAAGGGTATCCTGTTTTTCAAGTTTTAAATATGATTTCATTTCATAAAGCTGCCGTCAGCATCGTTTAT
TTCGCCGAACGTCACGGTAATGTGCCATTTGCTTTTATGCAGAGCGGCGAATTGCTCCGCCGTAAAATCGAACAGGCCGA
AAAAATCAAAAAAGCCGCACATTATCCGGCATTTTTGATTTTGACGGTTTGCCTCATTGTCTATATGATGAAAGCCGCCA
TTGTTCCGCAGTTTTCCGCGATCTATGACTCGATGAACATAGAAACGCCCTTTCTGACATCCTTCATCTTTTTATTTTTT
GAAAGTTTCTCCTTGTTGTTTCTGTGCATACTAGCCGCCGCTGCTGTTTTTTGGGCGTATTATTTGTACGCTTTCCGCCA
AAAGCCCCCTGAAGACAAAATGGCTCTTCTCATCAGAATTCCGCTGGCAGGCAGAATCCTCAAATTGTTTAACAGCTACT
TTTTATCACTTCAGCTGAGCAATCTTCTTACATCCGGTTTGTCTATATATGACAGTTTAAAAGCGTTTGAAAGCCAGCCC
TTTTTGCCGTTTTTCCAAAAGGAAGCTAAACGGCTGATCGAGAGGCTGAAACAGGGGGAAGCGATAGAACATATGTTAAA
CGGACACCCGTTTTATGAAAAGGACCTATCAAAGGTGGTGGCTCACGGCCAATTAAACGGCCAGCTTCACAGAGAGCTTT
ATTCATACAGCCAATTTCTGATCGACCGGTTTGAAAAGAAAGCGGAAAAGTGGACAGGCCTGCTGCAGCCGCTGATTTAC
GGTTTTACCGCGGCCATGATTTTAATACTCTATCTGTCCATGCTGTTGCCAATGTATCAAATGATGAATCAGTTATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comGB | Bacillus subtilis subsp. subtilis str. 168 |
56.347 |
93.623 |
0.528 |