Detailed information
Overview
| Name | comGA | Type | Machinery gene |
| Locus tag | NST20_RS08915 | Genome accession | NZ_CP151954 |
| Coordinates | 1813221..1814345 (-) | Length | 374 a.a. |
| NCBI ID | WP_013859647.1 | Uniprot ID | - |
| Organism | Weizmannia sp. FSL W8-0676 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1808221..1819345
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| NST20_RS08870 (NST20_08870) | - | 1808791..1809582 (+) | 792 | WP_342399300.1 | YqhG family protein | - |
| NST20_RS08875 (NST20_08875) | - | 1809616..1809792 (-) | 177 | WP_064485109.1 | YqzE family protein | - |
| NST20_RS08880 (NST20_08880) | - | 1809779..1810336 (-) | 558 | WP_342399301.1 | shikimate kinase | - |
| NST20_RS08885 (NST20_08885) | comGG | 1810333..1810701 (-) | 369 | WP_029142070.1 | competence type IV pilus minor pilin ComGG | - |
| NST20_RS08890 (NST20_08890) | comGF | 1810698..1811180 (-) | 483 | WP_342399302.1 | competence type IV pilus minor pilin ComGF | - |
| NST20_RS08895 (NST20_08895) | - | 1811143..1811451 (-) | 309 | WP_029142068.1 | type II secretion system protein | - |
| NST20_RS08900 (NST20_08900) | comGD | 1811438..1811881 (-) | 444 | WP_029142067.1 | competence type IV pilus minor pilin ComGD | - |
| NST20_RS08905 (NST20_08905) | comGC | 1811878..1812189 (-) | 312 | WP_013859645.1 | competence type IV pilus major pilin ComGC | - |
| NST20_RS08910 (NST20_08910) | comGB | 1812257..1813321 (-) | 1065 | WP_342399303.1 | competence type IV pilus assembly protein ComGB | - |
| NST20_RS08915 (NST20_08915) | comGA | 1813221..1814345 (-) | 1125 | WP_013859647.1 | competence type IV pilus ATPase ComGA | Machinery gene |
| NST20_RS08920 (NST20_08920) | - | 1814540..1815247 (+) | 708 | WP_258921440.1 | helix-turn-helix domain-containing protein | - |
| NST20_RS08925 (NST20_08925) | - | 1815325..1815567 (+) | 243 | WP_014098019.1 | DUF2626 domain-containing protein | - |
| NST20_RS08930 (NST20_08930) | - | 1815637..1816266 (-) | 630 | WP_013859649.1 | MBL fold metallo-hydrolase | - |
| NST20_RS08935 (NST20_08935) | - | 1816396..1816569 (+) | 174 | WP_019720590.1 | DUF2759 domain-containing protein | - |
| NST20_RS08940 (NST20_08940) | - | 1816668..1817636 (-) | 969 | WP_342399304.1 | ROK family glucokinase | - |
| NST20_RS08945 (NST20_08945) | - | 1817629..1817850 (-) | 222 | WP_342399305.1 | YqgQ family protein | - |
| NST20_RS08950 (NST20_08950) | - | 1818159..1819325 (-) | 1167 | WP_041819143.1 | rhomboid family intramembrane serine protease | - |
Sequence
Protein
Download Length: 374 a.a. Molecular weight: 42378.89 Da Isoelectric Point: 9.4407
>NTDB_id=981999 NST20_RS08915 WP_013859647.1 1813221..1814345(-) (comGA) [Weizmannia sp. FSL W8-0676]
MSVEKIAELLIGQAVKNNVTDVHIVPKEKHYHVQFRQYGRLYPHRNLSAKAGERLISHLKFMSSMDISEKRKPQSGSFAM
NVLQQTISLRISTLPTSLSKESLVIRILPHEEQFQLSQISLFPSSTKKLMALLNHSHGMLIFSGPTGSGKSTTMYTLVEH
CAKRFLRNVITLEDPVEKQSDSFLQVQVNEKAGITYSTGLKAILRHDPDIILVGEIRDAETARIAVRASLTGHLVLTTLH
TRDAKGAIYRMMEFGVSIHEMEQTLLAVSAQRLITLRCPVCGGIRCPHDSQDCPRSREKRQTAVYELLYGKNLQRVLKEA
KGEQVYYHYPTLKEWLRKGIVLGYISEEEYHRWIAEEEEADKPQTAGALSYPGR
MSVEKIAELLIGQAVKNNVTDVHIVPKEKHYHVQFRQYGRLYPHRNLSAKAGERLISHLKFMSSMDISEKRKPQSGSFAM
NVLQQTISLRISTLPTSLSKESLVIRILPHEEQFQLSQISLFPSSTKKLMALLNHSHGMLIFSGPTGSGKSTTMYTLVEH
CAKRFLRNVITLEDPVEKQSDSFLQVQVNEKAGITYSTGLKAILRHDPDIILVGEIRDAETARIAVRASLTGHLVLTTLH
TRDAKGAIYRMMEFGVSIHEMEQTLLAVSAQRLITLRCPVCGGIRCPHDSQDCPRSREKRQTAVYELLYGKNLQRVLKEA
KGEQVYYHYPTLKEWLRKGIVLGYISEEEYHRWIAEEEEADKPQTAGALSYPGR
Nucleotide
Download Length: 1125 bp
>NTDB_id=981999 NST20_RS08915 WP_013859647.1 1813221..1814345(-) (comGA) [Weizmannia sp. FSL W8-0676]
ATGTCAGTTGAAAAAATTGCGGAACTTTTGATCGGACAGGCGGTAAAAAACAATGTGACCGATGTCCATATCGTTCCGAA
AGAAAAGCATTACCATGTCCAGTTCCGGCAGTACGGAAGGCTGTACCCCCACCGCAACCTTTCCGCGAAAGCCGGGGAAA
GGCTGATATCCCATTTGAAATTTATGTCTTCCATGGATATCAGTGAAAAACGGAAACCCCAAAGCGGGTCGTTTGCCATG
AACGTTTTGCAGCAGACGATCTCCTTAAGGATTTCGACGCTGCCAACCTCTCTTTCCAAAGAGAGCCTCGTCATCCGGAT
TTTGCCCCACGAAGAACAATTCCAGCTCAGCCAAATTTCACTTTTTCCATCGAGCACGAAAAAATTAATGGCGCTATTAA
ACCATTCGCACGGCATGCTGATTTTCAGCGGTCCGACCGGAAGCGGCAAGTCGACGACGATGTATACGCTTGTTGAGCAC
TGCGCAAAACGGTTTTTACGGAATGTAATTACACTGGAAGACCCTGTTGAAAAACAAAGCGATTCGTTTTTGCAGGTGCA
GGTGAATGAAAAAGCAGGCATTACTTATAGCACAGGGCTGAAAGCAATTTTGCGCCATGATCCGGATATTATCCTGGTGG
GGGAAATCCGCGATGCCGAAACAGCAAGAATTGCGGTCCGCGCATCGCTCACCGGCCATCTCGTGCTGACGACATTGCAT
ACGCGTGACGCAAAGGGCGCGATCTACAGGATGATGGAATTCGGCGTGAGCATCCACGAAATGGAGCAGACTTTGCTCGC
AGTCAGCGCACAGCGGCTGATTACATTGCGATGCCCGGTATGCGGTGGGATCCGCTGTCCGCATGATTCGCAAGATTGCC
CACGCAGCCGAGAAAAGCGGCAAACCGCGGTATATGAGCTGCTGTACGGGAAAAATTTGCAGCGAGTGTTGAAAGAAGCA
AAGGGGGAACAAGTCTACTATCATTATCCGACACTGAAAGAATGGCTCAGAAAGGGGATTGTGCTTGGATACATTTCTGA
AGAAGAGTATCACCGCTGGATTGCCGAAGAGGAAGAAGCGGATAAGCCTCAAACTGCAGGGGCGCTTTCTTACCCGGGTC
GGTGA
ATGTCAGTTGAAAAAATTGCGGAACTTTTGATCGGACAGGCGGTAAAAAACAATGTGACCGATGTCCATATCGTTCCGAA
AGAAAAGCATTACCATGTCCAGTTCCGGCAGTACGGAAGGCTGTACCCCCACCGCAACCTTTCCGCGAAAGCCGGGGAAA
GGCTGATATCCCATTTGAAATTTATGTCTTCCATGGATATCAGTGAAAAACGGAAACCCCAAAGCGGGTCGTTTGCCATG
AACGTTTTGCAGCAGACGATCTCCTTAAGGATTTCGACGCTGCCAACCTCTCTTTCCAAAGAGAGCCTCGTCATCCGGAT
TTTGCCCCACGAAGAACAATTCCAGCTCAGCCAAATTTCACTTTTTCCATCGAGCACGAAAAAATTAATGGCGCTATTAA
ACCATTCGCACGGCATGCTGATTTTCAGCGGTCCGACCGGAAGCGGCAAGTCGACGACGATGTATACGCTTGTTGAGCAC
TGCGCAAAACGGTTTTTACGGAATGTAATTACACTGGAAGACCCTGTTGAAAAACAAAGCGATTCGTTTTTGCAGGTGCA
GGTGAATGAAAAAGCAGGCATTACTTATAGCACAGGGCTGAAAGCAATTTTGCGCCATGATCCGGATATTATCCTGGTGG
GGGAAATCCGCGATGCCGAAACAGCAAGAATTGCGGTCCGCGCATCGCTCACCGGCCATCTCGTGCTGACGACATTGCAT
ACGCGTGACGCAAAGGGCGCGATCTACAGGATGATGGAATTCGGCGTGAGCATCCACGAAATGGAGCAGACTTTGCTCGC
AGTCAGCGCACAGCGGCTGATTACATTGCGATGCCCGGTATGCGGTGGGATCCGCTGTCCGCATGATTCGCAAGATTGCC
CACGCAGCCGAGAAAAGCGGCAAACCGCGGTATATGAGCTGCTGTACGGGAAAAATTTGCAGCGAGTGTTGAAAGAAGCA
AAGGGGGAACAAGTCTACTATCATTATCCGACACTGAAAGAATGGCTCAGAAAGGGGATTGTGCTTGGATACATTTCTGA
AGAAGAGTATCACCGCTGGATTGCCGAAGAGGAAGAAGCGGATAAGCCTCAAACTGCAGGGGCGCTTTCTTACCCGGGTC
GGTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comGA | Bacillus subtilis subsp. subtilis str. 168 |
54.469 |
95.722 |
0.521 |