Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | DI291_RS18990 | Genome accession | NZ_CP068988 |
| Coordinates | 3747782..3749176 (-) | Length | 464 a.a. |
| NCBI ID | WP_128822151.1 | Uniprot ID | - |
| Organism | Bacillus paralicheniformis strain SUBG0010 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3742782..3754176
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| DI291_RS18955 (DI291_19475) | flgL | 3742969..3743880 (-) | 912 | WP_128822149.1 | flagellar hook-associated protein FlgL | - |
| DI291_RS18960 (DI291_19480) | flgK | 3743891..3745414 (-) | 1524 | WP_026580173.1 | flagellar hook-associated protein FlgK | - |
| DI291_RS18965 (DI291_19485) | - | 3745430..3745906 (-) | 477 | WP_128822150.1 | flagellar protein FlgN | - |
| DI291_RS18970 (DI291_19490) | flgM | 3745925..3746191 (-) | 267 | WP_003185713.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| DI291_RS18975 (DI291_19495) | - | 3746271..3746690 (-) | 420 | WP_020453114.1 | TIGR03826 family flagellar region protein | - |
| DI291_RS18980 (DI291_19500) | comFC | 3746749..3747486 (-) | 738 | WP_026580174.1 | ComF family protein | Machinery gene |
| DI291_RS18985 (DI291_19505) | - | 3747443..3747727 (-) | 285 | WP_020453116.1 | late competence development ComFB family protein | - |
| DI291_RS18990 (DI291_19510) | comFA | 3747782..3749176 (-) | 1395 | WP_128822151.1 | DEAD/DEAH box helicase | Machinery gene |
| DI291_RS18995 (DI291_19515) | - | 3749300..3750142 (-) | 843 | WP_020453118.1 | DegV family protein | - |
| DI291_RS19000 (DI291_19520) | degU | 3750262..3750951 (-) | 690 | WP_003185730.1 | two-component system response regulator DegU | Regulator |
| DI291_RS19005 (DI291_19525) | degS | 3751033..3752190 (-) | 1158 | WP_020453119.1 | histidine kinase | Regulator |
| DI291_RS19010 (DI291_19530) | - | 3752413..3753051 (+) | 639 | WP_020453120.1 | YigZ family protein | - |
| DI291_RS19015 (DI291_19535) | - | 3753066..3754145 (+) | 1080 | WP_128822152.1 | LCP family protein | - |
Sequence
Protein
Download Length: 464 a.a. Molecular weight: 53044.82 Da Isoelectric Point: 9.7663
>NTDB_id=531871 DI291_RS18990 WP_128822151.1 3747782..3749176(-) (comFA) [Bacillus paralicheniformis strain SUBG0010]
MIHIEHPASYSCELRSCLEQRHLLKSELPFPEPVIDWHIQEGLIKTEEGIKKTKRGFVCLRCGQHERFCFARYPCYRCNK
CCVYCRACVMMGRVSECTSLLTWRGHGRHEWAPVYTEWKGVLSAGQEKAAKSIIDAIRRKEELLIWAVCGSGKTELLFQG
IEFALNNGLRVCIATPRTDVVLELEPRFRHAFPGMEIAALYGGSPDVGTLSPLVISTTHQLLRYKEAFDVIIIDEVDAFP
YSIDNTLQYAVKKSAKRQSAHIYLTATPSRDMKKRAESGKLHTVRIPARFHRSPLPEPTLIWCGNWERGLKRRKVPLRLK
KWLFKHQELQHPVFLFVPSIPVLKTVVSLLKKETFRAEGVYAEDPERNEKVNRFRKSKLEVLVTTTILERGVTVKKAQVG
VLGAEAAVFSESALVQMAGRAGRHPEHTDGDVCFFHHGRTKSMNAARRHIQYMNKLSKKEMLID
MIHIEHPASYSCELRSCLEQRHLLKSELPFPEPVIDWHIQEGLIKTEEGIKKTKRGFVCLRCGQHERFCFARYPCYRCNK
CCVYCRACVMMGRVSECTSLLTWRGHGRHEWAPVYTEWKGVLSAGQEKAAKSIIDAIRRKEELLIWAVCGSGKTELLFQG
IEFALNNGLRVCIATPRTDVVLELEPRFRHAFPGMEIAALYGGSPDVGTLSPLVISTTHQLLRYKEAFDVIIIDEVDAFP
YSIDNTLQYAVKKSAKRQSAHIYLTATPSRDMKKRAESGKLHTVRIPARFHRSPLPEPTLIWCGNWERGLKRRKVPLRLK
KWLFKHQELQHPVFLFVPSIPVLKTVVSLLKKETFRAEGVYAEDPERNEKVNRFRKSKLEVLVTTTILERGVTVKKAQVG
VLGAEAAVFSESALVQMAGRAGRHPEHTDGDVCFFHHGRTKSMNAARRHIQYMNKLSKKEMLID
Nucleotide
Download Length: 1395 bp
>NTDB_id=531871 DI291_RS18990 WP_128822151.1 3747782..3749176(-) (comFA) [Bacillus paralicheniformis strain SUBG0010]
GTGATTCACATCGAACATCCGGCATCCTATTCCTGTGAATTGCGGTCATGTTTGGAGCAGCGCCACCTTCTGAAAAGCGA
ACTTCCTTTTCCGGAACCTGTCATTGATTGGCATATCCAAGAAGGCCTGATAAAAACGGAGGAAGGCATTAAAAAAACGA
AGAGAGGCTTTGTTTGCTTGAGGTGCGGCCAGCACGAGCGCTTCTGTTTCGCTCGATATCCTTGCTATAGGTGCAATAAA
TGCTGCGTGTACTGCCGAGCCTGTGTCATGATGGGCAGGGTGAGCGAGTGTACGTCTCTGTTGACTTGGCGTGGACATGG
CAGACATGAATGGGCGCCTGTTTACACGGAGTGGAAAGGCGTACTTTCAGCCGGTCAGGAAAAAGCGGCAAAATCTATTA
TCGATGCTATACGCAGGAAAGAAGAGCTGTTGATCTGGGCGGTTTGCGGATCGGGAAAAACGGAACTTCTTTTTCAAGGG
ATCGAATTCGCGCTGAACAACGGTTTGAGAGTATGTATCGCCACTCCGAGAACAGACGTTGTGCTTGAGCTTGAACCGCG
GTTTCGCCACGCCTTTCCCGGGATGGAAATCGCCGCTTTATACGGAGGAAGCCCAGACGTTGGAACTCTCTCGCCGCTTG
TCATTTCAACTACCCACCAGCTGCTCCGCTATAAAGAAGCATTTGACGTGATCATCATAGATGAAGTGGATGCTTTTCCG
TATTCAATTGATAATACGCTGCAATACGCTGTTAAAAAATCGGCAAAACGGCAAAGCGCACATATCTATTTAACCGCCAC
GCCTTCTAGAGATATGAAGAAAAGGGCTGAGAGCGGAAAGCTGCACACCGTCCGTATTCCCGCAAGATTCCACCGCAGCC
CGCTGCCTGAACCTACATTGATTTGGTGCGGAAATTGGGAAAGGGGTTTGAAACGAAGAAAAGTTCCTTTACGCCTCAAA
AAATGGCTTTTTAAGCACCAGGAACTGCAGCATCCGGTTTTTCTGTTTGTTCCCTCTATCCCTGTTCTTAAAACCGTGGT
CAGTCTGCTGAAAAAGGAAACATTTCGCGCAGAAGGAGTCTATGCCGAAGACCCTGAGAGAAATGAAAAAGTGAACCGGT
TTAGGAAAAGTAAGCTTGAGGTGCTCGTGACGACAACGATTTTGGAAAGAGGTGTAACCGTCAAAAAAGCACAGGTTGGA
GTATTGGGGGCAGAGGCGGCTGTTTTTTCGGAAAGCGCTCTTGTCCAAATGGCCGGCAGAGCCGGCAGGCATCCTGAGCA
CACAGATGGAGACGTCTGTTTCTTTCATCACGGTAGGACAAAGTCGATGAATGCGGCACGACGTCATATTCAATATATGA
ATAAACTGTCTAAAAAGGAAATGTTGATTGACTAG
GTGATTCACATCGAACATCCGGCATCCTATTCCTGTGAATTGCGGTCATGTTTGGAGCAGCGCCACCTTCTGAAAAGCGA
ACTTCCTTTTCCGGAACCTGTCATTGATTGGCATATCCAAGAAGGCCTGATAAAAACGGAGGAAGGCATTAAAAAAACGA
AGAGAGGCTTTGTTTGCTTGAGGTGCGGCCAGCACGAGCGCTTCTGTTTCGCTCGATATCCTTGCTATAGGTGCAATAAA
TGCTGCGTGTACTGCCGAGCCTGTGTCATGATGGGCAGGGTGAGCGAGTGTACGTCTCTGTTGACTTGGCGTGGACATGG
CAGACATGAATGGGCGCCTGTTTACACGGAGTGGAAAGGCGTACTTTCAGCCGGTCAGGAAAAAGCGGCAAAATCTATTA
TCGATGCTATACGCAGGAAAGAAGAGCTGTTGATCTGGGCGGTTTGCGGATCGGGAAAAACGGAACTTCTTTTTCAAGGG
ATCGAATTCGCGCTGAACAACGGTTTGAGAGTATGTATCGCCACTCCGAGAACAGACGTTGTGCTTGAGCTTGAACCGCG
GTTTCGCCACGCCTTTCCCGGGATGGAAATCGCCGCTTTATACGGAGGAAGCCCAGACGTTGGAACTCTCTCGCCGCTTG
TCATTTCAACTACCCACCAGCTGCTCCGCTATAAAGAAGCATTTGACGTGATCATCATAGATGAAGTGGATGCTTTTCCG
TATTCAATTGATAATACGCTGCAATACGCTGTTAAAAAATCGGCAAAACGGCAAAGCGCACATATCTATTTAACCGCCAC
GCCTTCTAGAGATATGAAGAAAAGGGCTGAGAGCGGAAAGCTGCACACCGTCCGTATTCCCGCAAGATTCCACCGCAGCC
CGCTGCCTGAACCTACATTGATTTGGTGCGGAAATTGGGAAAGGGGTTTGAAACGAAGAAAAGTTCCTTTACGCCTCAAA
AAATGGCTTTTTAAGCACCAGGAACTGCAGCATCCGGTTTTTCTGTTTGTTCCCTCTATCCCTGTTCTTAAAACCGTGGT
CAGTCTGCTGAAAAAGGAAACATTTCGCGCAGAAGGAGTCTATGCCGAAGACCCTGAGAGAAATGAAAAAGTGAACCGGT
TTAGGAAAAGTAAGCTTGAGGTGCTCGTGACGACAACGATTTTGGAAAGAGGTGTAACCGTCAAAAAAGCACAGGTTGGA
GTATTGGGGGCAGAGGCGGCTGTTTTTTCGGAAAGCGCTCTTGTCCAAATGGCCGGCAGAGCCGGCAGGCATCCTGAGCA
CACAGATGGAGACGTCTGTTTCTTTCATCACGGTAGGACAAAGTCGATGAATGCGGCACGACGTCATATTCAATATATGA
ATAAACTGTCTAAAAAGGAAATGTTGATTGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
56.987 |
98.707 |
0.563 |