Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | BALI_RS18675 | Genome accession | NC_021362 |
| Coordinates | 3770043..3771437 (-) | Length | 464 a.a. |
| NCBI ID | WP_020453117.1 | Uniprot ID | - |
| Organism | Bacillus paralicheniformis ATCC 9945a | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3765043..3776437
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| BALI_RS18640 (BaLi_c37830) | flgL | 3765230..3766141 (-) | 912 | WP_020453110.1 | flagellar hook-associated protein FlgL | - |
| BALI_RS18645 (BaLi_c37840) | flgK | 3766152..3767675 (-) | 1524 | WP_020453111.1 | flagellar hook-associated protein FlgK | - |
| BALI_RS18650 (BaLi_c37850) | - | 3767691..3768167 (-) | 477 | WP_020453112.1 | flagellar protein FlgN | - |
| BALI_RS18655 (BaLi_c37860) | flgM | 3768186..3768452 (-) | 267 | WP_020453113.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| BALI_RS18660 (BaLi_c37870) | - | 3768532..3768951 (-) | 420 | WP_020453114.1 | TIGR03826 family flagellar region protein | - |
| BALI_RS18665 (BaLi_c37880) | comFC | 3769010..3769747 (-) | 738 | WP_026580174.1 | ComF family protein | Machinery gene |
| BALI_RS18670 (BaLi_c37890) | - | 3769704..3769988 (-) | 285 | WP_020453116.1 | late competence development ComFB family protein | - |
| BALI_RS18675 (BaLi_c37900) | comFA | 3770043..3771437 (-) | 1395 | WP_020453117.1 | DEAD/DEAH box helicase | Machinery gene |
| BALI_RS18680 (BaLi_c37910) | - | 3771561..3772403 (-) | 843 | WP_020453118.1 | DegV family protein | - |
| BALI_RS18685 (BaLi_c37920) | degU | 3772523..3773212 (-) | 690 | WP_003185730.1 | two-component system response regulator DegU | Regulator |
| BALI_RS18690 (BaLi_c37930) | degS | 3773294..3774451 (-) | 1158 | WP_020453119.1 | sensor histidine kinase | Regulator |
| BALI_RS18695 (BaLi_c37940) | - | 3774674..3775312 (+) | 639 | WP_020453120.1 | YigZ family protein | - |
| BALI_RS18700 (BaLi_c37950) | - | 3775327..3776406 (+) | 1080 | WP_020453121.1 | LCP family protein | - |
Sequence
Protein
Download Length: 464 a.a. Molecular weight: 53050.78 Da Isoelectric Point: 9.7663
>NTDB_id=59029 BALI_RS18675 WP_020453117.1 3770043..3771437(-) (comFA) [Bacillus paralicheniformis ATCC 9945a]
MIHIEHPASYSCELRSCLEQRHLLKSELSFPEPVIDWHIQEGLIKTEEGIKKTKRGFVCLRCGQHERFCFARYPCYRCNK
CCVYCRACVMMGRVSECTSLLTWRGHGRHEWAPVYTEWKGVLSAGQEKAAKSIIDAIRRKEELLIWAVCGSGKTELLFQG
IEFALNNGLRVCIATPRTDVVLELEPRFRHAFPGMEIAALYGGSPDVGTLSPLVISTTHQLLRYKEAFDVIIIDEVDAFP
YSIDNTLQYAVKKSAKRQSAHIYLTATPSRDMKKRAESGKLHTVRIPARFHRSPLPEPTLIWCGNWERGLKRRKVPLRLK
KWLFKHQELQHPVFLFVPSIPVLKTVVSLLKKETFRAEGVYAEDPERNEKVNRFRKSKLEVLVTTTILERGVTVKKAQVG
VLGAESAVFSESALVQMAGRAGRHPEHTDGDVCFFHHGRTKSMNAARRHIQYMNKLSKKEMLID
MIHIEHPASYSCELRSCLEQRHLLKSELSFPEPVIDWHIQEGLIKTEEGIKKTKRGFVCLRCGQHERFCFARYPCYRCNK
CCVYCRACVMMGRVSECTSLLTWRGHGRHEWAPVYTEWKGVLSAGQEKAAKSIIDAIRRKEELLIWAVCGSGKTELLFQG
IEFALNNGLRVCIATPRTDVVLELEPRFRHAFPGMEIAALYGGSPDVGTLSPLVISTTHQLLRYKEAFDVIIIDEVDAFP
YSIDNTLQYAVKKSAKRQSAHIYLTATPSRDMKKRAESGKLHTVRIPARFHRSPLPEPTLIWCGNWERGLKRRKVPLRLK
KWLFKHQELQHPVFLFVPSIPVLKTVVSLLKKETFRAEGVYAEDPERNEKVNRFRKSKLEVLVTTTILERGVTVKKAQVG
VLGAESAVFSESALVQMAGRAGRHPEHTDGDVCFFHHGRTKSMNAARRHIQYMNKLSKKEMLID
Nucleotide
Download Length: 1395 bp
>NTDB_id=59029 BALI_RS18675 WP_020453117.1 3770043..3771437(-) (comFA) [Bacillus paralicheniformis ATCC 9945a]
GTGATTCACATCGAACATCCGGCATCCTATTCCTGTGAATTGCGGTCATGTTTGGAGCAGCGCCACCTTCTGAAAAGCGA
ACTTTCTTTTCCGGAACCTGTCATTGATTGGCATATCCAAGAAGGCCTGATAAAAACGGAGGAAGGCATTAAAAAAACGA
AGAGAGGCTTTGTTTGCTTGAGGTGCGGCCAGCACGAGCGCTTCTGTTTCGCTCGATATCCTTGCTATAGGTGCAATAAA
TGCTGCGTGTACTGCCGAGCCTGTGTCATGATGGGCAGGGTGAGCGAGTGTACGTCTCTGTTGACTTGGCGTGGACATGG
CAGACATGAATGGGCGCCTGTTTACACGGAGTGGAAAGGCGTACTTTCAGCCGGTCAGGAAAAAGCGGCAAAATCTATTA
TCGATGCTATACGCAGGAAAGAAGAGCTGTTGATCTGGGCGGTTTGCGGATCGGGAAAAACGGAACTTCTTTTTCAAGGG
ATCGAATTCGCGCTGAACAACGGTTTGAGAGTATGTATCGCCACTCCGAGAACAGACGTTGTGCTTGAGCTTGAACCGCG
GTTTCGCCACGCCTTTCCCGGGATGGAAATCGCCGCTTTATACGGAGGAAGCCCAGACGTTGGAACTCTCTCGCCGCTTG
TCATTTCAACTACCCACCAGCTGCTCCGCTATAAAGAAGCATTTGACGTGATCATCATAGATGAAGTGGATGCTTTTCCG
TATTCTATTGATAATACGCTGCAATACGCTGTTAAAAAATCGGCAAAACGGCAAAGCGCACATATCTATTTAACCGCCAC
GCCTTCTAGAGATATGAAGAAAAGGGCTGAGAGCGGAAAGCTGCACACCGTCCGTATTCCCGCAAGATTCCACCGCAGCC
CGCTGCCTGAACCTACATTGATTTGGTGCGGAAATTGGGAAAGGGGTTTGAAACGAAGAAAAGTTCCTTTACGCCTCAAA
AAATGGCTTTTTAAGCACCAGGAACTGCAGCATCCGGTTTTTCTGTTTGTTCCCTCTATCCCTGTTCTTAAAACCGTGGT
CAGTCTGCTGAAAAAGGAAACATTTCGCGCAGAAGGAGTCTATGCCGAAGACCCTGAGAGAAATGAAAAAGTGAACCGGT
TTAGGAAAAGTAAGCTTGAGGTGCTCGTGACGACAACGATTTTGGAAAGAGGTGTAACCGTCAAAAAAGCACAGGTTGGA
GTATTGGGGGCAGAGTCGGCTGTTTTTTCGGAAAGCGCTCTTGTCCAAATGGCCGGCAGAGCCGGCAGGCATCCTGAGCA
CACAGATGGAGACGTCTGTTTCTTTCATCACGGTAGGACAAAGTCGATGAATGCGGCACGACGTCATATTCAATATATGA
ATAAACTGTCTAAAAAGGAAATGTTGATTGACTAG
GTGATTCACATCGAACATCCGGCATCCTATTCCTGTGAATTGCGGTCATGTTTGGAGCAGCGCCACCTTCTGAAAAGCGA
ACTTTCTTTTCCGGAACCTGTCATTGATTGGCATATCCAAGAAGGCCTGATAAAAACGGAGGAAGGCATTAAAAAAACGA
AGAGAGGCTTTGTTTGCTTGAGGTGCGGCCAGCACGAGCGCTTCTGTTTCGCTCGATATCCTTGCTATAGGTGCAATAAA
TGCTGCGTGTACTGCCGAGCCTGTGTCATGATGGGCAGGGTGAGCGAGTGTACGTCTCTGTTGACTTGGCGTGGACATGG
CAGACATGAATGGGCGCCTGTTTACACGGAGTGGAAAGGCGTACTTTCAGCCGGTCAGGAAAAAGCGGCAAAATCTATTA
TCGATGCTATACGCAGGAAAGAAGAGCTGTTGATCTGGGCGGTTTGCGGATCGGGAAAAACGGAACTTCTTTTTCAAGGG
ATCGAATTCGCGCTGAACAACGGTTTGAGAGTATGTATCGCCACTCCGAGAACAGACGTTGTGCTTGAGCTTGAACCGCG
GTTTCGCCACGCCTTTCCCGGGATGGAAATCGCCGCTTTATACGGAGGAAGCCCAGACGTTGGAACTCTCTCGCCGCTTG
TCATTTCAACTACCCACCAGCTGCTCCGCTATAAAGAAGCATTTGACGTGATCATCATAGATGAAGTGGATGCTTTTCCG
TATTCTATTGATAATACGCTGCAATACGCTGTTAAAAAATCGGCAAAACGGCAAAGCGCACATATCTATTTAACCGCCAC
GCCTTCTAGAGATATGAAGAAAAGGGCTGAGAGCGGAAAGCTGCACACCGTCCGTATTCCCGCAAGATTCCACCGCAGCC
CGCTGCCTGAACCTACATTGATTTGGTGCGGAAATTGGGAAAGGGGTTTGAAACGAAGAAAAGTTCCTTTACGCCTCAAA
AAATGGCTTTTTAAGCACCAGGAACTGCAGCATCCGGTTTTTCTGTTTGTTCCCTCTATCCCTGTTCTTAAAACCGTGGT
CAGTCTGCTGAAAAAGGAAACATTTCGCGCAGAAGGAGTCTATGCCGAAGACCCTGAGAGAAATGAAAAAGTGAACCGGT
TTAGGAAAAGTAAGCTTGAGGTGCTCGTGACGACAACGATTTTGGAAAGAGGTGTAACCGTCAAAAAAGCACAGGTTGGA
GTATTGGGGGCAGAGTCGGCTGTTTTTTCGGAAAGCGCTCTTGTCCAAATGGCCGGCAGAGCCGGCAGGCATCCTGAGCA
CACAGATGGAGACGTCTGTTTCTTTCATCACGGTAGGACAAAGTCGATGAATGCGGCACGACGTCATATTCAATATATGA
ATAAACTGTCTAAAAAGGAAATGTTGATTGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
57.424 |
98.707 |
0.567 |