Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | P5637_RS15250 | Genome accession | NZ_CP120601 |
| Coordinates | 2931821..2933215 (-) | Length | 464 a.a. |
| NCBI ID | WP_023856393.1 | Uniprot ID | - |
| Organism | Bacillus paralicheniformis strain PRO109 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2926821..2938215
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| P5637_RS15215 (P5637_15215) | flgL | 2927008..2927919 (-) | 912 | WP_023856389.1 | flagellar hook-associated protein FlgL | - |
| P5637_RS15220 (P5637_15220) | flgK | 2927930..2929453 (-) | 1524 | WP_025810701.1 | flagellar hook-associated protein FlgK | - |
| P5637_RS15225 (P5637_15225) | - | 2929469..2929945 (-) | 477 | WP_025810700.1 | flagellar protein FlgN | - |
| P5637_RS15230 (P5637_15230) | flgM | 2929964..2930230 (-) | 267 | WP_003185713.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| P5637_RS15235 (P5637_15235) | - | 2930310..2930729 (-) | 420 | WP_020453114.1 | TIGR03826 family flagellar region protein | - |
| P5637_RS15240 (P5637_15240) | comFC | 2930788..2931525 (-) | 738 | WP_026580174.1 | ComF family protein | Machinery gene |
| P5637_RS15245 (P5637_15245) | - | 2931482..2931766 (-) | 285 | WP_145685822.1 | late competence development ComFB family protein | - |
| P5637_RS15250 (P5637_15250) | comFA | 2931821..2933215 (-) | 1395 | WP_023856393.1 | DEAD/DEAH box helicase | Machinery gene |
| P5637_RS15255 (P5637_15255) | - | 2933339..2934181 (-) | 843 | WP_020453118.1 | DegV family protein | - |
| P5637_RS15260 (P5637_15260) | degU | 2934301..2934990 (-) | 690 | WP_003185730.1 | two-component system response regulator DegU | Regulator |
| P5637_RS15265 (P5637_15265) | degS | 2935072..2936229 (-) | 1158 | WP_020453119.1 | histidine kinase | Regulator |
| P5637_RS15270 (P5637_15270) | - | 2936452..2937090 (+) | 639 | WP_023856394.1 | YigZ family protein | - |
| P5637_RS15275 (P5637_15275) | - | 2937105..2938184 (+) | 1080 | WP_023856395.1 | LCP family protein | - |
Sequence
Protein
Download Length: 464 a.a. Molecular weight: 53060.82 Da Isoelectric Point: 9.7663
>NTDB_id=806666 P5637_RS15250 WP_023856393.1 2931821..2933215(-) (comFA) [Bacillus paralicheniformis strain PRO109]
MIHIEHPASYSCELRSCLEQRHLLKSELPFPEPVIDWHIQEGLIKTEEGIKKTKRGFVCLRCGQHERFCFARYPCYRCNK
CCVYCRACVMMGRVSECTSLLTWRGHGRHEWAPVYTEWKGVLSAGQEKAAKSIIDAIRRKEELLIWAVCGSGKTELLFQG
IEFALNNGLRVCIATPRTDVVLELEPRFRHAFPGMEIAALYGGSPDVGTLSPLVISTTHQLLRYKEAFDVIIIDEVDAFP
YSIDNTLQYAVKKSAKRQSAHIYLTATPSRDMKKRAESGKLHTVRIPARFHRSPLPEPTLIWCGNWERGLKRRKVPLRLK
KWLFKHQELQHPVFLFVPSIPVLKTVVSLLKKETFRAEGVYAEDPERNEKVNRFRKSKLEVLVTTTILERGVTVKKAQVG
VLGAESAVFSESALVQMAGRAGRHPEHTDGDVCFFHHGRTKSMNAARRHIQYMNKLSKKEMLID
MIHIEHPASYSCELRSCLEQRHLLKSELPFPEPVIDWHIQEGLIKTEEGIKKTKRGFVCLRCGQHERFCFARYPCYRCNK
CCVYCRACVMMGRVSECTSLLTWRGHGRHEWAPVYTEWKGVLSAGQEKAAKSIIDAIRRKEELLIWAVCGSGKTELLFQG
IEFALNNGLRVCIATPRTDVVLELEPRFRHAFPGMEIAALYGGSPDVGTLSPLVISTTHQLLRYKEAFDVIIIDEVDAFP
YSIDNTLQYAVKKSAKRQSAHIYLTATPSRDMKKRAESGKLHTVRIPARFHRSPLPEPTLIWCGNWERGLKRRKVPLRLK
KWLFKHQELQHPVFLFVPSIPVLKTVVSLLKKETFRAEGVYAEDPERNEKVNRFRKSKLEVLVTTTILERGVTVKKAQVG
VLGAESAVFSESALVQMAGRAGRHPEHTDGDVCFFHHGRTKSMNAARRHIQYMNKLSKKEMLID
Nucleotide
Download Length: 1395 bp
>NTDB_id=806666 P5637_RS15250 WP_023856393.1 2931821..2933215(-) (comFA) [Bacillus paralicheniformis strain PRO109]
GTGATTCACATCGAACATCCGGCATCCTATTCCTGTGAATTGCGGTCATGTTTGGAGCAGCGCCACCTTCTGAAAAGCGA
ACTTCCTTTTCCGGAACCTGTCATTGATTGGCATATCCAAGAAGGCCTGATAAAAACGGAGGAAGGCATTAAAAAAACGA
AGAGAGGCTTTGTTTGCTTGAGGTGCGGCCAGCACGAGCGCTTCTGTTTCGCTCGATATCCTTGCTATAGGTGCAATAAA
TGCTGCGTGTACTGCCGAGCCTGTGTCATGATGGGCAGGGTGAGCGAGTGTACGTCTCTGTTGACTTGGCGTGGACATGG
CAGACATGAATGGGCGCCTGTTTACACGGAGTGGAAAGGCGTACTTTCAGCCGGTCAGGAAAAAGCGGCAAAATCTATTA
TCGATGCTATACGCAGGAAAGAAGAGCTGTTGATCTGGGCGGTTTGCGGATCGGGAAAAACGGAACTTCTTTTTCAAGGG
ATCGAATTCGCGCTGAACAACGGTTTGAGAGTATGTATCGCCACTCCGAGAACAGACGTTGTGCTTGAGCTTGAACCGCG
GTTTCGCCACGCCTTTCCCGGGATGGAAATCGCCGCTTTATACGGAGGAAGCCCAGACGTTGGAACTCTCTCGCCGCTTG
TCATTTCAACTACCCACCAGCTGCTCCGCTATAAAGAAGCATTTGACGTGATCATCATAGATGAAGTGGATGCTTTTCCG
TATTCTATTGATAATACGCTGCAATACGCTGTTAAAAAATCGGCAAAACGGCAAAGCGCACATATCTATTTAACCGCCAC
GCCTTCTAGAGATATGAAGAAAAGGGCTGAGAGCGGAAAGCTGCACACCGTCCGTATTCCCGCAAGATTCCACCGCAGCC
CGCTGCCTGAACCTACATTGATTTGGTGCGGAAATTGGGAAAGGGGTTTGAAACGAAGAAAAGTTCCTTTACGCCTCAAA
AAATGGCTTTTTAAGCACCAGGAACTGCAGCATCCGGTTTTTCTGTTTGTTCCCTCTATCCCTGTTCTTAAAACCGTGGT
CAGTCTGCTGAAAAAGGAAACATTTCGCGCAGAAGGAGTCTATGCCGAAGACCCTGAGAGAAATGAAAAAGTGAACCGGT
TTAGGAAAAGTAAGCTTGAGGTGCTCGTGACGACAACGATTTTGGAAAGAGGTGTAACCGTCAAAAAAGCACAGGTTGGA
GTATTGGGGGCAGAGTCGGCTGTTTTTTCGGAAAGCGCTCTTGTCCAAATGGCCGGCAGAGCCGGCAGGCATCCTGAGCA
CACAGATGGAGACGTCTGTTTCTTTCATCACGGTAGGACAAAGTCGATGAATGCGGCACGACGTCATATTCAATATATGA
ATAAACTGTCTAAAAAGGAAATGTTGATTGACTAG
GTGATTCACATCGAACATCCGGCATCCTATTCCTGTGAATTGCGGTCATGTTTGGAGCAGCGCCACCTTCTGAAAAGCGA
ACTTCCTTTTCCGGAACCTGTCATTGATTGGCATATCCAAGAAGGCCTGATAAAAACGGAGGAAGGCATTAAAAAAACGA
AGAGAGGCTTTGTTTGCTTGAGGTGCGGCCAGCACGAGCGCTTCTGTTTCGCTCGATATCCTTGCTATAGGTGCAATAAA
TGCTGCGTGTACTGCCGAGCCTGTGTCATGATGGGCAGGGTGAGCGAGTGTACGTCTCTGTTGACTTGGCGTGGACATGG
CAGACATGAATGGGCGCCTGTTTACACGGAGTGGAAAGGCGTACTTTCAGCCGGTCAGGAAAAAGCGGCAAAATCTATTA
TCGATGCTATACGCAGGAAAGAAGAGCTGTTGATCTGGGCGGTTTGCGGATCGGGAAAAACGGAACTTCTTTTTCAAGGG
ATCGAATTCGCGCTGAACAACGGTTTGAGAGTATGTATCGCCACTCCGAGAACAGACGTTGTGCTTGAGCTTGAACCGCG
GTTTCGCCACGCCTTTCCCGGGATGGAAATCGCCGCTTTATACGGAGGAAGCCCAGACGTTGGAACTCTCTCGCCGCTTG
TCATTTCAACTACCCACCAGCTGCTCCGCTATAAAGAAGCATTTGACGTGATCATCATAGATGAAGTGGATGCTTTTCCG
TATTCTATTGATAATACGCTGCAATACGCTGTTAAAAAATCGGCAAAACGGCAAAGCGCACATATCTATTTAACCGCCAC
GCCTTCTAGAGATATGAAGAAAAGGGCTGAGAGCGGAAAGCTGCACACCGTCCGTATTCCCGCAAGATTCCACCGCAGCC
CGCTGCCTGAACCTACATTGATTTGGTGCGGAAATTGGGAAAGGGGTTTGAAACGAAGAAAAGTTCCTTTACGCCTCAAA
AAATGGCTTTTTAAGCACCAGGAACTGCAGCATCCGGTTTTTCTGTTTGTTCCCTCTATCCCTGTTCTTAAAACCGTGGT
CAGTCTGCTGAAAAAGGAAACATTTCGCGCAGAAGGAGTCTATGCCGAAGACCCTGAGAGAAATGAAAAAGTGAACCGGT
TTAGGAAAAGTAAGCTTGAGGTGCTCGTGACGACAACGATTTTGGAAAGAGGTGTAACCGTCAAAAAAGCACAGGTTGGA
GTATTGGGGGCAGAGTCGGCTGTTTTTTCGGAAAGCGCTCTTGTCCAAATGGCCGGCAGAGCCGGCAGGCATCCTGAGCA
CACAGATGGAGACGTCTGTTTCTTTCATCACGGTAGGACAAAGTCGATGAATGCGGCACGACGTCATATTCAATATATGA
ATAAACTGTCTAAAAAGGAAATGTTGATTGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
57.205 |
98.707 |
0.565 |