Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | KI220_RS01705 | Genome accession | NZ_AP023088 |
| Coordinates | 329789..331183 (-) | Length | 464 a.a. |
| NCBI ID | WP_020453117.1 | Uniprot ID | - |
| Organism | Bacillus paralicheniformis strain RSC-1 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 324789..336183
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| KI220_RS01670 (RSC1_00343) | flgL | 324976..325887 (-) | 912 | WP_020453110.1 | flagellar hook-associated protein FlgL | - |
| KI220_RS01675 (RSC1_00344) | flgK | 325898..327421 (-) | 1524 | WP_020453111.1 | flagellar hook-associated protein FlgK | - |
| KI220_RS01680 (RSC1_00345) | - | 327437..327913 (-) | 477 | WP_023856391.1 | flagellar protein FlgN | - |
| KI220_RS01685 (RSC1_00346) | flgM | 327932..328198 (-) | 267 | WP_020453113.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| KI220_RS01690 (RSC1_00347) | - | 328278..328697 (-) | 420 | WP_020453114.1 | TIGR03826 family flagellar region protein | - |
| KI220_RS01695 (RSC1_00348) | comFC | 328756..329493 (-) | 738 | WP_026580174.1 | ComF family protein | Machinery gene |
| KI220_RS01700 (RSC1_00349) | - | 329450..329734 (-) | 285 | WP_020453116.1 | late competence development ComFB family protein | - |
| KI220_RS01705 (RSC1_00350) | comFA | 329789..331183 (-) | 1395 | WP_020453117.1 | DEAD/DEAH box helicase | Machinery gene |
| KI220_RS01710 (RSC1_00351) | - | 331307..332149 (-) | 843 | WP_020453118.1 | DegV family protein | - |
| KI220_RS01715 (RSC1_00352) | degU | 332269..332958 (-) | 690 | WP_003185730.1 | two-component system response regulator DegU | Regulator |
| KI220_RS01720 (RSC1_00353) | degS | 333040..334197 (-) | 1158 | WP_020453119.1 | histidine kinase | Regulator |
| KI220_RS01725 (RSC1_00354) | - | 334420..335058 (+) | 639 | WP_023856394.1 | YigZ family protein | - |
| KI220_RS01730 (RSC1_00355) | - | 335073..336152 (+) | 1080 | WP_145645335.1 | LCP family protein | - |
Sequence
Protein
Download Length: 464 a.a. Molecular weight: 53050.78 Da Isoelectric Point: 9.7663
>NTDB_id=81265 KI220_RS01705 WP_020453117.1 329789..331183(-) (comFA) [Bacillus paralicheniformis strain RSC-1]
MIHIEHPASYSCELRSCLEQRHLLKSELSFPEPVIDWHIQEGLIKTEEGIKKTKRGFVCLRCGQHERFCFARYPCYRCNK
CCVYCRACVMMGRVSECTSLLTWRGHGRHEWAPVYTEWKGVLSAGQEKAAKSIIDAIRRKEELLIWAVCGSGKTELLFQG
IEFALNNGLRVCIATPRTDVVLELEPRFRHAFPGMEIAALYGGSPDVGTLSPLVISTTHQLLRYKEAFDVIIIDEVDAFP
YSIDNTLQYAVKKSAKRQSAHIYLTATPSRDMKKRAESGKLHTVRIPARFHRSPLPEPTLIWCGNWERGLKRRKVPLRLK
KWLFKHQELQHPVFLFVPSIPVLKTVVSLLKKETFRAEGVYAEDPERNEKVNRFRKSKLEVLVTTTILERGVTVKKAQVG
VLGAESAVFSESALVQMAGRAGRHPEHTDGDVCFFHHGRTKSMNAARRHIQYMNKLSKKEMLID
MIHIEHPASYSCELRSCLEQRHLLKSELSFPEPVIDWHIQEGLIKTEEGIKKTKRGFVCLRCGQHERFCFARYPCYRCNK
CCVYCRACVMMGRVSECTSLLTWRGHGRHEWAPVYTEWKGVLSAGQEKAAKSIIDAIRRKEELLIWAVCGSGKTELLFQG
IEFALNNGLRVCIATPRTDVVLELEPRFRHAFPGMEIAALYGGSPDVGTLSPLVISTTHQLLRYKEAFDVIIIDEVDAFP
YSIDNTLQYAVKKSAKRQSAHIYLTATPSRDMKKRAESGKLHTVRIPARFHRSPLPEPTLIWCGNWERGLKRRKVPLRLK
KWLFKHQELQHPVFLFVPSIPVLKTVVSLLKKETFRAEGVYAEDPERNEKVNRFRKSKLEVLVTTTILERGVTVKKAQVG
VLGAESAVFSESALVQMAGRAGRHPEHTDGDVCFFHHGRTKSMNAARRHIQYMNKLSKKEMLID
Nucleotide
Download Length: 1395 bp
>NTDB_id=81265 KI220_RS01705 WP_020453117.1 329789..331183(-) (comFA) [Bacillus paralicheniformis strain RSC-1]
GTGATTCACATCGAACATCCGGCATCCTATTCCTGTGAATTGCGGTCATGTTTGGAGCAGCGCCACCTTCTGAAAAGCGA
ACTTTCTTTTCCGGAACCTGTCATTGATTGGCATATCCAAGAAGGCCTGATAAAAACGGAGGAAGGCATTAAAAAAACGA
AGAGAGGCTTTGTTTGCTTGAGGTGCGGCCAGCACGAGCGCTTCTGTTTCGCTCGATATCCTTGCTATAGGTGCAATAAA
TGCTGCGTGTACTGCCGAGCCTGTGTCATGATGGGCAGGGTGAGCGAGTGTACGTCTCTGTTGACTTGGCGTGGACATGG
CAGACATGAATGGGCGCCTGTTTACACGGAGTGGAAAGGCGTACTTTCAGCCGGTCAGGAAAAAGCGGCAAAATCTATTA
TCGATGCTATACGCAGGAAAGAAGAGCTGTTGATCTGGGCGGTTTGCGGATCGGGAAAAACGGAACTTCTTTTTCAAGGG
ATCGAATTCGCGCTGAACAACGGTTTGAGAGTATGTATCGCCACTCCGAGAACAGACGTTGTGCTTGAGCTTGAACCGCG
GTTTCGCCACGCCTTTCCCGGGATGGAAATCGCCGCTTTATACGGAGGAAGCCCAGACGTTGGAACTCTCTCGCCGCTTG
TCATTTCAACTACCCACCAGCTGCTCCGCTATAAAGAAGCATTTGACGTGATCATCATAGATGAAGTGGATGCTTTTCCG
TATTCTATTGATAATACGCTGCAATACGCTGTTAAAAAATCGGCAAAACGGCAAAGCGCACATATCTATTTAACCGCCAC
GCCTTCTAGAGATATGAAGAAAAGGGCTGAGAGCGGAAAGCTGCACACCGTCCGTATTCCCGCAAGATTCCACCGCAGCC
CGCTGCCTGAACCTACATTGATTTGGTGCGGAAATTGGGAAAGGGGTTTGAAACGAAGAAAAGTTCCTTTACGCCTCAAA
AAATGGCTTTTTAAGCACCAGGAACTGCAGCATCCGGTTTTTCTGTTTGTTCCCTCTATCCCTGTTCTTAAAACCGTGGT
CAGTCTGCTGAAAAAGGAAACATTTCGCGCAGAAGGAGTCTATGCCGAAGACCCTGAGAGAAATGAAAAAGTGAACCGGT
TTAGGAAAAGTAAGCTTGAGGTGCTCGTGACGACAACGATTTTGGAAAGAGGTGTAACCGTCAAAAAAGCACAGGTTGGA
GTATTGGGGGCAGAGTCGGCTGTTTTTTCGGAAAGCGCTCTTGTCCAAATGGCCGGCAGAGCCGGCAGGCATCCTGAGCA
CACAGATGGAGACGTCTGTTTCTTTCATCACGGTAGGACAAAGTCGATGAATGCGGCACGACGTCATATTCAATATATGA
ATAAACTGTCTAAAAAGGAAATGTTGATTGACTAG
GTGATTCACATCGAACATCCGGCATCCTATTCCTGTGAATTGCGGTCATGTTTGGAGCAGCGCCACCTTCTGAAAAGCGA
ACTTTCTTTTCCGGAACCTGTCATTGATTGGCATATCCAAGAAGGCCTGATAAAAACGGAGGAAGGCATTAAAAAAACGA
AGAGAGGCTTTGTTTGCTTGAGGTGCGGCCAGCACGAGCGCTTCTGTTTCGCTCGATATCCTTGCTATAGGTGCAATAAA
TGCTGCGTGTACTGCCGAGCCTGTGTCATGATGGGCAGGGTGAGCGAGTGTACGTCTCTGTTGACTTGGCGTGGACATGG
CAGACATGAATGGGCGCCTGTTTACACGGAGTGGAAAGGCGTACTTTCAGCCGGTCAGGAAAAAGCGGCAAAATCTATTA
TCGATGCTATACGCAGGAAAGAAGAGCTGTTGATCTGGGCGGTTTGCGGATCGGGAAAAACGGAACTTCTTTTTCAAGGG
ATCGAATTCGCGCTGAACAACGGTTTGAGAGTATGTATCGCCACTCCGAGAACAGACGTTGTGCTTGAGCTTGAACCGCG
GTTTCGCCACGCCTTTCCCGGGATGGAAATCGCCGCTTTATACGGAGGAAGCCCAGACGTTGGAACTCTCTCGCCGCTTG
TCATTTCAACTACCCACCAGCTGCTCCGCTATAAAGAAGCATTTGACGTGATCATCATAGATGAAGTGGATGCTTTTCCG
TATTCTATTGATAATACGCTGCAATACGCTGTTAAAAAATCGGCAAAACGGCAAAGCGCACATATCTATTTAACCGCCAC
GCCTTCTAGAGATATGAAGAAAAGGGCTGAGAGCGGAAAGCTGCACACCGTCCGTATTCCCGCAAGATTCCACCGCAGCC
CGCTGCCTGAACCTACATTGATTTGGTGCGGAAATTGGGAAAGGGGTTTGAAACGAAGAAAAGTTCCTTTACGCCTCAAA
AAATGGCTTTTTAAGCACCAGGAACTGCAGCATCCGGTTTTTCTGTTTGTTCCCTCTATCCCTGTTCTTAAAACCGTGGT
CAGTCTGCTGAAAAAGGAAACATTTCGCGCAGAAGGAGTCTATGCCGAAGACCCTGAGAGAAATGAAAAAGTGAACCGGT
TTAGGAAAAGTAAGCTTGAGGTGCTCGTGACGACAACGATTTTGGAAAGAGGTGTAACCGTCAAAAAAGCACAGGTTGGA
GTATTGGGGGCAGAGTCGGCTGTTTTTTCGGAAAGCGCTCTTGTCCAAATGGCCGGCAGAGCCGGCAGGCATCCTGAGCA
CACAGATGGAGACGTCTGTTTCTTTCATCACGGTAGGACAAAGTCGATGAATGCGGCACGACGTCATATTCAATATATGA
ATAAACTGTCTAAAAAGGAAATGTTGATTGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
57.424 |
98.707 |
0.567 |