Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | MHH76_RS18840 | Genome accession | NZ_CP150261 |
| Coordinates | 3730484..3731878 (-) | Length | 464 a.a. |
| NCBI ID | WP_020453117.1 | Uniprot ID | - |
| Organism | Bacillus sp. FSL L8-0193 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3725484..3736878
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| MHH76_RS18805 (MHH76_18805) | flgL | 3725671..3726582 (-) | 912 | WP_020453110.1 | flagellar hook-associated protein FlgL | - |
| MHH76_RS18810 (MHH76_18810) | flgK | 3726593..3728116 (-) | 1524 | WP_043054491.1 | flagellar hook-associated protein FlgK | - |
| MHH76_RS18815 (MHH76_18815) | - | 3728132..3728608 (-) | 477 | WP_023856391.1 | flagellar protein FlgN | - |
| MHH76_RS18820 (MHH76_18820) | flgM | 3728627..3728893 (-) | 267 | WP_020453113.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| MHH76_RS18825 (MHH76_18825) | - | 3728973..3729392 (-) | 420 | WP_020453114.1 | TIGR03826 family flagellar region protein | - |
| MHH76_RS18830 (MHH76_18830) | comFC | 3729451..3730188 (-) | 738 | WP_026580174.1 | ComF family protein | Machinery gene |
| MHH76_RS18835 (MHH76_18835) | - | 3730145..3730429 (-) | 285 | WP_020453116.1 | late competence development ComFB family protein | - |
| MHH76_RS18840 (MHH76_18840) | comFA | 3730484..3731878 (-) | 1395 | WP_020453117.1 | DEAD/DEAH box helicase | Machinery gene |
| MHH76_RS18845 (MHH76_18845) | - | 3732002..3732844 (-) | 843 | WP_020453118.1 | DegV family protein | - |
| MHH76_RS18850 (MHH76_18850) | degU | 3732964..3733653 (-) | 690 | WP_003185730.1 | two-component system response regulator DegU | Regulator |
| MHH76_RS18855 (MHH76_18855) | degS | 3733735..3734892 (-) | 1158 | WP_020453119.1 | histidine kinase | Regulator |
| MHH76_RS18860 (MHH76_18860) | - | 3735115..3735753 (+) | 639 | WP_023856394.1 | YigZ family protein | - |
| MHH76_RS18865 (MHH76_18865) | - | 3735768..3736847 (+) | 1080 | WP_020453121.1 | LCP family protein | - |
Sequence
Protein
Download Length: 464 a.a. Molecular weight: 53050.78 Da Isoelectric Point: 9.7663
>NTDB_id=968556 MHH76_RS18840 WP_020453117.1 3730484..3731878(-) (comFA) [Bacillus sp. FSL L8-0193]
MIHIEHPASYSCELRSCLEQRHLLKSELSFPEPVIDWHIQEGLIKTEEGIKKTKRGFVCLRCGQHERFCFARYPCYRCNK
CCVYCRACVMMGRVSECTSLLTWRGHGRHEWAPVYTEWKGVLSAGQEKAAKSIIDAIRRKEELLIWAVCGSGKTELLFQG
IEFALNNGLRVCIATPRTDVVLELEPRFRHAFPGMEIAALYGGSPDVGTLSPLVISTTHQLLRYKEAFDVIIIDEVDAFP
YSIDNTLQYAVKKSAKRQSAHIYLTATPSRDMKKRAESGKLHTVRIPARFHRSPLPEPTLIWCGNWERGLKRRKVPLRLK
KWLFKHQELQHPVFLFVPSIPVLKTVVSLLKKETFRAEGVYAEDPERNEKVNRFRKSKLEVLVTTTILERGVTVKKAQVG
VLGAESAVFSESALVQMAGRAGRHPEHTDGDVCFFHHGRTKSMNAARRHIQYMNKLSKKEMLID
MIHIEHPASYSCELRSCLEQRHLLKSELSFPEPVIDWHIQEGLIKTEEGIKKTKRGFVCLRCGQHERFCFARYPCYRCNK
CCVYCRACVMMGRVSECTSLLTWRGHGRHEWAPVYTEWKGVLSAGQEKAAKSIIDAIRRKEELLIWAVCGSGKTELLFQG
IEFALNNGLRVCIATPRTDVVLELEPRFRHAFPGMEIAALYGGSPDVGTLSPLVISTTHQLLRYKEAFDVIIIDEVDAFP
YSIDNTLQYAVKKSAKRQSAHIYLTATPSRDMKKRAESGKLHTVRIPARFHRSPLPEPTLIWCGNWERGLKRRKVPLRLK
KWLFKHQELQHPVFLFVPSIPVLKTVVSLLKKETFRAEGVYAEDPERNEKVNRFRKSKLEVLVTTTILERGVTVKKAQVG
VLGAESAVFSESALVQMAGRAGRHPEHTDGDVCFFHHGRTKSMNAARRHIQYMNKLSKKEMLID
Nucleotide
Download Length: 1395 bp
>NTDB_id=968556 MHH76_RS18840 WP_020453117.1 3730484..3731878(-) (comFA) [Bacillus sp. FSL L8-0193]
GTGATTCACATCGAACATCCGGCATCCTATTCCTGTGAATTGCGGTCATGTTTGGAGCAGCGCCACCTTCTGAAAAGCGA
ACTTTCTTTTCCGGAACCTGTCATTGATTGGCATATCCAAGAAGGCCTGATAAAAACGGAGGAAGGCATTAAAAAAACGA
AGAGAGGCTTTGTTTGCTTGAGGTGCGGCCAGCACGAGCGCTTCTGTTTCGCTCGATATCCTTGCTATAGGTGCAATAAA
TGCTGCGTGTACTGCCGAGCCTGTGTCATGATGGGCAGGGTGAGCGAGTGTACGTCTCTGTTGACTTGGCGTGGACATGG
CAGACATGAATGGGCGCCTGTTTACACGGAGTGGAAAGGCGTACTTTCAGCCGGTCAGGAAAAAGCGGCAAAATCTATTA
TCGATGCTATACGCAGGAAAGAAGAGCTGTTGATCTGGGCGGTTTGCGGATCGGGAAAAACGGAACTTCTTTTTCAAGGG
ATCGAATTCGCGCTGAACAACGGTTTGAGAGTATGTATCGCCACTCCGAGAACAGACGTTGTGCTTGAGCTTGAACCGCG
GTTTCGCCACGCCTTTCCCGGGATGGAAATCGCCGCTTTATACGGAGGAAGCCCAGACGTTGGAACTCTCTCGCCGCTTG
TCATTTCAACTACCCACCAGCTGCTCCGCTATAAAGAAGCATTTGACGTGATCATCATAGATGAAGTGGATGCTTTTCCG
TATTCTATTGATAATACGCTGCAATACGCTGTTAAAAAATCGGCAAAACGGCAAAGCGCACATATCTATTTAACCGCCAC
GCCTTCTAGAGATATGAAGAAAAGGGCTGAGAGCGGAAAGCTGCACACCGTCCGTATTCCCGCAAGATTCCACCGCAGCC
CGCTGCCTGAACCTACATTGATTTGGTGCGGAAATTGGGAAAGGGGTTTGAAACGAAGAAAAGTTCCTTTACGCCTCAAA
AAATGGCTTTTTAAGCACCAGGAACTGCAGCATCCGGTTTTTCTGTTTGTTCCCTCTATCCCTGTTCTTAAAACCGTGGT
CAGTCTGCTGAAAAAGGAAACATTTCGCGCAGAAGGAGTCTATGCCGAAGACCCTGAGAGAAATGAAAAAGTGAACCGGT
TTAGGAAAAGTAAGCTTGAGGTGCTCGTGACGACAACGATTTTGGAAAGAGGTGTAACCGTCAAAAAAGCACAGGTTGGA
GTATTGGGGGCAGAGTCGGCTGTTTTTTCGGAAAGCGCTCTTGTCCAAATGGCCGGCAGAGCCGGCAGGCATCCTGAGCA
CACAGATGGAGACGTCTGTTTCTTTCATCACGGTAGGACAAAGTCGATGAATGCGGCACGACGTCATATTCAATATATGA
ATAAACTGTCTAAAAAGGAAATGTTGATTGACTAG
GTGATTCACATCGAACATCCGGCATCCTATTCCTGTGAATTGCGGTCATGTTTGGAGCAGCGCCACCTTCTGAAAAGCGA
ACTTTCTTTTCCGGAACCTGTCATTGATTGGCATATCCAAGAAGGCCTGATAAAAACGGAGGAAGGCATTAAAAAAACGA
AGAGAGGCTTTGTTTGCTTGAGGTGCGGCCAGCACGAGCGCTTCTGTTTCGCTCGATATCCTTGCTATAGGTGCAATAAA
TGCTGCGTGTACTGCCGAGCCTGTGTCATGATGGGCAGGGTGAGCGAGTGTACGTCTCTGTTGACTTGGCGTGGACATGG
CAGACATGAATGGGCGCCTGTTTACACGGAGTGGAAAGGCGTACTTTCAGCCGGTCAGGAAAAAGCGGCAAAATCTATTA
TCGATGCTATACGCAGGAAAGAAGAGCTGTTGATCTGGGCGGTTTGCGGATCGGGAAAAACGGAACTTCTTTTTCAAGGG
ATCGAATTCGCGCTGAACAACGGTTTGAGAGTATGTATCGCCACTCCGAGAACAGACGTTGTGCTTGAGCTTGAACCGCG
GTTTCGCCACGCCTTTCCCGGGATGGAAATCGCCGCTTTATACGGAGGAAGCCCAGACGTTGGAACTCTCTCGCCGCTTG
TCATTTCAACTACCCACCAGCTGCTCCGCTATAAAGAAGCATTTGACGTGATCATCATAGATGAAGTGGATGCTTTTCCG
TATTCTATTGATAATACGCTGCAATACGCTGTTAAAAAATCGGCAAAACGGCAAAGCGCACATATCTATTTAACCGCCAC
GCCTTCTAGAGATATGAAGAAAAGGGCTGAGAGCGGAAAGCTGCACACCGTCCGTATTCCCGCAAGATTCCACCGCAGCC
CGCTGCCTGAACCTACATTGATTTGGTGCGGAAATTGGGAAAGGGGTTTGAAACGAAGAAAAGTTCCTTTACGCCTCAAA
AAATGGCTTTTTAAGCACCAGGAACTGCAGCATCCGGTTTTTCTGTTTGTTCCCTCTATCCCTGTTCTTAAAACCGTGGT
CAGTCTGCTGAAAAAGGAAACATTTCGCGCAGAAGGAGTCTATGCCGAAGACCCTGAGAGAAATGAAAAAGTGAACCGGT
TTAGGAAAAGTAAGCTTGAGGTGCTCGTGACGACAACGATTTTGGAAAGAGGTGTAACCGTCAAAAAAGCACAGGTTGGA
GTATTGGGGGCAGAGTCGGCTGTTTTTTCGGAAAGCGCTCTTGTCCAAATGGCCGGCAGAGCCGGCAGGCATCCTGAGCA
CACAGATGGAGACGTCTGTTTCTTTCATCACGGTAGGACAAAGTCGATGAATGCGGCACGACGTCATATTCAATATATGA
ATAAACTGTCTAAAAAGGAAATGTTGATTGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
57.424 |
98.707 |
0.567 |