Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | HC658_RS17950 | Genome accession | NZ_CP051466 |
| Coordinates | 3484119..3485510 (-) | Length | 463 a.a. |
| NCBI ID | WP_089172669.1 | Uniprot ID | - |
| Organism | Bacillus subtilis subsp. subtilis strain UCMB5021 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3479119..3490510
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HC658_RS17915 (HC658_35290) | flgL | 3479291..3480187 (-) | 897 | WP_003228004.1 | flagellar hook-associated protein FlgL | - |
| HC658_RS17920 (HC658_35300) | flgK | 3480198..3481721 (-) | 1524 | WP_003228001.1 | flagellar hook-associated protein FlgK | - |
| HC658_RS17925 (HC658_35310) | flgN | 3481740..3482222 (-) | 483 | WP_032722609.1 | flagellar protein FlgN | - |
| HC658_RS17930 (HC658_35320) | flgM | 3482238..3482504 (-) | 267 | WP_014478128.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| HC658_RS17935 (HC658_35330) | yvyF | 3482585..3483004 (-) | 420 | WP_003227995.1 | TIGR03826 family flagellar region protein | - |
| HC658_RS17940 (HC658_35340) | comFC | 3483077..3483799 (-) | 723 | WP_014481051.1 | comF operon protein ComFC | Machinery gene |
| HC658_RS17945 (HC658_35350) | comFB | 3483763..3484059 (-) | 297 | WP_015483764.1 | late competence protein ComFB | - |
| HC658_RS17950 (HC658_35360) | comFA | 3484119..3485510 (-) | 1392 | WP_089172669.1 | ATP-dependent helicase ComFA | Machinery gene |
| HC658_RS17955 (HC658_35370) | fakBA | 3485616..3486461 (-) | 846 | WP_003244125.1 | DegV family protein | - |
| HC658_RS17960 (HC658_35380) | degU | 3486559..3487248 (-) | 690 | WP_003219701.1 | two-component system response regulator DegU | Regulator |
| HC658_RS17965 (HC658_35390) | degS | 3487331..3488488 (-) | 1158 | WP_003227983.1 | two-component sensor histidine kinase DegS | Regulator |
| HC658_RS17970 (HC658_35400) | yvyE | 3488705..3489358 (+) | 654 | WP_003227979.1 | YigZ family protein | - |
Sequence
Protein
Download Length: 463 a.a. Molecular weight: 52587.68 Da Isoelectric Point: 10.0153
>NTDB_id=438680 HC658_RS17950 WP_089172669.1 3484119..3485510(-) (comFA) [Bacillus subtilis subsp. subtilis strain UCMB5021]
MNVPVEKNGSFSKELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRRYRCNRCGQTDQRYFSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWKEENESNWKSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHIKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIYFHFGKTKSMLDARKHIKEMNELAAKVECTD
MNVPVEKNGSFSKELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRRYRCNRCGQTDQRYFSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWKEENESNWKSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHIKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIYFHFGKTKSMLDARKHIKEMNELAAKVECTD
Nucleotide
Download Length: 1392 bp
>NTDB_id=438680 HC658_RS17950 WP_089172669.1 3484119..3485510(-) (comFA) [Bacillus subtilis subsp. subtilis strain UCMB5021]
GTGAATGTGCCAGTTGAAAAAAACGGTTCCTTTTCAAAAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGAGATATAGATGTAATAGGTGCGGACAAACTGATCAGCGGTATTTTTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGAAAGAGGAAAATGA
ATCAAACTGGAAGTCTATTAAGCTGACATGGGATGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGACT
CAAGGCTGCCTTTCAGGGTGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCTGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATATAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATTTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTACTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
GTGAATGTGCCAGTTGAAAAAAACGGTTCCTTTTCAAAAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGAGATATAGATGTAATAGGTGCGGACAAACTGATCAGCGGTATTTTTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGAAAGAGGAAAATGA
ATCAAACTGGAAGTCTATTAAGCTGACATGGGATGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGACT
CAAGGCTGCCTTTCAGGGTGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCTGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATATAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATTTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTACTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
99.136 |
100 |
0.991 |