Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | CAH07_RS03070 | Genome accession | NZ_CP021123 |
| Coordinates | 598709..600100 (+) | Length | 463 a.a. |
| NCBI ID | WP_101172232.1 | Uniprot ID | - |
| Organism | Bacillus subtilis strain SEM-9 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 593709..605100
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| CAH07_RS03050 (CAH07_03010) | yvyE | 594861..595514 (-) | 654 | WP_003227979.1 | YigZ family protein | - |
| CAH07_RS03055 (CAH07_03015) | degS | 595731..596888 (+) | 1158 | WP_003227983.1 | two-component sensor histidine kinase DegS | Regulator |
| CAH07_RS03060 (CAH07_03020) | degU | 596971..597660 (+) | 690 | WP_003219701.1 | two-component system response regulator DegU | Regulator |
| CAH07_RS03065 (CAH07_03025) | fakBA | 597758..598603 (+) | 846 | WP_003244125.1 | DegV family protein | - |
| CAH07_RS03070 (CAH07_03030) | comFA | 598709..600100 (+) | 1392 | WP_101172232.1 | ATP-dependent helicase ComFA | Machinery gene |
| CAH07_RS03075 (CAH07_03035) | comFB | 600160..600456 (+) | 297 | WP_014478130.1 | late competence protein ComFB | - |
| CAH07_RS03080 (CAH07_03040) | comFC | 600420..601142 (+) | 723 | WP_033884179.1 | comF operon protein ComFC | Machinery gene |
| CAH07_RS03085 (CAH07_03045) | yvyF | 601215..601634 (+) | 420 | WP_003227995.1 | TIGR03826 family flagellar region protein | - |
| CAH07_RS03090 (CAH07_03050) | flgM | 601715..601981 (+) | 267 | WP_014478128.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| CAH07_RS03095 (CAH07_03055) | flgN | 601997..602479 (+) | 483 | WP_014478127.1 | flagellar protein FlgN | - |
| CAH07_RS03100 (CAH07_03060) | flgK | 602498..604021 (+) | 1524 | WP_003228001.1 | flagellar hook-associated protein FlgK | - |
| CAH07_RS03105 (CAH07_03065) | flgL | 604032..604928 (+) | 897 | WP_015714797.1 | flagellar hook-associated protein FlgL | - |
Sequence
Protein
Download Length: 463 a.a. Molecular weight: 52655.85 Da Isoelectric Point: 10.1764
>NTDB_id=228366 CAH07_RS03070 WP_101172232.1 598709..600100(+) (comFA) [Bacillus subtilis strain SEM-9]
MNVPVEKNSSFSKELQQRLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRRYRCNRCGQTDQRYFSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWKEENESNWKSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGAEISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIKFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD
MNVPVEKNSSFSKELQQRLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRRYRCNRCGQTDQRYFSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWKEENESNWKSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGAEISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIKFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD
Nucleotide
Download Length: 1392 bp
>NTDB_id=228366 CAH07_RS03070 WP_101172232.1 598709..600100(+) (comFA) [Bacillus subtilis strain SEM-9]
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCAAAAGAATTGCAGCAGAGGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGAGATATAGATGTAATAGGTGCGGACAAACTGATCAGCGGTATTTTTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGAAAGAGGAAAATGA
ATCAAACTGGAAGTCTATTAAGCTGACATGGGATGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGTGCTGAAATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCTGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCATTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAAAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGGGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCAAAAGAATTGCAGCAGAGGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGAGATATAGATGTAATAGGTGCGGACAAACTGATCAGCGGTATTTTTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGAAAGAGGAAAATGA
ATCAAACTGGAAGTCTATTAAGCTGACATGGGATGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGTGCTGAAATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCTGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCATTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAAAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGGGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
98.704 |
100 |
0.987 |