Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | S101441_RS19410 | Genome accession | NZ_CP021507 |
| Coordinates | 3655586..3656977 (+) | Length | 463 a.a. |
| NCBI ID | WP_014481052.1 | Uniprot ID | - |
| Organism | Bacillus subtilis subsp. subtilis strain SRCM101441 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3650586..3661977
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| S101441_RS19385 (S101441_03916) | - | 3650745..3651737 (-) | 993 | WP_014481056.1 | LCP family protein | - |
| S101441_RS19390 (S101441_03917) | - | 3651737..3652390 (-) | 654 | WP_014481055.1 | YigZ family protein | - |
| S101441_RS19395 (S101441_03918) | degS | 3652607..3653764 (+) | 1158 | WP_003227983.1 | two-component sensor histidine kinase DegS | Regulator |
| S101441_RS19400 (S101441_03919) | degU | 3653847..3654536 (+) | 690 | WP_014481054.1 | two-component system response regulator DegU | Regulator |
| S101441_RS19405 (S101441_03920) | fakBA | 3654634..3655479 (+) | 846 | WP_014481053.1 | DegV family protein | - |
| S101441_RS19410 (S101441_03921) | comFA | 3655586..3656977 (+) | 1392 | WP_014481052.1 | ATP-dependent helicase ComFA | Machinery gene |
| S101441_RS19415 (S101441_03922) | comFB | 3657037..3657333 (+) | 297 | WP_003227989.1 | late competence protein ComFB | - |
| S101441_RS19420 (S101441_03923) | comFC | 3657297..3658019 (+) | 723 | WP_014481051.1 | comF operon protein ComFC | Machinery gene |
| S101441_RS19425 (S101441_03924) | yvyF | 3658092..3658511 (+) | 420 | WP_038429728.1 | TIGR03826 family flagellar region protein | - |
| S101441_RS19430 (S101441_03925) | flgM | 3658592..3658858 (+) | 267 | WP_014481049.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| S101441_RS19435 (S101441_03926) | flgN | 3658874..3659356 (+) | 483 | WP_014481048.1 | flagellar protein FlgN | - |
| S101441_RS19440 (S101441_03927) | flgK | 3659375..3660898 (+) | 1524 | WP_014481047.1 | flagellar hook-associated protein FlgK | - |
| S101441_RS19445 (S101441_03928) | flgL | 3660909..3661805 (+) | 897 | WP_014481046.1 | flagellar hook-associated protein FlgL | - |
Sequence
Protein
Download Length: 463 a.a. Molecular weight: 52543.49 Da Isoelectric Point: 9.7976
>NTDB_id=231503 S101441_RS19410 WP_014481052.1 3655586..3656977(+) (comFA) [Bacillus subtilis subsp. subtilis strain SRCM101441]
MNVPVEKNSSFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYYSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENEPNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD
MNVPVEKNSSFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYYSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENEPNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD
Nucleotide
Download Length: 1392 bp
>NTDB_id=231503 S101441_RS19410 WP_014481052.1 3655586..3656977(+) (comFA) [Bacillus subtilis subsp. subtilis strain SRCM101441]
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGCCAAACTGATCAGCGGTATTATTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGA
ACCAAACTGGCAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGGGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCCGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGCCAAACTGATCAGCGGTATTATTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGA
ACCAAACTGGCAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGGGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCCGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
98.056 |
100 |
0.981 |