Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | AWV81_RS18890 | Genome accession | NZ_CP014471 |
| Coordinates | 3519992..3521383 (-) | Length | 463 a.a. |
| NCBI ID | WP_014481052.1 | Uniprot ID | - |
| Organism | Bacillus subtilis subsp. natto strain CGMCC 2108 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3514992..3526383
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| AWV81_RS18855 (AWV81_18845) | flgL | 3515164..3516060 (-) | 897 | WP_014481046.1 | flagellar hook-associated protein FlgL | - |
| AWV81_RS18860 (AWV81_18850) | flgK | 3516071..3517594 (-) | 1524 | WP_014481047.1 | flagellar hook-associated protein FlgK | - |
| AWV81_RS18865 (AWV81_18855) | flgN | 3517613..3518095 (-) | 483 | WP_014481048.1 | flagellar protein FlgN | - |
| AWV81_RS18870 (AWV81_18860) | flgM | 3518111..3518377 (-) | 267 | WP_014481049.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| AWV81_RS18875 (AWV81_18865) | yvyF | 3518458..3518877 (-) | 420 | WP_014481050.1 | TIGR03826 family flagellar region protein | - |
| AWV81_RS18880 (AWV81_18870) | comFC | 3518950..3519672 (-) | 723 | WP_014481051.1 | comF operon protein ComFC | Machinery gene |
| AWV81_RS18885 (AWV81_18875) | comFB | 3519636..3519932 (-) | 297 | WP_003227989.1 | late competence protein ComFB | - |
| AWV81_RS18890 (AWV81_18880) | comFA | 3519992..3521383 (-) | 1392 | WP_014481052.1 | ATP-dependent helicase ComFA | Machinery gene |
| AWV81_RS18895 (AWV81_18885) | fakBA | 3521490..3522335 (-) | 846 | WP_014481053.1 | DegV family protein | - |
| AWV81_RS18900 (AWV81_18890) | degU | 3522433..3523122 (-) | 690 | WP_014481054.1 | two-component system response regulator DegU | Regulator |
| AWV81_RS18905 (AWV81_18895) | degS | 3523205..3524362 (-) | 1158 | WP_003227983.1 | two-component sensor histidine kinase DegS | Regulator |
| AWV81_RS18910 (AWV81_18900) | - | 3524579..3525232 (+) | 654 | WP_014481055.1 | YigZ family protein | - |
| AWV81_RS18915 (AWV81_18905) | - | 3525232..3526224 (+) | 993 | WP_014481056.1 | LCP family protein | - |
Sequence
Protein
Download Length: 463 a.a. Molecular weight: 52543.49 Da Isoelectric Point: 9.7976
>NTDB_id=171755 AWV81_RS18890 WP_014481052.1 3519992..3521383(-) (comFA) [Bacillus subtilis subsp. natto strain CGMCC 2108]
MNVPVEKNSSFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYYSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENEPNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD
MNVPVEKNSSFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYYSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENEPNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD
Nucleotide
Download Length: 1392 bp
>NTDB_id=171755 AWV81_RS18890 WP_014481052.1 3519992..3521383(-) (comFA) [Bacillus subtilis subsp. natto strain CGMCC 2108]
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGCCAAACTGATCAGCGGTATTATTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGA
ACCAAACTGGCAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGGGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCCGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGCCAAACTGATCAGCGGTATTATTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGA
ACCAAACTGGCAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGGGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCCGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
98.056 |
100 |
0.981 |