Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | NSQ16_RS18435 | Genome accession | NZ_CP150155 |
| Coordinates | 3548660..3550051 (-) | Length | 463 a.a. |
| NCBI ID | WP_106073350.1 | Uniprot ID | - |
| Organism | Bacillus sp. PS108 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3543660..3555051
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| NSQ16_RS18400 (NSQ16_18400) | flgL | 3543832..3544728 (-) | 897 | WP_015714797.1 | flagellar hook-associated protein FlgL | - |
| NSQ16_RS18405 (NSQ16_18405) | flgK | 3544739..3546262 (-) | 1524 | WP_088327073.1 | flagellar hook-associated protein FlgK | - |
| NSQ16_RS18410 (NSQ16_18410) | flgN | 3546281..3546763 (-) | 483 | WP_003227999.1 | flagellar protein FlgN | - |
| NSQ16_RS18415 (NSQ16_18415) | flgM | 3546779..3547045 (-) | 267 | WP_014478128.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| NSQ16_RS18420 (NSQ16_18420) | - | 3547126..3547545 (-) | 420 | WP_230488622.1 | TIGR03826 family flagellar region protein | - |
| NSQ16_RS18425 (NSQ16_18425) | comFC | 3547618..3548340 (-) | 723 | WP_230488621.1 | comF operon protein ComFC | Machinery gene |
| NSQ16_RS18430 (NSQ16_18430) | comFB | 3548304..3548600 (-) | 297 | WP_015483764.1 | late competence protein ComFB | - |
| NSQ16_RS18435 (NSQ16_18435) | comFA | 3548660..3550051 (-) | 1392 | WP_106073350.1 | ATP-dependent helicase ComFA | Machinery gene |
| NSQ16_RS18440 (NSQ16_18440) | - | 3550159..3551004 (-) | 846 | WP_003227986.1 | DegV family protein | - |
| NSQ16_RS18445 (NSQ16_18445) | degU | 3551102..3551791 (-) | 690 | WP_003219701.1 | two-component system response regulator DegU | Regulator |
| NSQ16_RS18450 (NSQ16_18450) | degS | 3551874..3553031 (-) | 1158 | WP_003227983.1 | two-component sensor histidine kinase DegS | Regulator |
| NSQ16_RS18455 (NSQ16_18455) | - | 3553248..3553901 (+) | 654 | WP_230488620.1 | YigZ family protein | - |
| NSQ16_RS18460 (NSQ16_18460) | - | 3553901..3554893 (+) | 993 | WP_230488619.1 | LCP family protein | - |
Sequence
Protein
Download Length: 463 a.a. Molecular weight: 52597.67 Da Isoelectric Point: 9.8808
>NTDB_id=963691 NSQ16_RS18435 WP_106073350.1 3548660..3550051(-) (comFA) [Bacillus sp. PS108]
MKVPTEKNISFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYFSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENEPNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELVPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD
MKVPTEKNISFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYFSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENEPNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELVPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD
Nucleotide
Download Length: 1392 bp
>NTDB_id=963691 NSQ16_RS18435 WP_106073350.1 3548660..3550051(-) (comFA) [Bacillus sp. PS108]
GTGAAGGTGCCAACTGAAAAGAACATTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAACTCTCATTTTCCGATGAAATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGACAAACTGATCAGCGGTATTTTTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGA
ACCAAACTGGCAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAACAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGTTCCAAGGCT
CAAGGCTGCCTTTCAGGGTGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCTGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGGGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
GTGAAGGTGCCAACTGAAAAGAACATTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAACTCTCATTTTCCGATGAAATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGACAAACTGATCAGCGGTATTTTTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGA
ACCAAACTGGCAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAACAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGTTCCAAGGCT
CAAGGCTGCCTTTCAGGGTGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCTGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGGGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
97.408 |
100 |
0.974 |