Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | Q4I20_RS19165 | Genome accession | NZ_CP130597 |
| Coordinates | 3642206..3643597 (-) | Length | 463 a.a. |
| NCBI ID | WP_003243962.1 | Uniprot ID | A0AAE2SL32 |
| Organism | Bacillus subtilis strain NCIB_3610 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3637206..3648597
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| Q4I20_RS19130 (Q4I20_19115) | flgL | 3637377..3638273 (-) | 897 | WP_003228004.1 | flagellar hook-associated protein FlgL | - |
| Q4I20_RS19135 (Q4I20_19120) | flgK | 3638284..3639807 (-) | 1524 | WP_003228001.1 | flagellar hook-associated protein FlgK | - |
| Q4I20_RS19140 (Q4I20_19125) | flgN | 3639826..3640308 (-) | 483 | WP_003227999.1 | flagellar protein FlgN | - |
| Q4I20_RS19145 (Q4I20_19130) | flgM | 3640324..3640590 (-) | 267 | WP_003227997.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| Q4I20_RS19150 (Q4I20_19135) | yvyF | 3640671..3641090 (-) | 420 | WP_003227995.1 | TIGR03826 family flagellar region protein | - |
| Q4I20_RS19155 (Q4I20_19140) | comFC | 3641164..3641886 (-) | 723 | WP_003227991.1 | comF operon protein ComFC | Machinery gene |
| Q4I20_RS19160 (Q4I20_19145) | comFB | 3641850..3642146 (-) | 297 | WP_003227989.1 | late competence protein ComFB | - |
| Q4I20_RS19165 (Q4I20_19150) | comFA | 3642206..3643597 (-) | 1392 | WP_003243962.1 | ATP-dependent helicase ComFA | Machinery gene |
| Q4I20_RS19170 (Q4I20_19155) | fakBA | 3643703..3644548 (-) | 846 | WP_003244125.1 | DegV family protein | - |
| Q4I20_RS19175 (Q4I20_19160) | degU | 3644646..3645335 (-) | 690 | WP_003219701.1 | two-component system response regulator DegU | Regulator |
| Q4I20_RS19180 (Q4I20_19165) | degS | 3645418..3646575 (-) | 1158 | WP_003227983.1 | two-component sensor histidine kinase DegS | Regulator |
| Q4I20_RS19185 (Q4I20_19170) | yvyE | 3646792..3647445 (+) | 654 | WP_003227979.1 | YigZ family protein | - |
Sequence
Protein
Download Length: 463 a.a. Molecular weight: 52565.63 Da Isoelectric Point: 10.0082
>NTDB_id=860486 Q4I20_RS19165 WP_003243962.1 3642206..3643597(-) (comFA) [Bacillus subtilis strain NCIB_3610]
MNVPVEKNSSFSKELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRRYRCNRCGQTDQRYFSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWKEENESNWKSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFKGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESSIFTESALVQIAGRTGRHKEYADGDVIYFHFGKTKSMLDARKHIKEMNELAAKVECTD
MNVPVEKNSSFSKELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRRYRCNRCGQTDQRYFSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWKEENESNWKSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFKGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESSIFTESALVQIAGRTGRHKEYADGDVIYFHFGKTKSMLDARKHIKEMNELAAKVECTD
Nucleotide
Download Length: 1392 bp
>NTDB_id=860486 Q4I20_RS19165 WP_003243962.1 3642206..3643597(-) (comFA) [Bacillus subtilis strain NCIB_3610]
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCAAAAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGAGATATAGATGTAATAGGTGCGGACAAACTGATCAGCGGTATTTTTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGAAAGAGGAAAATGA
ATCAAACTGGAAGTCTATTAAGCTGACATGGGATGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGACT
CAAGGCTGCCTTTCAGGGTGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCTGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAAAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCGTCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTACTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCAAAAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGAGATATAGATGTAATAGGTGCGGACAAACTGATCAGCGGTATTTTTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGAAAGAGGAAAATGA
ATCAAACTGGAAGTCTATTAAGCTGACATGGGATGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGACT
CAAGGCTGCCTTTCAGGGTGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCTGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAAAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCGTCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTACTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
100 |
100 |
1 |