Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | BSA41_RS14870 | Genome accession | NZ_CP015610 |
| Coordinates | 2820202..2821599 (-) | Length | 465 a.a. |
| NCBI ID | WP_269199592.1 | Uniprot ID | - |
| Organism | Bacillus safensis strain U41 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2815202..2826599
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| BSA41_RS14835 (BSA41_14520) | flgL | 2815402..2816310 (-) | 909 | WP_041088047.1 | flagellar hook-associated protein FlgL | - |
| BSA41_RS14840 (BSA41_14525) | flgK | 2816320..2817840 (-) | 1521 | WP_058837539.1 | flagellar hook-associated protein FlgK | - |
| BSA41_RS14845 (BSA41_14530) | - | 2817856..2818338 (-) | 483 | WP_075611997.1 | flagellar protein FlgN | - |
| BSA41_RS14850 (BSA41_14535) | flgM | 2818357..2818620 (-) | 264 | WP_075611998.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| BSA41_RS14855 (BSA41_14540) | - | 2818700..2819119 (-) | 420 | WP_075611999.1 | TIGR03826 family flagellar region protein | - |
| BSA41_RS14860 (BSA41_14545) | comFC | 2819176..2819832 (-) | 657 | WP_231120176.1 | ComF family protein | Machinery gene |
| BSA41_RS14865 (BSA41_14550) | - | 2819859..2820152 (-) | 294 | WP_041088040.1 | late competence development ComFB family protein | - |
| BSA41_RS14870 (BSA41_14555) | comFA | 2820202..2821599 (-) | 1398 | WP_269199592.1 | DEAD/DEAH box helicase | Machinery gene |
| BSA41_RS14875 (BSA41_14560) | - | 2821720..2822562 (-) | 843 | WP_041088038.1 | DegV family protein | - |
| BSA41_RS14880 (BSA41_14565) | degU | 2822778..2823467 (-) | 690 | WP_008348273.1 | two-component system response regulator DegU | Regulator |
| BSA41_RS14885 (BSA41_14570) | degS | 2823532..2824707 (-) | 1176 | WP_003213166.1 | sensor histidine kinase | Regulator |
| BSA41_RS14890 (BSA41_14575) | - | 2824929..2825597 (+) | 669 | WP_012011434.1 | YigZ family protein | - |
Sequence
Protein
Download Length: 465 a.a. Molecular weight: 53154.96 Da Isoelectric Point: 9.8002
>NTDB_id=181404 BSA41_RS14870 WP_269199592.1 2820202..2821599(-) (comFA) [Bacillus safensis strain U41]
MNMDKAMELMRQLHSRHLLTAETRCPQSNLDWLEEKGLVNRTPAIERKANGFTCCRCGVSHKRYFAHSPCEVCQKDCIYC
RSCIMMGKATECGFLYEWTGPQMVETYRAELTWQGELTKGQKRASEGMIEAIKNKFDLLVWAVCGAGKTEVLFHGIEYAL
NQGMRVCIATPRTDVVLELEPRLRKAFQGMTIAVLYGGSSQRFQIAPLVIATTHQLMRYKNAFDVLIVDEVDAFPYSMDE
RLQFAVLKAMSRNGGVRIYLSATPSKKMTREVSSGKLEAIKIPLRFHKQPLPVPTFHWIGHWKKKLKKNQLPRKVMNWMQ
RHIVNKRRVLLFVPSIATMKKVTKILREHSLNVEGVSADDPDRKQKVQYFRDYEYDVLVTTTILERGVTIPNVQVGVLGS
ESTIFTESALVQISGRVGRHPDYCTGDVFLFHFGLTRSMKQAKKHIVKMNDTAANEFSEKQCGFN
MNMDKAMELMRQLHSRHLLTAETRCPQSNLDWLEEKGLVNRTPAIERKANGFTCCRCGVSHKRYFAHSPCEVCQKDCIYC
RSCIMMGKATECGFLYEWTGPQMVETYRAELTWQGELTKGQKRASEGMIEAIKNKFDLLVWAVCGAGKTEVLFHGIEYAL
NQGMRVCIATPRTDVVLELEPRLRKAFQGMTIAVLYGGSSQRFQIAPLVIATTHQLMRYKNAFDVLIVDEVDAFPYSMDE
RLQFAVLKAMSRNGGVRIYLSATPSKKMTREVSSGKLEAIKIPLRFHKQPLPVPTFHWIGHWKKKLKKNQLPRKVMNWMQ
RHIVNKRRVLLFVPSIATMKKVTKILREHSLNVEGVSADDPDRKQKVQYFRDYEYDVLVTTTILERGVTIPNVQVGVLGS
ESTIFTESALVQISGRVGRHPDYCTGDVFLFHFGLTRSMKQAKKHIVKMNDTAANEFSEKQCGFN
Nucleotide
Download Length: 1398 bp
>NTDB_id=181404 BSA41_RS14870 WP_269199592.1 2820202..2821599(-) (comFA) [Bacillus safensis strain U41]
TTGAATATGGACAAAGCAATGGAACTAATGCGGCAGCTGCATTCACGCCATCTGCTCACCGCCGAAACAAGGTGCCCACA
GTCCAATTTGGATTGGTTGGAGGAGAAGGGCTTAGTCAATCGAACACCTGCCATTGAGAGGAAAGCAAATGGTTTTACAT
GCTGCCGGTGCGGTGTATCACACAAGCGGTACTTTGCTCACTCTCCATGTGAAGTGTGTCAAAAGGATTGTATATACTGT
AGATCATGCATCATGATGGGAAAAGCAACTGAATGTGGGTTTCTTTATGAATGGACAGGTCCACAAATGGTAGAAACATA
TCGAGCTGAATTAACATGGCAGGGAGAGCTGACTAAAGGGCAAAAAAGAGCGTCAGAAGGAATGATTGAAGCTATCAAAA
ACAAATTTGATCTACTTGTTTGGGCGGTTTGCGGAGCAGGGAAAACGGAGGTGCTTTTTCACGGAATCGAATATGCGTTA
AATCAAGGAATGAGAGTCTGTATTGCGACACCTAGAACAGATGTTGTGCTTGAACTCGAACCACGACTTAGAAAAGCATT
TCAAGGGATGACAATAGCCGTACTTTATGGAGGCAGTTCTCAAAGGTTTCAGATTGCACCGCTTGTGATCGCCACAACCC
ATCAGCTGATGAGGTACAAAAATGCATTTGATGTCCTCATTGTAGATGAAGTCGATGCCTTCCCTTATTCAATGGATGAG
CGTCTCCAATTTGCTGTTCTAAAGGCGATGAGTAGGAACGGAGGGGTTAGGATTTATTTAAGTGCGACACCCTCTAAAAA
AATGACGAGAGAGGTTTCTAGTGGAAAACTGGAAGCGATAAAAATTCCTCTGCGTTTTCATAAACAACCATTACCCGTAC
CGACCTTTCATTGGATTGGACATTGGAAAAAGAAATTAAAAAAGAATCAGCTGCCCCGTAAAGTGATGAATTGGATGCAG
AGACATATTGTAAATAAAAGAAGAGTATTACTTTTTGTTCCTTCAATTGCTACCATGAAGAAGGTAACGAAGATTCTTCG
AGAACACTCCTTAAACGTGGAGGGAGTATCTGCTGATGATCCAGATAGGAAACAAAAGGTTCAGTACTTTAGAGATTACG
AATACGATGTACTAGTTACAACCACTATTCTAGAAAGAGGCGTAACCATTCCAAATGTTCAAGTCGGGGTTTTAGGTTCG
GAATCTACTATTTTTACAGAGAGTGCACTTGTTCAGATTTCTGGAAGAGTAGGCAGACACCCGGATTATTGTACAGGAGA
CGTTTTCCTTTTTCATTTTGGTTTAACGAGAAGTATGAAACAAGCAAAGAAGCATATCGTAAAAATGAATGATACGGCTG
CAAATGAGTTTTCTGAAAAACAGTGTGGGTTCAACTAA
TTGAATATGGACAAAGCAATGGAACTAATGCGGCAGCTGCATTCACGCCATCTGCTCACCGCCGAAACAAGGTGCCCACA
GTCCAATTTGGATTGGTTGGAGGAGAAGGGCTTAGTCAATCGAACACCTGCCATTGAGAGGAAAGCAAATGGTTTTACAT
GCTGCCGGTGCGGTGTATCACACAAGCGGTACTTTGCTCACTCTCCATGTGAAGTGTGTCAAAAGGATTGTATATACTGT
AGATCATGCATCATGATGGGAAAAGCAACTGAATGTGGGTTTCTTTATGAATGGACAGGTCCACAAATGGTAGAAACATA
TCGAGCTGAATTAACATGGCAGGGAGAGCTGACTAAAGGGCAAAAAAGAGCGTCAGAAGGAATGATTGAAGCTATCAAAA
ACAAATTTGATCTACTTGTTTGGGCGGTTTGCGGAGCAGGGAAAACGGAGGTGCTTTTTCACGGAATCGAATATGCGTTA
AATCAAGGAATGAGAGTCTGTATTGCGACACCTAGAACAGATGTTGTGCTTGAACTCGAACCACGACTTAGAAAAGCATT
TCAAGGGATGACAATAGCCGTACTTTATGGAGGCAGTTCTCAAAGGTTTCAGATTGCACCGCTTGTGATCGCCACAACCC
ATCAGCTGATGAGGTACAAAAATGCATTTGATGTCCTCATTGTAGATGAAGTCGATGCCTTCCCTTATTCAATGGATGAG
CGTCTCCAATTTGCTGTTCTAAAGGCGATGAGTAGGAACGGAGGGGTTAGGATTTATTTAAGTGCGACACCCTCTAAAAA
AATGACGAGAGAGGTTTCTAGTGGAAAACTGGAAGCGATAAAAATTCCTCTGCGTTTTCATAAACAACCATTACCCGTAC
CGACCTTTCATTGGATTGGACATTGGAAAAAGAAATTAAAAAAGAATCAGCTGCCCCGTAAAGTGATGAATTGGATGCAG
AGACATATTGTAAATAAAAGAAGAGTATTACTTTTTGTTCCTTCAATTGCTACCATGAAGAAGGTAACGAAGATTCTTCG
AGAACACTCCTTAAACGTGGAGGGAGTATCTGCTGATGATCCAGATAGGAAACAAAAGGTTCAGTACTTTAGAGATTACG
AATACGATGTACTAGTTACAACCACTATTCTAGAAAGAGGCGTAACCATTCCAAATGTTCAAGTCGGGGTTTTAGGTTCG
GAATCTACTATTTTTACAGAGAGTGCACTTGTTCAGATTTCTGGAAGAGTAGGCAGACACCCGGATTATTGTACAGGAGA
CGTTTTCCTTTTTCATTTTGGTTTAACGAGAAGTATGAAACAAGCAAAGAAGCATATCGTAAAAATGAATGATACGGCTG
CAAATGAGTTTTCTGAAAAACAGTGTGGGTTCAACTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
51.868 |
97.849 |
0.508 |
| comFA/cflA | Streptococcus pneumoniae Rx1 |
38.549 |
94.839 |
0.366 |
| comFA/cflA | Streptococcus pneumoniae D39 |
38.549 |
94.839 |
0.366 |
| comFA/cflA | Streptococcus pneumoniae R6 |
38.549 |
94.839 |
0.366 |
| comFA/cflA | Streptococcus pneumoniae TIGR4 |
38.497 |
94.409 |
0.363 |