Detailed information
Overview
| Name | comFA | Type | Machinery gene |
| Locus tag | SOZ35_RS18550 | Genome accession | NZ_CP150480 |
| Coordinates | 3654081..3655478 (-) | Length | 465 a.a. |
| NCBI ID | WP_212063894.1 | Uniprot ID | - |
| Organism | Bacillus atrophaeus strain TL401 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3649081..3660478
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SOZ35_RS18515 (SOZ35_18515) | flgL | 3649265..3650161 (-) | 897 | WP_088118067.1 | flagellar hook-associated protein FlgL | - |
| SOZ35_RS18520 (SOZ35_18520) | flgK | 3650173..3651696 (-) | 1524 | WP_268476347.1 | flagellar hook-associated protein FlgK | - |
| SOZ35_RS18525 (SOZ35_18525) | - | 3651713..3652195 (-) | 483 | WP_340639879.1 | flagellar protein FlgN | - |
| SOZ35_RS18530 (SOZ35_18530) | flgM | 3652211..3652471 (-) | 261 | WP_340639880.1 | flagellar biosynthesis anti-sigma factor FlgM | - |
| SOZ35_RS18535 (SOZ35_18535) | - | 3652551..3652970 (-) | 420 | WP_003326721.1 | TIGR03826 family flagellar region protein | - |
| SOZ35_RS18540 (SOZ35_18540) | comFC | 3653042..3653731 (-) | 690 | WP_340640751.1 | double zinc ribbon domain-containing protein | Machinery gene |
| SOZ35_RS18545 (SOZ35_18545) | - | 3653728..3654024 (-) | 297 | WP_106033805.1 | late competence development ComFB family protein | - |
| SOZ35_RS18550 (SOZ35_18550) | comFA | 3654081..3655478 (-) | 1398 | WP_212063894.1 | ATP-dependent helicase ComFA | Machinery gene |
| SOZ35_RS18555 (SOZ35_18555) | - | 3655585..3656427 (-) | 843 | WP_003326716.1 | DegV family protein | - |
| SOZ35_RS18560 (SOZ35_18560) | degU | 3656534..3657223 (-) | 690 | WP_003219701.1 | two-component system response regulator DegU | Regulator |
| SOZ35_RS18565 (SOZ35_18565) | degS | 3657300..3658463 (-) | 1164 | WP_003326714.1 | two-component sensor histidine kinase DegS | Regulator |
| SOZ35_RS18570 (SOZ35_18570) | - | 3658681..3659334 (+) | 654 | WP_003326713.1 | YigZ family protein | - |
Sequence
Protein
Download Length: 465 a.a. Molecular weight: 53036.62 Da Isoelectric Point: 10.1151
>NTDB_id=970516 SOZ35_RS18550 WP_212063894.1 3654081..3655478(-) (comFA) [Bacillus atrophaeus strain TL401]
MQTDLKKKPLFSADLQQFLHQRHLLRTEIPFSEEIINWHIEHGFISAEKSIIKNKKGYLCNRCGQNDKRYFSSYWSCSDE
KNQMYCRSCVMMGRVSENIFLYSWIKEVEASWQPIKLTWEGKLSLGQQKAADILIDAITKREELLIWAVCGAGKTEILFP
GIEFALNQGFRVCIATPRTDVVLELTPRLKTAFQTTKISALYGGSEDKGSLSPLMISTTHQLLRYKEAFDVIVIDEVDAF
PYSADQTLQFAVQKARKKNSTLIYLSATPSKELKKKAHIGKLNSVRIPARHHRKPLPEPRFCWCGNWQKKLAGSKIPKQV
KIWVEQHVKVGRPVFLFVPSVSVLEKVTACFAGMRYRTAGVHAEDKNRKEKVQQFRDGRLDVLITTTILERGVTVPMVQT
GVLGAESPIFTESALVQIAGRTGRHKKHAQGDVIYFHFGKTKSMIDARNHINEMNKLARKNELID
MQTDLKKKPLFSADLQQFLHQRHLLRTEIPFSEEIINWHIEHGFISAEKSIIKNKKGYLCNRCGQNDKRYFSSYWSCSDE
KNQMYCRSCVMMGRVSENIFLYSWIKEVEASWQPIKLTWEGKLSLGQQKAADILIDAITKREELLIWAVCGAGKTEILFP
GIEFALNQGFRVCIATPRTDVVLELTPRLKTAFQTTKISALYGGSEDKGSLSPLMISTTHQLLRYKEAFDVIVIDEVDAF
PYSADQTLQFAVQKARKKNSTLIYLSATPSKELKKKAHIGKLNSVRIPARHHRKPLPEPRFCWCGNWQKKLAGSKIPKQV
KIWVEQHVKVGRPVFLFVPSVSVLEKVTACFAGMRYRTAGVHAEDKNRKEKVQQFRDGRLDVLITTTILERGVTVPMVQT
GVLGAESPIFTESALVQIAGRTGRHKKHAQGDVIYFHFGKTKSMIDARNHINEMNKLARKNELID
Nucleotide
Download Length: 1398 bp
>NTDB_id=970516 SOZ35_RS18550 WP_212063894.1 3654081..3655478(-) (comFA) [Bacillus atrophaeus strain TL401]
ATGCAGACTGATTTGAAAAAGAAACCTTTGTTCTCTGCTGATCTGCAGCAGTTCCTTCATCAGCGCCACTTGCTTAGAAC
TGAAATCCCATTCTCTGAGGAAATTATCAACTGGCATATTGAGCATGGTTTTATTTCCGCTGAAAAATCAATCATCAAAA
ATAAAAAGGGTTATTTGTGTAACAGGTGCGGCCAAAATGATAAGCGGTATTTTTCCTCCTACTGGTCATGCTCAGATGAA
AAAAATCAAATGTATTGCCGTTCGTGTGTCATGATGGGAAGAGTAAGCGAGAACATTTTCTTATATTCATGGATAAAGGA
GGTGGAAGCAAGCTGGCAGCCTATAAAGCTTACTTGGGAAGGTAAACTCTCTCTTGGACAGCAAAAAGCGGCGGATATTT
TAATTGACGCGATAACTAAAAGGGAAGAGCTTCTTATTTGGGCTGTTTGCGGCGCAGGTAAAACCGAAATATTATTCCCG
GGCATTGAGTTTGCGTTAAATCAGGGATTTCGTGTATGTATTGCAACACCGCGCACCGATGTCGTACTAGAGCTTACTCC
AAGGCTGAAAACCGCCTTTCAGACCACAAAGATCTCAGCCCTTTACGGAGGGAGTGAAGATAAAGGGAGCTTGTCCCCGC
TGATGATTTCAACGACACATCAGCTATTACGTTACAAAGAAGCATTTGACGTCATCGTCATAGACGAAGTTGATGCTTTT
CCTTATTCTGCTGATCAAACTCTCCAGTTTGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCATTTATTTAAGTGC
AACACCTTCCAAAGAATTAAAAAAGAAAGCGCATATAGGGAAGTTAAACTCGGTTCGCATACCCGCAAGACATCACCGGA
AACCATTGCCTGAACCACGTTTTTGCTGGTGCGGAAACTGGCAAAAAAAATTAGCCGGAAGCAAAATTCCGAAACAGGTG
AAAATATGGGTAGAACAGCATGTGAAAGTAGGCAGACCTGTTTTTTTATTTGTGCCGTCTGTTTCCGTTTTAGAGAAGGT
TACTGCCTGTTTCGCCGGCATGAGATATCGGACAGCGGGTGTGCATGCGGAAGATAAGAACCGTAAAGAGAAAGTGCAGC
AATTCAGGGACGGCCGGCTTGATGTATTAATCACAACGACCATATTAGAAAGAGGAGTTACTGTTCCAATGGTGCAGACT
GGCGTATTGGGAGCTGAATCGCCTATATTTACTGAGAGCGCACTCGTCCAAATCGCCGGGAGGACAGGGCGGCATAAAAA
ACATGCTCAGGGTGACGTCATATATTTTCACTTTGGAAAAACAAAGAGTATGATAGATGCCAGAAATCATATAAACGAAA
TGAATAAATTGGCAAGAAAAAACGAATTAATAGACTAG
ATGCAGACTGATTTGAAAAAGAAACCTTTGTTCTCTGCTGATCTGCAGCAGTTCCTTCATCAGCGCCACTTGCTTAGAAC
TGAAATCCCATTCTCTGAGGAAATTATCAACTGGCATATTGAGCATGGTTTTATTTCCGCTGAAAAATCAATCATCAAAA
ATAAAAAGGGTTATTTGTGTAACAGGTGCGGCCAAAATGATAAGCGGTATTTTTCCTCCTACTGGTCATGCTCAGATGAA
AAAAATCAAATGTATTGCCGTTCGTGTGTCATGATGGGAAGAGTAAGCGAGAACATTTTCTTATATTCATGGATAAAGGA
GGTGGAAGCAAGCTGGCAGCCTATAAAGCTTACTTGGGAAGGTAAACTCTCTCTTGGACAGCAAAAAGCGGCGGATATTT
TAATTGACGCGATAACTAAAAGGGAAGAGCTTCTTATTTGGGCTGTTTGCGGCGCAGGTAAAACCGAAATATTATTCCCG
GGCATTGAGTTTGCGTTAAATCAGGGATTTCGTGTATGTATTGCAACACCGCGCACCGATGTCGTACTAGAGCTTACTCC
AAGGCTGAAAACCGCCTTTCAGACCACAAAGATCTCAGCCCTTTACGGAGGGAGTGAAGATAAAGGGAGCTTGTCCCCGC
TGATGATTTCAACGACACATCAGCTATTACGTTACAAAGAAGCATTTGACGTCATCGTCATAGACGAAGTTGATGCTTTT
CCTTATTCTGCTGATCAAACTCTCCAGTTTGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCATTTATTTAAGTGC
AACACCTTCCAAAGAATTAAAAAAGAAAGCGCATATAGGGAAGTTAAACTCGGTTCGCATACCCGCAAGACATCACCGGA
AACCATTGCCTGAACCACGTTTTTGCTGGTGCGGAAACTGGCAAAAAAAATTAGCCGGAAGCAAAATTCCGAAACAGGTG
AAAATATGGGTAGAACAGCATGTGAAAGTAGGCAGACCTGTTTTTTTATTTGTGCCGTCTGTTTCCGTTTTAGAGAAGGT
TACTGCCTGTTTCGCCGGCATGAGATATCGGACAGCGGGTGTGCATGCGGAAGATAAGAACCGTAAAGAGAAAGTGCAGC
AATTCAGGGACGGCCGGCTTGATGTATTAATCACAACGACCATATTAGAAAGAGGAGTTACTGTTCCAATGGTGCAGACT
GGCGTATTGGGAGCTGAATCGCCTATATTTACTGAGAGCGCACTCGTCCAAATCGCCGGGAGGACAGGGCGGCATAAAAA
ACATGCTCAGGGTGACGTCATATATTTTCACTTTGGAAAAACAAAGAGTATGATAGATGCCAGAAATCATATAAACGAAA
TGAATAAATTGGCAAGAAAAAACGAATTAATAGACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
76.129 |
100 |
0.761 |