Detailed information
Overview
| Name | comFA/cflA | Type | Machinery gene |
| Locus tag | MGGS36055_RS08505 | Genome accession | NZ_CP117286 |
| Coordinates | 1689837..1691162 (-) | Length | 441 a.a. |
| NCBI ID | WP_015017291.1 | Uniprot ID | A0A9X8T4I6 |
| Organism | Streptococcus dysgalactiae subsp. equisimilis strain MGGS36055 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1684837..1696162
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| MGGS36055_RS08465 | - | 1685478..1685675 (-) | 198 | WP_000208122.1 | helix-turn-helix transcriptional regulator | - |
| MGGS36055_RS08470 (MGGS36055_03406) | - | 1685672..1685812 (-) | 141 | WP_017645165.1 | hypothetical protein | - |
| MGGS36055_RS08475 (MGGS36055_03408) | - | 1685809..1686330 (-) | 522 | WP_225045582.1 | hypothetical protein | - |
| MGGS36055_RS08480 (MGGS36055_03410) | - | 1686492..1687526 (-) | 1035 | WP_000822470.1 | replication initiator protein A | - |
| MGGS36055_RS08485 (MGGS36055_03412) | - | 1687528..1687692 (-) | 165 | WP_000166480.1 | hypothetical protein | - |
| MGGS36055_RS08490 | - | 1687799..1688241 (-) | 443 | Protein_1608 | hypothetical protein | - |
| MGGS36055_RS08495 (MGGS36055_03418) | - | 1688720..1689091 (-) | 372 | Protein_1609 | hypothetical protein | - |
| MGGS36055_RS10965 (MGGS36055_03420) | - | 1689200..1689865 (-) | 666 | WP_012767419.1 | ComF family protein | - |
| MGGS36055_RS08505 (MGGS36055_03422) | comFA/cflA | 1689837..1691162 (-) | 1326 | WP_015017291.1 | DEAD/DEAH box helicase | Machinery gene |
| MGGS36055_RS08510 (MGGS36055_03424) | - | 1691218..1691856 (+) | 639 | WP_015017292.1 | YigZ family protein | - |
| MGGS36055_RS08515 (MGGS36055_03426) | cysK | 1691960..1692895 (+) | 936 | WP_003062335.1 | cysteine synthase A | - |
| MGGS36055_RS08520 (MGGS36055_03428) | - | 1693047..1693406 (-) | 360 | WP_015017293.1 | S1 RNA-binding domain-containing protein | - |
| MGGS36055_RS08525 (MGGS36055_03430) | - | 1693406..1694806 (-) | 1401 | WP_042357970.1 | bifunctional Cof-type HAD-IIB family hydrolase/peptidylprolyl isomerase | - |
| MGGS36055_RS08530 (MGGS36055_03432) | - | 1694843..1695484 (-) | 642 | WP_014612503.1 | response regulator transcription factor | - |
Sequence
Protein
Download Length: 441 a.a. Molecular weight: 49283.45 Da Isoelectric Point: 9.7069
>NTDB_id=783279 MGGS36055_RS08505 WP_015017291.1 1689837..1691162(-) (comFA/cflA) [Streptococcus dysgalactiae subsp. equisimilis strain MGGS36055]
MEGIENSYGRLFLKHQLPKEVNHLAKTLESIVAIKGKVTCQRCHYQITAEARLPSGTYYCRFCLVFGRNQADRPLYYIPP
HPFPIANYLQWKGILTPYQESISNQLVKNVHAKKPTLVHAVTGAGKTEMIYGAIAAVVNAGGWVCLASPRVDVCIELANR
LKAAFSCRVTLLHADSEAYQRSPIIVATTHQLLTFYKAFDLLIIDEVDAFPFVNNHQLQYAAQQAAKEGASSIVLTATST
KELEEQVKSGELEKLTLARRFHDNPLILPKFIRSFGLLKKIHCQKLPRALVKSISQQRKTGCPLLIFLPIIAMAELVTEL
LKLAFPEEQIACVSSQSADRVKDIDAFRQGKKGILVTTTILERGVTFPGVDVFVLMAQHRGYSSQSLVQIAGRVGRSIER
PTGKVYFFHDGISQAMRNARKEIKEMNQKGYLNDMPPMSTI
MEGIENSYGRLFLKHQLPKEVNHLAKTLESIVAIKGKVTCQRCHYQITAEARLPSGTYYCRFCLVFGRNQADRPLYYIPP
HPFPIANYLQWKGILTPYQESISNQLVKNVHAKKPTLVHAVTGAGKTEMIYGAIAAVVNAGGWVCLASPRVDVCIELANR
LKAAFSCRVTLLHADSEAYQRSPIIVATTHQLLTFYKAFDLLIIDEVDAFPFVNNHQLQYAAQQAAKEGASSIVLTATST
KELEEQVKSGELEKLTLARRFHDNPLILPKFIRSFGLLKKIHCQKLPRALVKSISQQRKTGCPLLIFLPIIAMAELVTEL
LKLAFPEEQIACVSSQSADRVKDIDAFRQGKKGILVTTTILERGVTFPGVDVFVLMAQHRGYSSQSLVQIAGRVGRSIER
PTGKVYFFHDGISQAMRNARKEIKEMNQKGYLNDMPPMSTI
Nucleotide
Download Length: 1326 bp
>NTDB_id=783279 MGGS36055_RS08505 WP_015017291.1 1689837..1691162(-) (comFA/cflA) [Streptococcus dysgalactiae subsp. equisimilis strain MGGS36055]
ATGGAAGGTATCGAAAACAGTTACGGTCGTTTATTTTTAAAACATCAATTGCCAAAAGAAGTAAATCATCTGGCAAAGAC
CTTGGAAAGTATAGTAGCCATAAAAGGTAAAGTCACTTGTCAACGCTGTCATTATCAGATAACCGCAGAAGCGAGATTAC
CAAGTGGCACTTACTACTGCCGCTTCTGTCTCGTTTTTGGCCGAAATCAAGCTGATAGGCCTCTTTATTATATACCTCCA
CATCCTTTTCCTATAGCCAATTATCTTCAATGGAAGGGGATATTAACGCCCTACCAAGAAAGTATTTCAAACCAATTAGT
CAAGAATGTTCATGCTAAAAAACCTACTTTAGTGCATGCTGTTACCGGAGCCGGCAAGACAGAGATGATTTATGGAGCCA
TAGCAGCAGTTGTTAATGCTGGAGGTTGGGTTTGTTTAGCCAGCCCAAGGGTCGATGTTTGTATAGAACTTGCTAATCGC
TTAAAAGCTGCTTTTTCTTGCCGGGTTACTCTTTTACATGCTGACTCAGAAGCTTATCAAAGAAGTCCTATTATAGTAGC
TACTACCCATCAATTACTGACCTTTTATAAGGCTTTTGATCTTTTAATCATTGATGAAGTTGATGCTTTTCCTTTTGTCA
ATAATCATCAACTACAATATGCAGCGCAGCAAGCCGCTAAAGAAGGCGCTAGTAGTATCGTATTGACCGCGACATCGACC
AAAGAACTGGAAGAGCAGGTCAAAAGTGGGGAACTAGAGAAGTTGACATTAGCTAGACGATTCCACGATAATCCTTTGAT
TCTTCCGAAATTTATCAGGAGTTTTGGTCTATTAAAAAAAATTCACTGTCAAAAACTTCCTCGAGCTCTTGTTAAATCTA
TCAGTCAACAAAGGAAAACAGGATGTCCACTGCTAATTTTCCTTCCTATTATTGCCATGGCAGAATTAGTTACTGAATTA
TTAAAGTTAGCTTTTCCAGAGGAACAAATTGCCTGTGTCTCTAGCCAATCAGCTGACAGAGTTAAAGACATTGATGCCTT
TCGTCAAGGAAAGAAAGGTATTTTGGTGACCACCACAATATTGGAAAGGGGTGTTACTTTCCCAGGCGTGGATGTCTTTG
TGCTTATGGCACAGCATCGAGGATACAGTTCTCAAAGCTTGGTTCAAATCGCAGGTCGTGTGGGGAGATCTATCGAAAGG
CCAACAGGGAAGGTGTATTTCTTTCACGATGGGATTAGCCAAGCAATGCGCAATGCAAGAAAAGAAATTAAAGAAATGAA
TCAGAAAGGTTATTTGAATGATATGCCTCCTATGTCAACAATTTAG
ATGGAAGGTATCGAAAACAGTTACGGTCGTTTATTTTTAAAACATCAATTGCCAAAAGAAGTAAATCATCTGGCAAAGAC
CTTGGAAAGTATAGTAGCCATAAAAGGTAAAGTCACTTGTCAACGCTGTCATTATCAGATAACCGCAGAAGCGAGATTAC
CAAGTGGCACTTACTACTGCCGCTTCTGTCTCGTTTTTGGCCGAAATCAAGCTGATAGGCCTCTTTATTATATACCTCCA
CATCCTTTTCCTATAGCCAATTATCTTCAATGGAAGGGGATATTAACGCCCTACCAAGAAAGTATTTCAAACCAATTAGT
CAAGAATGTTCATGCTAAAAAACCTACTTTAGTGCATGCTGTTACCGGAGCCGGCAAGACAGAGATGATTTATGGAGCCA
TAGCAGCAGTTGTTAATGCTGGAGGTTGGGTTTGTTTAGCCAGCCCAAGGGTCGATGTTTGTATAGAACTTGCTAATCGC
TTAAAAGCTGCTTTTTCTTGCCGGGTTACTCTTTTACATGCTGACTCAGAAGCTTATCAAAGAAGTCCTATTATAGTAGC
TACTACCCATCAATTACTGACCTTTTATAAGGCTTTTGATCTTTTAATCATTGATGAAGTTGATGCTTTTCCTTTTGTCA
ATAATCATCAACTACAATATGCAGCGCAGCAAGCCGCTAAAGAAGGCGCTAGTAGTATCGTATTGACCGCGACATCGACC
AAAGAACTGGAAGAGCAGGTCAAAAGTGGGGAACTAGAGAAGTTGACATTAGCTAGACGATTCCACGATAATCCTTTGAT
TCTTCCGAAATTTATCAGGAGTTTTGGTCTATTAAAAAAAATTCACTGTCAAAAACTTCCTCGAGCTCTTGTTAAATCTA
TCAGTCAACAAAGGAAAACAGGATGTCCACTGCTAATTTTCCTTCCTATTATTGCCATGGCAGAATTAGTTACTGAATTA
TTAAAGTTAGCTTTTCCAGAGGAACAAATTGCCTGTGTCTCTAGCCAATCAGCTGACAGAGTTAAAGACATTGATGCCTT
TCGTCAAGGAAAGAAAGGTATTTTGGTGACCACCACAATATTGGAAAGGGGTGTTACTTTCCCAGGCGTGGATGTCTTTG
TGCTTATGGCACAGCATCGAGGATACAGTTCTCAAAGCTTGGTTCAAATCGCAGGTCGTGTGGGGAGATCTATCGAAAGG
CCAACAGGGAAGGTGTATTTCTTTCACGATGGGATTAGCCAAGCAATGCGCAATGCAAGAAAAGAAATTAAAGAAATGAA
TCAGAAAGGTTATTTGAATGATATGCCTCCTATGTCAACAATTTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA/cflA | Streptococcus pneumoniae Rx1 |
53.302 |
96.145 |
0.512 |
| comFA/cflA | Streptococcus pneumoniae D39 |
53.302 |
96.145 |
0.512 |
| comFA/cflA | Streptococcus pneumoniae R6 |
53.302 |
96.145 |
0.512 |
| comFA/cflA | Streptococcus pneumoniae TIGR4 |
53.302 |
96.145 |
0.512 |
| comFA/cflA | Streptococcus mitis NCTC 12261 |
51.852 |
97.959 |
0.508 |
| comFA/cflA | Streptococcus mitis SK321 |
52.214 |
97.279 |
0.508 |
| comFA | Lactococcus lactis subsp. cremoris KW2 |
46.465 |
89.796 |
0.417 |
| comFA | Latilactobacillus sakei subsp. sakei 23K |
37.587 |
97.732 |
0.367 |