Detailed information
Overview
| Name | comFA/cflA | Type | Machinery gene |
| Locus tag | GGS_RS08000 | Genome accession | NC_018712 |
| Coordinates | 1636965..1638290 (-) | Length | 441 a.a. |
| NCBI ID | WP_015017291.1 | Uniprot ID | A0A9X8T4I6 |
| Organism | Streptococcus dysgalactiae subsp. equisimilis RE378 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1631965..1643290
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GGS_RS07960 (GGS_1511) | - | 1632337..1632636 (-) | 300 | WP_000129105.1 | hypothetical protein | - |
| GGS_RS07965 | - | 1633019..1633216 (-) | 198 | WP_000208122.1 | helix-turn-helix transcriptional regulator | - |
| GGS_RS07970 | - | 1633213..1633398 (-) | 186 | WP_000938679.1 | hypothetical protein | - |
| GGS_RS07975 (GGS_1512) | - | 1633400..1634425 (-) | 1026 | WP_000822468.1 | replication initiator protein A | - |
| GGS_RS11035 | - | 1634427..1634591 (-) | 165 | WP_000166480.1 | hypothetical protein | - |
| GGS_RS07980 (GGS_1513) | - | 1634698..1635141 (-) | 444 | WP_000447421.1 | hypothetical protein | - |
| GGS_RS07985 | - | 1635407..1635664 (-) | 258 | WP_000191926.1 | DUF3850 domain-containing protein | - |
| GGS_RS07990 (GGS_1514) | - | 1635848..1636219 (-) | 372 | WP_001282475.1 | hypothetical protein | - |
| GGS_RS11340 (GGS_1515) | - | 1636328..1636993 (-) | 666 | WP_012767419.1 | ComF family protein | - |
| GGS_RS08000 (GGS_1516) | comFA/cflA | 1636965..1638290 (-) | 1326 | WP_015017291.1 | DEAD/DEAH box helicase | Machinery gene |
| GGS_RS08005 (GGS_1517) | - | 1638346..1638984 (+) | 639 | WP_015017292.1 | YigZ family protein | - |
| GGS_RS08010 (GGS_1518) | cysK | 1639088..1640023 (+) | 936 | WP_003062335.1 | cysteine synthase A | - |
| GGS_RS08015 (GGS_1519) | - | 1640175..1640534 (-) | 360 | WP_015017293.1 | S1 RNA-binding domain-containing protein | - |
| GGS_RS08020 (GGS_1520) | - | 1640534..1641934 (-) | 1401 | WP_042357970.1 | bifunctional Cof-type HAD-IIB family hydrolase/peptidylprolyl isomerase | - |
| GGS_RS08025 (GGS_1521) | - | 1641971..1642612 (-) | 642 | WP_014612503.1 | response regulator transcription factor | - |
Sequence
Protein
Download Length: 441 a.a. Molecular weight: 49283.45 Da Isoelectric Point: 9.7069
>NTDB_id=53970 GGS_RS08000 WP_015017291.1 1636965..1638290(-) (comFA/cflA) [Streptococcus dysgalactiae subsp. equisimilis RE378]
MEGIENSYGRLFLKHQLPKEVNHLAKTLESIVAIKGKVTCQRCHYQITAEARLPSGTYYCRFCLVFGRNQADRPLYYIPP
HPFPIANYLQWKGILTPYQESISNQLVKNVHAKKPTLVHAVTGAGKTEMIYGAIAAVVNAGGWVCLASPRVDVCIELANR
LKAAFSCRVTLLHADSEAYQRSPIIVATTHQLLTFYKAFDLLIIDEVDAFPFVNNHQLQYAAQQAAKEGASSIVLTATST
KELEEQVKSGELEKLTLARRFHDNPLILPKFIRSFGLLKKIHCQKLPRALVKSISQQRKTGCPLLIFLPIIAMAELVTEL
LKLAFPEEQIACVSSQSADRVKDIDAFRQGKKGILVTTTILERGVTFPGVDVFVLMAQHRGYSSQSLVQIAGRVGRSIER
PTGKVYFFHDGISQAMRNARKEIKEMNQKGYLNDMPPMSTI
MEGIENSYGRLFLKHQLPKEVNHLAKTLESIVAIKGKVTCQRCHYQITAEARLPSGTYYCRFCLVFGRNQADRPLYYIPP
HPFPIANYLQWKGILTPYQESISNQLVKNVHAKKPTLVHAVTGAGKTEMIYGAIAAVVNAGGWVCLASPRVDVCIELANR
LKAAFSCRVTLLHADSEAYQRSPIIVATTHQLLTFYKAFDLLIIDEVDAFPFVNNHQLQYAAQQAAKEGASSIVLTATST
KELEEQVKSGELEKLTLARRFHDNPLILPKFIRSFGLLKKIHCQKLPRALVKSISQQRKTGCPLLIFLPIIAMAELVTEL
LKLAFPEEQIACVSSQSADRVKDIDAFRQGKKGILVTTTILERGVTFPGVDVFVLMAQHRGYSSQSLVQIAGRVGRSIER
PTGKVYFFHDGISQAMRNARKEIKEMNQKGYLNDMPPMSTI
Nucleotide
Download Length: 1326 bp
>NTDB_id=53970 GGS_RS08000 WP_015017291.1 1636965..1638290(-) (comFA/cflA) [Streptococcus dysgalactiae subsp. equisimilis RE378]
ATGGAAGGTATCGAAAACAGTTACGGTCGTTTATTTTTAAAACATCAATTGCCAAAAGAAGTAAATCATCTGGCAAAGAC
CTTGGAAAGTATAGTAGCCATAAAAGGTAAAGTCACTTGTCAACGCTGTCATTATCAGATAACCGCAGAAGCGAGATTAC
CAAGTGGCACTTACTACTGCCGCTTCTGTCTCGTTTTTGGCCGAAATCAAGCTGATAGGCCTCTTTATTATATACCTCCA
CATCCTTTTCCTATAGCCAATTATCTTCAATGGAAGGGGATATTAACGCCCTACCAAGAAAGTATTTCAAACCAATTAGT
CAAGAATGTTCATGCTAAAAAACCTACTTTAGTGCATGCTGTTACCGGAGCCGGCAAGACAGAGATGATTTATGGAGCCA
TAGCAGCAGTTGTTAATGCTGGAGGTTGGGTTTGTTTAGCCAGCCCAAGGGTCGATGTTTGTATAGAACTTGCTAATCGC
TTAAAAGCTGCTTTTTCTTGCCGGGTTACTCTTTTACATGCTGACTCAGAAGCTTATCAAAGAAGTCCTATTATAGTAGC
TACTACCCATCAATTACTGACCTTTTATAAGGCTTTTGATCTTTTAATCATTGATGAAGTTGATGCTTTTCCTTTTGTCA
ATAATCATCAACTACAATATGCAGCGCAGCAAGCCGCTAAAGAAGGCGCTAGTAGTATCGTATTGACCGCGACATCGACC
AAAGAACTGGAAGAGCAGGTCAAAAGTGGGGAACTAGAGAAGTTGACATTAGCTAGACGATTCCACGATAATCCTTTGAT
TCTTCCGAAATTTATCAGGAGTTTTGGTCTATTAAAAAAAATTCACTGTCAAAAACTTCCTCGAGCTCTTGTTAAATCTA
TCAGTCAACAAAGGAAAACAGGATGTCCACTGCTAATTTTCCTTCCTATTATTGCCATGGCAGAATTAGTTACTGAATTA
TTAAAGTTAGCTTTTCCAGAGGAACAAATTGCCTGTGTCTCTAGCCAATCAGCTGACAGAGTTAAAGACATTGATGCCTT
TCGTCAAGGAAAGAAAGGTATTTTGGTGACCACCACAATATTGGAAAGGGGTGTTACTTTCCCAGGCGTGGATGTCTTTG
TGCTTATGGCACAGCATCGAGGATACAGTTCTCAAAGCTTGGTTCAAATCGCAGGTCGTGTGGGGAGATCTATCGAAAGG
CCAACAGGGAAGGTGTATTTCTTTCACGATGGGATTAGCCAAGCAATGCGCAATGCAAGAAAAGAAATTAAAGAAATGAA
TCAGAAAGGTTATTTGAATGATATGCCTCCTATGTCAACAATTTAG
ATGGAAGGTATCGAAAACAGTTACGGTCGTTTATTTTTAAAACATCAATTGCCAAAAGAAGTAAATCATCTGGCAAAGAC
CTTGGAAAGTATAGTAGCCATAAAAGGTAAAGTCACTTGTCAACGCTGTCATTATCAGATAACCGCAGAAGCGAGATTAC
CAAGTGGCACTTACTACTGCCGCTTCTGTCTCGTTTTTGGCCGAAATCAAGCTGATAGGCCTCTTTATTATATACCTCCA
CATCCTTTTCCTATAGCCAATTATCTTCAATGGAAGGGGATATTAACGCCCTACCAAGAAAGTATTTCAAACCAATTAGT
CAAGAATGTTCATGCTAAAAAACCTACTTTAGTGCATGCTGTTACCGGAGCCGGCAAGACAGAGATGATTTATGGAGCCA
TAGCAGCAGTTGTTAATGCTGGAGGTTGGGTTTGTTTAGCCAGCCCAAGGGTCGATGTTTGTATAGAACTTGCTAATCGC
TTAAAAGCTGCTTTTTCTTGCCGGGTTACTCTTTTACATGCTGACTCAGAAGCTTATCAAAGAAGTCCTATTATAGTAGC
TACTACCCATCAATTACTGACCTTTTATAAGGCTTTTGATCTTTTAATCATTGATGAAGTTGATGCTTTTCCTTTTGTCA
ATAATCATCAACTACAATATGCAGCGCAGCAAGCCGCTAAAGAAGGCGCTAGTAGTATCGTATTGACCGCGACATCGACC
AAAGAACTGGAAGAGCAGGTCAAAAGTGGGGAACTAGAGAAGTTGACATTAGCTAGACGATTCCACGATAATCCTTTGAT
TCTTCCGAAATTTATCAGGAGTTTTGGTCTATTAAAAAAAATTCACTGTCAAAAACTTCCTCGAGCTCTTGTTAAATCTA
TCAGTCAACAAAGGAAAACAGGATGTCCACTGCTAATTTTCCTTCCTATTATTGCCATGGCAGAATTAGTTACTGAATTA
TTAAAGTTAGCTTTTCCAGAGGAACAAATTGCCTGTGTCTCTAGCCAATCAGCTGACAGAGTTAAAGACATTGATGCCTT
TCGTCAAGGAAAGAAAGGTATTTTGGTGACCACCACAATATTGGAAAGGGGTGTTACTTTCCCAGGCGTGGATGTCTTTG
TGCTTATGGCACAGCATCGAGGATACAGTTCTCAAAGCTTGGTTCAAATCGCAGGTCGTGTGGGGAGATCTATCGAAAGG
CCAACAGGGAAGGTGTATTTCTTTCACGATGGGATTAGCCAAGCAATGCGCAATGCAAGAAAAGAAATTAAAGAAATGAA
TCAGAAAGGTTATTTGAATGATATGCCTCCTATGTCAACAATTTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA/cflA | Streptococcus pneumoniae Rx1 |
53.302 |
96.145 |
0.512 |
| comFA/cflA | Streptococcus pneumoniae D39 |
53.302 |
96.145 |
0.512 |
| comFA/cflA | Streptococcus pneumoniae R6 |
53.302 |
96.145 |
0.512 |
| comFA/cflA | Streptococcus pneumoniae TIGR4 |
53.302 |
96.145 |
0.512 |
| comFA/cflA | Streptococcus mitis NCTC 12261 |
51.852 |
97.959 |
0.508 |
| comFA/cflA | Streptococcus mitis SK321 |
52.214 |
97.279 |
0.508 |
| comFA | Lactococcus lactis subsp. cremoris KW2 |
46.465 |
89.796 |
0.417 |
| comFA | Latilactobacillus sakei subsp. sakei 23K |
37.587 |
97.732 |
0.367 |