Detailed information
Overview
| Name | comFA/cflA | Type | Machinery gene |
| Locus tag | I6J14_RS10435 | Genome accession | NZ_CP068057 |
| Coordinates | 2090169..2091494 (+) | Length | 441 a.a. |
| NCBI ID | WP_003052142.1 | Uniprot ID | - |
| Organism | Streptococcus dysgalactiae strain FDAARGOS_1157 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2085169..2096494
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| I6J14_RS10410 (I6J14_10410) | - | 2085848..2086489 (+) | 642 | WP_003052152.1 | response regulator transcription factor | - |
| I6J14_RS10415 (I6J14_10415) | - | 2086526..2087926 (+) | 1401 | WP_115253070.1 | bifunctional Cof-type HAD-IIB family hydrolase/peptidylprolyl isomerase | - |
| I6J14_RS10420 (I6J14_10420) | - | 2087926..2088285 (+) | 360 | WP_003052148.1 | S1 RNA-binding domain-containing protein | - |
| I6J14_RS10425 (I6J14_10425) | cysK | 2088436..2089371 (-) | 936 | WP_003052146.1 | cysteine synthase A | - |
| I6J14_RS10430 (I6J14_10430) | - | 2089475..2090113 (-) | 639 | WP_003052144.1 | YigZ family protein | - |
| I6J14_RS10435 (I6J14_10435) | comFA/cflA | 2090169..2091494 (+) | 1326 | WP_003052142.1 | DEAD/DEAH box helicase | Machinery gene |
| I6J14_RS11130 (I6J14_10440) | - | 2091466..2092131 (+) | 666 | WP_115253069.1 | ComF family protein | - |
| I6J14_RS10445 (I6J14_10445) | raiA | 2092211..2092759 (+) | 549 | WP_003052138.1 | ribosome-associated translation inhibitor RaiA | - |
| I6J14_RS10450 (I6J14_10450) | - | 2092714..2093651 (-) | 938 | Protein_2022 | IS30 family transposase | - |
| I6J14_RS10455 (I6J14_10455) | - | 2093861..2094208 (+) | 348 | WP_003052134.1 | hypothetical protein | - |
| I6J14_RS10460 (I6J14_10460) | - | 2094213..2094401 (+) | 189 | WP_003052131.1 | helix-turn-helix transcriptional regulator | - |
| I6J14_RS10465 (I6J14_10465) | - | 2094414..2094809 (+) | 396 | WP_115253068.1 | ATP-binding cassette domain-containing protein | - |
| I6J14_RS10470 (I6J14_10470) | - | 2094831..2095109 (+) | 279 | WP_115253067.1 | hypothetical protein | - |
| I6J14_RS10475 (I6J14_10475) | - | 2095120..2095470 (+) | 351 | WP_115253066.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 441 a.a. Molecular weight: 49359.45 Da Isoelectric Point: 9.6283
>NTDB_id=526763 I6J14_RS10435 WP_003052142.1 2090169..2091494(+) (comFA/cflA) [Streptococcus dysgalactiae strain FDAARGOS_1157]
MEGIENSYGRLFLKHQLPKEVNHLAKTLESIVIIKGKVTCQRCHYQITAEARLPSGTYYCRFCLVFGRNQADRPLYYIPP
HPFPIANYLQWKGILTPYQESISNQLVKNVHAKKPTLVHAVTGAGKTEMIYGAIAAVVNAGGWVCLASPRVDVCIELANR
LKAAFSCRVTLLHADSEAYQRSPIIVATTHQLLTFYKAFDLLIIDEVDAFPFVNNHQLQYASQQAAKEGASSIVLTATST
KELEEQVKSGELEKLTLARRFHDNPLILPKFIRSFGLLKKIHCQKLPRALVKSISQQRKTGCPLLIFLPIIAMAELVTEL
LKLAFPEEQIACVSSQSADRVKDIDDFRQGKKGILVTTTILERGVTFPGVDVFVLMAQHRGYSSQSLVQIAGRVGRSIER
PTGKVYFFHDGISQAMRNARKEIKEMNQKGYSNDMPPMSTI
MEGIENSYGRLFLKHQLPKEVNHLAKTLESIVIIKGKVTCQRCHYQITAEARLPSGTYYCRFCLVFGRNQADRPLYYIPP
HPFPIANYLQWKGILTPYQESISNQLVKNVHAKKPTLVHAVTGAGKTEMIYGAIAAVVNAGGWVCLASPRVDVCIELANR
LKAAFSCRVTLLHADSEAYQRSPIIVATTHQLLTFYKAFDLLIIDEVDAFPFVNNHQLQYASQQAAKEGASSIVLTATST
KELEEQVKSGELEKLTLARRFHDNPLILPKFIRSFGLLKKIHCQKLPRALVKSISQQRKTGCPLLIFLPIIAMAELVTEL
LKLAFPEEQIACVSSQSADRVKDIDDFRQGKKGILVTTTILERGVTFPGVDVFVLMAQHRGYSSQSLVQIAGRVGRSIER
PTGKVYFFHDGISQAMRNARKEIKEMNQKGYSNDMPPMSTI
Nucleotide
Download Length: 1326 bp
>NTDB_id=526763 I6J14_RS10435 WP_003052142.1 2090169..2091494(+) (comFA/cflA) [Streptococcus dysgalactiae strain FDAARGOS_1157]
ATGGAAGGTATTGAAAACAGTTACGGTCGTTTATTTTTAAAACATCAATTGCCAAAAGAAGTAAATCATCTGGCAAAGAC
CTTGGAAAGTATAGTAATCATAAAAGGTAAAGTCACTTGTCAACGCTGTCATTATCAGATAACCGCAGAAGCGAGATTAC
CAAGCGGCACTTACTACTGTCGCTTCTGTCTCGTTTTTGGCCGAAATCAAGCTGATAGGCCTCTTTATTATATACCTCCA
CATCCTTTTCCTATAGCCAATTATCTTCAATGGAAGGGGATATTAACACCCTACCAAGAAAGTATTTCAAACCAATTAGT
CAAGAATGTTCATGCTAAAAAACCTACTTTAGTGCATGCTGTTACCGGAGCCGGCAAGACAGAGATGATTTATGGTGCCA
TAGCAGCAGTTGTTAATGCTGGAGGTTGGGTTTGTTTAGCTAGCCCAAGGGTCGATGTTTGTATAGAACTTGCTAATCGC
TTAAAAGCTGCTTTTTCTTGCCGAGTTACTCTTTTACACGCTGACTCAGAAGCTTATCAAAGAAGTCCTATTATAGTAGC
TACTACCCATCAATTACTGACCTTTTATAAGGCTTTTGATCTTTTAATCATTGATGAAGTTGATGCTTTTCCTTTTGTCA
ATAATCATCAACTACAATATGCATCGCAGCAAGCCGCTAAAGAAGGCGCTAGTAGTATCGTATTGACCGCGACATCGACC
AAAGAACTGGAAGAGCAGGTCAAAAGTGGGGAATTAGAGAAGTTGACATTAGCTAGACGATTTCATGATAATCCTTTGAT
TCTTCCGAAATTTATCAGGAGTTTTGGCCTATTAAAAAAAATTCACTGTCAAAAACTTCCTCGAGCTCTTGTTAAATCTA
TCAGTCAACAAAGGAAAACAGGATGTCCACTGCTAATTTTTCTTCCTATTATTGCCATGGCAGAATTAGTTACTGAATTA
TTAAAGTTAGCTTTTCCAGAGGAGCAAATTGCCTGTGTCTCTAGCCAATCAGCTGACAGAGTTAAAGACATTGATGACTT
TCGTCAAGGAAAGAAAGGCATTTTGGTGACCACCACAATATTGGAAAGGGGTGTTACTTTCCCAGGTGTGGATGTCTTTG
TGCTTATGGCACAGCATCGAGGATACAGTTCTCAAAGCTTGGTTCAAATCGCAGGTCGTGTCGGGAGATCTATCGAAAGG
CCAACAGGAAAGGTGTATTTCTTTCACGATGGGATTAGCCAAGCAATGCGCAATGCAAGAAAAGAAATTAAAGAAATGAA
TCAGAAAGGTTATTCGAATGATATGCCTCCTATGTCAACAATTTAG
ATGGAAGGTATTGAAAACAGTTACGGTCGTTTATTTTTAAAACATCAATTGCCAAAAGAAGTAAATCATCTGGCAAAGAC
CTTGGAAAGTATAGTAATCATAAAAGGTAAAGTCACTTGTCAACGCTGTCATTATCAGATAACCGCAGAAGCGAGATTAC
CAAGCGGCACTTACTACTGTCGCTTCTGTCTCGTTTTTGGCCGAAATCAAGCTGATAGGCCTCTTTATTATATACCTCCA
CATCCTTTTCCTATAGCCAATTATCTTCAATGGAAGGGGATATTAACACCCTACCAAGAAAGTATTTCAAACCAATTAGT
CAAGAATGTTCATGCTAAAAAACCTACTTTAGTGCATGCTGTTACCGGAGCCGGCAAGACAGAGATGATTTATGGTGCCA
TAGCAGCAGTTGTTAATGCTGGAGGTTGGGTTTGTTTAGCTAGCCCAAGGGTCGATGTTTGTATAGAACTTGCTAATCGC
TTAAAAGCTGCTTTTTCTTGCCGAGTTACTCTTTTACACGCTGACTCAGAAGCTTATCAAAGAAGTCCTATTATAGTAGC
TACTACCCATCAATTACTGACCTTTTATAAGGCTTTTGATCTTTTAATCATTGATGAAGTTGATGCTTTTCCTTTTGTCA
ATAATCATCAACTACAATATGCATCGCAGCAAGCCGCTAAAGAAGGCGCTAGTAGTATCGTATTGACCGCGACATCGACC
AAAGAACTGGAAGAGCAGGTCAAAAGTGGGGAATTAGAGAAGTTGACATTAGCTAGACGATTTCATGATAATCCTTTGAT
TCTTCCGAAATTTATCAGGAGTTTTGGCCTATTAAAAAAAATTCACTGTCAAAAACTTCCTCGAGCTCTTGTTAAATCTA
TCAGTCAACAAAGGAAAACAGGATGTCCACTGCTAATTTTTCTTCCTATTATTGCCATGGCAGAATTAGTTACTGAATTA
TTAAAGTTAGCTTTTCCAGAGGAGCAAATTGCCTGTGTCTCTAGCCAATCAGCTGACAGAGTTAAAGACATTGATGACTT
TCGTCAAGGAAAGAAAGGCATTTTGGTGACCACCACAATATTGGAAAGGGGTGTTACTTTCCCAGGTGTGGATGTCTTTG
TGCTTATGGCACAGCATCGAGGATACAGTTCTCAAAGCTTGGTTCAAATCGCAGGTCGTGTCGGGAGATCTATCGAAAGG
CCAACAGGAAAGGTGTATTTCTTTCACGATGGGATTAGCCAAGCAATGCGCAATGCAAGAAAAGAAATTAAAGAAATGAA
TCAGAAAGGTTATTCGAATGATATGCCTCCTATGTCAACAATTTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA/cflA | Streptococcus pneumoniae Rx1 |
53.081 |
95.692 |
0.508 |
| comFA/cflA | Streptococcus pneumoniae D39 |
53.081 |
95.692 |
0.508 |
| comFA/cflA | Streptococcus pneumoniae R6 |
53.081 |
95.692 |
0.508 |
| comFA/cflA | Streptococcus pneumoniae TIGR4 |
53.081 |
95.692 |
0.508 |
| comFA/cflA | Streptococcus mitis NCTC 12261 |
51.628 |
97.506 |
0.503 |
| comFA/cflA | Streptococcus mitis SK321 |
51.765 |
96.372 |
0.499 |
| comFA | Lactococcus lactis subsp. cremoris KW2 |
45.366 |
92.971 |
0.422 |
| comFA | Latilactobacillus sakei subsp. sakei 23K |
37.355 |
97.732 |
0.365 |