Detailed information
Overview
| Name | comFA/cflA | Type | Machinery gene |
| Locus tag | AB6M97_RS08010 | Genome accession | NZ_CP163514 |
| Coordinates | 1665621..1666913 (-) | Length | 430 a.a. |
| NCBI ID | WP_121835775.1 | Uniprot ID | A0A3L9DUS7 |
| Organism | Streptococcus hillyeri strain S23-3001-1 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 1645128..1665624 | 1665621..1666913 | flank | -3 |
Gene organization within MGE regions
Location: 1645128..1666913
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| AB6M97_RS07870 (AB6M97_07870) | - | 1645128..1645865 (-) | 738 | WP_121835751.1 | helix-turn-helix domain-containing protein | - |
| AB6M97_RS07875 (AB6M97_07875) | - | 1646050..1646235 (+) | 186 | WP_121835752.1 | helix-turn-helix transcriptional regulator | - |
| AB6M97_RS07880 (AB6M97_07880) | - | 1646325..1646558 (+) | 234 | WP_121835753.1 | hypothetical protein | - |
| AB6M97_RS07885 (AB6M97_07885) | - | 1646727..1647035 (+) | 309 | WP_121835754.1 | hypothetical protein | - |
| AB6M97_RS07890 (AB6M97_07890) | - | 1647016..1647222 (+) | 207 | WP_121835755.1 | hypothetical protein | - |
| AB6M97_RS07895 (AB6M97_07895) | - | 1647209..1647688 (+) | 480 | WP_245962708.1 | hypothetical protein | - |
| AB6M97_RS07900 (AB6M97_07900) | - | 1647681..1648088 (+) | 408 | WP_121835756.1 | hypothetical protein | - |
| AB6M97_RS07905 (AB6M97_07905) | - | 1648217..1648480 (+) | 264 | WP_121835757.1 | hypothetical protein | - |
| AB6M97_RS07910 (AB6M97_07910) | - | 1648910..1649140 (+) | 231 | WP_121835758.1 | hypothetical protein | - |
| AB6M97_RS07915 (AB6M97_07915) | - | 1649125..1649319 (+) | 195 | WP_121835759.1 | hypothetical protein | - |
| AB6M97_RS07920 (AB6M97_07920) | - | 1649333..1649671 (+) | 339 | WP_121835760.1 | HTH domain-containing protein | - |
| AB6M97_RS07925 (AB6M97_07925) | - | 1649880..1650218 (+) | 339 | WP_142925595.1 | hypothetical protein | - |
| AB6M97_RS07930 (AB6M97_07930) | - | 1650175..1651701 (+) | 1527 | WP_121835762.1 | phage/plasmid primase, P4 family | - |
| AB6M97_RS07935 (AB6M97_07935) | - | 1652222..1652763 (+) | 542 | Protein_1548 | site-specific integrase | - |
| AB6M97_RS07940 (AB6M97_07940) | - | 1653553..1654221 (+) | 669 | WP_121835765.1 | Fic family protein | - |
| AB6M97_RS07945 (AB6M97_07945) | - | 1654842..1654994 (+) | 153 | WP_183121669.1 | hypothetical protein | - |
| AB6M97_RS07950 (AB6M97_07950) | - | 1655013..1656515 (+) | 1503 | WP_369351206.1 | ATP-binding cassette domain-containing protein | - |
| AB6M97_RS07955 (AB6M97_07955) | - | 1656531..1656701 (-) | 171 | WP_121835767.1 | hypothetical protein | - |
| AB6M97_RS07960 (AB6M97_07960) | - | 1656718..1657272 (-) | 555 | WP_121835768.1 | GNAT family N-acetyltransferase | - |
| AB6M97_RS07965 (AB6M97_07965) | obgE | 1657467..1658780 (-) | 1314 | WP_121835769.1 | GTPase ObgE | - |
| AB6M97_RS07970 (AB6M97_07970) | - | 1659047..1659175 (-) | 129 | WP_083959937.1 | DUF4044 domain-containing protein | - |
| AB6M97_RS07975 (AB6M97_07975) | - | 1659318..1660568 (+) | 1251 | WP_369350515.1 | aminopeptidase | - |
| AB6M97_RS07980 (AB6M97_07980) | asnA | 1660681..1661673 (-) | 993 | WP_121835771.1 | aspartate--ammonia ligase | - |
| AB6M97_RS07985 (AB6M97_07985) | - | 1661754..1662818 (-) | 1065 | WP_121835772.1 | MmcQ/YjbR family DNA-binding protein | - |
| AB6M97_RS07995 (AB6M97_07995) | rplS | 1663217..1663564 (-) | 348 | WP_027971744.1 | 50S ribosomal protein L19 | - |
| AB6M97_RS08000 (AB6M97_08000) | raiA | 1664327..1664875 (-) | 549 | WP_121835773.1 | ribosome-associated translation inhibitor RaiA | - |
| AB6M97_RS08005 (AB6M97_08005) | comFC/cflB | 1664962..1665624 (-) | 663 | WP_121835774.1 | ComF family protein | Machinery gene |
| AB6M97_RS08010 (AB6M97_08010) | comFA/cflA | 1665621..1666913 (-) | 1293 | WP_121835775.1 | DEAD/DEAH box helicase | Machinery gene |
Sequence
Protein
Download Length: 430 a.a. Molecular weight: 48745.72 Da Isoelectric Point: 8.6825
>NTDB_id=1030359 AB6M97_RS08010 WP_121835775.1 1665621..1666913(-) (comFA/cflA) [Streptococcus hillyeri strain S23-3001-1]
MEDFYGRLFVAHQLSDEEKSRAKQLPAMLEQKGKLCCQRCGSHILTDWLLPAGDYYCRECLMLGRNRQSQPLYYFPAQPF
SKEKYLIWQGTLTSYQQEISDGMKTAVANKENILIHAVTGAGKTEMIYETVASVLEQGGQVCLASPRIDVCLELHKRLSR
DFSCPIALLHGESEPYTRAPLVIATTHQLLKFYQAFDLLIIDEVDAFPFVDNAMLYHGLNQAVKIDGVKVFLTATSTDAL
DKQVKQGKLKKLDLARRFHANPLVVPKMKWLSGLLFSLQKDKLPSKLLKQVASQRETGYPLLIFFPHIEIGQRFTEILTK
TFPQETIGFVASTTENRLELVQQFRDKKLTVLVSTTILERGVTFPCVDVFVLWSNHKLYTRSSLVQIGGRVGRAMERPTG
ELIFFHDGMTIEMARAIAEIRLMNQKGGFA
MEDFYGRLFVAHQLSDEEKSRAKQLPAMLEQKGKLCCQRCGSHILTDWLLPAGDYYCRECLMLGRNRQSQPLYYFPAQPF
SKEKYLIWQGTLTSYQQEISDGMKTAVANKENILIHAVTGAGKTEMIYETVASVLEQGGQVCLASPRIDVCLELHKRLSR
DFSCPIALLHGESEPYTRAPLVIATTHQLLKFYQAFDLLIIDEVDAFPFVDNAMLYHGLNQAVKIDGVKVFLTATSTDAL
DKQVKQGKLKKLDLARRFHANPLVVPKMKWLSGLLFSLQKDKLPSKLLKQVASQRETGYPLLIFFPHIEIGQRFTEILTK
TFPQETIGFVASTTENRLELVQQFRDKKLTVLVSTTILERGVTFPCVDVFVLWSNHKLYTRSSLVQIGGRVGRAMERPTG
ELIFFHDGMTIEMARAIAEIRLMNQKGGFA
Nucleotide
Download Length: 1293 bp
>NTDB_id=1030359 AB6M97_RS08010 WP_121835775.1 1665621..1666913(-) (comFA/cflA) [Streptococcus hillyeri strain S23-3001-1]
ATGGAAGATTTTTACGGACGTTTATTTGTAGCGCATCAACTTTCCGATGAGGAAAAAAGCAGAGCTAAACAGTTACCAGC
CATGCTAGAGCAAAAAGGGAAACTCTGTTGTCAGCGATGTGGCAGTCACATATTAACAGACTGGCTGTTGCCAGCTGGTG
ACTATTATTGCAGGGAGTGTCTTATGTTAGGGAGAAATCGACAAAGCCAGCCTCTCTATTATTTCCCAGCGCAGCCTTTT
TCTAAAGAAAAGTATCTGATCTGGCAAGGAACTCTGACGTCGTACCAACAGGAAATCTCAGATGGCATGAAAACTGCTGT
GGCTAATAAGGAAAATATCCTTATTCATGCGGTGACAGGGGCTGGAAAGACCGAAATGATTTATGAGACAGTGGCAAGTG
TTTTGGAGCAAGGTGGACAGGTTTGTCTAGCTAGTCCACGGATTGATGTCTGTCTAGAACTTCATAAACGCCTTAGCAGA
GATTTTTCTTGTCCCATAGCGCTCTTGCATGGTGAGAGTGAGCCTTATACTAGAGCGCCCTTGGTTATTGCAACAACCCA
TCAGTTGCTTAAATTTTACCAAGCCTTTGATTTACTGATTATAGATGAGGTGGACGCTTTTCCTTTCGTTGATAATGCCA
TGCTTTATCATGGTCTTAACCAAGCGGTTAAAATTGATGGCGTCAAAGTTTTTTTGACAGCTACCTCAACGGATGCATTG
GATAAACAAGTCAAACAAGGAAAGCTTAAAAAGTTAGATTTAGCCAGACGTTTCCACGCCAATCCTCTCGTGGTTCCCAA
AATGAAGTGGCTGAGTGGACTGTTATTTAGTCTTCAAAAAGACAAACTCCCTTCAAAATTGTTGAAACAGGTAGCCTCAC
AAAGAGAAACGGGTTATCCACTTTTGATTTTCTTTCCTCACATTGAGATAGGACAAAGGTTTACTGAAATCTTAACGAAA
ACATTTCCACAGGAAACTATTGGGTTTGTTGCCTCAACAACAGAAAATCGTTTAGAATTAGTTCAGCAGTTTAGAGATAA
AAAGCTGACTGTTTTGGTATCAACCACTATCTTAGAGAGGGGAGTTACCTTTCCCTGTGTAGATGTTTTTGTACTTTGGA
GCAACCATAAACTCTATACGCGGTCCTCTCTTGTGCAGATTGGTGGCCGTGTCGGAAGAGCAATGGAGAGACCAACAGGA
GAATTGATTTTCTTTCACGATGGCATGACTATAGAGATGGCAAGAGCCATTGCTGAGATTCGCCTGATGAATCAAAAAGG
AGGCTTTGCATGA
ATGGAAGATTTTTACGGACGTTTATTTGTAGCGCATCAACTTTCCGATGAGGAAAAAAGCAGAGCTAAACAGTTACCAGC
CATGCTAGAGCAAAAAGGGAAACTCTGTTGTCAGCGATGTGGCAGTCACATATTAACAGACTGGCTGTTGCCAGCTGGTG
ACTATTATTGCAGGGAGTGTCTTATGTTAGGGAGAAATCGACAAAGCCAGCCTCTCTATTATTTCCCAGCGCAGCCTTTT
TCTAAAGAAAAGTATCTGATCTGGCAAGGAACTCTGACGTCGTACCAACAGGAAATCTCAGATGGCATGAAAACTGCTGT
GGCTAATAAGGAAAATATCCTTATTCATGCGGTGACAGGGGCTGGAAAGACCGAAATGATTTATGAGACAGTGGCAAGTG
TTTTGGAGCAAGGTGGACAGGTTTGTCTAGCTAGTCCACGGATTGATGTCTGTCTAGAACTTCATAAACGCCTTAGCAGA
GATTTTTCTTGTCCCATAGCGCTCTTGCATGGTGAGAGTGAGCCTTATACTAGAGCGCCCTTGGTTATTGCAACAACCCA
TCAGTTGCTTAAATTTTACCAAGCCTTTGATTTACTGATTATAGATGAGGTGGACGCTTTTCCTTTCGTTGATAATGCCA
TGCTTTATCATGGTCTTAACCAAGCGGTTAAAATTGATGGCGTCAAAGTTTTTTTGACAGCTACCTCAACGGATGCATTG
GATAAACAAGTCAAACAAGGAAAGCTTAAAAAGTTAGATTTAGCCAGACGTTTCCACGCCAATCCTCTCGTGGTTCCCAA
AATGAAGTGGCTGAGTGGACTGTTATTTAGTCTTCAAAAAGACAAACTCCCTTCAAAATTGTTGAAACAGGTAGCCTCAC
AAAGAGAAACGGGTTATCCACTTTTGATTTTCTTTCCTCACATTGAGATAGGACAAAGGTTTACTGAAATCTTAACGAAA
ACATTTCCACAGGAAACTATTGGGTTTGTTGCCTCAACAACAGAAAATCGTTTAGAATTAGTTCAGCAGTTTAGAGATAA
AAAGCTGACTGTTTTGGTATCAACCACTATCTTAGAGAGGGGAGTTACCTTTCCCTGTGTAGATGTTTTTGTACTTTGGA
GCAACCATAAACTCTATACGCGGTCCTCTCTTGTGCAGATTGGTGGCCGTGTCGGAAGAGCAATGGAGAGACCAACAGGA
GAATTGATTTTCTTTCACGATGGCATGACTATAGAGATGGCAAGAGCCATTGCTGAGATTCGCCTGATGAATCAAAAAGG
AGGCTTTGCATGA
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA/cflA | Streptococcus mitis NCTC 12261 |
61.593 |
99.302 |
0.612 |
| comFA/cflA | Streptococcus pneumoniae Rx1 |
61.593 |
99.302 |
0.612 |
| comFA/cflA | Streptococcus pneumoniae D39 |
61.593 |
99.302 |
0.612 |
| comFA/cflA | Streptococcus pneumoniae R6 |
61.593 |
99.302 |
0.612 |
| comFA/cflA | Streptococcus pneumoniae TIGR4 |
61.358 |
99.302 |
0.609 |
| comFA/cflA | Streptococcus mitis SK321 |
61.358 |
99.302 |
0.609 |
| comFA | Lactococcus lactis subsp. cremoris KW2 |
51.256 |
92.558 |
0.474 |
| comFA | Latilactobacillus sakei subsp. sakei 23K |
36.891 |
100 |
0.37 |
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
37.805 |
95.349 |
0.36 |