Detailed information
Overview
| Name | comFA/cflA | Type | Machinery gene |
| Locus tag | GOM47_RS09475 | Genome accession | NZ_CP046524 |
| Coordinates | 1911560..1912858 (-) | Length | 432 a.a. |
| NCBI ID | WP_235080647.1 | Uniprot ID | - |
| Organism | Streptococcus oralis strain SOT | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 1889668..1929478 | 1911560..1912858 | within | 0 |
Gene organization within MGE regions
Location: 1889668..1929478
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GOM47_RS09345 (GOM47_09350) | - | 1889668..1890819 (-) | 1152 | WP_414930277.1 | phage major capsid protein | - |
| GOM47_RS09350 (GOM47_09355) | - | 1890912..1891058 (-) | 147 | WP_001003898.1 | DUF2292 domain-containing protein | - |
| GOM47_RS09355 (GOM47_09360) | - | 1891389..1892843 (-) | 1455 | WP_235080635.1 | virulence-associated E family protein | - |
| GOM47_RS09360 (GOM47_09365) | - | 1892836..1893351 (-) | 516 | WP_084856430.1 | hypothetical protein | - |
| GOM47_RS09365 (GOM47_09370) | - | 1893342..1893611 (-) | 270 | WP_084856428.1 | hypothetical protein | - |
| GOM47_RS09370 (GOM47_09375) | - | 1893613..1893822 (-) | 210 | WP_084856425.1 | hypothetical protein | - |
| GOM47_RS09375 (GOM47_09380) | - | 1893819..1894250 (-) | 432 | WP_084856423.1 | hypothetical protein | - |
| GOM47_RS09380 | - | 1894483..1894632 (-) | 150 | WP_180383029.1 | hypothetical protein | - |
| GOM47_RS09385 (GOM47_09385) | - | 1894636..1895064 (-) | 429 | WP_084856421.1 | hypothetical protein | - |
| GOM47_RS09390 (GOM47_09390) | - | 1895183..1895431 (-) | 249 | WP_084856419.1 | hypothetical protein | - |
| GOM47_RS09395 (GOM47_09395) | - | 1895446..1895646 (-) | 201 | WP_084856417.1 | helix-turn-helix transcriptional regulator | - |
| GOM47_RS09400 (GOM47_09400) | - | 1895849..1896307 (+) | 459 | WP_084856416.1 | helix-turn-helix domain-containing protein | - |
| GOM47_RS09405 (GOM47_09405) | - | 1896458..1897624 (+) | 1167 | WP_084856415.1 | tyrosine-type recombinase/integrase | - |
| GOM47_RS09410 (GOM47_09410) | - | 1897781..1898509 (-) | 729 | WP_235080636.1 | ABC transporter ATP-binding protein | - |
| GOM47_RS09415 (GOM47_09415) | - | 1898509..1899516 (-) | 1008 | WP_235080637.1 | ABC transporter substrate-binding protein | - |
| GOM47_RS09420 (GOM47_09420) | - | 1899555..1900313 (-) | 759 | WP_235080638.1 | ABC transporter permease | - |
| GOM47_RS09425 (GOM47_09425) | - | 1900276..1900566 (-) | 291 | WP_235080639.1 | MTH1187 family thiamine-binding protein | - |
| GOM47_RS09430 (GOM47_09430) | polA | 1900843..1903476 (-) | 2634 | WP_235080640.1 | DNA polymerase I | - |
| GOM47_RS09435 (GOM47_09435) | - | 1903561..1904670 (-) | 1110 | WP_235080641.1 | SH3 domain-containing protein | - |
| GOM47_RS09440 (GOM47_09440) | - | 1904761..1905036 (-) | 276 | WP_235080642.1 | Veg family protein | - |
| GOM47_RS09445 (GOM47_09445) | dnaB | 1905038..1906390 (-) | 1353 | WP_235080643.1 | replicative DNA helicase | - |
| GOM47_RS09450 (GOM47_09450) | rplI | 1906434..1906886 (-) | 453 | WP_195215772.1 | 50S ribosomal protein L9 | - |
| GOM47_RS09455 (GOM47_09455) | - | 1906883..1908856 (-) | 1974 | WP_235080644.1 | DHH family phosphoesterase | - |
| GOM47_RS09460 (GOM47_09460) | - | 1909013..1910191 (-) | 1179 | WP_235080645.1 | acetyl-CoA C-acetyltransferase | - |
| GOM47_RS09465 (GOM47_09465) | hpf | 1910273..1910821 (-) | 549 | WP_000599113.1 | ribosome hibernation-promoting factor, HPF/YfiA family | - |
| GOM47_RS09470 (GOM47_09470) | comFC/cflB | 1910901..1911563 (-) | 663 | WP_235080646.1 | ComF family protein | Machinery gene |
| GOM47_RS09475 (GOM47_09475) | comFA/cflA | 1911560..1912858 (-) | 1299 | WP_235080647.1 | DEAD/DEAH box helicase | Machinery gene |
| GOM47_RS09480 (GOM47_09480) | - | 1912915..1913550 (+) | 636 | WP_235080648.1 | YigZ family protein | - |
| GOM47_RS09485 (GOM47_09485) | - | 1913565..1914008 (+) | 444 | WP_235080649.1 | PH domain-containing protein | - |
| GOM47_RS09490 (GOM47_09490) | cysK | 1914105..1915031 (+) | 927 | WP_235080650.1 | cysteine synthase A | - |
| GOM47_RS09495 (GOM47_09495) | tsf | 1915146..1916186 (-) | 1041 | WP_084974703.1 | translation elongation factor Ts | - |
| GOM47_RS09500 (GOM47_09500) | rpsB | 1916265..1917044 (-) | 780 | WP_235080651.1 | 30S ribosomal protein S2 | - |
| GOM47_RS09505 (GOM47_09505) | pcsB | 1917267..1918472 (-) | 1206 | WP_235080652.1 | peptidoglycan hydrolase PcsB | - |
| GOM47_RS09510 (GOM47_09510) | mreD | 1918566..1919060 (-) | 495 | WP_235080653.1 | rod shape-determining protein MreD | - |
| GOM47_RS09515 (GOM47_09515) | mreC | 1919063..1919878 (-) | 816 | WP_235080654.1 | rod shape-determining protein MreC | - |
| GOM47_RS09520 (GOM47_09520) | - | 1919939..1920733 (-) | 795 | WP_235080655.1 | energy-coupling factor transporter transmembrane component T family protein | - |
| GOM47_RS09525 (GOM47_09525) | - | 1920726..1921565 (-) | 840 | WP_235080656.1 | energy-coupling factor transporter ATPase | - |
| GOM47_RS09530 (GOM47_09530) | - | 1921550..1922377 (-) | 828 | WP_235080657.1 | energy-coupling factor ABC transporter ATP-binding protein | - |
| GOM47_RS09535 (GOM47_09535) | pgsA | 1922374..1922919 (-) | 546 | WP_235080658.1 | CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase | - |
| GOM47_RS09540 (GOM47_09540) | rodZ | 1922930..1923754 (-) | 825 | WP_235080659.1 | cytoskeleton protein RodZ | - |
| GOM47_RS09545 (GOM47_09545) | yfmH | 1923790..1925073 (-) | 1284 | WP_235080660.1 | EF-P 5-aminopentanol modification-associated protein YfmH | - |
| GOM47_RS09550 (GOM47_09550) | yfmF | 1925070..1926320 (-) | 1251 | WP_235080661.1 | EF-P 5-aminopentanol modification-associated protein YfmF | - |
| GOM47_RS09555 (GOM47_09555) | yaaA | 1926481..1926849 (+) | 369 | WP_195215749.1 | S4 domain-containing protein YaaA | - |
| GOM47_RS09560 (GOM47_09560) | recF | 1926852..1927949 (+) | 1098 | WP_235080662.1 | DNA replication/repair protein RecF | - |
| GOM47_RS09565 (GOM47_09565) | guaB | 1928000..1929478 (-) | 1479 | WP_235080663.1 | IMP dehydrogenase | - |
Sequence
Protein
Download Length: 432 a.a. Molecular weight: 49654.81 Da Isoelectric Point: 9.0664
>NTDB_id=405022 GOM47_RS09475 WP_235080647.1 1911560..1912858(-) (comFA/cflA) [Streptococcus oralis strain SOT]
MKVNPNYLGRLFTEKEITEEERQVAVKLPAMRKEKGKLFCQRCNSSILEEWYLPIGAYYCRECLLMKRVRSDQALYYFPQ
EDFPKQDVLKWRGKLTPFQEKVSEGLIRAVEKKEPTLVHAVTGAGKTEMIYQVVAKVINNGGAVCLASPRIDVCLELYKR
LQNDFACEIALLHGESEPYFRTPLVVATTHQLLKFHHAFDLLIVDEVDAFPYVDNPILYYAVNQCVKEEGLKIFLTATST
DELDKKVRTGELKRLSLPRRFHGNPLIIPKLVWLSDFNRYIEKSQLSPKLKSYIKKQRRTSYPLLIFASEIKKGEKLKEL
LQEQFPNENIGFVSSITENRLEQVQAFRDGELTILISTTILERGVTFPCVDVFVVEANHRLFTKSSLIQIGGRVGRSMDR
PTGELLFFHDGLNVSIKKAIKEIKQMNKEAGL
MKVNPNYLGRLFTEKEITEEERQVAVKLPAMRKEKGKLFCQRCNSSILEEWYLPIGAYYCRECLLMKRVRSDQALYYFPQ
EDFPKQDVLKWRGKLTPFQEKVSEGLIRAVEKKEPTLVHAVTGAGKTEMIYQVVAKVINNGGAVCLASPRIDVCLELYKR
LQNDFACEIALLHGESEPYFRTPLVVATTHQLLKFHHAFDLLIVDEVDAFPYVDNPILYYAVNQCVKEEGLKIFLTATST
DELDKKVRTGELKRLSLPRRFHGNPLIIPKLVWLSDFNRYIEKSQLSPKLKSYIKKQRRTSYPLLIFASEIKKGEKLKEL
LQEQFPNENIGFVSSITENRLEQVQAFRDGELTILISTTILERGVTFPCVDVFVVEANHRLFTKSSLIQIGGRVGRSMDR
PTGELLFFHDGLNVSIKKAIKEIKQMNKEAGL
Nucleotide
Download Length: 1299 bp
>NTDB_id=405022 GOM47_RS09475 WP_235080647.1 1911560..1912858(-) (comFA/cflA) [Streptococcus oralis strain SOT]
ATGAAAGTAAATCCAAATTATCTCGGTCGTTTGTTTACTGAGAAAGAAATAACTGAAGAAGAACGACAGGTAGCAGTGAA
ACTGCCAGCAATGAGAAAAGAGAAGGGGAAACTGTTTTGTCAACGTTGTAATAGTTCGATTTTAGAAGAATGGTATTTGC
CTATTGGCGCATACTATTGTAGGGAGTGTTTGCTGATGAAGCGAGTCAGGAGTGATCAAGCCTTATACTATTTTCCGCAG
GAAGATTTTCCTAAACAAGATGTTCTTAAGTGGCGTGGTAAGTTAACCCCTTTTCAGGAGAAGGTGTCAGAGGGATTGAT
TCGAGCAGTCGAAAAGAAAGAACCGACCTTGGTTCATGCTGTAACAGGAGCTGGAAAGACGGAGATGATTTATCAAGTTG
TGGCCAAGGTAATCAATAATGGTGGTGCAGTGTGTTTGGCCAGTCCTCGAATTGATGTATGTTTGGAATTGTATAAGCGA
CTGCAGAATGACTTTGCTTGTGAGATAGCACTGCTTCATGGCGAATCAGAGCCCTATTTTCGAACTCCACTAGTTGTTGC
AACGACTCACCAGTTGTTAAAATTTCATCATGCTTTTGATTTGCTGATAGTAGATGAAGTAGATGCCTTTCCTTATGTTG
ACAACCCTATACTTTACTACGCTGTAAACCAATGTGTAAAGGAGGAGGGGTTAAAGATATTCCTTACAGCGACCTCTACA
GATGAGTTAGATAAGAAGGTTCGCACTGGAGAATTAAAACGATTGAGCTTGCCAAGACGATTTCATGGAAATCCATTGAT
TATTCCAAAGCTAGTTTGGTTATCAGATTTTAATCGCTATATAGAAAAGAGTCAGTTGTCTCCAAAGTTAAAGTCCTACA
TTAAGAAGCAGAGAAGAACAAGTTATCCGTTGTTAATCTTTGCATCTGAGATTAAGAAAGGCGAGAAACTAAAAGAACTC
TTGCAGGAACAGTTTCCAAATGAAAACATCGGCTTTGTGTCCTCTATCACAGAAAATCGATTAGAGCAGGTACAAGCTTT
TCGAGATGGAGAGTTGACAATCCTTATTAGTACAACAATTTTGGAGCGTGGGGTCACCTTTCCTTGTGTGGATGTTTTTG
TTGTAGAAGCTAATCATCGTCTCTTTACCAAGTCTAGCTTGATTCAGATTGGAGGGCGAGTTGGGCGCAGTATGGATAGA
CCGACTGGTGAACTGCTCTTCTTTCATGATGGATTAAATGTTTCGATCAAAAAAGCAATCAAGGAAATAAAGCAGATGAA
CAAGGAGGCAGGCTTATGA
ATGAAAGTAAATCCAAATTATCTCGGTCGTTTGTTTACTGAGAAAGAAATAACTGAAGAAGAACGACAGGTAGCAGTGAA
ACTGCCAGCAATGAGAAAAGAGAAGGGGAAACTGTTTTGTCAACGTTGTAATAGTTCGATTTTAGAAGAATGGTATTTGC
CTATTGGCGCATACTATTGTAGGGAGTGTTTGCTGATGAAGCGAGTCAGGAGTGATCAAGCCTTATACTATTTTCCGCAG
GAAGATTTTCCTAAACAAGATGTTCTTAAGTGGCGTGGTAAGTTAACCCCTTTTCAGGAGAAGGTGTCAGAGGGATTGAT
TCGAGCAGTCGAAAAGAAAGAACCGACCTTGGTTCATGCTGTAACAGGAGCTGGAAAGACGGAGATGATTTATCAAGTTG
TGGCCAAGGTAATCAATAATGGTGGTGCAGTGTGTTTGGCCAGTCCTCGAATTGATGTATGTTTGGAATTGTATAAGCGA
CTGCAGAATGACTTTGCTTGTGAGATAGCACTGCTTCATGGCGAATCAGAGCCCTATTTTCGAACTCCACTAGTTGTTGC
AACGACTCACCAGTTGTTAAAATTTCATCATGCTTTTGATTTGCTGATAGTAGATGAAGTAGATGCCTTTCCTTATGTTG
ACAACCCTATACTTTACTACGCTGTAAACCAATGTGTAAAGGAGGAGGGGTTAAAGATATTCCTTACAGCGACCTCTACA
GATGAGTTAGATAAGAAGGTTCGCACTGGAGAATTAAAACGATTGAGCTTGCCAAGACGATTTCATGGAAATCCATTGAT
TATTCCAAAGCTAGTTTGGTTATCAGATTTTAATCGCTATATAGAAAAGAGTCAGTTGTCTCCAAAGTTAAAGTCCTACA
TTAAGAAGCAGAGAAGAACAAGTTATCCGTTGTTAATCTTTGCATCTGAGATTAAGAAAGGCGAGAAACTAAAAGAACTC
TTGCAGGAACAGTTTCCAAATGAAAACATCGGCTTTGTGTCCTCTATCACAGAAAATCGATTAGAGCAGGTACAAGCTTT
TCGAGATGGAGAGTTGACAATCCTTATTAGTACAACAATTTTGGAGCGTGGGGTCACCTTTCCTTGTGTGGATGTTTTTG
TTGTAGAAGCTAATCATCGTCTCTTTACCAAGTCTAGCTTGATTCAGATTGGAGGGCGAGTTGGGCGCAGTATGGATAGA
CCGACTGGTGAACTGCTCTTCTTTCATGATGGATTAAATGTTTCGATCAAAAAAGCAATCAAGGAAATAAAGCAGATGAA
CAAGGAGGCAGGCTTATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA/cflA | Streptococcus mitis NCTC 12261 |
89.815 |
100 |
0.898 |
| comFA/cflA | Streptococcus pneumoniae Rx1 |
88.657 |
100 |
0.887 |
| comFA/cflA | Streptococcus pneumoniae D39 |
88.657 |
100 |
0.887 |
| comFA/cflA | Streptococcus pneumoniae R6 |
88.657 |
100 |
0.887 |
| comFA/cflA | Streptococcus pneumoniae TIGR4 |
88.426 |
100 |
0.884 |
| comFA/cflA | Streptococcus mitis SK321 |
88.426 |
100 |
0.884 |
| comFA | Lactococcus lactis subsp. cremoris KW2 |
50.976 |
94.907 |
0.484 |
| comFA | Latilactobacillus sakei subsp. sakei 23K |
37.558 |
100 |
0.377 |