Detailed information
Overview
| Name | comFA/cflA | Type | Machinery gene |
| Locus tag | YB51_RS02395 | Genome accession | NC_022516 |
| Coordinates | 452502..453794 (+) | Length | 430 a.a. |
| NCBI ID | WP_004194121.1 | Uniprot ID | A0A140EWW0 |
| Organism | Streptococcus suis YB51 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| ICE | 433399..453794 | 452502..453794 | within | 0 |
Gene organization within MGE regions
Location: 433399..453794
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| YB51_RS02245 (YB51_2120) | - | 433399..434556 (-) | 1158 | WP_013730163.1 | tyrosine-type recombinase/integrase | - |
| YB51_RS02250 (YB51_2125) | - | 434753..435484 (-) | 732 | WP_013730164.1 | hypothetical protein | - |
| YB51_RS02255 (YB51_2130) | - | 435532..435915 (-) | 384 | WP_013730165.1 | ImmA/IrrE family metallo-endopeptidase | - |
| YB51_RS02260 (YB51_2135) | - | 435922..436470 (-) | 549 | WP_022540560.1 | helix-turn-helix domain-containing protein | - |
| YB51_RS02265 (YB51_2140) | - | 436487..436687 (+) | 201 | WP_002937001.1 | helix-turn-helix domain-containing protein | - |
| YB51_RS02270 (YB51_2145) | - | 436757..437083 (+) | 327 | WP_013730166.1 | hypothetical protein | - |
| YB51_RS02275 | - | 437157..437411 (+) | 255 | WP_013730167.1 | hypothetical protein | - |
| YB51_RS02280 (YB51_2150) | - | 437452..437700 (+) | 249 | WP_013730168.1 | hypothetical protein | - |
| YB51_RS10650 (YB51_2155) | - | 437690..437842 (+) | 153 | WP_022540561.1 | hypothetical protein | - |
| YB51_RS02285 (YB51_2160) | - | 437847..438188 (+) | 342 | WP_013730169.1 | hypothetical protein | - |
| YB51_RS02290 (YB51_2165) | - | 438188..438667 (+) | 480 | WP_013730170.1 | siphovirus Gp157 family protein | - |
| YB51_RS02295 (YB51_2170) | - | 438664..438909 (+) | 246 | WP_013730171.1 | hypothetical protein | - |
| YB51_RS02300 (YB51_2175) | - | 438890..439354 (+) | 465 | WP_013730172.1 | hypothetical protein | - |
| YB51_RS02305 (YB51_2180) | - | 439323..440495 (+) | 1173 | WP_013730173.1 | DEAD/DEAH box helicase family protein | - |
| YB51_RS02310 (YB51_2185) | - | 440570..440836 (+) | 267 | WP_226960629.1 | hypothetical protein | - |
| YB51_RS02315 (YB51_2190) | - | 440846..441544 (+) | 699 | WP_013730175.1 | ERF family protein | - |
| YB51_RS02320 (YB51_2195) | - | 441544..441840 (+) | 297 | WP_022540564.1 | hypothetical protein | - |
| YB51_RS02325 (YB51_2200) | ssbA | 441837..442235 (+) | 399 | WP_022540565.1 | single-stranded DNA-binding protein | Machinery gene |
| YB51_RS02330 (YB51_2205) | - | 442246..443073 (+) | 828 | WP_013730177.1 | bifunctional DNA primase/polymerase | - |
| YB51_RS02335 (YB51_2210) | - | 443057..444436 (+) | 1380 | WP_079253058.1 | virulence-associated E family protein | - |
| YB51_RS10655 (YB51_2215) | - | 444814..444969 (+) | 156 | WP_013730179.1 | hypothetical protein | - |
| YB51_RS02340 (YB51_2220) | - | 444966..445274 (+) | 309 | WP_013730180.1 | DUF1372 family protein | - |
| YB51_RS02345 (YB51_2225) | - | 445276..445491 (+) | 216 | WP_013730181.1 | hypothetical protein | - |
| YB51_RS02350 (YB51_2230) | - | 445680..445994 (+) | 315 | WP_013730182.1 | hypothetical protein | - |
| YB51_RS02355 (YB51_2235) | - | 446067..446501 (+) | 435 | WP_002937915.1 | DUF1492 domain-containing protein | - |
| YB51_RS02360 (YB51_2240) | - | 446598..446945 (+) | 348 | WP_013730183.1 | HNH endonuclease | - |
| YB51_RS02365 (YB51_2245) | - | 447150..447395 (+) | 246 | WP_022540566.1 | hypothetical protein | - |
| YB51_RS02370 (YB51_2250) | - | 447392..448657 (+) | 1266 | WP_013730185.1 | phage portal protein | - |
| YB51_RS10660 (YB51_2255) | - | 448650..449870 (+) | 1221 | WP_013730186.1 | hypothetical protein | - |
| YB51_RS02380 (YB51_2260) | - | 449875..450108 (+) | 234 | WP_022540567.1 | hypothetical protein | - |
| YB51_RS02385 (YB51_2270) | - | 450809..451033 (-) | 225 | Protein_430 | DUF1492 domain-containing protein | - |
| YB51_RS02390 (YB51_2275) | - | 451813..452445 (-) | 633 | WP_004194123.1 | YigZ family protein | - |
| YB51_RS02395 (YB51_2280) | comFA/cflA | 452502..453794 (+) | 1293 | WP_004194121.1 | DEAD/DEAH box helicase | Machinery gene |
Sequence
Protein
Download Length: 430 a.a. Molecular weight: 48868.45 Da Isoelectric Point: 9.2762
>NTDB_id=62441 YB51_RS02395 WP_004194121.1 452502..453794(+) (comFA/cflA) [Streptococcus suis YB51]
MKELENYYGRLFTKYQLTAKEREIAEKVPSITKKNNCFRCGTTFKEENKLPNDAYYCRACLLLGRVRSDEKLYHFPQKDF
PITKCLKWKGQLTDWQQRISDGLVANVENNRATLVHAVTGAGKTEMIYHTVASVIDKGGAVCLASPRIDVCIELYKRLQN
DFSVPISLLHGESEPYFRTPLVVATTHQLLKFYQAFDLVLIDEVDAFPYADNPMLYQAADNAVKEAGVQVFLTATSTDEL
DKKVRTGKLSRLSLPRRFHGNPLVVPQKVWFSKFDDTLKKNRLVPKLKKAIEEQRKSGFPLLIFVPEISKGQEFTKIMKK
TFPEETIGFVSSQTENRLEIVEGFRKREITVLISTTILERGVTFPCVDVFVVQANHYLYTASSLVQIAGRVGRSIERPTG
LLQFYHEGSTGAIEKAIAEIKQMNKEAGYV
MKELENYYGRLFTKYQLTAKEREIAEKVPSITKKNNCFRCGTTFKEENKLPNDAYYCRACLLLGRVRSDEKLYHFPQKDF
PITKCLKWKGQLTDWQQRISDGLVANVENNRATLVHAVTGAGKTEMIYHTVASVIDKGGAVCLASPRIDVCIELYKRLQN
DFSVPISLLHGESEPYFRTPLVVATTHQLLKFYQAFDLVLIDEVDAFPYADNPMLYQAADNAVKEAGVQVFLTATSTDEL
DKKVRTGKLSRLSLPRRFHGNPLVVPQKVWFSKFDDTLKKNRLVPKLKKAIEEQRKSGFPLLIFVPEISKGQEFTKIMKK
TFPEETIGFVSSQTENRLEIVEGFRKREITVLISTTILERGVTFPCVDVFVVQANHYLYTASSLVQIAGRVGRSIERPTG
LLQFYHEGSTGAIEKAIAEIKQMNKEAGYV
Nucleotide
Download Length: 1293 bp
>NTDB_id=62441 YB51_RS02395 WP_004194121.1 452502..453794(+) (comFA/cflA) [Streptococcus suis YB51]
ATGAAAGAATTAGAAAATTATTATGGAAGATTATTTACCAAATACCAATTGACAGCAAAAGAAAGAGAAATAGCAGAAAA
AGTGCCAAGTATTACAAAAAAGAATAACTGCTTTCGCTGTGGAACAACTTTTAAAGAAGAAAACAAATTGCCAAACGATG
CTTATTACTGTCGAGCCTGCTTGCTTCTAGGCAGAGTACGGTCAGACGAAAAACTCTATCATTTTCCTCAGAAAGATTTT
CCAATCACTAAGTGTTTAAAGTGGAAAGGTCAACTAACTGATTGGCAACAAAGAATTTCAGATGGACTAGTTGCAAACGT
GGAAAATAATCGTGCGACATTGGTTCATGCAGTAACAGGAGCAGGTAAGACAGAAATGATCTACCACACCGTTGCCTCAG
TGATTGATAAAGGCGGAGCGGTTTGCCTAGCCAGTCCTCGAATTGATGTTTGTATCGAACTCTATAAACGTCTGCAAAAT
GACTTTTCAGTTCCAATTAGTTTACTACATGGAGAGTCTGAACCCTATTTCCGAACCCCATTAGTTGTAGCAACCACACA
TCAGTTATTAAAATTTTATCAGGCCTTTGATTTGGTTTTGATTGATGAAGTAGACGCCTTTCCCTATGCAGATAATCCCA
TGCTCTATCAAGCAGCAGACAATGCGGTCAAGGAAGCCGGTGTTCAAGTTTTTCTGACAGCGACTTCAACAGATGAATTG
GATAAAAAAGTCAGAACAGGTAAATTAAGTCGTCTTAGTTTGCCAAGGCGCTTTCATGGCAACCCACTTGTTGTCCCGCA
AAAAGTCTGGTTTAGTAAATTCGATGATACCCTAAAGAAAAATAGACTAGTCCCAAAGTTGAAAAAAGCGATTGAAGAAC
AGAGAAAGTCGGGCTTTCCCTTACTCATTTTTGTCCCAGAAATCTCCAAAGGTCAAGAATTTACCAAGATAATGAAAAAA
ACATTCCCAGAAGAAACAATTGGCTTTGTATCCAGTCAAACAGAAAATCGCCTTGAAATAGTTGAAGGGTTTCGCAAGAG
AGAAATCACAGTCTTAATCTCGACTACTATTCTTGAACGTGGGGTGACCTTCCCATGTGTAGACGTCTTTGTTGTTCAAG
CTAATCATTACCTCTACACAGCGTCAAGTCTTGTTCAGATTGCAGGCCGGGTCGGAAGGAGTATAGAACGTCCGACTGGT
TTACTTCAGTTTTATCATGAGGGAAGTACAGGAGCCATTGAAAAGGCAATCGCTGAAATTAAACAGATGAACAAGGAGGC
TGGTTATGTCTAA
ATGAAAGAATTAGAAAATTATTATGGAAGATTATTTACCAAATACCAATTGACAGCAAAAGAAAGAGAAATAGCAGAAAA
AGTGCCAAGTATTACAAAAAAGAATAACTGCTTTCGCTGTGGAACAACTTTTAAAGAAGAAAACAAATTGCCAAACGATG
CTTATTACTGTCGAGCCTGCTTGCTTCTAGGCAGAGTACGGTCAGACGAAAAACTCTATCATTTTCCTCAGAAAGATTTT
CCAATCACTAAGTGTTTAAAGTGGAAAGGTCAACTAACTGATTGGCAACAAAGAATTTCAGATGGACTAGTTGCAAACGT
GGAAAATAATCGTGCGACATTGGTTCATGCAGTAACAGGAGCAGGTAAGACAGAAATGATCTACCACACCGTTGCCTCAG
TGATTGATAAAGGCGGAGCGGTTTGCCTAGCCAGTCCTCGAATTGATGTTTGTATCGAACTCTATAAACGTCTGCAAAAT
GACTTTTCAGTTCCAATTAGTTTACTACATGGAGAGTCTGAACCCTATTTCCGAACCCCATTAGTTGTAGCAACCACACA
TCAGTTATTAAAATTTTATCAGGCCTTTGATTTGGTTTTGATTGATGAAGTAGACGCCTTTCCCTATGCAGATAATCCCA
TGCTCTATCAAGCAGCAGACAATGCGGTCAAGGAAGCCGGTGTTCAAGTTTTTCTGACAGCGACTTCAACAGATGAATTG
GATAAAAAAGTCAGAACAGGTAAATTAAGTCGTCTTAGTTTGCCAAGGCGCTTTCATGGCAACCCACTTGTTGTCCCGCA
AAAAGTCTGGTTTAGTAAATTCGATGATACCCTAAAGAAAAATAGACTAGTCCCAAAGTTGAAAAAAGCGATTGAAGAAC
AGAGAAAGTCGGGCTTTCCCTTACTCATTTTTGTCCCAGAAATCTCCAAAGGTCAAGAATTTACCAAGATAATGAAAAAA
ACATTCCCAGAAGAAACAATTGGCTTTGTATCCAGTCAAACAGAAAATCGCCTTGAAATAGTTGAAGGGTTTCGCAAGAG
AGAAATCACAGTCTTAATCTCGACTACTATTCTTGAACGTGGGGTGACCTTCCCATGTGTAGACGTCTTTGTTGTTCAAG
CTAATCATTACCTCTACACAGCGTCAAGTCTTGTTCAGATTGCAGGCCGGGTCGGAAGGAGTATAGAACGTCCGACTGGT
TTACTTCAGTTTTATCATGAGGGAAGTACAGGAGCCATTGAAAAGGCAATCGCTGAAATTAAACAGATGAACAAGGAGGC
TGGTTATGTCTAA
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA/cflA | Streptococcus mitis NCTC 12261 |
67.053 |
100 |
0.672 |
| comFA/cflA | Streptococcus pneumoniae Rx1 |
66.357 |
100 |
0.665 |
| comFA/cflA | Streptococcus pneumoniae D39 |
66.357 |
100 |
0.665 |
| comFA/cflA | Streptococcus pneumoniae R6 |
66.357 |
100 |
0.665 |
| comFA/cflA | Streptococcus pneumoniae TIGR4 |
66.357 |
100 |
0.665 |
| comFA/cflA | Streptococcus mitis SK321 |
65.893 |
100 |
0.66 |
| comFA | Lactococcus lactis subsp. cremoris KW2 |
54.156 |
92.326 |
0.5 |
| comFA | Latilactobacillus sakei subsp. sakei 23K |
38.051 |
100 |
0.381 |
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
37.59 |
96.512 |
0.363 |