Detailed information
Overview
| Name | comFA/cflA | Type | Machinery gene |
| Locus tag | PUW49_RS02355 | Genome accession | NZ_CP118054 |
| Coordinates | 462683..463984 (+) | Length | 433 a.a. |
| NCBI ID | WP_024052084.1 | Uniprot ID | A0AAW5TE88 |
| Organism | Streptococcus anginosus strain VSI37 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| ICE | 393823..467219 | 462683..463984 | within | 0 |
Gene organization within MGE regions
Location: 393823..467219
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| PUW49_RS02050 (PUW49_02050) | - | 394462..394779 (+) | 318 | WP_070583991.1 | hypothetical protein | - |
| PUW49_RS02055 (PUW49_02055) | - | 394772..395104 (+) | 333 | WP_003030124.1 | hypothetical protein | - |
| PUW49_RS02060 (PUW49_02060) | - | 395120..395905 (+) | 786 | WP_070583994.1 | DNA/RNA non-specific endonuclease | - |
| PUW49_RS02065 (PUW49_02065) | - | 395927..396385 (+) | 459 | WP_070583996.1 | hypothetical protein | - |
| PUW49_RS02070 (PUW49_02070) | mobP2 | 396397..398751 (+) | 2355 | WP_070583998.1 | MobP2 family relaxase | - |
| PUW49_RS02075 (PUW49_02075) | - | 399014..399274 (+) | 261 | WP_024052239.1 | hypothetical protein | - |
| PUW49_RS02080 (PUW49_02080) | - | 399295..405309 (+) | 6015 | WP_224785017.1 | PBECR4 domain-containing protein | - |
| PUW49_RS02085 (PUW49_02085) | - | 405681..407435 (+) | 1755 | WP_101750451.1 | DNA topoisomerase | - |
| PUW49_RS02090 (PUW49_02090) | - | 407432..409267 (+) | 1836 | WP_024052235.1 | ATP-dependent Clp protease ATP-binding subunit | - |
| PUW49_RS02095 (PUW49_02095) | - | 409356..409703 (+) | 348 | WP_274994864.1 | TrbC/VirB2 family protein | - |
| PUW49_RS02100 (PUW49_02100) | - | 409717..410775 (+) | 1059 | WP_070584005.1 | hypothetical protein | - |
| PUW49_RS02105 (PUW49_02105) | - | 410794..411546 (+) | 753 | WP_150888908.1 | peptidase | - |
| PUW49_RS02110 (PUW49_02110) | - | 411571..415050 (+) | 3480 | WP_150888910.1 | SspB-related isopeptide-forming adhesin | - |
| PUW49_RS02115 (PUW49_02115) | - | 415205..415390 (+) | 186 | WP_024052230.1 | helix-turn-helix domain-containing protein | - |
| PUW49_RS02120 (PUW49_02120) | - | 415453..418008 (+) | 2556 | WP_274994871.1 | SEC10/PgrA surface exclusion domain-containing protein | - |
| PUW49_RS02125 (PUW49_02125) | - | 418010..418432 (+) | 423 | WP_003030094.1 | single-stranded DNA-binding protein | - |
| PUW49_RS02130 (PUW49_02130) | - | 418694..418957 (+) | 264 | WP_003030093.1 | hypothetical protein | - |
| PUW49_RS02135 (PUW49_02135) | - | 418941..422090 (+) | 3150 | WP_150890706.1 | type IV secretory system conjugative DNA transfer family protein | - |
| PUW49_RS02140 (PUW49_02140) | - | 422225..422704 (+) | 480 | WP_024052227.1 | hypothetical protein | - |
| PUW49_RS02145 (PUW49_02145) | - | 422848..424842 (+) | 1995 | WP_224785025.1 | hypothetical protein | - |
| PUW49_RS02150 (PUW49_02150) | - | 424847..425128 (+) | 282 | WP_042754043.1 | BRCT domain-containing protein | - |
| PUW49_RS02155 (PUW49_02155) | - | 425292..425600 (+) | 309 | WP_003078005.1 | DUF5592 family protein | - |
| PUW49_RS02160 (PUW49_02160) | - | 425600..426250 (+) | 651 | WP_101750458.1 | hypothetical protein | - |
| PUW49_RS02165 (PUW49_02165) | - | 426262..428211 (+) | 1950 | WP_070654276.1 | virulence factor | - |
| PUW49_RS02170 (PUW49_02170) | - | 428228..428785 (+) | 558 | WP_024052221.1 | hypothetical protein | - |
| PUW49_RS02175 (PUW49_02175) | - | 428805..429083 (+) | 279 | WP_070811260.1 | hypothetical protein | - |
| PUW49_RS02180 (PUW49_02180) | - | 429099..430427 (+) | 1329 | WP_024052219.1 | CHAP domain-containing protein | - |
| PUW49_RS02185 (PUW49_02185) | - | 430451..431035 (+) | 585 | WP_070584026.1 | hypothetical protein | - |
| PUW49_RS02190 (PUW49_02190) | - | 431228..432046 (+) | 819 | WP_101750456.1 | ParA family protein | - |
| PUW49_RS02195 (PUW49_02195) | - | 432114..432365 (+) | 252 | WP_024052216.1 | hypothetical protein | - |
| PUW49_RS02200 (PUW49_02200) | - | 432381..432707 (+) | 327 | WP_024052215.1 | hypothetical protein | - |
| PUW49_RS02205 (PUW49_02205) | - | 432853..434160 (+) | 1308 | WP_070654275.1 | ISLre2 family transposase | - |
| PUW49_RS02210 (PUW49_02210) | gmk | 434574..435200 (+) | 627 | WP_274995243.1 | guanylate kinase | - |
| PUW49_RS02215 (PUW49_02215) | rpoZ | 435226..435540 (+) | 315 | WP_003031464.1 | DNA-directed RNA polymerase subunit omega | - |
| PUW49_RS02220 (PUW49_02220) | - | 435599..437980 (+) | 2382 | WP_274994882.1 | primosomal protein N' | - |
| PUW49_RS02225 (PUW49_02225) | fmt | 438054..438989 (+) | 936 | WP_024052110.1 | methionyl-tRNA formyltransferase | - |
| PUW49_RS02230 (PUW49_02230) | rsmB | 438979..440292 (+) | 1314 | WP_070583972.1 | 16S rRNA (cytosine(967)-C(5))-methyltransferase RsmB | - |
| PUW49_RS02235 (PUW49_02235) | - | 440311..441051 (+) | 741 | WP_022526377.1 | Stp1/IreP family PP2C-type Ser/Thr phosphatase | - |
| PUW49_RS02240 (PUW49_02240) | stkP | 441048..442922 (+) | 1875 | WP_024052108.1 | Stk1 family PASTA domain-containing Ser/Thr kinase | Regulator |
| PUW49_RS02245 (PUW49_02245) | liaF | 443183..443881 (+) | 699 | WP_024052107.1 | cell wall-active antibiotics response protein LiaF | - |
| PUW49_RS02250 (PUW49_02250) | - | 443878..444873 (+) | 996 | WP_024052106.1 | sensor histidine kinase | - |
| PUW49_RS02255 (PUW49_02255) | - | 444875..445501 (+) | 627 | WP_274994886.1 | response regulator transcription factor | - |
| PUW49_RS02260 (PUW49_02260) | - | 445548..446948 (+) | 1401 | WP_070241554.1 | Cof-type HAD-IIB family hydrolase | - |
| PUW49_RS02265 (PUW49_02265) | - | 446950..447321 (+) | 372 | WP_024052103.1 | S1 RNA-binding domain-containing protein | - |
| PUW49_RS02270 (PUW49_02270) | - | 447455..447772 (-) | 318 | WP_024052102.1 | DUF960 domain-containing protein | - |
| PUW49_RS02275 (PUW49_02275) | - | 447892..448425 (-) | 534 | WP_024052101.1 | DUF402 domain-containing protein | - |
| PUW49_RS02280 (PUW49_02280) | recX | 448503..449288 (-) | 786 | WP_042754014.1 | recombination regulator RecX | - |
| PUW49_RS02285 (PUW49_02285) | - | 449352..450128 (-) | 777 | WP_274994889.1 | aminoglycoside 3'-phosphotransferase | - |
| PUW49_RS02290 (PUW49_02290) | rlmD | 450185..451543 (+) | 1359 | WP_024052098.1 | 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD | - |
| PUW49_RS02295 (PUW49_02295) | - | 451545..451975 (+) | 431 | Protein_411 | cytidine deaminase | - |
| PUW49_RS02300 (PUW49_02300) | - | 452266..453198 (-) | 933 | WP_070241556.1 | nitronate monooxygenase | - |
| PUW49_RS02305 (PUW49_02305) | - | 453511..453744 (+) | 234 | WP_003025778.1 | DUF1858 domain-containing protein | - |
| PUW49_RS02310 (PUW49_02310) | - | 453744..455108 (+) | 1365 | WP_274994891.1 | DUF438 domain-containing protein | - |
| PUW49_RS02315 (PUW49_02315) | - | 455119..455373 (+) | 255 | WP_024052094.1 | DUF1912 family protein | - |
| PUW49_RS02320 (PUW49_02320) | - | 455775..456293 (+) | 519 | WP_224783676.1 | GNAT family protein | - |
| PUW49_RS02325 (PUW49_02325) | - | 456293..456856 (+) | 564 | WP_070241557.1 | shikimate kinase | - |
| PUW49_RS02330 (PUW49_02330) | - | 456869..459520 (+) | 2652 | WP_070815195.1 | valine--tRNA ligase | - |
| PUW49_RS02335 (PUW49_02335) | - | 459622..460038 (+) | 417 | WP_024052089.1 | hypothetical protein | - |
| PUW49_RS02340 (PUW49_02340) | dcm | 460195..460640 (+) | 446 | Protein_420 | DNA (cytosine-5-)-methyltransferase | - |
| PUW49_RS02345 (PUW49_02345) | cysK | 460962..461891 (-) | 930 | WP_024052086.1 | cysteine synthase A | - |
| PUW49_RS02350 (PUW49_02350) | - | 461991..462626 (-) | 636 | WP_024052085.1 | YigZ family protein | - |
| PUW49_RS02355 (PUW49_02355) | comFA/cflA | 462683..463984 (+) | 1302 | WP_024052084.1 | DEAD/DEAH box helicase | Machinery gene |
| PUW49_RS02360 (PUW49_02360) | comFC/cflB | 463981..464646 (+) | 666 | WP_024052083.1 | ComF family protein | Machinery gene |
| PUW49_RS02365 (PUW49_02365) | raiA | 464724..465266 (+) | 543 | WP_003025754.1 | ribosome-associated translation inhibitor RaiA | - |
| PUW49_RS02370 (PUW49_02370) | rsmD | 465455..466003 (+) | 549 | WP_171021853.1 | 16S rRNA (guanine(966)-N(2))-methyltransferase RsmD | - |
| PUW49_RS02375 (PUW49_02375) | coaD | 465993..466490 (+) | 498 | WP_021001374.1 | pantetheine-phosphate adenylyltransferase | - |
Sequence
Protein
Download Length: 433 a.a. Molecular weight: 49050.81 Da Isoelectric Point: 8.0447
>NTDB_id=789657 PUW49_RS02355 WP_024052084.1 462683..463984(+) (comFA/cflA) [Streptococcus anginosus strain VSI37]
MTELQDCLGRIFTKNQLSPELQLQAQTLTGMVEEKGRLSCNRCGQAIDKEKHQLPIGAYYCRSCLILGRVRSDEDLYYFP
QEEFPKANVLKWQGKLTEFQAKVSQGLVEAVTKRKDSLVHAVTGAGKTEMIYQVVAQVINQGGAVCLASPRIDVCLELYR
RLKVDFTCDISLLHGESEAYSRSPLVIATTHQLIKFYRAFDLLIVDEVDAFPYVDNPMLYHAVHQAVKVEGTKIFLTATS
TDELDKKVAKGELTRLSLPRRFHGNPLIVPQKIWLADFQKNLGQKKLVPKLRQFIQKQRKTGFPLLIFASEIRKGQELAE
ILQSTFPNEEVDFVASTTENRLEIVEKFRQKEITILVTTTILERGVTFPCVDVFVVEANHRLFSRSALVQIAGRVGRSME
RPTGELIFFHDGTTMAIEKAIKEIQEMNQEAGL
MTELQDCLGRIFTKNQLSPELQLQAQTLTGMVEEKGRLSCNRCGQAIDKEKHQLPIGAYYCRSCLILGRVRSDEDLYYFP
QEEFPKANVLKWQGKLTEFQAKVSQGLVEAVTKRKDSLVHAVTGAGKTEMIYQVVAQVINQGGAVCLASPRIDVCLELYR
RLKVDFTCDISLLHGESEAYSRSPLVIATTHQLIKFYRAFDLLIVDEVDAFPYVDNPMLYHAVHQAVKVEGTKIFLTATS
TDELDKKVAKGELTRLSLPRRFHGNPLIVPQKIWLADFQKNLGQKKLVPKLRQFIQKQRKTGFPLLIFASEIRKGQELAE
ILQSTFPNEEVDFVASTTENRLEIVEKFRQKEITILVTTTILERGVTFPCVDVFVVEANHRLFSRSALVQIAGRVGRSME
RPTGELIFFHDGTTMAIEKAIKEIQEMNQEAGL
Nucleotide
Download Length: 1302 bp
>NTDB_id=789657 PUW49_RS02355 WP_024052084.1 462683..463984(+) (comFA/cflA) [Streptococcus anginosus strain VSI37]
ATGACAGAATTACAAGATTGTTTAGGTCGCATTTTTACAAAAAATCAACTGTCACCAGAATTGCAATTGCAAGCACAAAC
CTTAACTGGAATGGTAGAAGAAAAAGGGAGATTAAGCTGTAATCGCTGTGGACAAGCCATTGACAAAGAAAAACACCAAC
TGCCAATCGGTGCTTATTATTGTAGGTCTTGCTTGATCTTAGGAAGAGTTAGGAGTGATGAAGATCTCTACTATTTTCCA
CAGGAAGAATTTCCCAAAGCGAATGTCTTGAAATGGCAAGGAAAGTTGACAGAATTTCAAGCCAAGGTTTCTCAAGGACT
TGTAGAGGCGGTTACCAAACGCAAAGATAGCTTGGTTCACGCAGTCACGGGAGCCGGAAAGACGGAAATGATCTATCAAG
TGGTGGCACAAGTCATCAATCAGGGCGGAGCAGTCTGCTTGGCTAGCCCCAGAATTGATGTCTGCTTAGAACTTTATCGC
AGACTGAAAGTGGATTTTACTTGTGATATTTCACTCCTGCACGGTGAATCAGAAGCATATTCTCGCAGTCCTCTCGTGAT
TGCCACCACACATCAGCTTATCAAATTTTACCGAGCATTTGATCTTCTTATTGTTGATGAGGTAGATGCCTTTCCTTATG
TGGACAATCCGATGCTTTATCATGCAGTTCATCAGGCAGTCAAAGTAGAGGGGACGAAGATTTTCTTAACAGCAACTTCC
ACAGATGAGCTGGATAAAAAAGTGGCTAAAGGAGAATTAACTCGTTTGAGTCTACCCAGACGTTTTCATGGCAATCCTTT
GATTGTTCCGCAAAAAATTTGGTTGGCGGATTTTCAAAAAAATCTTGGTCAAAAGAAGCTAGTTCCCAAGTTAAGGCAGT
TTATTCAAAAGCAGAGAAAAACAGGATTTCCACTTCTTATTTTTGCTTCCGAGATCAGAAAAGGGCAGGAGCTGGCAGAG
ATTCTTCAAAGCACTTTTCCTAATGAAGAAGTTGATTTTGTAGCCTCGACGACTGAAAATCGACTAGAAATTGTAGAGAA
ATTTCGTCAAAAAGAAATCACGATTTTGGTAACGACGACAATTTTGGAGCGTGGGGTGACTTTTCCTTGTGTAGATGTTT
TCGTGGTGGAAGCCAACCACCGTCTGTTTAGTCGCAGCGCTCTGGTACAAATTGCTGGACGCGTTGGTCGCAGTATGGAG
CGACCGACAGGCGAGTTAATCTTTTTTCATGACGGTACGACTATGGCGATAGAAAAAGCGATTAAAGAAATTCAGGAGAT
GAATCAGGAGGCCGGTTTATGA
ATGACAGAATTACAAGATTGTTTAGGTCGCATTTTTACAAAAAATCAACTGTCACCAGAATTGCAATTGCAAGCACAAAC
CTTAACTGGAATGGTAGAAGAAAAAGGGAGATTAAGCTGTAATCGCTGTGGACAAGCCATTGACAAAGAAAAACACCAAC
TGCCAATCGGTGCTTATTATTGTAGGTCTTGCTTGATCTTAGGAAGAGTTAGGAGTGATGAAGATCTCTACTATTTTCCA
CAGGAAGAATTTCCCAAAGCGAATGTCTTGAAATGGCAAGGAAAGTTGACAGAATTTCAAGCCAAGGTTTCTCAAGGACT
TGTAGAGGCGGTTACCAAACGCAAAGATAGCTTGGTTCACGCAGTCACGGGAGCCGGAAAGACGGAAATGATCTATCAAG
TGGTGGCACAAGTCATCAATCAGGGCGGAGCAGTCTGCTTGGCTAGCCCCAGAATTGATGTCTGCTTAGAACTTTATCGC
AGACTGAAAGTGGATTTTACTTGTGATATTTCACTCCTGCACGGTGAATCAGAAGCATATTCTCGCAGTCCTCTCGTGAT
TGCCACCACACATCAGCTTATCAAATTTTACCGAGCATTTGATCTTCTTATTGTTGATGAGGTAGATGCCTTTCCTTATG
TGGACAATCCGATGCTTTATCATGCAGTTCATCAGGCAGTCAAAGTAGAGGGGACGAAGATTTTCTTAACAGCAACTTCC
ACAGATGAGCTGGATAAAAAAGTGGCTAAAGGAGAATTAACTCGTTTGAGTCTACCCAGACGTTTTCATGGCAATCCTTT
GATTGTTCCGCAAAAAATTTGGTTGGCGGATTTTCAAAAAAATCTTGGTCAAAAGAAGCTAGTTCCCAAGTTAAGGCAGT
TTATTCAAAAGCAGAGAAAAACAGGATTTCCACTTCTTATTTTTGCTTCCGAGATCAGAAAAGGGCAGGAGCTGGCAGAG
ATTCTTCAAAGCACTTTTCCTAATGAAGAAGTTGATTTTGTAGCCTCGACGACTGAAAATCGACTAGAAATTGTAGAGAA
ATTTCGTCAAAAAGAAATCACGATTTTGGTAACGACGACAATTTTGGAGCGTGGGGTGACTTTTCCTTGTGTAGATGTTT
TCGTGGTGGAAGCCAACCACCGTCTGTTTAGTCGCAGCGCTCTGGTACAAATTGCTGGACGCGTTGGTCGCAGTATGGAG
CGACCGACAGGCGAGTTAATCTTTTTTCATGACGGTACGACTATGGCGATAGAAAAAGCGATTAAAGAAATTCAGGAGAT
GAATCAGGAGGCCGGTTTATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA/cflA | Streptococcus pneumoniae Rx1 |
70.794 |
98.845 |
0.7 |
| comFA/cflA | Streptococcus pneumoniae D39 |
70.794 |
98.845 |
0.7 |
| comFA/cflA | Streptococcus pneumoniae R6 |
70.794 |
98.845 |
0.7 |
| comFA/cflA | Streptococcus pneumoniae TIGR4 |
70.561 |
98.845 |
0.697 |
| comFA/cflA | Streptococcus mitis NCTC 12261 |
70.892 |
98.383 |
0.697 |
| comFA/cflA | Streptococcus mitis SK321 |
70.188 |
98.383 |
0.691 |
| comFA | Lactococcus lactis subsp. cremoris KW2 |
55.276 |
91.917 |
0.508 |
| comFA | Latilactobacillus sakei subsp. sakei 23K |
40.092 |
100 |
0.402 |
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
40.247 |
93.533 |
0.376 |