Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | G3M63_RS05525 | Genome accession | NZ_CP048629 |
| Coordinates | 1197452..1198957 (-) | Length | 501 a.a. |
| NCBI ID | WP_163982104.1 | Uniprot ID | A0A6C0TNR6 |
| Organism | Pseudomonas sp. OIL-1 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| ICE | 1193568..1204586 | 1197452..1198957 | within | 0 |
Gene organization within MGE regions
Location: 1193568..1204586
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| G3M63_RS05515 (G3M63_05515) | rep | 1193568..1195574 (-) | 2007 | WP_150300044.1 | DNA helicase Rep | - |
| G3M63_RS05520 (G3M63_05520) | - | 1195755..1197443 (+) | 1689 | WP_150300045.1 | bifunctional diguanylate cyclase/phosphodiesterase | - |
| G3M63_RS05525 (G3M63_05525) | comM | 1197452..1198957 (-) | 1506 | WP_163982104.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| G3M63_RS05530 (G3M63_05530) | - | 1198988..1199245 (-) | 258 | WP_150300047.1 | accessory factor UbiK family protein | - |
| G3M63_RS05535 (G3M63_05535) | glnK | 1199788..1200126 (+) | 339 | WP_093396590.1 | P-II family nitrogen regulator | - |
| G3M63_RS05540 (G3M63_05540) | - | 1200173..1201417 (+) | 1245 | WP_178100951.1 | ammonium transporter | - |
| G3M63_RS05545 (G3M63_05545) | - | 1201511..1201927 (+) | 417 | WP_163982105.1 | secondary thiamine-phosphate synthase enzyme YjbQ | - |
| G3M63_RS05550 (G3M63_05550) | - | 1201979..1202434 (+) | 456 | WP_150300050.1 | hypothetical protein | - |
| G3M63_RS05555 (G3M63_05555) | - | 1202611..1202922 (+) | 312 | WP_150300051.1 | hypothetical protein | - |
| G3M63_RS05560 (G3M63_05560) | - | 1202926..1203648 (-) | 723 | WP_163982106.1 | HAD family hydrolase | - |
| G3M63_RS05565 (G3M63_05565) | xerC | 1203645..1204568 (-) | 924 | WP_163984191.1 | tyrosine recombinase XerC | - |
Sequence
Protein
Download Length: 501 a.a. Molecular weight: 54028.91 Da Isoelectric Point: 7.5388
>NTDB_id=421965 G3M63_RS05525 WP_163982104.1 1197452..1198957(-) (comM) [Pseudomonas sp. OIL-1]
MSLAIVHSRAKLGVQAPAVTVEVHLANGMPSLTLVGLPETAVRESKDRVRSALQNARLEFPGLKRITINLAPADLPKEGG
RFDLAIALGLLSASGQIPADVLDQCECLGELALSGELRPVQGVLPAALACRDQGRTLLLPRANAMEAALVRGLHIVAADH
LLELCEHLRGERTLEPVSPNPAPELPRYHADLDEVQGQAQAKRALIVAASGGHNLLLFGPPGTGKTLLASRLPGILPQLV
EGEALEVAAIQSVCGQGPLQTWPTRPFRAPHHGASAAALVGGGSQPRPGEISQAHRGVLFLDELPEFDRKVLEMLREPME
SGEIHIARAREQLSFPARFQLIAAMNPCPCGYQGDPSGRCRCSAEQVERYRGKLSGPLLDRIDLHVAMHNEAFSLANKEP
GKPNSAEAAAQVAAARQQQQQRQGRLNAHLDLAGLRDHCPLKDEDARWMERAIEQLGLSHRASHRALKVARTLADLDNQT
AIGRAHLAEALQYRQLDRRPS
MSLAIVHSRAKLGVQAPAVTVEVHLANGMPSLTLVGLPETAVRESKDRVRSALQNARLEFPGLKRITINLAPADLPKEGG
RFDLAIALGLLSASGQIPADVLDQCECLGELALSGELRPVQGVLPAALACRDQGRTLLLPRANAMEAALVRGLHIVAADH
LLELCEHLRGERTLEPVSPNPAPELPRYHADLDEVQGQAQAKRALIVAASGGHNLLLFGPPGTGKTLLASRLPGILPQLV
EGEALEVAAIQSVCGQGPLQTWPTRPFRAPHHGASAAALVGGGSQPRPGEISQAHRGVLFLDELPEFDRKVLEMLREPME
SGEIHIARAREQLSFPARFQLIAAMNPCPCGYQGDPSGRCRCSAEQVERYRGKLSGPLLDRIDLHVAMHNEAFSLANKEP
GKPNSAEAAAQVAAARQQQQQRQGRLNAHLDLAGLRDHCPLKDEDARWMERAIEQLGLSHRASHRALKVARTLADLDNQT
AIGRAHLAEALQYRQLDRRPS
Nucleotide
Download Length: 1506 bp
>NTDB_id=421965 G3M63_RS05525 WP_163982104.1 1197452..1198957(-) (comM) [Pseudomonas sp. OIL-1]
ATGTCCCTGGCGATTGTCCACAGCCGCGCCAAACTAGGCGTGCAGGCGCCTGCCGTCACCGTTGAGGTGCATCTGGCCAA
TGGAATGCCCAGCCTGACACTGGTCGGGTTGCCTGAAACCGCCGTGCGCGAAAGCAAGGATCGCGTACGCAGCGCCCTGC
AGAATGCCCGCCTCGAATTCCCCGGCCTGAAGCGCATCACCATCAATCTCGCGCCCGCTGACCTGCCCAAGGAAGGCGGA
CGCTTCGATCTGGCGATTGCACTGGGCCTGTTATCCGCCAGCGGTCAGATTCCCGCCGACGTGCTGGACCAGTGCGAATG
CCTGGGCGAGCTGGCGCTGTCCGGCGAGTTACGCCCGGTTCAGGGCGTGTTGCCAGCGGCGCTGGCGTGTCGGGACCAGG
GCCGCACGCTGTTGTTACCACGCGCCAATGCCATGGAAGCGGCATTAGTCCGGGGCTTGCACATTGTCGCCGCCGACCAC
CTGTTGGAGCTGTGCGAACACCTGCGCGGCGAACGCACCCTCGAACCAGTATCACCAAACCCTGCGCCAGAGCTGCCTCG
CTACCATGCCGACCTTGACGAGGTGCAGGGTCAGGCGCAGGCCAAACGCGCGCTGATCGTCGCGGCCAGCGGCGGTCACA
ATTTACTTTTATTTGGCCCTCCGGGTACGGGGAAAACCCTGCTGGCGAGTCGCCTACCCGGAATCCTCCCGCAGCTGGTC
GAGGGCGAAGCACTGGAAGTGGCGGCCATCCAATCGGTGTGTGGGCAGGGCCCGCTACAGACCTGGCCAACACGGCCCTT
CCGTGCGCCCCATCATGGTGCATCTGCTGCAGCGCTAGTCGGCGGCGGCAGTCAGCCGCGCCCCGGAGAGATCAGCCAGG
CCCACCGGGGCGTACTGTTTCTCGACGAATTGCCCGAGTTTGATCGCAAGGTCCTTGAGATGCTGCGTGAGCCGATGGAG
TCAGGCGAGATACACATCGCTCGCGCACGTGAACAACTGAGCTTCCCTGCCCGTTTTCAATTGATCGCAGCGATGAACCC
ATGCCCCTGCGGATACCAGGGAGACCCATCCGGACGCTGTCGCTGCTCCGCTGAGCAAGTGGAACGCTACCGCGGCAAAC
TGTCCGGCCCGCTGCTGGACCGGATCGACCTGCACGTCGCCATGCACAATGAAGCCTTCAGCCTTGCCAACAAAGAGCCT
GGCAAACCCAACAGCGCGGAAGCCGCCGCCCAGGTAGCCGCTGCACGCCAACAGCAGCAACAGCGCCAGGGTAGATTGAA
CGCGCATCTGGATCTGGCGGGCTTGCGTGACCATTGCCCGTTGAAGGATGAGGATGCACGCTGGATGGAGCGCGCTATAG
AGCAGCTGGGTTTATCCCACCGCGCCAGCCATCGGGCATTGAAAGTGGCGCGGACATTGGCGGATCTTGATAATCAGACG
GCTATCGGCCGAGCGCATCTGGCGGAGGCGTTGCAATACCGGCAACTGGATAGAAGGCCCAGCTAG
ATGTCCCTGGCGATTGTCCACAGCCGCGCCAAACTAGGCGTGCAGGCGCCTGCCGTCACCGTTGAGGTGCATCTGGCCAA
TGGAATGCCCAGCCTGACACTGGTCGGGTTGCCTGAAACCGCCGTGCGCGAAAGCAAGGATCGCGTACGCAGCGCCCTGC
AGAATGCCCGCCTCGAATTCCCCGGCCTGAAGCGCATCACCATCAATCTCGCGCCCGCTGACCTGCCCAAGGAAGGCGGA
CGCTTCGATCTGGCGATTGCACTGGGCCTGTTATCCGCCAGCGGTCAGATTCCCGCCGACGTGCTGGACCAGTGCGAATG
CCTGGGCGAGCTGGCGCTGTCCGGCGAGTTACGCCCGGTTCAGGGCGTGTTGCCAGCGGCGCTGGCGTGTCGGGACCAGG
GCCGCACGCTGTTGTTACCACGCGCCAATGCCATGGAAGCGGCATTAGTCCGGGGCTTGCACATTGTCGCCGCCGACCAC
CTGTTGGAGCTGTGCGAACACCTGCGCGGCGAACGCACCCTCGAACCAGTATCACCAAACCCTGCGCCAGAGCTGCCTCG
CTACCATGCCGACCTTGACGAGGTGCAGGGTCAGGCGCAGGCCAAACGCGCGCTGATCGTCGCGGCCAGCGGCGGTCACA
ATTTACTTTTATTTGGCCCTCCGGGTACGGGGAAAACCCTGCTGGCGAGTCGCCTACCCGGAATCCTCCCGCAGCTGGTC
GAGGGCGAAGCACTGGAAGTGGCGGCCATCCAATCGGTGTGTGGGCAGGGCCCGCTACAGACCTGGCCAACACGGCCCTT
CCGTGCGCCCCATCATGGTGCATCTGCTGCAGCGCTAGTCGGCGGCGGCAGTCAGCCGCGCCCCGGAGAGATCAGCCAGG
CCCACCGGGGCGTACTGTTTCTCGACGAATTGCCCGAGTTTGATCGCAAGGTCCTTGAGATGCTGCGTGAGCCGATGGAG
TCAGGCGAGATACACATCGCTCGCGCACGTGAACAACTGAGCTTCCCTGCCCGTTTTCAATTGATCGCAGCGATGAACCC
ATGCCCCTGCGGATACCAGGGAGACCCATCCGGACGCTGTCGCTGCTCCGCTGAGCAAGTGGAACGCTACCGCGGCAAAC
TGTCCGGCCCGCTGCTGGACCGGATCGACCTGCACGTCGCCATGCACAATGAAGCCTTCAGCCTTGCCAACAAAGAGCCT
GGCAAACCCAACAGCGCGGAAGCCGCCGCCCAGGTAGCCGCTGCACGCCAACAGCAGCAACAGCGCCAGGGTAGATTGAA
CGCGCATCTGGATCTGGCGGGCTTGCGTGACCATTGCCCGTTGAAGGATGAGGATGCACGCTGGATGGAGCGCGCTATAG
AGCAGCTGGGTTTATCCCACCGCGCCAGCCATCGGGCATTGAAAGTGGCGCGGACATTGGCGGATCTTGATAATCAGACG
GCTATCGGCCGAGCGCATCTGGCGGAGGCGTTGCAATACCGGCAACTGGATAGAAGGCCCAGCTAG
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Vibrio campbellii strain DS40M4 |
54.871 |
100 |
0.551 |
| comM | Vibrio cholerae strain A1552 |
54.8 |
99.8 |
0.547 |
| comM | Glaesserella parasuis strain SC1401 |
52.838 |
100 |
0.539 |
| comM | Haemophilus influenzae Rd KW20 |
52.871 |
100 |
0.533 |
| comM | Legionella pneumophila str. Paris |
49.602 |
100 |
0.497 |
| comM | Legionella pneumophila strain ERS1305867 |
49.602 |
100 |
0.497 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
46.365 |
100 |
0.471 |