Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | clem_RS03375 | Genome accession | NZ_CP016397 |
| Coordinates | 767974..769485 (-) | Length | 503 a.a. |
| NCBI ID | WP_094090325.1 | Uniprot ID | A0A222P0D1 |
| Organism | Legionella clemsonensis strain CDC-D5610 | ||
| Function | DNA binding (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 762974..774485
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| clem_RS03350 (clem_03410) | - | 763884..764309 (+) | 426 | WP_094090320.1 | HIT domain-containing protein | - |
| clem_RS03355 (clem_03415) | - | 764283..765815 (-) | 1533 | WP_094090321.1 | FMN-binding glutamate synthase family protein | - |
| clem_RS03360 (clem_03420) | - | 766054..766617 (+) | 564 | WP_094090322.1 | YqgE/AlgH family protein | - |
| clem_RS03365 (clem_03425) | ruvX | 766640..767059 (+) | 420 | WP_094090323.1 | Holliday junction resolvase RuvX | - |
| clem_RS03370 (clem_03430) | - | 767063..767959 (+) | 897 | WP_094090324.1 | aspartate carbamoyltransferase catalytic subunit | - |
| clem_RS03375 (clem_03435) | comM | 767974..769485 (-) | 1512 | WP_094090325.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| clem_RS03380 (clem_03440) | ubiK | 769571..769825 (-) | 255 | WP_094090326.1 | ubiquinone biosynthesis accessory factor UbiK | - |
| clem_RS03385 (clem_03445) | - | 769919..770257 (+) | 339 | WP_094090327.1 | P-II family nitrogen regulator | - |
| clem_RS03390 (clem_03450) | - | 770263..770733 (-) | 471 | WP_232505548.1 | EVE domain-containing protein | - |
| clem_RS03395 (clem_03455) | - | 770730..771302 (-) | 573 | WP_094090329.1 | 5-formyltetrahydrofolate cyclo-ligase | - |
| clem_RS03400 (clem_03460) | - | 771496..771675 (+) | 180 | WP_094090330.1 | PA3496 family putative envelope integrity protein | - |
| clem_RS03405 (clem_03465) | - | 771678..772496 (+) | 819 | WP_094090331.1 | aminotransferase class IV | - |
| clem_RS03410 (clem_03470) | - | 772438..773130 (-) | 693 | WP_094090332.1 | TVP38/TMEM64 family protein | - |
Sequence
Protein
Download Length: 503 a.a. Molecular weight: 54818.04 Da Isoelectric Point: 8.4888
>NTDB_id=187963 clem_RS03375 WP_094090325.1 767974..769485(-) (comM) [Legionella clemsonensis strain CDC-D5610]
MNLAFSKTRSTVGILAQSVSVEVHLSNGLPSFTIVGLAETAVKESKDRVRSAIINSQFEFPCRKITVNLAPADLPKSGSG
FDLPIAVGILAASGQLPVDKLATHEFISELALSGNLRGVSAIIPAVLAVRRDNQKLVIATANAAEASLAGYNDVFSANNL
REVCSYLCQNTPLKVLPARPETTYVNGKMDWSDIKGQYHAKRAMEIAACGGHSILLSGPPGSGKTMLAKRFATLLPDLSE
TQALECAAIKSIRGRLPDFNSWRSPPFRSPHHTASQVALVGGGNPPKPGEISLAHNGVLFLDELPEFHKQVLETLREPLE
SGNIWISRAATQIEFPAQFQLVAAMNPCPCGQWGNPQASCLCSPERITRYLAKLSAPLLDRIDMQITLQALTQEELIKPN
LTTAGESKRIRQTVEQVRARQLSRQNCINAQLDAKDCEEFCQLSQAEQGFLSEVMNQLKLSARAYHRLLKVARTIADMNN
LEKVDLSALQQALSFRQNLQLPK
MNLAFSKTRSTVGILAQSVSVEVHLSNGLPSFTIVGLAETAVKESKDRVRSAIINSQFEFPCRKITVNLAPADLPKSGSG
FDLPIAVGILAASGQLPVDKLATHEFISELALSGNLRGVSAIIPAVLAVRRDNQKLVIATANAAEASLAGYNDVFSANNL
REVCSYLCQNTPLKVLPARPETTYVNGKMDWSDIKGQYHAKRAMEIAACGGHSILLSGPPGSGKTMLAKRFATLLPDLSE
TQALECAAIKSIRGRLPDFNSWRSPPFRSPHHTASQVALVGGGNPPKPGEISLAHNGVLFLDELPEFHKQVLETLREPLE
SGNIWISRAATQIEFPAQFQLVAAMNPCPCGQWGNPQASCLCSPERITRYLAKLSAPLLDRIDMQITLQALTQEELIKPN
LTTAGESKRIRQTVEQVRARQLSRQNCINAQLDAKDCEEFCQLSQAEQGFLSEVMNQLKLSARAYHRLLKVARTIADMNN
LEKVDLSALQQALSFRQNLQLPK
Nucleotide
Download Length: 1512 bp
>NTDB_id=187963 clem_RS03375 WP_094090325.1 767974..769485(-) (comM) [Legionella clemsonensis strain CDC-D5610]
ATGAATCTCGCTTTTAGCAAAACGCGTAGTACTGTAGGTATACTCGCGCAGTCTGTTTCTGTCGAAGTCCATTTATCCAA
TGGCTTGCCCAGCTTCACAATTGTGGGGCTTGCCGAAACTGCTGTTAAGGAAAGCAAAGACAGAGTTCGTAGTGCAATCA
TTAATAGTCAATTTGAGTTTCCCTGTCGTAAAATCACAGTCAATCTTGCTCCTGCCGATTTACCCAAATCAGGGAGTGGT
TTTGATTTACCTATTGCCGTAGGTATTCTTGCAGCTTCAGGCCAGCTACCTGTAGATAAGTTAGCTACACATGAATTTAT
TAGTGAACTCGCCTTGAGTGGTAATTTGCGTGGTGTATCCGCTATCATTCCTGCAGTCCTGGCTGTGCGGCGGGATAATC
AAAAATTAGTAATTGCTACAGCTAATGCTGCAGAAGCCTCACTGGCAGGCTATAATGACGTGTTTAGTGCCAATAACTTG
CGCGAGGTGTGCAGTTATCTATGTCAAAATACACCGCTTAAAGTCCTACCTGCGCGTCCTGAAACTACCTATGTAAATGG
GAAAATGGATTGGTCTGATATTAAGGGTCAGTATCATGCAAAGCGAGCGATGGAAATTGCGGCTTGTGGAGGTCATAGTA
TTTTATTAAGCGGACCTCCCGGGAGTGGTAAAACCATGTTGGCCAAACGCTTCGCTACCCTCCTTCCAGATCTTAGCGAA
ACTCAAGCACTTGAATGTGCTGCCATTAAGTCCATTCGTGGACGGCTTCCAGATTTTAATAGCTGGCGCTCTCCACCATT
TCGTTCGCCACATCACACAGCCTCCCAAGTTGCACTAGTAGGTGGAGGTAATCCACCAAAGCCAGGGGAGATTTCACTGG
CCCATAATGGCGTATTATTTCTTGATGAATTGCCTGAGTTTCATAAGCAAGTACTGGAAACCTTACGTGAACCCCTGGAA
TCAGGGAATATCTGGATTTCCCGGGCAGCGACTCAAATTGAATTTCCTGCCCAATTTCAACTTGTTGCTGCGATGAATCC
TTGCCCTTGTGGTCAGTGGGGGAATCCTCAAGCAAGCTGTCTTTGTAGTCCTGAACGCATTACTCGTTATTTGGCAAAAT
TATCAGCTCCACTGCTTGACAGAATTGATATGCAAATAACCTTGCAAGCATTAACACAAGAGGAATTAATTAAACCCAAT
CTTACTACTGCAGGAGAAAGCAAACGGATCAGACAAACTGTTGAACAAGTTAGAGCGCGTCAGCTAAGCCGACAAAATTG
TATTAATGCCCAACTTGACGCTAAAGATTGTGAAGAATTCTGTCAATTAAGCCAGGCAGAACAAGGGTTTTTAAGTGAAG
TCATGAACCAGCTTAAATTATCGGCACGTGCCTACCACCGTCTTCTGAAAGTGGCAAGAACTATTGCCGATATGAATAAT
CTGGAGAAAGTAGACTTAAGTGCCCTGCAGCAAGCTTTATCGTTCAGGCAAAATTTACAACTACCGAAATGA
ATGAATCTCGCTTTTAGCAAAACGCGTAGTACTGTAGGTATACTCGCGCAGTCTGTTTCTGTCGAAGTCCATTTATCCAA
TGGCTTGCCCAGCTTCACAATTGTGGGGCTTGCCGAAACTGCTGTTAAGGAAAGCAAAGACAGAGTTCGTAGTGCAATCA
TTAATAGTCAATTTGAGTTTCCCTGTCGTAAAATCACAGTCAATCTTGCTCCTGCCGATTTACCCAAATCAGGGAGTGGT
TTTGATTTACCTATTGCCGTAGGTATTCTTGCAGCTTCAGGCCAGCTACCTGTAGATAAGTTAGCTACACATGAATTTAT
TAGTGAACTCGCCTTGAGTGGTAATTTGCGTGGTGTATCCGCTATCATTCCTGCAGTCCTGGCTGTGCGGCGGGATAATC
AAAAATTAGTAATTGCTACAGCTAATGCTGCAGAAGCCTCACTGGCAGGCTATAATGACGTGTTTAGTGCCAATAACTTG
CGCGAGGTGTGCAGTTATCTATGTCAAAATACACCGCTTAAAGTCCTACCTGCGCGTCCTGAAACTACCTATGTAAATGG
GAAAATGGATTGGTCTGATATTAAGGGTCAGTATCATGCAAAGCGAGCGATGGAAATTGCGGCTTGTGGAGGTCATAGTA
TTTTATTAAGCGGACCTCCCGGGAGTGGTAAAACCATGTTGGCCAAACGCTTCGCTACCCTCCTTCCAGATCTTAGCGAA
ACTCAAGCACTTGAATGTGCTGCCATTAAGTCCATTCGTGGACGGCTTCCAGATTTTAATAGCTGGCGCTCTCCACCATT
TCGTTCGCCACATCACACAGCCTCCCAAGTTGCACTAGTAGGTGGAGGTAATCCACCAAAGCCAGGGGAGATTTCACTGG
CCCATAATGGCGTATTATTTCTTGATGAATTGCCTGAGTTTCATAAGCAAGTACTGGAAACCTTACGTGAACCCCTGGAA
TCAGGGAATATCTGGATTTCCCGGGCAGCGACTCAAATTGAATTTCCTGCCCAATTTCAACTTGTTGCTGCGATGAATCC
TTGCCCTTGTGGTCAGTGGGGGAATCCTCAAGCAAGCTGTCTTTGTAGTCCTGAACGCATTACTCGTTATTTGGCAAAAT
TATCAGCTCCACTGCTTGACAGAATTGATATGCAAATAACCTTGCAAGCATTAACACAAGAGGAATTAATTAAACCCAAT
CTTACTACTGCAGGAGAAAGCAAACGGATCAGACAAACTGTTGAACAAGTTAGAGCGCGTCAGCTAAGCCGACAAAATTG
TATTAATGCCCAACTTGACGCTAAAGATTGTGAAGAATTCTGTCAATTAAGCCAGGCAGAACAAGGGTTTTTAAGTGAAG
TCATGAACCAGCTTAAATTATCGGCACGTGCCTACCACCGTCTTCTGAAAGTGGCAAGAACTATTGCCGATATGAATAAT
CTGGAGAAAGTAGACTTAAGTGCCCTGCAGCAAGCTTTATCGTTCAGGCAAAATTTACAACTACCGAAATGA
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Legionella pneumophila str. Paris |
75.746 |
100 |
0.757 |
| comM | Legionella pneumophila strain ERS1305867 |
75.746 |
100 |
0.757 |
| comM | Vibrio cholerae strain A1552 |
51.509 |
98.807 |
0.509 |
| comM | Haemophilus influenzae Rd KW20 |
50.198 |
100 |
0.505 |
| comM | Vibrio campbellii strain DS40M4 |
50 |
99.404 |
0.497 |
| comM | Glaesserella parasuis strain SC1401 |
48.6 |
99.404 |
0.483 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
43.227 |
99.801 |
0.431 |