Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | GH742_RS04210 | Genome accession | NZ_CP045732 |
| Coordinates | 889854..891365 (-) | Length | 503 a.a. |
| NCBI ID | WP_203456237.1 | Uniprot ID | - |
| Organism | Legionella sp. MW5194 | ||
| Function | DNA binding (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 884854..896365
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GH742_RS04175 (GH742_04160) | - | 885360..885581 (-) | 222 | WP_203456230.1 | cold-shock protein | - |
| GH742_RS04180 (GH742_04165) | - | 885830..886792 (-) | 963 | WP_203456231.1 | metal-dependent hydrolase | - |
| GH742_RS04185 (GH742_04170) | - | 886911..887336 (+) | 426 | WP_203456232.1 | HIT domain-containing protein | - |
| GH742_RS04190 (GH742_04175) | - | 887442..888005 (+) | 564 | WP_203456233.1 | YqgE/AlgH family protein | - |
| GH742_RS04195 (GH742_04180) | ruvX | 888038..888457 (+) | 420 | WP_203456234.1 | Holliday junction resolvase RuvX | - |
| GH742_RS04200 (GH742_04185) | - | 888468..889388 (+) | 921 | WP_203456235.1 | aspartate carbamoyltransferase catalytic subunit | - |
| GH742_RS04205 (GH742_04190) | - | 889371..889781 (-) | 411 | WP_203456236.1 | YidH family protein | - |
| GH742_RS04210 (GH742_04195) | comM | 889854..891365 (-) | 1512 | WP_203456237.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| GH742_RS04215 (GH742_04200) | - | 891440..891691 (-) | 252 | WP_203456238.1 | accessory factor UbiK family protein | - |
| GH742_RS04220 (GH742_04205) | - | 891791..892132 (+) | 342 | WP_058525665.1 | P-II family nitrogen regulator | - |
| GH742_RS04225 (GH742_04210) | - | 892129..892596 (-) | 468 | WP_203456239.1 | EVE domain-containing protein | - |
| GH742_RS04230 (GH742_04215) | - | 892596..893168 (-) | 573 | WP_203456240.1 | 5-formyltetrahydrofolate cyclo-ligase | - |
| GH742_RS04235 (GH742_04220) | - | 893362..893556 (+) | 195 | WP_108292377.1 | PA3496 family putative envelope integrity protein | - |
| GH742_RS04240 (GH742_04225) | - | 893559..894371 (+) | 813 | WP_203456241.1 | aminotransferase class IV | - |
| GH742_RS04245 (GH742_04230) | - | 894804..895049 (-) | 246 | WP_203456242.1 | hypothetical protein | - |
| GH742_RS04250 (GH742_04235) | - | 895321..895593 (+) | 273 | WP_239005275.1 | hypothetical protein | - |
| GH742_RS04255 (GH742_04240) | - | 895594..895974 (+) | 381 | WP_203456243.1 | hypothetical protein | - |
| GH742_RS04260 (GH742_04245) | - | 895946..896263 (+) | 318 | WP_203456244.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 503 a.a. Molecular weight: 55077.52 Da Isoelectric Point: 8.5586
>NTDB_id=396424 GH742_RS04210 WP_203456237.1 889854..891365(-) (comM) [Legionella sp. MW5194]
MNLALTYTRSAQGIHARAVHVEVHLSNGLPQFTIVGLAETAVKESKDRVRSAIINSQFEFPCRKITVNLAPADLPKTGSG
FDLPIALGILAASGQIPSEGLTSHEFISELALSGELRGHTPIIPSVMAARREQRRLIIAEANAREAALTAYDQVFSAGNL
RQVCDYLLHQTPLNAMPALPPLPAVEPGLDWSDVKGQYHAKQAMAIAACGGHSLLLSGPPGSGKTMLAKRFTTLLPELSE
TQALECAAIHSLRGKVPPYEHWRIPPFRSPHHTASPVALVGGGSPPKPGEISLAHHGILFLDELPEFPKQVLETLRQPLE
SGLISISRAAMQTDFPAEFQFIAAMNPCPCGQWGNPRANCLCTPERIKRYLGKLSAPLLDRIDMQVNVQALSQQELLKAS
PTREGESQRIRLQVQESRAVQISRQGKLNARLDSKTCEQVCYLGREEKVFLSQVLDTLQLSARAYHRFLKVARTIADMNG
EEQVKRPALQQALSFKQCLQMPQ
MNLALTYTRSAQGIHARAVHVEVHLSNGLPQFTIVGLAETAVKESKDRVRSAIINSQFEFPCRKITVNLAPADLPKTGSG
FDLPIALGILAASGQIPSEGLTSHEFISELALSGELRGHTPIIPSVMAARREQRRLIIAEANAREAALTAYDQVFSAGNL
RQVCDYLLHQTPLNAMPALPPLPAVEPGLDWSDVKGQYHAKQAMAIAACGGHSLLLSGPPGSGKTMLAKRFTTLLPELSE
TQALECAAIHSLRGKVPPYEHWRIPPFRSPHHTASPVALVGGGSPPKPGEISLAHHGILFLDELPEFPKQVLETLRQPLE
SGLISISRAAMQTDFPAEFQFIAAMNPCPCGQWGNPRANCLCTPERIKRYLGKLSAPLLDRIDMQVNVQALSQQELLKAS
PTREGESQRIRLQVQESRAVQISRQGKLNARLDSKTCEQVCYLGREEKVFLSQVLDTLQLSARAYHRFLKVARTIADMNG
EEQVKRPALQQALSFKQCLQMPQ
Nucleotide
Download Length: 1512 bp
>NTDB_id=396424 GH742_RS04210 WP_203456237.1 889854..891365(-) (comM) [Legionella sp. MW5194]
ATGAATCTCGCGTTAACCTATACCCGTAGCGCACAGGGCATTCATGCCAGGGCGGTACATGTCGAAGTGCATTTATCCAA
TGGTCTGCCGCAATTTACCATCGTCGGTCTTGCTGAAACTGCCGTGAAGGAAAGCAAGGATCGCGTTCGCAGCGCCATCA
TTAACAGTCAATTCGAATTTCCCTGCCGTAAAATCACCGTCAATCTTGCGCCGGCTGATTTACCCAAAACCGGCAGCGGC
TTTGATTTACCCATTGCCTTGGGCATCCTGGCTGCATCCGGGCAAATTCCCTCGGAAGGATTGACCTCCCATGAGTTTAT
CAGTGAGTTGGCGTTAAGCGGTGAATTACGCGGCCATACACCCATTATTCCCAGCGTGATGGCTGCCCGCCGCGAGCAGC
GGCGCTTGATCATTGCCGAAGCAAATGCACGTGAAGCCGCGCTTACCGCCTACGATCAGGTATTCAGTGCGGGCAACCTG
CGGCAGGTGTGTGATTATCTTCTTCACCAGACACCGCTTAATGCGATGCCGGCTCTGCCTCCCCTTCCTGCCGTTGAGCC
GGGATTGGATTGGTCAGATGTTAAAGGGCAGTACCACGCCAAACAGGCCATGGCCATCGCAGCCTGCGGCGGGCATAGTC
TTTTATTAAGCGGTCCACCGGGCAGCGGTAAAACCATGCTGGCGAAACGCTTCACCACCCTGCTTCCAGAATTGAGCGAA
ACGCAGGCTTTAGAATGCGCTGCGATCCATTCGCTTCGCGGCAAAGTGCCACCCTATGAACACTGGCGCATTCCTCCTTT
CCGCTCCCCTCATCATACAGCCTCACCCGTCGCGCTGGTGGGCGGAGGCAGTCCGCCCAAACCCGGGGAAATTTCATTAG
CACACCACGGCATCCTTTTTCTTGATGAGTTGCCTGAGTTCCCTAAACAGGTGTTGGAGACCCTGCGCCAACCCCTTGAA
TCAGGCCTTATTTCCATTTCACGTGCCGCCATGCAGACTGATTTTCCGGCCGAGTTCCAGTTCATTGCCGCCATGAATCC
CTGCCCCTGTGGCCAATGGGGCAATCCCAGGGCCAATTGCCTGTGTACCCCGGAACGAATTAAACGCTACCTCGGGAAAT
TGTCCGCTCCGCTTCTCGATCGCATTGACATGCAGGTGAATGTGCAGGCATTGTCACAACAGGAATTACTCAAGGCGAGC
CCCACCCGGGAAGGCGAAAGTCAGCGCATTCGGCTTCAGGTGCAAGAGTCACGCGCCGTGCAAATCAGTCGGCAGGGCAA
GCTCAATGCCCGGCTTGACAGCAAAACCTGTGAACAGGTTTGTTATCTTGGCAGGGAAGAAAAGGTGTTCCTGTCACAGG
TACTGGACACACTGCAACTGTCTGCCCGCGCTTACCATCGCTTTTTGAAGGTAGCCCGTACCATTGCCGACATGAACGGT
GAAGAACAGGTTAAACGGCCCGCCCTGCAACAAGCCCTGTCGTTTAAACAGTGTTTGCAGATGCCGCAGTAA
ATGAATCTCGCGTTAACCTATACCCGTAGCGCACAGGGCATTCATGCCAGGGCGGTACATGTCGAAGTGCATTTATCCAA
TGGTCTGCCGCAATTTACCATCGTCGGTCTTGCTGAAACTGCCGTGAAGGAAAGCAAGGATCGCGTTCGCAGCGCCATCA
TTAACAGTCAATTCGAATTTCCCTGCCGTAAAATCACCGTCAATCTTGCGCCGGCTGATTTACCCAAAACCGGCAGCGGC
TTTGATTTACCCATTGCCTTGGGCATCCTGGCTGCATCCGGGCAAATTCCCTCGGAAGGATTGACCTCCCATGAGTTTAT
CAGTGAGTTGGCGTTAAGCGGTGAATTACGCGGCCATACACCCATTATTCCCAGCGTGATGGCTGCCCGCCGCGAGCAGC
GGCGCTTGATCATTGCCGAAGCAAATGCACGTGAAGCCGCGCTTACCGCCTACGATCAGGTATTCAGTGCGGGCAACCTG
CGGCAGGTGTGTGATTATCTTCTTCACCAGACACCGCTTAATGCGATGCCGGCTCTGCCTCCCCTTCCTGCCGTTGAGCC
GGGATTGGATTGGTCAGATGTTAAAGGGCAGTACCACGCCAAACAGGCCATGGCCATCGCAGCCTGCGGCGGGCATAGTC
TTTTATTAAGCGGTCCACCGGGCAGCGGTAAAACCATGCTGGCGAAACGCTTCACCACCCTGCTTCCAGAATTGAGCGAA
ACGCAGGCTTTAGAATGCGCTGCGATCCATTCGCTTCGCGGCAAAGTGCCACCCTATGAACACTGGCGCATTCCTCCTTT
CCGCTCCCCTCATCATACAGCCTCACCCGTCGCGCTGGTGGGCGGAGGCAGTCCGCCCAAACCCGGGGAAATTTCATTAG
CACACCACGGCATCCTTTTTCTTGATGAGTTGCCTGAGTTCCCTAAACAGGTGTTGGAGACCCTGCGCCAACCCCTTGAA
TCAGGCCTTATTTCCATTTCACGTGCCGCCATGCAGACTGATTTTCCGGCCGAGTTCCAGTTCATTGCCGCCATGAATCC
CTGCCCCTGTGGCCAATGGGGCAATCCCAGGGCCAATTGCCTGTGTACCCCGGAACGAATTAAACGCTACCTCGGGAAAT
TGTCCGCTCCGCTTCTCGATCGCATTGACATGCAGGTGAATGTGCAGGCATTGTCACAACAGGAATTACTCAAGGCGAGC
CCCACCCGGGAAGGCGAAAGTCAGCGCATTCGGCTTCAGGTGCAAGAGTCACGCGCCGTGCAAATCAGTCGGCAGGGCAA
GCTCAATGCCCGGCTTGACAGCAAAACCTGTGAACAGGTTTGTTATCTTGGCAGGGAAGAAAAGGTGTTCCTGTCACAGG
TACTGGACACACTGCAACTGTCTGCCCGCGCTTACCATCGCTTTTTGAAGGTAGCCCGTACCATTGCCGACATGAACGGT
GAAGAACAGGTTAAACGGCCCGCCCTGCAACAAGCCCTGTCGTTTAAACAGTGTTTGCAGATGCCGCAGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Legionella pneumophila str. Paris |
70.775 |
100 |
0.708 |
| comM | Legionella pneumophila strain ERS1305867 |
70.775 |
100 |
0.708 |
| comM | Vibrio cholerae strain A1552 |
51.509 |
98.807 |
0.509 |
| comM | Haemophilus influenzae Rd KW20 |
49.704 |
100 |
0.501 |
| comM | Vibrio campbellii strain DS40M4 |
50.602 |
99.006 |
0.501 |
| comM | Glaesserella parasuis strain SC1401 |
49.9 |
99.205 |
0.495 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
43.083 |
100 |
0.433 |