Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | GKQ51_RS20045 | Genome accession | NZ_CP066310 |
| Coordinates | 4234914..4236407 (+) | Length | 497 a.a. |
| NCBI ID | WP_198866785.1 | Uniprot ID | A0AAQ0BYZ0 |
| Organism | Azotobacter chroococcum strain HR1 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 4229914..4241407
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GKQ51_RS20010 (GKQ51_20010) | - | 4229969..4230670 (+) | 702 | WP_089169477.1 | HAD family hydrolase | - |
| GKQ51_RS20015 (GKQ51_20015) | - | 4230676..4231488 (+) | 813 | WP_198866783.1 | CPBP family intramembrane glutamic endopeptidase | - |
| GKQ51_RS20020 (GKQ51_20020) | sutA | 4231557..4231877 (-) | 321 | WP_039806498.1 | transcriptional regulator SutA | - |
| GKQ51_RS20025 (GKQ51_20025) | - | 4232000..4232425 (-) | 426 | WP_198866784.1 | secondary thiamine-phosphate synthase enzyme YjbQ | - |
| GKQ51_RS20030 (GKQ51_20030) | - | 4232582..4233895 (-) | 1314 | WP_089169480.1 | ammonium transporter | - |
| GKQ51_RS20035 (GKQ51_20035) | glnK | 4233928..4234266 (-) | 339 | WP_012703245.1 | P-II family nitrogen regulator | - |
| GKQ51_RS20040 (GKQ51_20040) | - | 4234607..4234882 (+) | 276 | WP_039806510.1 | accessory factor UbiK family protein | - |
| GKQ51_RS20045 (GKQ51_20045) | comM | 4234914..4236407 (+) | 1494 | WP_198866785.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| GKQ51_RS20050 (GKQ51_20050) | - | 4236513..4237376 (-) | 864 | WP_089169482.1 | SPFH domain-containing protein | - |
| GKQ51_RS20055 (GKQ51_20055) | - | 4237405..4237842 (-) | 438 | WP_198866786.1 | NfeD family protein | - |
| GKQ51_RS20060 (GKQ51_20060) | - | 4237953..4238834 (-) | 882 | WP_198866787.1 | LysR family transcriptional regulator | - |
| GKQ51_RS20065 (GKQ51_20065) | - | 4238952..4239374 (+) | 423 | WP_198866788.1 | DoxX family protein | - |
| GKQ51_RS20070 (GKQ51_20070) | - | 4239423..4240151 (+) | 729 | WP_198866789.1 | pirin family protein | - |
Sequence
Protein
Download Length: 497 a.a. Molecular weight: 52151.93 Da Isoelectric Point: 7.3127
>NTDB_id=519263 GKQ51_RS20045 WP_198866785.1 4234914..4236407(+) (comM) [Azotobacter chroococcum strain HR1]
MSLAIVHSRAQVGVEAPAVTVEAHLANGLPALTLVGLPETAVRESKDRVRSAILTSGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASEQLPAEALGNLECLGELALSGSLRPVRGVLPAALAARAAGRTLVVPRANAEEASLASGLNVLAVDHL
LELAAHLNGQSPLAPYQAQGLLRQTLPYPDLADVQGQAAAKRALLVAAAGSHNLLLSGPPGTGKTLLASRLPGLLPPLDE
GEALEVAAIHSVAGSAPLAAWPQRPFRQPHHSASGPALVGGGSRPRPGEITLAHQGVLFLDELPEFDRKVLEVLREPLES
GEIVIARASDKVRFPARFQLVAAMNPCPCGYLGDPAGRCRCTPEQIQRYRAKLSGPLLDRIDLHIGVTREATALGAPRLD
GPDSAGAAAQVAAARTLQLARQGCPNAFLDLPGLHQHCALDNEDRQWLERACERLGLSLRAAHRILKVARTLADLEAAPE
IARAHLAEALQYRASSA
MSLAIVHSRAQVGVEAPAVTVEAHLANGLPALTLVGLPETAVRESKDRVRSAILTSGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASEQLPAEALGNLECLGELALSGSLRPVRGVLPAALAARAAGRTLVVPRANAEEASLASGLNVLAVDHL
LELAAHLNGQSPLAPYQAQGLLRQTLPYPDLADVQGQAAAKRALLVAAAGSHNLLLSGPPGTGKTLLASRLPGLLPPLDE
GEALEVAAIHSVAGSAPLAAWPQRPFRQPHHSASGPALVGGGSRPRPGEITLAHQGVLFLDELPEFDRKVLEVLREPLES
GEIVIARASDKVRFPARFQLVAAMNPCPCGYLGDPAGRCRCTPEQIQRYRAKLSGPLLDRIDLHIGVTREATALGAPRLD
GPDSAGAAAQVAAARTLQLARQGCPNAFLDLPGLHQHCALDNEDRQWLERACERLGLSLRAAHRILKVARTLADLEAAPE
IARAHLAEALQYRASSA
Nucleotide
Download Length: 1494 bp
>NTDB_id=519263 GKQ51_RS20045 WP_198866785.1 4234914..4236407(+) (comM) [Azotobacter chroococcum strain HR1]
ATGTCCCTGGCCATCGTCCACAGCCGCGCCCAGGTGGGCGTCGAGGCGCCCGCCGTCACCGTCGAGGCGCATCTGGCCAA
CGGCCTGCCGGCGCTGACCCTGGTCGGCCTGCCGGAAACCGCGGTCCGCGAGAGCAAGGACCGCGTGCGCAGCGCCATCC
TCACCTCCGGCTTCGACTTCCCGGCGCGGCGCATCACCCTCAACCTGGCCCCCGCCGACCTGCCCAAGGACGGCGGACGC
TTCGACCTGGCCATCGCCCTGGGCATCCTCGCCGCCAGTGAGCAGTTGCCCGCCGAGGCCCTCGGCAACCTGGAGTGCCT
CGGCGAGCTGGCCCTCTCCGGCAGCCTGCGGCCGGTCCGGGGCGTGCTGCCCGCCGCGCTGGCCGCCCGTGCCGCCGGAC
GCACCCTGGTGGTGCCACGGGCCAACGCCGAGGAAGCCAGCCTGGCCTCGGGGCTGAACGTGCTGGCGGTCGACCACCTG
CTGGAGCTGGCCGCCCACCTGAACGGCCAGTCCCCGCTGGCGCCCTACCAGGCCCAGGGCCTGCTGCGCCAGACGCTGCC
CTACCCCGACCTTGCCGACGTGCAGGGCCAGGCCGCGGCCAAGCGCGCCCTGCTGGTGGCCGCCGCCGGCAGCCACAACC
TGCTGCTCAGCGGCCCGCCGGGAACCGGCAAGACCCTGCTGGCCAGCCGCCTGCCGGGACTGCTGCCACCGCTGGACGAG
GGCGAGGCGCTGGAGGTGGCGGCGATCCATTCGGTGGCCGGCAGCGCGCCGCTCGCCGCCTGGCCGCAGCGGCCGTTCCG
CCAGCCGCACCACAGCGCCTCGGGACCGGCGCTGGTCGGCGGCGGCAGCCGGCCGCGTCCCGGCGAGATCACCCTGGCGC
ACCAGGGCGTACTGTTCCTCGACGAGTTGCCGGAGTTCGACCGCAAGGTGCTGGAAGTGCTGCGCGAACCGCTGGAAAGC
GGCGAGATCGTCATCGCCCGGGCCAGCGACAAGGTGCGCTTTCCGGCACGCTTCCAGCTGGTGGCGGCCATGAACCCCTG
CCCCTGCGGCTACCTGGGCGACCCTGCCGGCCGCTGCCGCTGTACCCCGGAGCAGATCCAGCGCTACCGTGCCAAGCTGT
CCGGCCCGCTGCTCGACCGCATCGACCTGCACATCGGCGTCACCCGCGAGGCCACCGCCCTGGGCGCACCGCGCCTGGAC
GGTCCGGACAGCGCTGGCGCCGCGGCCCAGGTGGCGGCGGCGCGCACCCTTCAGCTGGCGCGCCAGGGCTGCCCCAATGC
GTTCCTCGATCTGCCCGGATTGCACCAGCACTGTGCACTGGACAACGAGGACCGCCAGTGGCTGGAACGCGCCTGCGAGC
GCCTCGGCCTGTCGCTGCGCGCCGCCCACCGCATTCTCAAGGTGGCGCGCACCCTGGCCGATCTCGAGGCGGCGCCGGAG
ATCGCCCGCGCTCACCTGGCCGAAGCCCTGCAGTACCGGGCCAGCAGCGCCTGA
ATGTCCCTGGCCATCGTCCACAGCCGCGCCCAGGTGGGCGTCGAGGCGCCCGCCGTCACCGTCGAGGCGCATCTGGCCAA
CGGCCTGCCGGCGCTGACCCTGGTCGGCCTGCCGGAAACCGCGGTCCGCGAGAGCAAGGACCGCGTGCGCAGCGCCATCC
TCACCTCCGGCTTCGACTTCCCGGCGCGGCGCATCACCCTCAACCTGGCCCCCGCCGACCTGCCCAAGGACGGCGGACGC
TTCGACCTGGCCATCGCCCTGGGCATCCTCGCCGCCAGTGAGCAGTTGCCCGCCGAGGCCCTCGGCAACCTGGAGTGCCT
CGGCGAGCTGGCCCTCTCCGGCAGCCTGCGGCCGGTCCGGGGCGTGCTGCCCGCCGCGCTGGCCGCCCGTGCCGCCGGAC
GCACCCTGGTGGTGCCACGGGCCAACGCCGAGGAAGCCAGCCTGGCCTCGGGGCTGAACGTGCTGGCGGTCGACCACCTG
CTGGAGCTGGCCGCCCACCTGAACGGCCAGTCCCCGCTGGCGCCCTACCAGGCCCAGGGCCTGCTGCGCCAGACGCTGCC
CTACCCCGACCTTGCCGACGTGCAGGGCCAGGCCGCGGCCAAGCGCGCCCTGCTGGTGGCCGCCGCCGGCAGCCACAACC
TGCTGCTCAGCGGCCCGCCGGGAACCGGCAAGACCCTGCTGGCCAGCCGCCTGCCGGGACTGCTGCCACCGCTGGACGAG
GGCGAGGCGCTGGAGGTGGCGGCGATCCATTCGGTGGCCGGCAGCGCGCCGCTCGCCGCCTGGCCGCAGCGGCCGTTCCG
CCAGCCGCACCACAGCGCCTCGGGACCGGCGCTGGTCGGCGGCGGCAGCCGGCCGCGTCCCGGCGAGATCACCCTGGCGC
ACCAGGGCGTACTGTTCCTCGACGAGTTGCCGGAGTTCGACCGCAAGGTGCTGGAAGTGCTGCGCGAACCGCTGGAAAGC
GGCGAGATCGTCATCGCCCGGGCCAGCGACAAGGTGCGCTTTCCGGCACGCTTCCAGCTGGTGGCGGCCATGAACCCCTG
CCCCTGCGGCTACCTGGGCGACCCTGCCGGCCGCTGCCGCTGTACCCCGGAGCAGATCCAGCGCTACCGTGCCAAGCTGT
CCGGCCCGCTGCTCGACCGCATCGACCTGCACATCGGCGTCACCCGCGAGGCCACCGCCCTGGGCGCACCGCGCCTGGAC
GGTCCGGACAGCGCTGGCGCCGCGGCCCAGGTGGCGGCGGCGCGCACCCTTCAGCTGGCGCGCCAGGGCTGCCCCAATGC
GTTCCTCGATCTGCCCGGATTGCACCAGCACTGTGCACTGGACAACGAGGACCGCCAGTGGCTGGAACGCGCCTGCGAGC
GCCTCGGCCTGTCGCTGCGCGCCGCCCACCGCATTCTCAAGGTGGCGCGCACCCTGGCCGATCTCGAGGCGGCGCCGGAG
ATCGCCCGCGCTCACCTGGCCGAAGCCCTGCAGTACCGGGCCAGCAGCGCCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Vibrio campbellii strain DS40M4 |
57.43 |
100 |
0.575 |
| comM | Vibrio cholerae strain A1552 |
56.827 |
100 |
0.569 |
| comM | Haemophilus influenzae Rd KW20 |
54.89 |
100 |
0.553 |
| comM | Glaesserella parasuis strain SC1401 |
54.6 |
100 |
0.549 |
| comM | Legionella pneumophila str. Paris |
50 |
100 |
0.515 |
| comM | Legionella pneumophila strain ERS1305867 |
50 |
100 |
0.515 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
45.527 |
100 |
0.461 |