Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | ACG10_RS19530 | Genome accession | NZ_CP011835 |
| Coordinates | 4167427..4168920 (+) | Length | 497 a.a. |
| NCBI ID | WP_089169481.1 | Uniprot ID | - |
| Organism | Azotobacter chroococcum strain B3 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 4162427..4173920
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| ACG10_RS19495 (ACG10_19115) | - | 4162530..4163231 (+) | 702 | WP_089169477.1 | HAD family hydrolase | - |
| ACG10_RS19500 (ACG10_19120) | - | 4163237..4164001 (+) | 765 | WP_089169478.1 | CPBP family intramembrane glutamic endopeptidase | - |
| ACG10_RS19505 (ACG10_19125) | sutA | 4164070..4164390 (-) | 321 | WP_039806498.1 | transcriptional regulator SutA | - |
| ACG10_RS19510 (ACG10_19130) | - | 4164513..4164938 (-) | 426 | WP_089169479.1 | secondary thiamine-phosphate synthase enzyme YjbQ | - |
| ACG10_RS19515 (ACG10_19135) | - | 4165095..4166408 (-) | 1314 | WP_089169480.1 | ammonium transporter | - |
| ACG10_RS19520 (ACG10_19140) | glnK | 4166441..4166779 (-) | 339 | WP_012703245.1 | P-II family nitrogen regulator | - |
| ACG10_RS19525 (ACG10_19145) | - | 4167120..4167395 (+) | 276 | WP_039806510.1 | accessory factor UbiK family protein | - |
| ACG10_RS19530 (ACG10_19150) | comM | 4167427..4168920 (+) | 1494 | WP_089169481.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| ACG10_RS19535 (ACG10_19155) | - | 4169026..4169889 (-) | 864 | WP_089169482.1 | SPFH domain-containing protein | - |
| ACG10_RS19540 (ACG10_19160) | - | 4169918..4170355 (-) | 438 | WP_089169483.1 | NfeD family protein | - |
| ACG10_RS19545 (ACG10_19165) | - | 4170466..4171347 (-) | 882 | WP_089169484.1 | LysR family transcriptional regulator | - |
| ACG10_RS19550 (ACG10_19170) | - | 4171465..4171887 (+) | 423 | WP_089169485.1 | DoxX family protein | - |
| ACG10_RS19555 (ACG10_19175) | - | 4171936..4172664 (+) | 729 | WP_089169486.1 | pirin family protein | - |
Sequence
Protein
Download Length: 497 a.a. Molecular weight: 52079.86 Da Isoelectric Point: 7.5093
>NTDB_id=148930 ACG10_RS19530 WP_089169481.1 4167427..4168920(+) (comM) [Azotobacter chroococcum strain B3]
MSLAIVHSRAQVGVEAPAVTVEAHLANGLPALTLVGLPETAVRESKDRVRSAILTSGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASEQLPAEALGNLECLGELALSGSLRPVRGVLPAALAARAAGRTLVVPRANAEEASLASGLNVLAVDHL
LELAAHLNGQSPLAPYQAQGLLRQTLPYPDLADVQGQAAAKRALLVAAAGSHNLLLSGPPGTGKTLLASRLPGLLPPLDE
GEALEVAAIHSVAGSAPLAAWPQRPFRQPHHSASGPALVGGGSRPRPGEITLAHQGVLFLDELPEFDRKVLEVLREPLES
GEIVIARASDKVRFPARFQLVAAMNPCPCGYLGDPAGRCRCTPEQIQRYRAKLSGPLLDRIDLHIGVTREATALGAPRLD
GPDSAGAAAQVAAARTLQLARQGCPNAFLDLPGLHQHCALDNEDRQWLERACERLGLSLRAAHRILKVARTLADLEAAPG
IARAHLAEALQYRASSA
MSLAIVHSRAQVGVEAPAVTVEAHLANGLPALTLVGLPETAVRESKDRVRSAILTSGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASEQLPAEALGNLECLGELALSGSLRPVRGVLPAALAARAAGRTLVVPRANAEEASLASGLNVLAVDHL
LELAAHLNGQSPLAPYQAQGLLRQTLPYPDLADVQGQAAAKRALLVAAAGSHNLLLSGPPGTGKTLLASRLPGLLPPLDE
GEALEVAAIHSVAGSAPLAAWPQRPFRQPHHSASGPALVGGGSRPRPGEITLAHQGVLFLDELPEFDRKVLEVLREPLES
GEIVIARASDKVRFPARFQLVAAMNPCPCGYLGDPAGRCRCTPEQIQRYRAKLSGPLLDRIDLHIGVTREATALGAPRLD
GPDSAGAAAQVAAARTLQLARQGCPNAFLDLPGLHQHCALDNEDRQWLERACERLGLSLRAAHRILKVARTLADLEAAPG
IARAHLAEALQYRASSA
Nucleotide
Download Length: 1494 bp
>NTDB_id=148930 ACG10_RS19530 WP_089169481.1 4167427..4168920(+) (comM) [Azotobacter chroococcum strain B3]
ATGTCCCTGGCCATCGTCCACAGCCGCGCCCAGGTGGGCGTCGAGGCGCCCGCCGTCACCGTCGAGGCGCATCTGGCCAA
CGGCCTGCCGGCGCTGACCCTGGTCGGCCTGCCGGAAACCGCGGTCCGCGAGAGCAAGGACCGCGTGCGCAGCGCCATCC
TCACCTCCGGCTTCGACTTCCCGGCGCGGCGCATCACCCTCAACCTGGCCCCCGCCGACCTGCCCAAGGACGGCGGACGC
TTCGACCTGGCCATCGCCCTGGGCATCCTCGCCGCCAGTGAGCAGTTGCCCGCCGAGGCCCTCGGCAACCTGGAGTGCCT
CGGCGAGCTGGCCCTCTCCGGCAGCCTGCGGCCGGTCCGGGGCGTGCTGCCCGCCGCGCTGGCCGCCCGTGCCGCCGGAC
GCACCCTGGTGGTGCCACGGGCCAACGCCGAGGAAGCCAGCCTGGCCTCGGGGCTGAACGTGCTGGCGGTCGACCACCTG
CTGGAGCTGGCCGCCCACCTGAACGGCCAGTCCCCGCTGGCGCCCTACCAGGCCCAGGGCCTGCTGCGCCAGACGCTGCC
CTACCCCGACCTTGCCGACGTGCAGGGCCAGGCCGCGGCCAAGCGCGCCCTGCTGGTGGCCGCTGCCGGCAGCCACAACC
TGCTGCTCAGCGGCCCGCCGGGAACCGGCAAGACCCTGCTGGCCAGCCGCCTGCCGGGACTGCTGCCACCGCTGGACGAG
GGCGAGGCGCTGGAGGTGGCGGCGATCCATTCGGTGGCCGGCAGCGCGCCGCTCGCCGCCTGGCCGCAGCGGCCGTTCCG
CCAGCCGCACCACAGCGCCTCGGGACCGGCGCTGGTCGGCGGCGGCAGCCGGCCGCGTCCCGGCGAGATCACCCTGGCGC
ACCAGGGCGTACTGTTCCTCGACGAGTTGCCGGAGTTCGACCGCAAGGTGCTGGAAGTGCTGCGCGAGCCCCTGGAAAGC
GGCGAGATCGTCATCGCCCGGGCCAGCGACAAGGTGCGCTTCCCGGCGCGCTTCCAGCTGGTGGCGGCCATGAACCCCTG
CCCCTGCGGCTACCTGGGCGACCCTGCCGGCCGCTGCCGCTGCACCCCGGAGCAGATCCAGCGCTACCGCGCCAAGCTGT
CCGGCCCGCTGCTCGACCGCATCGACCTGCACATCGGCGTCACCCGCGAGGCCACCGCCCTGGGCGCACCGCGCCTGGAC
GGTCCGGACAGCGCCGGTGCCGCGGCCCAGGTGGCGGCGGCGCGCACCCTTCAGCTGGCGCGCCAGGGCTGCCCCAATGC
GTTCCTCGATCTGCCCGGATTGCACCAGCACTGTGCACTGGACAACGAGGACCGCCAGTGGCTGGAGCGCGCCTGCGAGC
GCCTCGGCCTGTCGCTGCGCGCCGCCCACCGCATTCTCAAGGTGGCGCGCACCCTGGCCGATCTCGAGGCGGCGCCGGGG
ATCGCCCGTGCCCACCTGGCCGAAGCCCTGCAGTACCGGGCCAGCAGCGCCTGA
ATGTCCCTGGCCATCGTCCACAGCCGCGCCCAGGTGGGCGTCGAGGCGCCCGCCGTCACCGTCGAGGCGCATCTGGCCAA
CGGCCTGCCGGCGCTGACCCTGGTCGGCCTGCCGGAAACCGCGGTCCGCGAGAGCAAGGACCGCGTGCGCAGCGCCATCC
TCACCTCCGGCTTCGACTTCCCGGCGCGGCGCATCACCCTCAACCTGGCCCCCGCCGACCTGCCCAAGGACGGCGGACGC
TTCGACCTGGCCATCGCCCTGGGCATCCTCGCCGCCAGTGAGCAGTTGCCCGCCGAGGCCCTCGGCAACCTGGAGTGCCT
CGGCGAGCTGGCCCTCTCCGGCAGCCTGCGGCCGGTCCGGGGCGTGCTGCCCGCCGCGCTGGCCGCCCGTGCCGCCGGAC
GCACCCTGGTGGTGCCACGGGCCAACGCCGAGGAAGCCAGCCTGGCCTCGGGGCTGAACGTGCTGGCGGTCGACCACCTG
CTGGAGCTGGCCGCCCACCTGAACGGCCAGTCCCCGCTGGCGCCCTACCAGGCCCAGGGCCTGCTGCGCCAGACGCTGCC
CTACCCCGACCTTGCCGACGTGCAGGGCCAGGCCGCGGCCAAGCGCGCCCTGCTGGTGGCCGCTGCCGGCAGCCACAACC
TGCTGCTCAGCGGCCCGCCGGGAACCGGCAAGACCCTGCTGGCCAGCCGCCTGCCGGGACTGCTGCCACCGCTGGACGAG
GGCGAGGCGCTGGAGGTGGCGGCGATCCATTCGGTGGCCGGCAGCGCGCCGCTCGCCGCCTGGCCGCAGCGGCCGTTCCG
CCAGCCGCACCACAGCGCCTCGGGACCGGCGCTGGTCGGCGGCGGCAGCCGGCCGCGTCCCGGCGAGATCACCCTGGCGC
ACCAGGGCGTACTGTTCCTCGACGAGTTGCCGGAGTTCGACCGCAAGGTGCTGGAAGTGCTGCGCGAGCCCCTGGAAAGC
GGCGAGATCGTCATCGCCCGGGCCAGCGACAAGGTGCGCTTCCCGGCGCGCTTCCAGCTGGTGGCGGCCATGAACCCCTG
CCCCTGCGGCTACCTGGGCGACCCTGCCGGCCGCTGCCGCTGCACCCCGGAGCAGATCCAGCGCTACCGCGCCAAGCTGT
CCGGCCCGCTGCTCGACCGCATCGACCTGCACATCGGCGTCACCCGCGAGGCCACCGCCCTGGGCGCACCGCGCCTGGAC
GGTCCGGACAGCGCCGGTGCCGCGGCCCAGGTGGCGGCGGCGCGCACCCTTCAGCTGGCGCGCCAGGGCTGCCCCAATGC
GTTCCTCGATCTGCCCGGATTGCACCAGCACTGTGCACTGGACAACGAGGACCGCCAGTGGCTGGAGCGCGCCTGCGAGC
GCCTCGGCCTGTCGCTGCGCGCCGCCCACCGCATTCTCAAGGTGGCGCGCACCCTGGCCGATCTCGAGGCGGCGCCGGGG
ATCGCCCGTGCCCACCTGGCCGAAGCCCTGCAGTACCGGGCCAGCAGCGCCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Vibrio campbellii strain DS40M4 |
57.43 |
100 |
0.575 |
| comM | Vibrio cholerae strain A1552 |
56.827 |
100 |
0.569 |
| comM | Haemophilus influenzae Rd KW20 |
54.89 |
100 |
0.553 |
| comM | Glaesserella parasuis strain SC1401 |
54.6 |
100 |
0.549 |
| comM | Legionella pneumophila str. Paris |
50 |
100 |
0.515 |
| comM | Legionella pneumophila strain ERS1305867 |
50 |
100 |
0.515 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
45.527 |
100 |
0.461 |