Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | CDA09_RS21245 | Genome accession | NZ_CP021731 |
| Coordinates | 4576399..4577898 (-) | Length | 499 a.a. |
| NCBI ID | WP_121430472.1 | Uniprot ID | - |
| Organism | Azoarcus sp. DN11 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 4571399..4582898
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| CDA09_RS21220 (CDA09_21195) | - | 4571818..4572918 (+) | 1101 | WP_121430467.1 | ABC transporter ATP-binding protein | - |
| CDA09_RS21225 (CDA09_21200) | - | 4572915..4573691 (+) | 777 | WP_121430468.1 | ABC transporter ATP-binding protein | - |
| CDA09_RS21230 (CDA09_21205) | - | 4573681..4574418 (+) | 738 | WP_121430469.1 | ABC transporter ATP-binding protein | - |
| CDA09_RS21235 (CDA09_21210) | - | 4574539..4574964 (+) | 426 | WP_121430470.1 | aldehyde-activating protein | - |
| CDA09_RS21240 (CDA09_21215) | - | 4575128..4576321 (-) | 1194 | WP_121430471.1 | multidrug effflux MFS transporter | - |
| CDA09_RS21245 (CDA09_21220) | comM | 4576399..4577898 (-) | 1500 | WP_121430472.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| CDA09_RS21250 (CDA09_21225) | - | 4577998..4578261 (-) | 264 | WP_121430473.1 | accessory factor UbiK family protein | - |
| CDA09_RS21255 (CDA09_21230) | - | 4578590..4579333 (+) | 744 | WP_121430474.1 | TorF family putative porin | - |
| CDA09_RS21260 (CDA09_21235) | glnK | 4579384..4579722 (+) | 339 | WP_121430475.1 | P-II family nitrogen regulator | - |
| CDA09_RS21265 (CDA09_21240) | amt | 4579733..4581196 (+) | 1464 | WP_121430476.1 | ammonium transporter | - |
| CDA09_RS21270 (CDA09_21245) | purU | 4581293..4582180 (-) | 888 | WP_121430477.1 | formyltetrahydrofolate deformylase | - |
| CDA09_RS21275 (CDA09_21250) | thrH | 4582185..4582799 (-) | 615 | WP_121430478.1 | bifunctional phosphoserine phosphatase/homoserine phosphotransferase ThrH | - |
Sequence
Protein
Download Length: 499 a.a. Molecular weight: 53248.87 Da Isoelectric Point: 8.5251
>NTDB_id=232934 CDA09_RS21245 WP_121430472.1 4576399..4577898(-) (comM) [Azoarcus sp. DN11]
MSLALVRTRALAGLGAPEVTVEVHLANGLPAFNLVGLPDTEVREARERVRAAIATSQFEFPQRRITVNLAPADLPKEGGR
FDLPIALGILAASGQVDAAALARHEFVGELSLDGSLRPVRGGLAMALESGRAGRALVLPAANADEAALARDASVLPAPSL
LAVCAHLNGHTPLTRRLAPPVAEGRDDEPDLAEVKGQLQARRALEVAAAGQHSLLMFGPPGTGKSMLARRLPGLLPPLDE
AEAIESASIQSLEGAFDARRWGRRPYRAPHHSASAPAVVGGGASPRPGEISLAHHGVLFLDELPEFERRVLEALREPLET
GTVTVSRARQRAEFPARFQLVAAMNPCPCGHAGDKNGRCRCTPDQVARYRGRLSGPLLDRMDIVIEVPLLDHADMLGQPA
GEPSAAVRERVTQAWAVQRERQGRANSHLAPGRVDALCAPDEQGKALLDHAIRRLNLSARGYHRILKVARTIADLAGAER
VGPAHLAEAIQYRRGLDSR
MSLALVRTRALAGLGAPEVTVEVHLANGLPAFNLVGLPDTEVREARERVRAAIATSQFEFPQRRITVNLAPADLPKEGGR
FDLPIALGILAASGQVDAAALARHEFVGELSLDGSLRPVRGGLAMALESGRAGRALVLPAANADEAALARDASVLPAPSL
LAVCAHLNGHTPLTRRLAPPVAEGRDDEPDLAEVKGQLQARRALEVAAAGQHSLLMFGPPGTGKSMLARRLPGLLPPLDE
AEAIESASIQSLEGAFDARRWGRRPYRAPHHSASAPAVVGGGASPRPGEISLAHHGVLFLDELPEFERRVLEALREPLET
GTVTVSRARQRAEFPARFQLVAAMNPCPCGHAGDKNGRCRCTPDQVARYRGRLSGPLLDRMDIVIEVPLLDHADMLGQPA
GEPSAAVRERVTQAWAVQRERQGRANSHLAPGRVDALCAPDEQGKALLDHAIRRLNLSARGYHRILKVARTIADLAGAER
VGPAHLAEAIQYRRGLDSR
Nucleotide
Download Length: 1500 bp
>NTDB_id=232934 CDA09_RS21245 WP_121430472.1 4576399..4577898(-) (comM) [Azoarcus sp. DN11]
ATGTCGCTGGCTCTCGTACGCACCCGCGCGCTCGCGGGTCTGGGCGCACCCGAGGTGACGGTGGAAGTGCACCTCGCCAA
CGGCCTGCCGGCCTTCAACCTCGTCGGTCTGCCTGATACCGAGGTGCGCGAGGCGCGCGAGCGCGTGCGCGCCGCGATCG
CCACGTCGCAGTTCGAATTCCCGCAGCGCCGGATCACCGTCAATCTCGCCCCGGCCGACCTGCCCAAGGAAGGCGGGCGC
TTCGACCTGCCGATCGCGCTGGGCATCCTCGCCGCGTCGGGCCAGGTCGATGCGGCGGCGCTGGCCCGTCACGAATTCGT
CGGCGAGCTGTCGCTCGACGGCAGCCTGCGCCCGGTGCGCGGCGGTCTCGCGATGGCCCTCGAGAGCGGCCGCGCCGGCC
GCGCGCTGGTACTGCCCGCGGCGAATGCCGACGAAGCCGCGCTCGCGCGCGACGCGAGCGTGCTGCCGGCACCGAGCCTG
CTCGCCGTGTGTGCCCACCTCAACGGCCACACGCCGCTCACGCGCCGTCTTGCGCCGCCGGTGGCGGAAGGGCGGGACGA
CGAACCGGATCTCGCCGAGGTGAAGGGGCAGCTGCAGGCGCGCCGCGCGCTCGAAGTCGCCGCCGCCGGCCAGCACTCGC
TGCTGATGTTCGGCCCCCCGGGAACCGGCAAGTCCATGCTCGCGCGCCGCTTGCCGGGGCTCCTGCCGCCGCTCGACGAG
GCCGAGGCGATCGAGAGCGCGTCGATCCAGTCGCTCGAAGGGGCCTTCGATGCCCGCCGCTGGGGGCGGCGCCCCTACCG
CGCGCCGCACCACTCCGCCTCGGCCCCCGCGGTGGTCGGCGGCGGCGCCAGTCCGCGCCCCGGCGAGATCAGCCTCGCGC
ACCACGGCGTGCTGTTCCTCGACGAGCTGCCGGAGTTCGAGCGCCGCGTGCTCGAAGCCCTGCGCGAGCCGCTCGAGACC
GGCACTGTCACGGTGTCGCGGGCGCGGCAGCGCGCCGAGTTCCCCGCGCGCTTCCAGCTCGTCGCCGCGATGAATCCGTG
CCCGTGCGGGCATGCCGGCGACAAGAATGGGCGCTGCCGCTGCACGCCGGACCAGGTTGCACGCTACCGCGGGCGCCTGT
CGGGCCCGCTGCTCGACCGCATGGACATCGTCATCGAGGTGCCGCTGCTCGATCACGCCGACATGCTCGGTCAGCCCGCG
GGCGAGCCGAGTGCCGCGGTGCGCGAGCGCGTCACGCAGGCGTGGGCCGTGCAGCGCGAGCGCCAGGGGCGCGCCAACAG
CCATCTCGCGCCCGGCCGCGTCGATGCGCTGTGCGCGCCCGATGAGCAGGGCAAGGCGCTGCTCGATCACGCGATCCGCC
GGCTGAACCTGTCCGCGCGGGGGTATCATCGCATCCTCAAGGTTGCCCGCACGATCGCGGACCTGGCCGGCGCCGAGCGC
GTCGGCCCGGCGCATCTGGCCGAGGCGATCCAGTACCGGCGCGGCCTCGATTCCCGCTGA
ATGTCGCTGGCTCTCGTACGCACCCGCGCGCTCGCGGGTCTGGGCGCACCCGAGGTGACGGTGGAAGTGCACCTCGCCAA
CGGCCTGCCGGCCTTCAACCTCGTCGGTCTGCCTGATACCGAGGTGCGCGAGGCGCGCGAGCGCGTGCGCGCCGCGATCG
CCACGTCGCAGTTCGAATTCCCGCAGCGCCGGATCACCGTCAATCTCGCCCCGGCCGACCTGCCCAAGGAAGGCGGGCGC
TTCGACCTGCCGATCGCGCTGGGCATCCTCGCCGCGTCGGGCCAGGTCGATGCGGCGGCGCTGGCCCGTCACGAATTCGT
CGGCGAGCTGTCGCTCGACGGCAGCCTGCGCCCGGTGCGCGGCGGTCTCGCGATGGCCCTCGAGAGCGGCCGCGCCGGCC
GCGCGCTGGTACTGCCCGCGGCGAATGCCGACGAAGCCGCGCTCGCGCGCGACGCGAGCGTGCTGCCGGCACCGAGCCTG
CTCGCCGTGTGTGCCCACCTCAACGGCCACACGCCGCTCACGCGCCGTCTTGCGCCGCCGGTGGCGGAAGGGCGGGACGA
CGAACCGGATCTCGCCGAGGTGAAGGGGCAGCTGCAGGCGCGCCGCGCGCTCGAAGTCGCCGCCGCCGGCCAGCACTCGC
TGCTGATGTTCGGCCCCCCGGGAACCGGCAAGTCCATGCTCGCGCGCCGCTTGCCGGGGCTCCTGCCGCCGCTCGACGAG
GCCGAGGCGATCGAGAGCGCGTCGATCCAGTCGCTCGAAGGGGCCTTCGATGCCCGCCGCTGGGGGCGGCGCCCCTACCG
CGCGCCGCACCACTCCGCCTCGGCCCCCGCGGTGGTCGGCGGCGGCGCCAGTCCGCGCCCCGGCGAGATCAGCCTCGCGC
ACCACGGCGTGCTGTTCCTCGACGAGCTGCCGGAGTTCGAGCGCCGCGTGCTCGAAGCCCTGCGCGAGCCGCTCGAGACC
GGCACTGTCACGGTGTCGCGGGCGCGGCAGCGCGCCGAGTTCCCCGCGCGCTTCCAGCTCGTCGCCGCGATGAATCCGTG
CCCGTGCGGGCATGCCGGCGACAAGAATGGGCGCTGCCGCTGCACGCCGGACCAGGTTGCACGCTACCGCGGGCGCCTGT
CGGGCCCGCTGCTCGACCGCATGGACATCGTCATCGAGGTGCCGCTGCTCGATCACGCCGACATGCTCGGTCAGCCCGCG
GGCGAGCCGAGTGCCGCGGTGCGCGAGCGCGTCACGCAGGCGTGGGCCGTGCAGCGCGAGCGCCAGGGGCGCGCCAACAG
CCATCTCGCGCCCGGCCGCGTCGATGCGCTGTGCGCGCCCGATGAGCAGGGCAAGGCGCTGCTCGATCACGCGATCCGCC
GGCTGAACCTGTCCGCGCGGGGGTATCATCGCATCCTCAAGGTTGCCCGCACGATCGCGGACCTGGCCGGCGCCGAGCGC
GTCGGCCCGGCGCATCTGGCCGAGGCGATCCAGTACCGGCGCGGCCTCGATTCCCGCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
53.6 |
100 |
0.537 |
| comM | Vibrio campbellii strain DS40M4 |
53.131 |
99.198 |
0.527 |
| comM | Vibrio cholerae strain A1552 |
53.131 |
99.198 |
0.527 |
| comM | Glaesserella parasuis strain SC1401 |
51.4 |
100 |
0.515 |
| comM | Legionella pneumophila str. Paris |
48.898 |
100 |
0.489 |
| comM | Legionella pneumophila strain ERS1305867 |
48.898 |
100 |
0.489 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
44.466 |
100 |
0.451 |