Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | ABL847_RS21055 | Genome accession | NZ_CP157616 |
| Coordinates | 4557906..4559441 (-) | Length | 511 a.a. |
| NCBI ID | WP_077000248.1 | Uniprot ID | - |
| Organism | Variovorax sp. KK3 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 4552906..4564441
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| ABL847_RS21035 (ABL847_21000) | - | 4553442..4554554 (+) | 1113 | WP_077000251.1 | ABC transporter substrate-binding protein | - |
| ABL847_RS21040 (ABL847_21005) | - | 4554680..4555843 (+) | 1164 | WP_077000250.1 | saccharopine dehydrogenase family protein | - |
| ABL847_RS21045 (ABL847_21010) | - | 4555861..4557366 (+) | 1506 | WP_077000249.1 | aldehyde dehydrogenase family protein | - |
| ABL847_RS21050 (ABL847_21015) | - | 4557423..4557884 (+) | 462 | WP_077000299.1 | Lrp/AsnC family transcriptional regulator | - |
| ABL847_RS21055 (ABL847_21020) | comM | 4557906..4559441 (-) | 1536 | WP_077000248.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| ABL847_RS21060 (ABL847_21025) | - | 4559599..4560432 (+) | 834 | WP_077000247.1 | TorF family putative porin | - |
| ABL847_RS21065 (ABL847_21030) | glnK | 4560503..4560841 (+) | 339 | WP_077000246.1 | P-II family nitrogen regulator | - |
| ABL847_RS21070 (ABL847_21035) | - | 4560868..4562397 (+) | 1530 | WP_077000245.1 | ammonium transporter | - |
| ABL847_RS21075 (ABL847_21040) | - | 4562634..4563533 (+) | 900 | WP_077000298.1 | SMP-30/gluconolactonase/LRE family protein | - |
Sequence
Protein
Download Length: 511 a.a. Molecular weight: 53439.14 Da Isoelectric Point: 7.8527
>NTDB_id=1007726 ABL847_RS21055 WP_077000248.1 4557906..4559441(-) (comM) [Variovorax sp. KK3]
MSLSLVQSRALIGLEAADVTVEVHLANGLPSFTLVGLADVEVKEARERVRSALQNAGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGILAASGQIEAARLAGHEFAGELSLSGHLRPVRGALAMALALHGRGVATRLVLPAESAQEAALVPGAEIYGA
AHLLDVVRQFVPGGPAPAGADDGWHRAVQAVAAEPSEALADLADVKGHAGARRVLEIAAAGQHSLLMVGPPGAGKSMLAQ
RFAGLLPQMSVDEALEAAAVASLQGRFAVGRWRQRPTCSPHHSASAVALVGGGSPPRPGEISLAHNGVLFLDEFPEFQRS
ALEALREPLETGSITIARAARRAEFPARFQLIAAMNPCPCGYLGSTLKACRCSPDQVTRYQGKLSGPLLDRIDLQIEVPA
VPTTELLDVPAGEASATVRERVAEARGRALERQGKANQALQGAEIDRHARPEAAALQLLHGAAARLGWSARGIHRALKVA
RTIADLAATDTVQAAHVAEAVQYRRALRAAA
MSLSLVQSRALIGLEAADVTVEVHLANGLPSFTLVGLADVEVKEARERVRSALQNAGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGILAASGQIEAARLAGHEFAGELSLSGHLRPVRGALAMALALHGRGVATRLVLPAESAQEAALVPGAEIYGA
AHLLDVVRQFVPGGPAPAGADDGWHRAVQAVAAEPSEALADLADVKGHAGARRVLEIAAAGQHSLLMVGPPGAGKSMLAQ
RFAGLLPQMSVDEALEAAAVASLQGRFAVGRWRQRPTCSPHHSASAVALVGGGSPPRPGEISLAHNGVLFLDEFPEFQRS
ALEALREPLETGSITIARAARRAEFPARFQLIAAMNPCPCGYLGSTLKACRCSPDQVTRYQGKLSGPLLDRIDLQIEVPA
VPTTELLDVPAGEASATVRERVAEARGRALERQGKANQALQGAEIDRHARPEAAALQLLHGAAARLGWSARGIHRALKVA
RTIADLAATDTVQAAHVAEAVQYRRALRAAA
Nucleotide
Download Length: 1536 bp
>NTDB_id=1007726 ABL847_RS21055 WP_077000248.1 4557906..4559441(-) (comM) [Variovorax sp. KK3]
ATGAGTTTGTCTTTGGTGCAAAGCCGTGCGCTGATCGGCTTGGAGGCGGCCGATGTCACGGTCGAGGTGCATCTGGCCAA
CGGCCTGCCCAGCTTCACGCTGGTCGGGTTGGCCGATGTGGAAGTCAAGGAAGCGCGCGAACGGGTGCGCTCGGCCCTCC
AGAACGCCGGCCTCGAATTCCCCAGCAACAAGCGCATCACGGTCAACCTGGCGCCGGCCGACCTGCCCAAGGATTCCGGC
CGCTTCGACCTGCCGATCGCGCTGGGCATCCTGGCGGCCAGCGGCCAGATCGAGGCGGCGCGGCTCGCTGGCCACGAATT
CGCGGGCGAGCTCTCGCTTTCGGGGCATCTGCGGCCCGTGCGTGGTGCACTGGCGATGGCGCTGGCCCTGCATGGCCGCG
GCGTCGCCACGCGGCTGGTGCTGCCGGCCGAGAGCGCACAGGAAGCCGCGCTGGTGCCTGGCGCCGAAATCTACGGCGCA
GCGCACCTGCTCGACGTGGTGCGCCAGTTCGTGCCGGGCGGCCCCGCGCCGGCCGGGGCCGACGATGGCTGGCACCGGGC
CGTGCAGGCCGTCGCCGCCGAGCCGTCCGAAGCGCTGGCCGACCTGGCCGACGTCAAGGGCCATGCCGGCGCACGGCGCG
TGCTCGAGATCGCCGCCGCCGGCCAGCACAGCCTGCTGATGGTCGGGCCGCCGGGCGCCGGCAAGTCGATGCTGGCCCAG
CGCTTCGCCGGCCTGCTGCCGCAGATGAGCGTGGACGAAGCGTTGGAAGCCGCGGCGGTGGCCAGCCTGCAAGGCCGGTT
CGCCGTCGGCCGATGGCGCCAGCGGCCGACCTGCAGCCCGCACCACAGCGCGAGCGCGGTCGCGCTGGTGGGTGGCGGCA
GTCCGCCGCGGCCCGGCGAGATCTCGCTGGCGCACAACGGCGTGCTGTTCCTCGACGAGTTTCCCGAGTTCCAGCGCTCG
GCCCTCGAGGCCCTGCGCGAGCCCCTGGAGACCGGCAGCATCACCATCGCGCGGGCTGCCCGGCGCGCCGAATTTCCGGC
GCGTTTCCAGCTGATCGCGGCGATGAACCCCTGCCCTTGCGGCTACCTGGGCTCGACGCTCAAGGCCTGCCGCTGCTCGC
CCGACCAGGTCACCCGATATCAAGGAAAGCTCAGCGGCCCGCTGCTGGACCGCATCGACCTGCAGATCGAGGTGCCGGCC
GTGCCCACCACTGAGCTGCTCGACGTGCCGGCCGGCGAAGCCAGCGCCACGGTGCGCGAGCGCGTGGCCGAGGCGCGCGG
CCGGGCCCTGGAGCGCCAGGGCAAGGCCAACCAGGCCTTGCAGGGCGCAGAGATCGACCGCCACGCCCGGCCCGAAGCGG
CTGCCTTGCAGCTGCTGCACGGCGCCGCAGCGCGGCTGGGCTGGTCGGCGCGCGGCATCCACCGGGCGCTGAAGGTCGCG
CGAACCATTGCGGACCTGGCCGCCACCGACACGGTGCAGGCGGCGCACGTGGCGGAGGCGGTGCAGTACCGCCGGGCACT
GCGCGCAGCAGCCTGA
ATGAGTTTGTCTTTGGTGCAAAGCCGTGCGCTGATCGGCTTGGAGGCGGCCGATGTCACGGTCGAGGTGCATCTGGCCAA
CGGCCTGCCCAGCTTCACGCTGGTCGGGTTGGCCGATGTGGAAGTCAAGGAAGCGCGCGAACGGGTGCGCTCGGCCCTCC
AGAACGCCGGCCTCGAATTCCCCAGCAACAAGCGCATCACGGTCAACCTGGCGCCGGCCGACCTGCCCAAGGATTCCGGC
CGCTTCGACCTGCCGATCGCGCTGGGCATCCTGGCGGCCAGCGGCCAGATCGAGGCGGCGCGGCTCGCTGGCCACGAATT
CGCGGGCGAGCTCTCGCTTTCGGGGCATCTGCGGCCCGTGCGTGGTGCACTGGCGATGGCGCTGGCCCTGCATGGCCGCG
GCGTCGCCACGCGGCTGGTGCTGCCGGCCGAGAGCGCACAGGAAGCCGCGCTGGTGCCTGGCGCCGAAATCTACGGCGCA
GCGCACCTGCTCGACGTGGTGCGCCAGTTCGTGCCGGGCGGCCCCGCGCCGGCCGGGGCCGACGATGGCTGGCACCGGGC
CGTGCAGGCCGTCGCCGCCGAGCCGTCCGAAGCGCTGGCCGACCTGGCCGACGTCAAGGGCCATGCCGGCGCACGGCGCG
TGCTCGAGATCGCCGCCGCCGGCCAGCACAGCCTGCTGATGGTCGGGCCGCCGGGCGCCGGCAAGTCGATGCTGGCCCAG
CGCTTCGCCGGCCTGCTGCCGCAGATGAGCGTGGACGAAGCGTTGGAAGCCGCGGCGGTGGCCAGCCTGCAAGGCCGGTT
CGCCGTCGGCCGATGGCGCCAGCGGCCGACCTGCAGCCCGCACCACAGCGCGAGCGCGGTCGCGCTGGTGGGTGGCGGCA
GTCCGCCGCGGCCCGGCGAGATCTCGCTGGCGCACAACGGCGTGCTGTTCCTCGACGAGTTTCCCGAGTTCCAGCGCTCG
GCCCTCGAGGCCCTGCGCGAGCCCCTGGAGACCGGCAGCATCACCATCGCGCGGGCTGCCCGGCGCGCCGAATTTCCGGC
GCGTTTCCAGCTGATCGCGGCGATGAACCCCTGCCCTTGCGGCTACCTGGGCTCGACGCTCAAGGCCTGCCGCTGCTCGC
CCGACCAGGTCACCCGATATCAAGGAAAGCTCAGCGGCCCGCTGCTGGACCGCATCGACCTGCAGATCGAGGTGCCGGCC
GTGCCCACCACTGAGCTGCTCGACGTGCCGGCCGGCGAAGCCAGCGCCACGGTGCGCGAGCGCGTGGCCGAGGCGCGCGG
CCGGGCCCTGGAGCGCCAGGGCAAGGCCAACCAGGCCTTGCAGGGCGCAGAGATCGACCGCCACGCCCGGCCCGAAGCGG
CTGCCTTGCAGCTGCTGCACGGCGCCGCAGCGCGGCTGGGCTGGTCGGCGCGCGGCATCCACCGGGCGCTGAAGGTCGCG
CGAACCATTGCGGACCTGGCCGCCACCGACACGGTGCAGGCGGCGCACGTGGCGGAGGCGGTGCAGTACCGCCGGGCACT
GCGCGCAGCAGCCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
51.663 |
100 |
0.517 |
| comM | Vibrio cholerae strain A1552 |
51.272 |
100 |
0.513 |
| comM | Glaesserella parasuis strain SC1401 |
51.282 |
99.217 |
0.509 |
| comM | Vibrio campbellii strain DS40M4 |
49.902 |
99.609 |
0.497 |
| comM | Legionella pneumophila str. Paris |
46.693 |
100 |
0.47 |
| comM | Legionella pneumophila strain ERS1305867 |
46.693 |
100 |
0.47 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
42.717 |
99.413 |
0.425 |