Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | E5CHR_RS01525 | Genome accession | NZ_LR594675 |
| Coordinates | 330431..331966 (-) | Length | 511 a.a. |
| NCBI ID | WP_162578062.1 | Uniprot ID | A0A6P2E8S5 |
| Organism | Variovorax sp. PBS-H4 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 325431..336966
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| E5CHR_RS01505 (H4CHR_00301) | - | 326242..327051 (+) | 810 | WP_162578059.1 | helix-turn-helix transcriptional regulator | - |
| E5CHR_RS01510 (H4CHR_00302) | - | 327144..328163 (+) | 1020 | WP_162578060.1 | aromatic ring-hydroxylating dioxygenase subunit alpha | - |
| E5CHR_RS01515 (H4CHR_00303) | - | 328208..329191 (+) | 984 | WP_232061918.1 | tripartite tricarboxylate transporter substrate-binding protein | - |
| E5CHR_RS01520 (H4CHR_00304) | - | 329321..330430 (+) | 1110 | WP_162578061.1 | ABC transporter substrate-binding protein | - |
| E5CHR_RS01525 (H4CHR_00305) | comM | 330431..331966 (-) | 1536 | WP_162578062.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| E5CHR_RS01530 (H4CHR_00306) | - | 332127..332948 (+) | 822 | WP_162583510.1 | TorF family putative porin | - |
| E5CHR_RS01535 (H4CHR_00307) | glnK | 333013..333351 (+) | 339 | WP_162578063.1 | P-II family nitrogen regulator | - |
| E5CHR_RS01540 (H4CHR_00308) | amt | 333378..334904 (+) | 1527 | WP_174255679.1 | ammonium transporter | - |
| E5CHR_RS01545 (H4CHR_00309) | - | 335045..335968 (+) | 924 | WP_162578064.1 | SMP-30/gluconolactonase/LRE family protein | - |
Sequence
Protein
Download Length: 511 a.a. Molecular weight: 53564.25 Da Isoelectric Point: 8.9412
>NTDB_id=1128239 E5CHR_RS01525 WP_162578062.1 330431..331966(-) (comM) [Variovorax sp. PBS-H4]
MSLSLVQSRALLGLEAASVTVEVHLANGLPSFTLVGLADVEVKEARERVRSAIQNAGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGILAAAGQIDAARLAGHEFAGELSLSGHLRPVRGALAMALALHGRGVATKLVLPAESAREAALVPGADIYGA
AHLLDVVRQFLPGGPAASDASEDGWHRAQPAEPGAVAAGADMADVKGHASARRALEVAAAGQHSLLMVGPPGSGKSMLAQ
RFAGLLPPMSVDEALESAAVASLHGRFAVERWRVRPTCSPHHSASAVALVGGGSPPRPGEISLAHNGVLFLDEFPEFQRA
ALEALREPLETGTITIARAARRAEFPARFQLIAAMNPCPCGYLGSSLKSCRCTPDQVARYQGKLSGPLLDRIDLHIEVPA
LPATQLLETPAGEASADVRARVAEGRERAIRRQGKANQALQGAEIDRHAKLDAAARQFLQTAATRLGWSARGTHRTLKLA
RTIADLAGAGAVQAAHVAEAVQYRRALQRVE
MSLSLVQSRALLGLEAASVTVEVHLANGLPSFTLVGLADVEVKEARERVRSAIQNAGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGILAAAGQIDAARLAGHEFAGELSLSGHLRPVRGALAMALALHGRGVATKLVLPAESAREAALVPGADIYGA
AHLLDVVRQFLPGGPAASDASEDGWHRAQPAEPGAVAAGADMADVKGHASARRALEVAAAGQHSLLMVGPPGSGKSMLAQ
RFAGLLPPMSVDEALESAAVASLHGRFAVERWRVRPTCSPHHSASAVALVGGGSPPRPGEISLAHNGVLFLDEFPEFQRA
ALEALREPLETGTITIARAARRAEFPARFQLIAAMNPCPCGYLGSSLKSCRCTPDQVARYQGKLSGPLLDRIDLHIEVPA
LPATQLLETPAGEASADVRARVAEGRERAIRRQGKANQALQGAEIDRHAKLDAAARQFLQTAATRLGWSARGTHRTLKLA
RTIADLAGAGAVQAAHVAEAVQYRRALQRVE
Nucleotide
Download Length: 1536 bp
>NTDB_id=1128239 E5CHR_RS01525 WP_162578062.1 330431..331966(-) (comM) [Variovorax sp. PBS-H4]
ATGAGCTTATCTTTGGTGCAAAGCCGTGCTTTGCTGGGCCTCGAAGCGGCAAGCGTCACGGTCGAGGTGCATCTGGCCAA
CGGGTTGCCGAGCTTCACGCTGGTGGGACTGGCGGACGTGGAAGTGAAGGAGGCGCGCGAGCGGGTGCGTTCTGCCATTC
AGAATGCCGGCCTCGAATTCCCGAGCAACAAGCGCATCACGGTCAACCTGGCGCCGGCCGATCTGCCGAAGGATTCGGGG
CGCTTCGACCTGCCAATCGCACTGGGCATCCTGGCGGCAGCCGGGCAGATCGACGCGGCCCGGCTGGCGGGCCACGAATT
TGCGGGGGAGCTCTCGCTTTCAGGCCATCTGCGGCCCGTGCGCGGTGCGCTCGCAATGGCGCTGGCGCTGCATGGCCGCG
GCGTTGCGACGAAGCTGGTGCTGCCGGCCGAGAGCGCGCGGGAGGCCGCCCTGGTACCCGGCGCCGACATCTACGGTGCG
GCCCACCTGCTGGACGTGGTGCGGCAGTTCCTGCCGGGCGGCCCGGCCGCGAGCGATGCGTCGGAAGACGGCTGGCACCG
TGCGCAGCCGGCCGAACCCGGCGCGGTGGCGGCCGGGGCCGACATGGCGGACGTCAAGGGGCACGCGAGCGCAAGACGCG
CGCTCGAGGTCGCGGCGGCCGGCCAGCACAGCCTGCTGATGGTGGGCCCGCCGGGTTCCGGCAAGTCGATGCTGGCGCAG
CGCTTTGCGGGCCTGCTGCCGCCGATGAGCGTCGACGAGGCGCTGGAAAGCGCCGCCGTAGCCAGCCTGCACGGCCGCTT
CGCCGTCGAGCGTTGGCGTGTGCGGCCGACCTGCAGCCCGCACCACAGCGCCAGTGCCGTGGCGCTGGTGGGCGGCGGCT
CGCCGCCGCGGCCTGGCGAGATCTCGCTGGCGCACAACGGCGTGCTGTTCCTGGACGAATTTCCCGAGTTCCAGCGTGCC
GCGCTCGAGGCGCTGCGCGAGCCGCTGGAGACCGGAACCATCACGATCGCGCGGGCCGCGCGGCGTGCCGAGTTCCCGGC
CCGCTTCCAGTTGATCGCCGCCATGAATCCGTGCCCTTGCGGCTACCTGGGCTCATCCCTCAAGTCCTGCCGTTGCACGC
CGGACCAGGTGGCCCGCTACCAGGGCAAGCTCAGCGGGCCGCTGCTCGACCGGATCGACCTGCACATCGAAGTGCCCGCC
CTACCCGCCACCCAGTTGCTGGAGACGCCCGCCGGCGAAGCCAGCGCAGACGTCCGTGCGCGCGTGGCCGAAGGGCGCGA
GCGCGCAATCCGCCGACAGGGCAAGGCCAACCAGGCGCTGCAAGGCGCAGAGATCGACCGCCACGCGAAGCTCGATGCGG
CGGCCCGGCAGTTCCTGCAGACTGCAGCGACGCGACTGGGCTGGTCGGCGCGCGGCACGCACCGCACGCTCAAGCTGGCC
CGAACGATAGCGGACCTGGCCGGAGCCGGCGCGGTGCAGGCGGCGCATGTGGCGGAGGCGGTGCAGTACCGGCGTGCACT
GCAGAGGGTGGAGTGA
ATGAGCTTATCTTTGGTGCAAAGCCGTGCTTTGCTGGGCCTCGAAGCGGCAAGCGTCACGGTCGAGGTGCATCTGGCCAA
CGGGTTGCCGAGCTTCACGCTGGTGGGACTGGCGGACGTGGAAGTGAAGGAGGCGCGCGAGCGGGTGCGTTCTGCCATTC
AGAATGCCGGCCTCGAATTCCCGAGCAACAAGCGCATCACGGTCAACCTGGCGCCGGCCGATCTGCCGAAGGATTCGGGG
CGCTTCGACCTGCCAATCGCACTGGGCATCCTGGCGGCAGCCGGGCAGATCGACGCGGCCCGGCTGGCGGGCCACGAATT
TGCGGGGGAGCTCTCGCTTTCAGGCCATCTGCGGCCCGTGCGCGGTGCGCTCGCAATGGCGCTGGCGCTGCATGGCCGCG
GCGTTGCGACGAAGCTGGTGCTGCCGGCCGAGAGCGCGCGGGAGGCCGCCCTGGTACCCGGCGCCGACATCTACGGTGCG
GCCCACCTGCTGGACGTGGTGCGGCAGTTCCTGCCGGGCGGCCCGGCCGCGAGCGATGCGTCGGAAGACGGCTGGCACCG
TGCGCAGCCGGCCGAACCCGGCGCGGTGGCGGCCGGGGCCGACATGGCGGACGTCAAGGGGCACGCGAGCGCAAGACGCG
CGCTCGAGGTCGCGGCGGCCGGCCAGCACAGCCTGCTGATGGTGGGCCCGCCGGGTTCCGGCAAGTCGATGCTGGCGCAG
CGCTTTGCGGGCCTGCTGCCGCCGATGAGCGTCGACGAGGCGCTGGAAAGCGCCGCCGTAGCCAGCCTGCACGGCCGCTT
CGCCGTCGAGCGTTGGCGTGTGCGGCCGACCTGCAGCCCGCACCACAGCGCCAGTGCCGTGGCGCTGGTGGGCGGCGGCT
CGCCGCCGCGGCCTGGCGAGATCTCGCTGGCGCACAACGGCGTGCTGTTCCTGGACGAATTTCCCGAGTTCCAGCGTGCC
GCGCTCGAGGCGCTGCGCGAGCCGCTGGAGACCGGAACCATCACGATCGCGCGGGCCGCGCGGCGTGCCGAGTTCCCGGC
CCGCTTCCAGTTGATCGCCGCCATGAATCCGTGCCCTTGCGGCTACCTGGGCTCATCCCTCAAGTCCTGCCGTTGCACGC
CGGACCAGGTGGCCCGCTACCAGGGCAAGCTCAGCGGGCCGCTGCTCGACCGGATCGACCTGCACATCGAAGTGCCCGCC
CTACCCGCCACCCAGTTGCTGGAGACGCCCGCCGGCGAAGCCAGCGCAGACGTCCGTGCGCGCGTGGCCGAAGGGCGCGA
GCGCGCAATCCGCCGACAGGGCAAGGCCAACCAGGCGCTGCAAGGCGCAGAGATCGACCGCCACGCGAAGCTCGATGCGG
CGGCCCGGCAGTTCCTGCAGACTGCAGCGACGCGACTGGGCTGGTCGGCGCGCGGCACGCACCGCACGCTCAAGCTGGCC
CGAACGATAGCGGACCTGGCCGGAGCCGGCGCGGTGCAGGCGGCGCATGTGGCGGAGGCGGTGCAGTACCGGCGTGCACT
GCAGAGGGTGGAGTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
51.744 |
100 |
0.523 |
| comM | Glaesserella parasuis strain SC1401 |
51.064 |
100 |
0.517 |
| comM | Vibrio cholerae strain A1552 |
51.581 |
99.022 |
0.511 |
| comM | Vibrio campbellii strain DS40M4 |
50 |
99.804 |
0.499 |
| comM | Legionella pneumophila str. Paris |
47.368 |
100 |
0.476 |
| comM | Legionella pneumophila strain ERS1305867 |
47.368 |
100 |
0.476 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
43.307 |
99.413 |
0.431 |