Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | E5P3_RS01785 | Genome accession | NZ_LR594662 |
| Coordinates | 389349..390884 (-) | Length | 511 a.a. |
| NCBI ID | WP_162584429.1 | Uniprot ID | A0A6P2DYQ9 |
| Organism | Variovorax sp. RA8 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 384349..395884
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| E5P3_RS01765 (G3W91_RS01765) | - | 385155..385964 (+) | 810 | WP_162584425.1 | helix-turn-helix transcriptional regulator | - |
| E5P3_RS01770 (G3W91_RS01770) | - | 386058..387077 (+) | 1020 | WP_162584426.1 | aromatic ring-hydroxylating dioxygenase subunit alpha | - |
| E5P3_RS01775 (G3W91_RS01775) | - | 387126..388109 (+) | 984 | WP_162584427.1 | tripartite tricarboxylate transporter substrate-binding protein | - |
| E5P3_RS01780 (G3W91_RS01780) | - | 388239..389348 (+) | 1110 | WP_162584428.1 | ABC transporter substrate-binding protein | - |
| E5P3_RS01785 (G3W91_RS01785) | comM | 389349..390884 (-) | 1536 | WP_162584429.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| E5P3_RS01790 (G3W91_RS01790) | - | 391044..391865 (+) | 822 | WP_162589481.1 | TorF family putative porin | - |
| E5P3_RS01795 (G3W91_RS01795) | glnK | 391932..392270 (+) | 339 | WP_068686416.1 | P-II family nitrogen regulator | - |
| E5P3_RS01800 (G3W91_RS01800) | amt | 392297..393823 (+) | 1527 | WP_174263023.1 | ammonium transporter | - |
| E5P3_RS01805 (G3W91_RS01805) | - | 393977..394900 (+) | 924 | WP_162584430.1 | SMP-30/gluconolactonase/LRE family protein | - |
Sequence
Protein
Download Length: 511 a.a. Molecular weight: 53401.16 Da Isoelectric Point: 8.5296
>NTDB_id=1128179 E5P3_RS01785 WP_162584429.1 389349..390884(-) (comM) [Variovorax sp. RA8]
MSLSLVQSRALLGLEAASVTVEVHLANGLPSFTLVGLADVEVKEARERVRCAIQNAGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGILAAAGQIEAARLAGHEFAGELSLSGHLRPVRGALAMALALHGRGVATKLVLPAESAREAALVPGAEIYGA
AHLLDVVRQFLPGGPAPGDAAEDGWHRAQAAAAGPAAAEADLADVKGHAGARRALEIAAAGQHSLLMVGPPGSGKSMLAQ
RFAGLLPSMSVDEALESAAVASLHGRFAVERWRLRPTCSPHHSASAVALVGGGSPPRPGEISLAHNGVLFLDEFPEFQRA
ALEALREPLETGSITIARAARRAEFPARFQLVAAMNPCPCGHLGSSLKPCRCTPDQVARYQGKLSGPLLDRIDLHIEVPA
VPATQLLETPTGEASAEVRARVVEARERALRRQGKANQALQGAEIDRHAQPGAAALQFLQAAATRLGWSARGTHRTLKLA
RTIADLAGAGTVQAAHVAEAVQYRRALQKVE
MSLSLVQSRALLGLEAASVTVEVHLANGLPSFTLVGLADVEVKEARERVRCAIQNAGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGILAAAGQIEAARLAGHEFAGELSLSGHLRPVRGALAMALALHGRGVATKLVLPAESAREAALVPGAEIYGA
AHLLDVVRQFLPGGPAPGDAAEDGWHRAQAAAAGPAAAEADLADVKGHAGARRALEIAAAGQHSLLMVGPPGSGKSMLAQ
RFAGLLPSMSVDEALESAAVASLHGRFAVERWRLRPTCSPHHSASAVALVGGGSPPRPGEISLAHNGVLFLDEFPEFQRA
ALEALREPLETGSITIARAARRAEFPARFQLVAAMNPCPCGHLGSSLKPCRCTPDQVARYQGKLSGPLLDRIDLHIEVPA
VPATQLLETPTGEASAEVRARVVEARERALRRQGKANQALQGAEIDRHAQPGAAALQFLQAAATRLGWSARGTHRTLKLA
RTIADLAGAGTVQAAHVAEAVQYRRALQKVE
Nucleotide
Download Length: 1536 bp
>NTDB_id=1128179 E5P3_RS01785 WP_162584429.1 389349..390884(-) (comM) [Variovorax sp. RA8]
ATGAGCTTATCTTTGGTGCAGAGCCGTGCTTTGCTGGGCCTGGAAGCGGCAAGCGTCACGGTCGAGGTGCATCTGGCCAA
CGGGCTGCCCAGCTTCACGCTGGTGGGATTGGCCGACGTGGAGGTGAAGGAAGCCCGGGAGCGGGTGCGTTGCGCCATCC
AGAACGCCGGCCTCGAATTCCCGAGCAACAAGCGGATCACGGTCAACCTGGCGCCGGCCGACCTGCCGAAGGACTCGGGC
CGCTTCGACCTGCCGATTGCCCTGGGCATCCTGGCGGCGGCCGGGCAGATCGAGGCGGCCCGGCTGGCGGGCCACGAATT
CGCGGGGGAGCTCTCGCTTTCAGGGCACCTGAGGCCCGTGCGTGGTGCGCTCGCGATGGCGCTGGCGCTGCATGGCCGCG
GTGTCGCGACCAAGCTGGTGCTGCCGGCAGAGAGTGCGAGGGAGGCCGCCCTGGTGCCGGGCGCCGAAATCTACGGTGCA
GCCCACCTGCTCGATGTGGTGCGGCAGTTCCTGCCGGGCGGCCCGGCACCCGGCGATGCGGCGGAAGATGGCTGGCATCG
TGCGCAGGCCGCCGCCGCCGGCCCGGCGGCGGCGGAGGCCGACCTGGCGGACGTCAAGGGCCACGCGGGCGCCAGGCGCG
CGCTCGAGATCGCGGCGGCCGGCCAGCACAGCCTGCTGATGGTGGGCCCGCCGGGGTCCGGCAAGTCGATGCTGGCCCAG
CGCTTCGCCGGCCTGCTGCCGTCGATGAGCGTGGACGAGGCGCTGGAAAGCGCCGCCGTCGCCAGCCTGCACGGCCGCTT
CGCCGTCGAGCGCTGGCGCCTGCGGCCGACCTGCAGCCCGCACCACAGCGCCAGTGCGGTAGCGCTGGTGGGCGGCGGCT
CGCCGCCGCGGCCGGGCGAAATCTCGCTGGCGCACAACGGCGTGCTGTTCCTGGACGAGTTCCCGGAGTTCCAGCGCGCC
GCGCTCGAAGCGCTGCGCGAGCCGCTGGAGACCGGCAGCATCACCATCGCGCGGGCCGCACGGCGTGCCGAGTTCCCGGC
CCGCTTCCAGTTGGTCGCGGCCATGAACCCCTGCCCTTGCGGGCACCTGGGCTCCTCGCTCAAGCCCTGCCGCTGCACGC
CGGACCAGGTGGCCCGCTACCAGGGCAAGCTCAGCGGGCCGCTGCTGGACCGCATCGACCTGCACATCGAGGTACCTGCG
GTGCCGGCCACCCAGTTGCTGGAGACACCCACCGGCGAAGCCAGCGCCGAGGTTCGCGCGCGCGTGGTCGAGGCGCGCGA
GCGCGCCCTGCGGCGCCAGGGCAAGGCCAACCAGGCGCTGCAGGGCGCGGAGATCGACCGCCACGCGCAGCCCGGTGCCG
CGGCCCTGCAGTTCCTGCAGGCCGCCGCGACGCGGCTGGGCTGGTCGGCGCGCGGGACGCATCGCACGCTCAAGCTGGCC
CGCACGATCGCGGACCTGGCCGGCGCCGGCACGGTGCAGGCGGCGCATGTGGCGGAGGCGGTGCAGTACCGGAGAGCGCT
GCAGAAGGTCGAGTGA
ATGAGCTTATCTTTGGTGCAGAGCCGTGCTTTGCTGGGCCTGGAAGCGGCAAGCGTCACGGTCGAGGTGCATCTGGCCAA
CGGGCTGCCCAGCTTCACGCTGGTGGGATTGGCCGACGTGGAGGTGAAGGAAGCCCGGGAGCGGGTGCGTTGCGCCATCC
AGAACGCCGGCCTCGAATTCCCGAGCAACAAGCGGATCACGGTCAACCTGGCGCCGGCCGACCTGCCGAAGGACTCGGGC
CGCTTCGACCTGCCGATTGCCCTGGGCATCCTGGCGGCGGCCGGGCAGATCGAGGCGGCCCGGCTGGCGGGCCACGAATT
CGCGGGGGAGCTCTCGCTTTCAGGGCACCTGAGGCCCGTGCGTGGTGCGCTCGCGATGGCGCTGGCGCTGCATGGCCGCG
GTGTCGCGACCAAGCTGGTGCTGCCGGCAGAGAGTGCGAGGGAGGCCGCCCTGGTGCCGGGCGCCGAAATCTACGGTGCA
GCCCACCTGCTCGATGTGGTGCGGCAGTTCCTGCCGGGCGGCCCGGCACCCGGCGATGCGGCGGAAGATGGCTGGCATCG
TGCGCAGGCCGCCGCCGCCGGCCCGGCGGCGGCGGAGGCCGACCTGGCGGACGTCAAGGGCCACGCGGGCGCCAGGCGCG
CGCTCGAGATCGCGGCGGCCGGCCAGCACAGCCTGCTGATGGTGGGCCCGCCGGGGTCCGGCAAGTCGATGCTGGCCCAG
CGCTTCGCCGGCCTGCTGCCGTCGATGAGCGTGGACGAGGCGCTGGAAAGCGCCGCCGTCGCCAGCCTGCACGGCCGCTT
CGCCGTCGAGCGCTGGCGCCTGCGGCCGACCTGCAGCCCGCACCACAGCGCCAGTGCGGTAGCGCTGGTGGGCGGCGGCT
CGCCGCCGCGGCCGGGCGAAATCTCGCTGGCGCACAACGGCGTGCTGTTCCTGGACGAGTTCCCGGAGTTCCAGCGCGCC
GCGCTCGAAGCGCTGCGCGAGCCGCTGGAGACCGGCAGCATCACCATCGCGCGGGCCGCACGGCGTGCCGAGTTCCCGGC
CCGCTTCCAGTTGGTCGCGGCCATGAACCCCTGCCCTTGCGGGCACCTGGGCTCCTCGCTCAAGCCCTGCCGCTGCACGC
CGGACCAGGTGGCCCGCTACCAGGGCAAGCTCAGCGGGCCGCTGCTGGACCGCATCGACCTGCACATCGAGGTACCTGCG
GTGCCGGCCACCCAGTTGCTGGAGACACCCACCGGCGAAGCCAGCGCCGAGGTTCGCGCGCGCGTGGTCGAGGCGCGCGA
GCGCGCCCTGCGGCGCCAGGGCAAGGCCAACCAGGCGCTGCAGGGCGCGGAGATCGACCGCCACGCGCAGCCCGGTGCCG
CGGCCCTGCAGTTCCTGCAGGCCGCCGCGACGCGGCTGGGCTGGTCGGCGCGCGGGACGCATCGCACGCTCAAGCTGGCC
CGCACGATCGCGGACCTGGCCGGCGCCGGCACGGTGCAGGCGGCGCATGTGGCGGAGGCGGTGCAGTACCGGAGAGCGCT
GCAGAAGGTCGAGTGA
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
51.456 |
100 |
0.519 |
| comM | Glaesserella parasuis strain SC1401 |
50.677 |
100 |
0.513 |
| comM | Vibrio cholerae strain A1552 |
51.272 |
100 |
0.513 |
| comM | Vibrio campbellii strain DS40M4 |
49.902 |
100 |
0.499 |
| comM | Legionella pneumophila str. Paris |
46.899 |
100 |
0.474 |
| comM | Legionella pneumophila strain ERS1305867 |
46.899 |
100 |
0.474 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
43.418 |
99.609 |
0.432 |