Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | HELO_RS00525 | Genome accession | NC_014532 |
| Coordinates | 116155..117660 (+) | Length | 501 a.a. |
| NCBI ID | WP_013330860.1 | Uniprot ID | - |
| Organism | Halomonas elongata DSM 2581 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 109032..128090 | 116155..117660 | within | 0 |
Gene organization within MGE regions
Location: 109032..128090
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HELO_RS00490 (HELO_1095) | - | 109032..109556 (+) | 525 | WP_013330853.1 | TRAP transporter small permease | - |
| HELO_RS00495 (HELO_1096) | - | 109553..110851 (+) | 1299 | WP_013330854.1 | TRAP transporter large permease | - |
| HELO_RS00500 (HELO_1097) | - | 110875..113244 (+) | 2370 | WP_013330855.1 | TIM-barrel domain-containing protein | - |
| HELO_RS00505 (HELO_1098) | glnK | 113309..113647 (-) | 339 | WP_013330856.1 | P-II family nitrogen regulator | - |
| HELO_RS00510 (HELO_1099) | - | 113803..115044 (-) | 1242 | WP_013330857.1 | ammonium transporter | - |
| HELO_RS00515 (HELO_1100) | - | 115090..115428 (-) | 339 | WP_013330858.1 | P-II family nitrogen regulator | - |
| HELO_RS00520 (HELO_1101) | - | 115721..116062 (+) | 342 | WP_013330859.1 | accessory factor UbiK family protein | - |
| HELO_RS00525 (HELO_1102) | comM | 116155..117660 (+) | 1506 | WP_013330860.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| HELO_RS00530 (HELO_1103) | cas1f | 117923..118900 (+) | 978 | WP_041601826.1 | type I-F CRISPR-associated endonuclease Cas1f | - |
| HELO_RS00535 (HELO_1104) | cas3f | 118897..122238 (+) | 3342 | WP_013330862.1 | type I-F CRISPR-associated helicase Cas3f | - |
| HELO_RS00540 (HELO_1105) | csy1 | 122631..124007 (+) | 1377 | WP_013330863.1 | type I-F CRISPR-associated protein Csy1 | - |
| HELO_RS00545 (HELO_1106) | csy2 | 124000..124959 (+) | 960 | WP_013330864.1 | type I-F CRISPR-associated protein Csy2 | - |
| HELO_RS00550 (HELO_1107) | csy3 | 124977..126008 (+) | 1032 | WP_013330865.1 | type I-F CRISPR-associated protein Csy3 | - |
| HELO_RS00555 (HELO_1107A) | cas6f | 126012..126581 (+) | 570 | WP_109637282.1 | type I-F CRISPR-associated endoribonuclease Cas6/Csy4 | - |
Sequence
Protein
Download Length: 501 a.a. Molecular weight: 53806.82 Da Isoelectric Point: 6.9349
>NTDB_id=38403 HELO_RS00525 WP_013330860.1 116155..117660(+) (comM) [Halomonas elongata DSM 2581]
MTLAIIRTRAGLGLEAPEVLVEVHLTNGLPGITLVGLPETAVKESRERVRSALVNAGFEFPLRRITLNLAPADLPKDGGR
FDLPIALGLLVASGQIPPEALAEVECVGELALDGGLRPASGVLPLAMATRQAGRRLIVPRANADEAALAGDLEVLPAEHL
LEVVAHLLGQETIAAHRLQAPPRRDTSEPDLREVRGQHQARRALEVAAAGGHNLLFAGPPGTGKTMLASRLPGILPPLGE
DEALEVAAVRSVSGLPLAEQWGRRPFRAPHHTASAVALVGGGSRPKPGEISLAHHGVLFLDELPEFSRQVLEVMREPMES
GQIHIARANHERRYPARFQLVAAMNPCPCGHLGDPRQACHCTAAQIQRYQARLSGPLLDRIDLQVEVPALPAEQLTSRES
GEDSATVRERVLAARERQWSRGALNAYLAGPDLEAACALGADDRAWLAEVLERLQLSARAFHRVLRVALTLADLAGAPRP
TREHLIEAIGYRQLDRLLKGG
MTLAIIRTRAGLGLEAPEVLVEVHLTNGLPGITLVGLPETAVKESRERVRSALVNAGFEFPLRRITLNLAPADLPKDGGR
FDLPIALGLLVASGQIPPEALAEVECVGELALDGGLRPASGVLPLAMATRQAGRRLIVPRANADEAALAGDLEVLPAEHL
LEVVAHLLGQETIAAHRLQAPPRRDTSEPDLREVRGQHQARRALEVAAAGGHNLLFAGPPGTGKTMLASRLPGILPPLGE
DEALEVAAVRSVSGLPLAEQWGRRPFRAPHHTASAVALVGGGSRPKPGEISLAHHGVLFLDELPEFSRQVLEVMREPMES
GQIHIARANHERRYPARFQLVAAMNPCPCGHLGDPRQACHCTAAQIQRYQARLSGPLLDRIDLQVEVPALPAEQLTSRES
GEDSATVRERVLAARERQWSRGALNAYLAGPDLEAACALGADDRAWLAEVLERLQLSARAFHRVLRVALTLADLAGAPRP
TREHLIEAIGYRQLDRLLKGG
Nucleotide
Download Length: 1506 bp
>NTDB_id=38403 HELO_RS00525 WP_013330860.1 116155..117660(+) (comM) [Halomonas elongata DSM 2581]
ATGACGCTGGCGATCATTCGCACCCGGGCGGGCCTCGGCCTGGAGGCGCCCGAGGTGCTTGTCGAGGTACACCTGACCAA
CGGCCTGCCTGGCATCACGCTGGTCGGGCTGCCCGAAACCGCCGTCAAGGAAAGCCGGGAGAGGGTGCGCAGCGCCCTGG
TCAATGCCGGTTTCGAATTTCCGCTGCGGCGTATCACCCTGAATCTGGCGCCCGCCGATCTTCCCAAGGACGGCGGGCGC
TTTGATCTCCCCATCGCACTGGGCCTGCTGGTCGCTTCCGGACAGATTCCGCCCGAGGCCCTGGCCGAGGTGGAGTGTGT
GGGCGAACTGGCGTTGGACGGCGGCCTGCGCCCGGCGAGCGGGGTGCTACCGCTGGCCATGGCCACGCGGCAAGCGGGGC
GGCGCTTGATCGTGCCCCGAGCCAACGCCGACGAAGCGGCCCTGGCCGGTGATCTCGAGGTTCTGCCCGCCGAGCATCTG
CTGGAGGTGGTGGCCCATCTTCTCGGGCAGGAAACCATTGCCGCCCATCGGCTACAGGCGCCGCCACGTCGCGATACCTC
GGAGCCGGATTTACGCGAGGTGAGAGGGCAGCACCAGGCGCGTCGTGCCCTGGAAGTCGCGGCGGCGGGAGGCCACAACC
TGTTGTTCGCCGGCCCGCCCGGCACCGGCAAGACCATGCTGGCCAGTCGTCTGCCCGGCATCCTGCCGCCGCTCGGCGAG
GACGAGGCCCTGGAGGTCGCGGCGGTACGTTCGGTCAGTGGATTGCCGCTGGCCGAGCAGTGGGGACGTCGCCCCTTTCG
AGCCCCACATCACACTGCGAGTGCCGTAGCTCTGGTCGGCGGCGGCTCGCGTCCCAAGCCGGGGGAGATCTCCCTGGCGC
ACCACGGCGTACTGTTTCTCGACGAACTGCCGGAGTTCTCGCGGCAGGTTCTGGAAGTGATGCGCGAGCCCATGGAATCC
GGACAGATCCACATTGCCCGCGCCAACCACGAGCGTCGTTATCCGGCGCGTTTCCAACTGGTGGCGGCCATGAATCCCTG
CCCCTGCGGTCATCTTGGCGACCCGCGCCAGGCCTGTCACTGCACGGCCGCCCAGATTCAGCGCTATCAGGCGCGACTGT
CAGGCCCCTTGCTGGATCGCATCGACCTGCAGGTGGAAGTGCCAGCCCTGCCAGCAGAGCAATTGACCTCGCGGGAGTCG
GGAGAGGATTCGGCGACGGTACGCGAACGGGTTTTGGCGGCGCGTGAGCGCCAATGGTCGAGAGGAGCGCTCAACGCCTA
CCTGGCAGGCCCCGATCTGGAAGCTGCTTGCGCGCTGGGTGCCGATGACCGTGCCTGGCTCGCCGAAGTGCTGGAGCGAC
TGCAGCTTTCGGCACGAGCCTTCCATCGCGTGCTCCGGGTGGCCCTGACCCTCGCCGACCTGGCCGGTGCCCCCAGGCCG
ACCCGCGAACATCTGATCGAGGCCATCGGTTATCGCCAGCTCGACCGCTTGCTCAAGGGCGGCTGA
ATGACGCTGGCGATCATTCGCACCCGGGCGGGCCTCGGCCTGGAGGCGCCCGAGGTGCTTGTCGAGGTACACCTGACCAA
CGGCCTGCCTGGCATCACGCTGGTCGGGCTGCCCGAAACCGCCGTCAAGGAAAGCCGGGAGAGGGTGCGCAGCGCCCTGG
TCAATGCCGGTTTCGAATTTCCGCTGCGGCGTATCACCCTGAATCTGGCGCCCGCCGATCTTCCCAAGGACGGCGGGCGC
TTTGATCTCCCCATCGCACTGGGCCTGCTGGTCGCTTCCGGACAGATTCCGCCCGAGGCCCTGGCCGAGGTGGAGTGTGT
GGGCGAACTGGCGTTGGACGGCGGCCTGCGCCCGGCGAGCGGGGTGCTACCGCTGGCCATGGCCACGCGGCAAGCGGGGC
GGCGCTTGATCGTGCCCCGAGCCAACGCCGACGAAGCGGCCCTGGCCGGTGATCTCGAGGTTCTGCCCGCCGAGCATCTG
CTGGAGGTGGTGGCCCATCTTCTCGGGCAGGAAACCATTGCCGCCCATCGGCTACAGGCGCCGCCACGTCGCGATACCTC
GGAGCCGGATTTACGCGAGGTGAGAGGGCAGCACCAGGCGCGTCGTGCCCTGGAAGTCGCGGCGGCGGGAGGCCACAACC
TGTTGTTCGCCGGCCCGCCCGGCACCGGCAAGACCATGCTGGCCAGTCGTCTGCCCGGCATCCTGCCGCCGCTCGGCGAG
GACGAGGCCCTGGAGGTCGCGGCGGTACGTTCGGTCAGTGGATTGCCGCTGGCCGAGCAGTGGGGACGTCGCCCCTTTCG
AGCCCCACATCACACTGCGAGTGCCGTAGCTCTGGTCGGCGGCGGCTCGCGTCCCAAGCCGGGGGAGATCTCCCTGGCGC
ACCACGGCGTACTGTTTCTCGACGAACTGCCGGAGTTCTCGCGGCAGGTTCTGGAAGTGATGCGCGAGCCCATGGAATCC
GGACAGATCCACATTGCCCGCGCCAACCACGAGCGTCGTTATCCGGCGCGTTTCCAACTGGTGGCGGCCATGAATCCCTG
CCCCTGCGGTCATCTTGGCGACCCGCGCCAGGCCTGTCACTGCACGGCCGCCCAGATTCAGCGCTATCAGGCGCGACTGT
CAGGCCCCTTGCTGGATCGCATCGACCTGCAGGTGGAAGTGCCAGCCCTGCCAGCAGAGCAATTGACCTCGCGGGAGTCG
GGAGAGGATTCGGCGACGGTACGCGAACGGGTTTTGGCGGCGCGTGAGCGCCAATGGTCGAGAGGAGCGCTCAACGCCTA
CCTGGCAGGCCCCGATCTGGAAGCTGCTTGCGCGCTGGGTGCCGATGACCGTGCCTGGCTCGCCGAAGTGCTGGAGCGAC
TGCAGCTTTCGGCACGAGCCTTCCATCGCGTGCTCCGGGTGGCCCTGACCCTCGCCGACCTGGCCGGTGCCCCCAGGCCG
ACCCGCGAACATCTGATCGAGGCCATCGGTTATCGCCAGCTCGACCGCTTGCTCAAGGGCGGCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Vibrio campbellii strain DS40M4 |
55.556 |
100 |
0.559 |
| comM | Vibrio cholerae strain A1552 |
54.582 |
100 |
0.547 |
| comM | Haemophilus influenzae Rd KW20 |
53.557 |
100 |
0.541 |
| comM | Glaesserella parasuis strain SC1401 |
53.346 |
100 |
0.541 |
| comM | Legionella pneumophila str. Paris |
51.098 |
100 |
0.511 |
| comM | Legionella pneumophila strain ERS1305867 |
51.098 |
100 |
0.511 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.686 |
99.202 |
0.473 |