Detailed information
Overview
| Name | comE1/comEA | Type | Machinery gene |
| Locus tag | O1Q79_RS00995 | Genome accession | NZ_CP170386 |
| Coordinates | 193945..194298 (+) | Length | 117 a.a. |
| NCBI ID | WP_386692529.1 | Uniprot ID | - |
| Organism | Lonepinella sp. MS14434 | ||
| Function | dsDNA binding (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 184373..196442 | 193945..194298 | within | 0 |
Gene organization within MGE regions
Location: 184373..196442
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| O1Q79_RS00930 (O1Q79_00186) | - | 184373..185569 (+) | 1197 | WP_386692516.1 | tyrosine-type recombinase/integrase | - |
| O1Q79_RS00935 (O1Q79_00187) | - | 185995..186273 (+) | 279 | WP_386692517.1 | helix-turn-helix transcriptional regulator | - |
| O1Q79_RS00940 (O1Q79_00188) | - | 186270..186986 (+) | 717 | WP_386692518.1 | hypothetical protein | - |
| O1Q79_RS00945 (O1Q79_00189) | - | 187000..187686 (+) | 687 | WP_386692519.1 | P2 family phage major capsid protein | - |
| O1Q79_RS00950 (O1Q79_00190) | - | 187683..187931 (+) | 249 | WP_386692520.1 | P2 family phage major capsid protein | - |
| O1Q79_RS00955 (O1Q79_00191) | - | 188083..188946 (+) | 864 | WP_386692521.1 | hypothetical protein | - |
| O1Q79_RS00960 (O1Q79_00192) | - | 189091..189300 (+) | 210 | WP_386692522.1 | helix-turn-helix transcriptional regulator | - |
| O1Q79_RS00965 (O1Q79_00193) | - | 189300..189545 (+) | 246 | WP_386692523.1 | hypothetical protein | - |
| O1Q79_RS00970 (O1Q79_00194) | - | 189626..190222 (+) | 597 | WP_386692524.1 | ash family protein | - |
| O1Q79_RS00975 (O1Q79_00195) | - | 190215..190436 (+) | 222 | WP_386692525.1 | hypothetical protein | - |
| O1Q79_RS00980 (O1Q79_00196) | - | 190426..190770 (+) | 345 | WP_386692526.1 | hypothetical protein | - |
| O1Q79_RS00985 (O1Q79_00197) | - | 190770..190979 (+) | 210 | WP_386692527.1 | hypothetical protein | - |
| O1Q79_RS00990 (O1Q79_00198) | - | 191259..193163 (+) | 1905 | WP_386692528.1 | DUF5906 domain-containing protein | - |
| O1Q79_RS00995 (O1Q79_00199) | comE1/comEA | 193945..194298 (+) | 354 | WP_386692529.1 | ComEA family DNA-binding protein | Machinery gene |
| O1Q79_RS01000 (O1Q79_00200) | - | 194393..194665 (+) | 273 | WP_386692530.1 | type II toxin-antitoxin system RelB/DinJ family antitoxin | - |
| O1Q79_RS01005 (O1Q79_00201) | purN | 194710..195348 (-) | 639 | WP_386692531.1 | phosphoribosylglycinamide formyltransferase | - |
| O1Q79_RS01010 (O1Q79_00202) | purM | 195405..196442 (-) | 1038 | WP_386692532.1 | phosphoribosylformylglycinamidine cyclo-ligase | - |
Sequence
Protein
Download Length: 117 a.a. Molecular weight: 12174.88 Da Isoelectric Point: 7.0098
>NTDB_id=1056510 O1Q79_RS00995 WP_386692529.1 193945..194298(+) (comE1/comEA) [Lonepinella sp. MS14434]
MKSIKALFGTVVLAGSLLGSANVFAETAQPATTAVQQVTAQTVATETQATVVGDKLNINTASAAEIQKALTGIGAKKAEA
IVQYRETHGAFTSLDQLLDVQGIGQATLDKNKDRIIF
MKSIKALFGTVVLAGSLLGSANVFAETAQPATTAVQQVTAQTVATETQATVVGDKLNINTASAAEIQKALTGIGAKKAEA
IVQYRETHGAFTSLDQLLDVQGIGQATLDKNKDRIIF
Nucleotide
Download Length: 354 bp
>NTDB_id=1056510 O1Q79_RS00995 WP_386692529.1 193945..194298(+) (comE1/comEA) [Lonepinella sp. MS14434]
ATGAAATCCATTAAAGCATTATTCGGCACAGTGGTACTAGCTGGCTCATTATTAGGCAGTGCAAACGTATTCGCAGAAAC
CGCACAACCAGCCACAACAGCCGTTCAGCAAGTTACTGCACAAACGGTTGCAACTGAAACACAAGCTACTGTAGTTGGTG
ATAAATTAAATATTAACACCGCTTCCGCCGCAGAAATTCAAAAAGCCTTAACGGGGATCGGAGCTAAAAAAGCTGAAGCC
ATTGTGCAATATCGTGAAACGCATGGTGCTTTCACTTCGTTAGATCAGTTATTAGACGTACAAGGTATTGGTCAAGCTAC
TTTAGATAAAAATAAAGATCGGATTATTTTCTAG
ATGAAATCCATTAAAGCATTATTCGGCACAGTGGTACTAGCTGGCTCATTATTAGGCAGTGCAAACGTATTCGCAGAAAC
CGCACAACCAGCCACAACAGCCGTTCAGCAAGTTACTGCACAAACGGTTGCAACTGAAACACAAGCTACTGTAGTTGGTG
ATAAATTAAATATTAACACCGCTTCCGCCGCAGAAATTCAAAAAGCCTTAACGGGGATCGGAGCTAAAAAAGCTGAAGCC
ATTGTGCAATATCGTGAAACGCATGGTGCTTTCACTTCGTTAGATCAGTTATTAGACGTACAAGGTATTGGTCAAGCTAC
TTTAGATAAAAATAAAGATCGGATTATTTTCTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comE1/comEA | Haemophilus influenzae Rd KW20 |
62.281 |
97.436 |
0.607 |
| comEA/comE1 | Glaesserella parasuis strain SC1401 |
52.525 |
84.615 |
0.444 |