Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | EHF33_RS08740 | Genome accession | NZ_CP034183 |
| Coordinates | 1775057..1777246 (-) | Length | 729 a.a. |
| NCBI ID | WP_241191106.1 | Uniprot ID | - |
| Organism | Deinococcus psychrotolerans strain S14-83 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1770057..1782246
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EHF33_RS08715 (EHF33_08715) | - | 1770281..1771234 (-) | 954 | WP_124870156.1 | NAD-dependent epimerase/dehydratase family protein | - |
| EHF33_RS08720 (EHF33_08720) | - | 1771251..1771790 (+) | 540 | WP_124870159.1 | 2'-5' RNA ligase family protein | - |
| EHF33_RS08725 (EHF33_08725) | - | 1771863..1772390 (+) | 528 | WP_124870161.1 | multidrug DMT transporter | - |
| EHF33_RS08730 (EHF33_08730) | - | 1772572..1774224 (+) | 1653 | WP_124870164.1 | GMC family oxidoreductase | - |
| EHF33_RS08735 (EHF33_08735) | - | 1774228..1774965 (-) | 738 | WP_124870167.1 | GNAT family N-acetyltransferase | - |
| EHF33_RS08740 (EHF33_08740) | comEC | 1775057..1777246 (-) | 2190 | WP_241191106.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| EHF33_RS08745 (EHF33_08745) | comEA | 1777293..1777676 (-) | 384 | WP_124870173.1 | ComEA family DNA-binding protein | Machinery gene |
| EHF33_RS08750 (EHF33_08750) | - | 1777931..1779604 (+) | 1674 | WP_124870176.1 | long-chain fatty acid--CoA ligase | - |
| EHF33_RS08755 (EHF33_08755) | - | 1779638..1780075 (+) | 438 | WP_124870180.1 | GlcG/HbpS family heme-binding protein | - |
| EHF33_RS08760 (EHF33_08760) | - | 1780104..1780637 (-) | 534 | WP_124870181.1 | DUF4385 domain-containing protein | - |
| EHF33_RS08765 (EHF33_08765) | - | 1780690..1781364 (-) | 675 | WP_241191107.1 | Dps family protein | - |
Sequence
Protein
Download Length: 729 a.a. Molecular weight: 77786.20 Da Isoelectric Point: 9.9213
>NTDB_id=329721 EHF33_RS08740 WP_241191106.1 1775057..1777246(-) (comEC) [Deinococcus psychrotolerans strain S14-83]
MQADATYSPRRWPVPAYPVPAALAVIGGVLLGFGALWGAAVLVVALALSWPAGKGWLLGACLVLAVCGFARERAWQAAPD
ALLSWVGAQATLIGDWDGQLLHLSSPAAAVALAPKPKGPAGRLTVRGRLVRPEGRRIPGGFDYAFWLRMQGVREVLVAAE
IKHLKPEGGVRGWFRRGLSAGLSPRESALLRAVELGERNDISQESFNDGLNVRDAFTRAGLAHLMALSGQNVALLVGALT
WLLARLLPLKWTNARYPLMLTVLAGFLWLVGPTPSITRAVLMGVLVLLSLWLGRGKLDVYGVLGLAAIASLLYQPAWLFD
VGFQLSFLAVLGLSVSNRVAALLPAKWPLWLRLALVATPCAELATLPVVLHTFGQLPLLSLPANLAAAALMAVLVPLGYL
AGLLGPLAVAVNWALGPLSEALLKVVDIFGRAPVLTWGMISPAGFAVYAVFAAAAVLTLYRLLPLWVLPLTALLGGVSTA
LPSLLNPPSEIVYLDVGQGDSSLIRTPKLTMLIDGGGTPRGDYDVGKGTVLPALRAMNVRSLDVMVATHADADHIEGLIS
VLLGLPVGELWIGQRKDDPNLNMLLRAAEARHVPVREVRRGDSVQAGKATFTVLWPTGKVWSDADNENSVVIKFDTPKFH
TVFLGDLPDPLESELGVGKLDVLKTAHHGSRFSSGQAFLDETRPHDAVISVGRSNSYGHPNPQVLDRLQASGVKVWRTDQ
VGTVRWPLP
MQADATYSPRRWPVPAYPVPAALAVIGGVLLGFGALWGAAVLVVALALSWPAGKGWLLGACLVLAVCGFARERAWQAAPD
ALLSWVGAQATLIGDWDGQLLHLSSPAAAVALAPKPKGPAGRLTVRGRLVRPEGRRIPGGFDYAFWLRMQGVREVLVAAE
IKHLKPEGGVRGWFRRGLSAGLSPRESALLRAVELGERNDISQESFNDGLNVRDAFTRAGLAHLMALSGQNVALLVGALT
WLLARLLPLKWTNARYPLMLTVLAGFLWLVGPTPSITRAVLMGVLVLLSLWLGRGKLDVYGVLGLAAIASLLYQPAWLFD
VGFQLSFLAVLGLSVSNRVAALLPAKWPLWLRLALVATPCAELATLPVVLHTFGQLPLLSLPANLAAAALMAVLVPLGYL
AGLLGPLAVAVNWALGPLSEALLKVVDIFGRAPVLTWGMISPAGFAVYAVFAAAAVLTLYRLLPLWVLPLTALLGGVSTA
LPSLLNPPSEIVYLDVGQGDSSLIRTPKLTMLIDGGGTPRGDYDVGKGTVLPALRAMNVRSLDVMVATHADADHIEGLIS
VLLGLPVGELWIGQRKDDPNLNMLLRAAEARHVPVREVRRGDSVQAGKATFTVLWPTGKVWSDADNENSVVIKFDTPKFH
TVFLGDLPDPLESELGVGKLDVLKTAHHGSRFSSGQAFLDETRPHDAVISVGRSNSYGHPNPQVLDRLQASGVKVWRTDQ
VGTVRWPLP
Nucleotide
Download Length: 2190 bp
>NTDB_id=329721 EHF33_RS08740 WP_241191106.1 1775057..1777246(-) (comEC) [Deinococcus psychrotolerans strain S14-83]
GTGCAGGCGGACGCGACCTACTCGCCGCGCCGCTGGCCCGTTCCGGCTTATCCGGTTCCAGCGGCGCTGGCAGTCATTGG
CGGCGTGCTGCTGGGCTTCGGAGCGCTCTGGGGCGCGGCGGTACTTGTTGTGGCCCTGGCGCTGAGCTGGCCTGCCGGCA
AGGGTTGGCTGCTGGGGGCCTGTCTGGTGTTGGCGGTGTGCGGCTTCGCGCGCGAGCGGGCCTGGCAAGCCGCTCCCGAC
GCGCTTTTAAGCTGGGTGGGTGCTCAAGCCACCCTGATTGGCGACTGGGACGGCCAACTGCTGCACCTCAGTAGCCCTGC
GGCGGCAGTGGCGCTGGCACCCAAGCCCAAGGGGCCAGCCGGACGCCTGACCGTCCGTGGGCGGCTGGTTCGCCCAGAGG
GTCGGCGCATTCCTGGCGGCTTCGATTACGCGTTTTGGCTGCGGATGCAGGGTGTGCGCGAAGTCTTGGTGGCCGCCGAG
ATCAAGCATTTGAAGCCTGAAGGCGGGGTGCGCGGTTGGTTTCGGCGGGGCCTGAGCGCCGGTCTCTCACCGCGTGAGTC
GGCCCTGCTACGGGCTGTGGAATTGGGCGAGCGCAACGATATCTCGCAGGAAAGCTTCAATGACGGCCTGAATGTCCGCG
ACGCTTTCACGCGGGCGGGTTTGGCACACCTGATGGCGCTCAGCGGGCAAAACGTGGCGCTGCTGGTGGGAGCGCTGACT
TGGCTCCTGGCCCGTTTATTGCCGCTCAAATGGACAAACGCCCGCTACCCGCTGATGCTCACGGTGCTGGCGGGTTTTTT
ATGGTTGGTCGGCCCCACGCCCAGCATCACCCGCGCCGTGCTGATGGGCGTCTTGGTGCTGCTCTCGTTGTGGCTGGGGC
GCGGTAAGCTCGACGTCTACGGTGTGTTGGGCTTGGCGGCCATCGCCAGCTTGCTGTATCAACCGGCTTGGTTGTTTGAC
GTGGGCTTTCAACTTAGTTTTCTGGCGGTGCTGGGCCTGAGTGTGTCAAACAGGGTCGCTGCTCTGCTGCCCGCAAAGTG
GCCGCTGTGGCTGCGCCTCGCGCTGGTCGCCACCCCCTGCGCCGAACTCGCCACTTTGCCGGTGGTGCTGCACACCTTCG
GCCAGTTGCCGCTGCTGAGCCTCCCCGCCAACCTCGCCGCCGCCGCGCTGATGGCCGTGCTGGTGCCGCTCGGCTACCTC
GCCGGACTGCTGGGGCCTCTGGCGGTGGCGGTCAATTGGGCGCTCGGCCCGCTGTCGGAAGCGCTGCTGAAAGTGGTGGA
TATTTTTGGCCGCGCTCCGGTGCTGACCTGGGGAATGATCAGTCCGGCAGGGTTCGCGGTTTACGCGGTATTTGCCGCTG
CCGCTGTGCTGACCCTTTACCGCCTCCTGCCGCTGTGGGTCTTGCCGCTCACCGCCTTGCTGGGCGGCGTGTCCACTGCT
TTGCCGAGCCTGCTCAACCCACCCAGCGAAATCGTCTATCTGGACGTCGGACAGGGCGACAGTTCGCTGATTCGCACTCC
CAAGCTCACCATGCTGATTGATGGCGGCGGCACTCCCAGAGGCGATTACGACGTCGGCAAAGGCACGGTGCTGCCTGCCT
TGCGGGCCATGAACGTGCGCTCCCTCGACGTGATGGTCGCCACCCACGCCGACGCCGATCACATCGAAGGCCTGATCAGC
GTGCTGCTGGGCTTGCCAGTAGGCGAGTTGTGGATCGGTCAGCGCAAAGACGACCCCAATCTCAATATGCTGCTCAGAGC
CGCCGAGGCCCGCCACGTGCCGGTGCGCGAAGTACGGCGCGGTGACAGCGTACAAGCCGGAAAAGCCACCTTCACGGTGC
TGTGGCCCACTGGCAAAGTCTGGTCGGATGCCGACAACGAAAATAGCGTGGTCATCAAGTTCGATACGCCCAAGTTTCAC
ACCGTCTTTCTGGGCGATCTTCCCGACCCGCTGGAAAGTGAGCTGGGCGTCGGCAAGTTGGATGTTCTCAAAACGGCCCA
CCACGGCAGCCGCTTTTCGAGCGGTCAAGCGTTTTTGGACGAAACCCGCCCGCATGACGCCGTCATCAGTGTCGGGCGAT
CTAACAGTTACGGTCACCCCAACCCGCAAGTGCTCGACCGCCTTCAAGCGTCCGGTGTAAAAGTCTGGCGCACCGATCAG
GTGGGAACCGTGCGCTGGCCCTTGCCTTGA
GTGCAGGCGGACGCGACCTACTCGCCGCGCCGCTGGCCCGTTCCGGCTTATCCGGTTCCAGCGGCGCTGGCAGTCATTGG
CGGCGTGCTGCTGGGCTTCGGAGCGCTCTGGGGCGCGGCGGTACTTGTTGTGGCCCTGGCGCTGAGCTGGCCTGCCGGCA
AGGGTTGGCTGCTGGGGGCCTGTCTGGTGTTGGCGGTGTGCGGCTTCGCGCGCGAGCGGGCCTGGCAAGCCGCTCCCGAC
GCGCTTTTAAGCTGGGTGGGTGCTCAAGCCACCCTGATTGGCGACTGGGACGGCCAACTGCTGCACCTCAGTAGCCCTGC
GGCGGCAGTGGCGCTGGCACCCAAGCCCAAGGGGCCAGCCGGACGCCTGACCGTCCGTGGGCGGCTGGTTCGCCCAGAGG
GTCGGCGCATTCCTGGCGGCTTCGATTACGCGTTTTGGCTGCGGATGCAGGGTGTGCGCGAAGTCTTGGTGGCCGCCGAG
ATCAAGCATTTGAAGCCTGAAGGCGGGGTGCGCGGTTGGTTTCGGCGGGGCCTGAGCGCCGGTCTCTCACCGCGTGAGTC
GGCCCTGCTACGGGCTGTGGAATTGGGCGAGCGCAACGATATCTCGCAGGAAAGCTTCAATGACGGCCTGAATGTCCGCG
ACGCTTTCACGCGGGCGGGTTTGGCACACCTGATGGCGCTCAGCGGGCAAAACGTGGCGCTGCTGGTGGGAGCGCTGACT
TGGCTCCTGGCCCGTTTATTGCCGCTCAAATGGACAAACGCCCGCTACCCGCTGATGCTCACGGTGCTGGCGGGTTTTTT
ATGGTTGGTCGGCCCCACGCCCAGCATCACCCGCGCCGTGCTGATGGGCGTCTTGGTGCTGCTCTCGTTGTGGCTGGGGC
GCGGTAAGCTCGACGTCTACGGTGTGTTGGGCTTGGCGGCCATCGCCAGCTTGCTGTATCAACCGGCTTGGTTGTTTGAC
GTGGGCTTTCAACTTAGTTTTCTGGCGGTGCTGGGCCTGAGTGTGTCAAACAGGGTCGCTGCTCTGCTGCCCGCAAAGTG
GCCGCTGTGGCTGCGCCTCGCGCTGGTCGCCACCCCCTGCGCCGAACTCGCCACTTTGCCGGTGGTGCTGCACACCTTCG
GCCAGTTGCCGCTGCTGAGCCTCCCCGCCAACCTCGCCGCCGCCGCGCTGATGGCCGTGCTGGTGCCGCTCGGCTACCTC
GCCGGACTGCTGGGGCCTCTGGCGGTGGCGGTCAATTGGGCGCTCGGCCCGCTGTCGGAAGCGCTGCTGAAAGTGGTGGA
TATTTTTGGCCGCGCTCCGGTGCTGACCTGGGGAATGATCAGTCCGGCAGGGTTCGCGGTTTACGCGGTATTTGCCGCTG
CCGCTGTGCTGACCCTTTACCGCCTCCTGCCGCTGTGGGTCTTGCCGCTCACCGCCTTGCTGGGCGGCGTGTCCACTGCT
TTGCCGAGCCTGCTCAACCCACCCAGCGAAATCGTCTATCTGGACGTCGGACAGGGCGACAGTTCGCTGATTCGCACTCC
CAAGCTCACCATGCTGATTGATGGCGGCGGCACTCCCAGAGGCGATTACGACGTCGGCAAAGGCACGGTGCTGCCTGCCT
TGCGGGCCATGAACGTGCGCTCCCTCGACGTGATGGTCGCCACCCACGCCGACGCCGATCACATCGAAGGCCTGATCAGC
GTGCTGCTGGGCTTGCCAGTAGGCGAGTTGTGGATCGGTCAGCGCAAAGACGACCCCAATCTCAATATGCTGCTCAGAGC
CGCCGAGGCCCGCCACGTGCCGGTGCGCGAAGTACGGCGCGGTGACAGCGTACAAGCCGGAAAAGCCACCTTCACGGTGC
TGTGGCCCACTGGCAAAGTCTGGTCGGATGCCGACAACGAAAATAGCGTGGTCATCAAGTTCGATACGCCCAAGTTTCAC
ACCGTCTTTCTGGGCGATCTTCCCGACCCGCTGGAAAGTGAGCTGGGCGTCGGCAAGTTGGATGTTCTCAAAACGGCCCA
CCACGGCAGCCGCTTTTCGAGCGGTCAAGCGTTTTTGGACGAAACCCGCCCGCATGACGCCGTCATCAGTGTCGGGCGAT
CTAACAGTTACGGTCACCCCAACCCGCAAGTGCTCGACCGCCTTCAAGCGTCCGGTGTAAAAGTCTGGCGCACCGATCAG
GTGGGAACCGTGCGCTGGCCCTTGCCTTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539 |
59.398 |
91.221 |
0.542 |