Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | SY84_RS13895 | Genome accession | NZ_CP011389 |
| Coordinates | 2862595..2864733 (-) | Length | 712 a.a. |
| NCBI ID | WP_081424610.1 | Uniprot ID | - |
| Organism | Deinococcus soli (ex Cha et al. 2016) strain N5 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2857595..2869733
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SY84_RS13870 (SY84_13875) | - | 2857896..2858945 (-) | 1050 | WP_046844495.1 | thiamine ABC transporter substrate-binding protein | - |
| SY84_RS13875 (SY84_13880) | - | 2859183..2859629 (+) | 447 | WP_046844496.1 | NUDIX domain-containing protein | - |
| SY84_RS13880 (SY84_13885) | - | 2859626..2860087 (+) | 462 | WP_046844497.1 | NUDIX domain-containing protein | - |
| SY84_RS13885 (SY84_13890) | - | 2860403..2860888 (+) | 486 | WP_046844498.1 | MarR family winged helix-turn-helix transcriptional regulator | - |
| SY84_RS13890 (SY84_13895) | - | 2860885..2862507 (+) | 1623 | WP_046844499.1 | MDR family MFS transporter | - |
| SY84_RS13895 (SY84_13900) | comEC | 2862595..2864733 (-) | 2139 | WP_081424610.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| SY84_RS13900 (SY84_13905) | comEA | 2864781..2865158 (-) | 378 | WP_046845262.1 | ComEA family DNA-binding protein | Machinery gene |
| SY84_RS13905 (SY84_13910) | rpmH | 2865486..2865629 (+) | 144 | WP_046844501.1 | 50S ribosomal protein L34 | - |
| SY84_RS13910 (SY84_13915) | rnpA | 2865700..2866200 (+) | 501 | WP_046844502.1 | ribonuclease P protein component | - |
| SY84_RS13915 (SY84_13920) | yidD | 2866218..2866451 (+) | 234 | WP_046845263.1 | membrane protein insertion efficiency factor YidD | - |
| SY84_RS13920 (SY84_13925) | yidC | 2866448..2868055 (+) | 1608 | WP_046844503.1 | YidC/Oxa1 family membrane protein insertase | - |
| SY84_RS13925 (SY84_13930) | - | 2868117..2868443 (-) | 327 | WP_046844504.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 712 a.a. Molecular weight: 74682.94 Da Isoelectric Point: 10.8967
>NTDB_id=145416 SY84_RS13895 WP_081424610.1 2862595..2864733(-) (comEC) [Deinococcus soli (ex Cha et al. 2016) strain N5]
MVGVIGGILLALGQAWGVAALLAGAGLVLWHARPVLLALVALGGAAGWASAHAVLTRPNLLTPWFGAQVTVQGDWDGQFL
TLRDPPARVTLAPKPTQGPGRLRVSGRLLAPEGRRTPGGFDQAAWFRAQGGLLSVMPGGALVGARVREFQPQGGLRGWFR
RGLTTGLTERQGALMQAIELGDRGDISREQFSEGEAVRDAFARAGLAHLMALSGQNVAILTGALILLLTRLGAAPAWRFG
LPVLLLGPYLLLVQSSASITRAVLMGGAVLLALALGRGRPDPTGVIALAALACLLLFPLWLTDVGFQLSFLAVLALTQSD
RVAGRLPERWPRWLRLGLAATLLAEAGTLPVIAGTFGQVPLVGLPANLLAGAIMALLVPLGFLAGLLGPLALPLNVVNGV
LADALLLVARTFGQAPVLSWGQVGVGGLLAYGAAVLAGWLWLLGRVRAPAALGTWLACLLLTTLPGRLHPAREVVFLDVG
QGDSTLIRLPHLTVLVDAGGSVGSDYDVGAHTVVPTLRALGVRKLDVLVATHADTDHIEGAASVLRQLPVGEVWIGQRKT
DDPVLTAVLLAAQERRVPVREVRRGDQVTSDGASLTVLWPAGTAWSTEDNDNSVALRLESRGWRAAFLGDLADLTEERLG
AGPLDLLKAAHHGSRHSTGAALLGQTSPADTVISVGRNTYGHPHQDVLTRLAQVHSRAWRTDQLGTIRWPVP
MVGVIGGILLALGQAWGVAALLAGAGLVLWHARPVLLALVALGGAAGWASAHAVLTRPNLLTPWFGAQVTVQGDWDGQFL
TLRDPPARVTLAPKPTQGPGRLRVSGRLLAPEGRRTPGGFDQAAWFRAQGGLLSVMPGGALVGARVREFQPQGGLRGWFR
RGLTTGLTERQGALMQAIELGDRGDISREQFSEGEAVRDAFARAGLAHLMALSGQNVAILTGALILLLTRLGAAPAWRFG
LPVLLLGPYLLLVQSSASITRAVLMGGAVLLALALGRGRPDPTGVIALAALACLLLFPLWLTDVGFQLSFLAVLALTQSD
RVAGRLPERWPRWLRLGLAATLLAEAGTLPVIAGTFGQVPLVGLPANLLAGAIMALLVPLGFLAGLLGPLALPLNVVNGV
LADALLLVARTFGQAPVLSWGQVGVGGLLAYGAAVLAGWLWLLGRVRAPAALGTWLACLLLTTLPGRLHPAREVVFLDVG
QGDSTLIRLPHLTVLVDAGGSVGSDYDVGAHTVVPTLRALGVRKLDVLVATHADTDHIEGAASVLRQLPVGEVWIGQRKT
DDPVLTAVLLAAQERRVPVREVRRGDQVTSDGASLTVLWPAGTAWSTEDNDNSVALRLESRGWRAAFLGDLADLTEERLG
AGPLDLLKAAHHGSRHSTGAALLGQTSPADTVISVGRNTYGHPHQDVLTRLAQVHSRAWRTDQLGTIRWPVP
Nucleotide
Download Length: 2139 bp
>NTDB_id=145416 SY84_RS13895 WP_081424610.1 2862595..2864733(-) (comEC) [Deinococcus soli (ex Cha et al. 2016) strain N5]
GTGGTGGGCGTGATCGGCGGCATCCTGCTGGCGCTGGGGCAGGCGTGGGGCGTGGCGGCGCTGCTGGCCGGCGCGGGGCT
GGTGCTGTGGCACGCCCGGCCGGTCCTACTGGCGCTGGTCGCGCTGGGCGGCGCGGCGGGCTGGGCGTCCGCGCACGCGG
TGCTGACCCGCCCGAACCTCCTGACGCCGTGGTTCGGGGCGCAGGTGACGGTGCAGGGCGACTGGGACGGGCAGTTCCTG
ACCCTGCGGGACCCGCCCGCCCGCGTGACGCTGGCGCCGAAGCCCACGCAGGGGCCGGGGCGGCTGCGGGTGTCGGGCCG
CCTGCTCGCCCCGGAGGGCCGCCGCACGCCGGGCGGGTTCGATCAGGCCGCGTGGTTCCGCGCGCAGGGGGGCCTGCTGA
GCGTCATGCCGGGCGGCGCGCTCGTGGGCGCGCGGGTGCGGGAGTTCCAGCCGCAGGGCGGTCTGCGCGGCTGGTTCCGC
CGGGGCCTGACCACGGGCCTGACCGAGCGGCAGGGCGCGCTGATGCAGGCCATCGAACTGGGCGACCGTGGGGACATCAG
CCGCGAGCAGTTCAGCGAGGGTGAGGCCGTGCGGGACGCGTTCGCGCGGGCGGGGCTGGCGCACCTGATGGCGCTGTCCG
GGCAGAACGTGGCGATCCTGACCGGTGCGCTGATCCTGCTGCTGACGCGGCTGGGCGCGGCCCCCGCGTGGCGGTTCGGG
CTGCCGGTGCTGCTGCTCGGGCCGTACCTGCTGCTCGTGCAGTCCTCGGCGAGCATCACGCGGGCTGTACTGATGGGCGG
CGCGGTGCTGCTGGCCCTGGCGCTGGGGCGCGGGCGGCCCGACCCGACCGGCGTGATCGCGCTGGCGGCGCTGGCGTGCC
TGCTGCTGTTCCCGCTGTGGCTGACAGACGTAGGCTTCCAGCTGTCGTTCCTGGCGGTGCTGGCCCTGACGCAGTCCGAC
CGGGTGGCGGGACGGCTGCCGGAACGCTGGCCGCGCTGGCTGCGGCTGGGTCTCGCGGCGACCCTGCTGGCCGAGGCCGG
GACGCTGCCAGTCATCGCGGGAACGTTCGGGCAGGTGCCGCTCGTGGGCCTGCCCGCGAACCTGCTGGCCGGGGCGATCA
TGGCCCTACTGGTCCCGCTGGGGTTCCTGGCGGGACTGCTGGGGCCGCTGGCCCTGCCGCTGAACGTGGTCAATGGCGTG
CTGGCCGACGCGCTGCTGCTGGTCGCCCGGACCTTCGGGCAGGCCCCGGTCCTGAGCTGGGGGCAGGTGGGCGTGGGAGG
ACTGCTGGCGTACGGCGCGGCCGTGCTGGCCGGGTGGCTGTGGCTGCTGGGCCGCGTCCGCGCGCCCGCCGCGCTGGGCA
CGTGGCTGGCCTGCCTGCTGCTGACGACCCTGCCGGGACGACTGCACCCCGCGCGGGAGGTGGTCTTTCTGGATGTCGGG
CAGGGGGACAGCACCCTGATCCGCCTGCCCCACCTGACGGTCCTCGTGGACGCCGGGGGTTCGGTGGGCAGTGACTACGA
CGTGGGCGCGCACACGGTCGTGCCCACCCTGCGGGCCCTGGGCGTGCGGAAACTGGACGTGCTCGTCGCCACGCACGCGG
ACACCGACCATATCGAGGGCGCCGCGAGCGTGCTGCGGCAGCTGCCGGTCGGGGAGGTCTGGATCGGGCAGCGCAAGACG
GACGATCCGGTCCTGACCGCCGTGCTGCTGGCCGCGCAGGAGAGGCGCGTGCCGGTGCGCGAGGTTCGCCGGGGCGATCA
GGTCACCTCCGACGGCGCCTCCCTGACCGTCCTGTGGCCCGCCGGGACCGCGTGGTCCACCGAGGACAACGACAACAGCG
TCGCCCTGCGCCTGGAGTCGCGCGGGTGGCGGGCCGCGTTCCTGGGCGACCTCGCGGACCTCACCGAGGAGCGGCTGGGC
GCCGGACCGCTGGACCTCCTGAAGGCCGCGCACCACGGCAGCCGCCACAGCACCGGCGCGGCCCTCCTGGGGCAGACCAG
CCCCGCCGACACCGTGATCAGCGTGGGACGGAACACCTACGGGCACCCTCACCAGGACGTGCTGACCCGCCTGGCGCAGG
TCCACTCGCGGGCGTGGCGCACCGATCAGCTGGGCACGATCCGCTGGCCTGTGCCGTGA
GTGGTGGGCGTGATCGGCGGCATCCTGCTGGCGCTGGGGCAGGCGTGGGGCGTGGCGGCGCTGCTGGCCGGCGCGGGGCT
GGTGCTGTGGCACGCCCGGCCGGTCCTACTGGCGCTGGTCGCGCTGGGCGGCGCGGCGGGCTGGGCGTCCGCGCACGCGG
TGCTGACCCGCCCGAACCTCCTGACGCCGTGGTTCGGGGCGCAGGTGACGGTGCAGGGCGACTGGGACGGGCAGTTCCTG
ACCCTGCGGGACCCGCCCGCCCGCGTGACGCTGGCGCCGAAGCCCACGCAGGGGCCGGGGCGGCTGCGGGTGTCGGGCCG
CCTGCTCGCCCCGGAGGGCCGCCGCACGCCGGGCGGGTTCGATCAGGCCGCGTGGTTCCGCGCGCAGGGGGGCCTGCTGA
GCGTCATGCCGGGCGGCGCGCTCGTGGGCGCGCGGGTGCGGGAGTTCCAGCCGCAGGGCGGTCTGCGCGGCTGGTTCCGC
CGGGGCCTGACCACGGGCCTGACCGAGCGGCAGGGCGCGCTGATGCAGGCCATCGAACTGGGCGACCGTGGGGACATCAG
CCGCGAGCAGTTCAGCGAGGGTGAGGCCGTGCGGGACGCGTTCGCGCGGGCGGGGCTGGCGCACCTGATGGCGCTGTCCG
GGCAGAACGTGGCGATCCTGACCGGTGCGCTGATCCTGCTGCTGACGCGGCTGGGCGCGGCCCCCGCGTGGCGGTTCGGG
CTGCCGGTGCTGCTGCTCGGGCCGTACCTGCTGCTCGTGCAGTCCTCGGCGAGCATCACGCGGGCTGTACTGATGGGCGG
CGCGGTGCTGCTGGCCCTGGCGCTGGGGCGCGGGCGGCCCGACCCGACCGGCGTGATCGCGCTGGCGGCGCTGGCGTGCC
TGCTGCTGTTCCCGCTGTGGCTGACAGACGTAGGCTTCCAGCTGTCGTTCCTGGCGGTGCTGGCCCTGACGCAGTCCGAC
CGGGTGGCGGGACGGCTGCCGGAACGCTGGCCGCGCTGGCTGCGGCTGGGTCTCGCGGCGACCCTGCTGGCCGAGGCCGG
GACGCTGCCAGTCATCGCGGGAACGTTCGGGCAGGTGCCGCTCGTGGGCCTGCCCGCGAACCTGCTGGCCGGGGCGATCA
TGGCCCTACTGGTCCCGCTGGGGTTCCTGGCGGGACTGCTGGGGCCGCTGGCCCTGCCGCTGAACGTGGTCAATGGCGTG
CTGGCCGACGCGCTGCTGCTGGTCGCCCGGACCTTCGGGCAGGCCCCGGTCCTGAGCTGGGGGCAGGTGGGCGTGGGAGG
ACTGCTGGCGTACGGCGCGGCCGTGCTGGCCGGGTGGCTGTGGCTGCTGGGCCGCGTCCGCGCGCCCGCCGCGCTGGGCA
CGTGGCTGGCCTGCCTGCTGCTGACGACCCTGCCGGGACGACTGCACCCCGCGCGGGAGGTGGTCTTTCTGGATGTCGGG
CAGGGGGACAGCACCCTGATCCGCCTGCCCCACCTGACGGTCCTCGTGGACGCCGGGGGTTCGGTGGGCAGTGACTACGA
CGTGGGCGCGCACACGGTCGTGCCCACCCTGCGGGCCCTGGGCGTGCGGAAACTGGACGTGCTCGTCGCCACGCACGCGG
ACACCGACCATATCGAGGGCGCCGCGAGCGTGCTGCGGCAGCTGCCGGTCGGGGAGGTCTGGATCGGGCAGCGCAAGACG
GACGATCCGGTCCTGACCGCCGTGCTGCTGGCCGCGCAGGAGAGGCGCGTGCCGGTGCGCGAGGTTCGCCGGGGCGATCA
GGTCACCTCCGACGGCGCCTCCCTGACCGTCCTGTGGCCCGCCGGGACCGCGTGGTCCACCGAGGACAACGACAACAGCG
TCGCCCTGCGCCTGGAGTCGCGCGGGTGGCGGGCCGCGTTCCTGGGCGACCTCGCGGACCTCACCGAGGAGCGGCTGGGC
GCCGGACCGCTGGACCTCCTGAAGGCCGCGCACCACGGCAGCCGCCACAGCACCGGCGCGGCCCTCCTGGGGCAGACCAG
CCCCGCCGACACCGTGATCAGCGTGGGACGGAACACCTACGGGCACCCTCACCAGGACGTGCTGACCCGCCTGGCGCAGG
TCCACTCGCGGGCGTGGCGCACCGATCAGCTGGGCACGATCCGCTGGCCTGTGCCGTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539 |
67.417 |
93.539 |
0.631 |