Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | EKH72_RS05030 | Genome accession | NZ_CP034565 |
| Coordinates | 997733..999991 (+) | Length | 752 a.a. |
| NCBI ID | WP_129829923.1 | Uniprot ID | - |
| Organism | Vibrio parahaemolyticus strain D3112 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 995056..1019656 | 997733..999991 | within | 0 |
Gene organization within MGE regions
Location: 995056..1019656
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EKH72_RS05015 | lolD | 995056..995763 (+) | 708 | WP_020840312.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| EKH72_RS05020 | lolE | 995766..997010 (+) | 1245 | WP_129829922.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| EKH72_RS05025 | - | 997215..997724 (-) | 510 | WP_005456245.1 | DUF2062 domain-containing protein | - |
| EKH72_RS05030 | comEC | 997733..999991 (+) | 2259 | WP_129829923.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| EKH72_RS05035 | msbA | 1000023..1001771 (+) | 1749 | WP_025542099.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| EKH72_RS05040 | lpxK | 1001777..1002784 (+) | 1008 | WP_129829924.1 | tetraacyldisaccharide 4'-kinase | - |
| EKH72_RS05045 | - | 1002765..1002944 (+) | 180 | WP_023585387.1 | Trm112 family protein | - |
| EKH72_RS05050 | kdsB | 1002944..1003699 (+) | 756 | WP_021449339.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| EKH72_RS05055 | - | 1003777..1005336 (-) | 1560 | WP_005456192.1 | SpoVR family protein | - |
| EKH72_RS05060 | - | 1005348..1006619 (-) | 1272 | WP_005481863.1 | YeaH/YhbH family protein | - |
| EKH72_RS05065 | - | 1006667..1008601 (-) | 1935 | WP_005456210.1 | PrkA family serine protein kinase | - |
| EKH72_RS05075 | - | 1009089..1009589 (-) | 501 | WP_005456271.1 | YfbU family protein | - |
| EKH72_RS05080 | - | 1009744..1010412 (-) | 669 | WP_025520632.1 | energy-coupling factor ABC transporter permease | - |
| EKH72_RS05085 | pflA | 1010573..1011313 (-) | 741 | WP_005456250.1 | pyruvate formate lyase 1-activating protein | - |
| EKH72_RS05090 | - | 1011460..1012437 (-) | 978 | WP_025499395.1 | lipid A deacylase LpxR family protein | - |
| EKH72_RS05095 | pflB | 1012590..1014866 (-) | 2277 | WP_005456189.1 | formate C-acetyltransferase | - |
| EKH72_RS05100 | - | 1015173..1016720 (-) | 1548 | WP_005456280.1 | DUF3360 family protein | - |
| EKH72_RS05105 | - | 1017177..1018643 (+) | 1467 | WP_005456213.1 | hypothetical protein | - |
| EKH72_RS05115 | - | 1018886..1019656 (+) | 771 | WP_005456207.1 | ABC transporter ATP-binding protein | - |
Sequence
Protein
Download Length: 752 a.a. Molecular weight: 84699.17 Da Isoelectric Point: 9.4163
>NTDB_id=331937 EKH72_RS05030 WP_129829923.1 997733..999991(+) (comEC) [Vibrio parahaemolyticus strain D3112]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVVAIKQVNTHTLLPFLKPKVRFITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSYWVIRTSSSWREAIIQTVERDISRLEHFALIKALVFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLAMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYLALKYWLVHWSPWRVVLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVEDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFISWF
GFVVVPIMFAALIASLLFPMLATVLWCLLDVFLVPLSWSVRYAIGTWQPISTEWTFVIAVVSAVLVLRHVMPRYVWMFVC
VIVVITGLFPKQYNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWHNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAGEKWKWQGLNMEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKSSSNLKFINAVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVVAIKQVNTHTLLPFLKPKVRFITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSYWVIRTSSSWREAIIQTVERDISRLEHFALIKALVFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLAMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYLALKYWLVHWSPWRVVLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVEDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFISWF
GFVVVPIMFAALIASLLFPMLATVLWCLLDVFLVPLSWSVRYAIGTWQPISTEWTFVIAVVSAVLVLRHVMPRYVWMFVC
VIVVITGLFPKQYNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWHNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAGEKWKWQGLNMEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKSSSNLKFINAVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE
Nucleotide
Download Length: 2259 bp
>NTDB_id=331937 EKH72_RS05030 WP_129829923.1 997733..999991(+) (comEC) [Vibrio parahaemolyticus strain D3112]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTCCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCGTAGCGATAAAACA
AGTGAATACTCACACTCTGTTACCTTTTCTTAAACCTAAAGTTCGCTTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCAAGAGCGGTCGTAACTAAAGATTCATACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAAGC
TATTATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCATTTTGCTCTAATCAAAGCACTGGTTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGACTACTGCATCTTGTATCAATTTCTGGTTTACACATT
GGGATGGCATTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGCTATGCCGCGATATTGGTTTCTGCCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATCTTGCTTTGAAGTATTGGCTTGTTCATTGGAGTCCATGGCGCGTAGTGTTACTGGCCGTGGCGTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTATTTGCGGTTAA
CACAGTGGAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTTCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCTATGTTGGCGACGGTTCTATGGTG
TTTGCTTGACGTGTTCCTTGTACCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTACCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGATTACAGGGTTGTTTCCTAAGCAATATAACCAAACTTGGCGTATTGATGTGCTTGATGTCGGGCATGG
GTTGGCGGTGCTGGTCGAAAAAGAGGGGAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCACAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGCTACTCAAGTGTCGATACGATGATTTTAAGTCATGCTGATAATGAC
CATGCTGGTGGCCGAAAAGTGATCGAACAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGCATCGCTGGTGAAAAATGGAAGTGGCAAGGGTTGAACATGGAGGTACTTTGGCCTCCTAAACCTGTAGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTGGTGATATCGAA
GCCATCAGCGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCGCATCATGGAAGCAA
AAGCTCTTCTAACCTTAAGTTTATCAATGCTGTTGAGCCTAGCTTGGCTATTGCTTCAACGGCAAAACTAAACCAGTGGG
GAATGCCCGCACCTGAAGTCGTTCAAGCCTATACCGACAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTCCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCGTAGCGATAAAACA
AGTGAATACTCACACTCTGTTACCTTTTCTTAAACCTAAAGTTCGCTTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCAAGAGCGGTCGTAACTAAAGATTCATACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAAGC
TATTATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCATTTTGCTCTAATCAAAGCACTGGTTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGACTACTGCATCTTGTATCAATTTCTGGTTTACACATT
GGGATGGCATTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGCTATGCCGCGATATTGGTTTCTGCCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATCTTGCTTTGAAGTATTGGCTTGTTCATTGGAGTCCATGGCGCGTAGTGTTACTGGCCGTGGCGTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTATTTGCGGTTAA
CACAGTGGAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTTCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCTATGTTGGCGACGGTTCTATGGTG
TTTGCTTGACGTGTTCCTTGTACCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTACCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGATTACAGGGTTGTTTCCTAAGCAATATAACCAAACTTGGCGTATTGATGTGCTTGATGTCGGGCATGG
GTTGGCGGTGCTGGTCGAAAAAGAGGGGAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCACAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGCTACTCAAGTGTCGATACGATGATTTTAAGTCATGCTGATAATGAC
CATGCTGGTGGCCGAAAAGTGATCGAACAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGCATCGCTGGTGAAAAATGGAAGTGGCAAGGGTTGAACATGGAGGTACTTTGGCCTCCTAAACCTGTAGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTGGTGATATCGAA
GCCATCAGCGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCGCATCATGGAAGCAA
AAGCTCTTCTAACCTTAAGTTTATCAATGCTGTTGAGCCTAGCTTGGCTATTGCTTCAACGGCAAAACTAAACCAGTGGG
GAATGCCCGCACCTGAAGTCGTTCAAGCCTATACCGACAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
97.872 |
100 |
0.979 |
| comEC | Vibrio campbellii strain DS40M4 |
66.223 |
100 |
0.662 |
| comEC | Vibrio cholerae strain A1552 |
40.903 |
100 |
0.41 |