Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | K5706_RS04840 | Genome accession | NZ_AP019849 |
| Coordinates | 1044181..1046457 (+) | Length | 758 a.a. |
| NCBI ID | WP_221071264.1 | Uniprot ID | - |
| Organism | Vibrio alfacsensis strain 04Ya249 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 1044181..1052853 | 1044181..1046457 | within | 0 |
Gene organization within MGE regions
Location: 1044181..1052853
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| K5706_RS04840 | comEC | 1044181..1046457 (+) | 2277 | WP_221071264.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| K5706_RS04845 (VA249_08770) | msbA | 1046489..1048237 (+) | 1749 | WP_128810527.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| K5706_RS04850 (VA249_08780) | lpxK | 1048243..1049250 (+) | 1008 | WP_221071266.1 | tetraacyldisaccharide 4'-kinase | - |
| K5706_RS04855 (VA249_08790) | - | 1049231..1049410 (+) | 180 | WP_128810529.1 | Trm112 family protein | - |
| K5706_RS04860 (VA249_08800) | kdsB | 1049410..1050168 (+) | 759 | WP_221071269.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| K5706_RS04865 (VA249_08820) | - | 1050430..1051190 (+) | 761 | WP_221069472.1 | IS5 family transposase | - |
| K5706_RS04870 (VA249_08830) | - | 1051297..1052853 (-) | 1557 | WP_221067840.1 | SpoVR family protein | - |
Sequence
Protein
Download Length: 758 a.a. Molecular weight: 85345.48 Da Isoelectric Point: 9.4609
>NTDB_id=74397 K5706_RS04840 WP_221071264.1 1044181..1046457(+) (comEC) [Vibrio alfacsensis strain 04Ya249]
MTLSDKSWTLVLFVVTVISSAWWPTMPDWRWTLLGMITIGSIIKWRQGVICIGAILGAMVVVVHGNIMEHHKQALFQAGE
DLTIIGTVDSPFTQISHGYEGIVSVSHADSKSLLPFAKPKIRLITPFPLSVNSHFTTQVSMKAITGLRNEAGFDIEQQSL
GNAVLARAVVSSQARWIIRTHTSVRQSIIQSVNREIRHLHHFPLISALAFSDRTRLTHQHWQQLRDSGLLHLISISGLHI
GMAYAFGASIGVCARYMASNLLYLPAVIGLIAAFSYAWLADFSLPTTRAFSVCAIYVMLKYFLVHWSAWRVLLLAVAIQL
SMSPFAFFSMSFWLSYLSVAAVLLAVNEMQQGSKGRKQPLLALLKVQFLLTLLIMPISGYFFVGTSLASIFYNLVFIPWF
GFVVVPVIFLALAVSSSLMTSSITYWFPGVLWQLLDWLLWPMTWALEFAQGSWQPLSVEQSSLVLLVCVLFILRRFFDRS
AWMLLSGVVVMLTLPFSKERQGWRLDVLDVGHGLAVLIEKDSHIMLYDTGKAWAGGSIAEQVIRPVLHRRGYNSIDTLVL
SHTDADHAGGRLYLESQFEPSRKLSSQKLLNYQPCVANNNWTWQGLMVEVVWPPKLVNRAYNPHSCVLRITDPQTHFRVL
LTGDIEAISEWLLTRQAEKINSDVMLVPHHGSKSSSNPKFIHAVNPTLAIASLAKNNQWGMPAESVKAAYQSASVEWLDT
GHDGQISVFINQDNWYFDTKRRESFEPWYRQMLRKGVE
MTLSDKSWTLVLFVVTVISSAWWPTMPDWRWTLLGMITIGSIIKWRQGVICIGAILGAMVVVVHGNIMEHHKQALFQAGE
DLTIIGTVDSPFTQISHGYEGIVSVSHADSKSLLPFAKPKIRLITPFPLSVNSHFTTQVSMKAITGLRNEAGFDIEQQSL
GNAVLARAVVSSQARWIIRTHTSVRQSIIQSVNREIRHLHHFPLISALAFSDRTRLTHQHWQQLRDSGLLHLISISGLHI
GMAYAFGASIGVCARYMASNLLYLPAVIGLIAAFSYAWLADFSLPTTRAFSVCAIYVMLKYFLVHWSAWRVLLLAVAIQL
SMSPFAFFSMSFWLSYLSVAAVLLAVNEMQQGSKGRKQPLLALLKVQFLLTLLIMPISGYFFVGTSLASIFYNLVFIPWF
GFVVVPVIFLALAVSSSLMTSSITYWFPGVLWQLLDWLLWPMTWALEFAQGSWQPLSVEQSSLVLLVCVLFILRRFFDRS
AWMLLSGVVVMLTLPFSKERQGWRLDVLDVGHGLAVLIEKDSHIMLYDTGKAWAGGSIAEQVIRPVLHRRGYNSIDTLVL
SHTDADHAGGRLYLESQFEPSRKLSSQKLLNYQPCVANNNWTWQGLMVEVVWPPKLVNRAYNPHSCVLRITDPQTHFRVL
LTGDIEAISEWLLTRQAEKINSDVMLVPHHGSKSSSNPKFIHAVNPTLAIASLAKNNQWGMPAESVKAAYQSASVEWLDT
GHDGQISVFINQDNWYFDTKRRESFEPWYRQMLRKGVE
Nucleotide
Download Length: 2277 bp
>NTDB_id=74397 K5706_RS04840 WP_221071264.1 1044181..1046457(+) (comEC) [Vibrio alfacsensis strain 04Ya249]
ATGACTCTCTCAGATAAAAGTTGGACCTTGGTGTTATTTGTAGTAACCGTTATATCGTCAGCATGGTGGCCGACAATGCC
AGATTGGCGATGGACGCTATTGGGAATGATTACTATCGGTTCAATCATAAAATGGCGTCAGGGCGTAATCTGCATAGGAG
CAATTTTGGGCGCTATGGTTGTCGTCGTCCATGGCAATATCATGGAGCATCATAAACAAGCCCTTTTTCAAGCAGGTGAG
GATCTTACCATAATTGGCACAGTTGACAGCCCTTTTACGCAAATAAGTCACGGATATGAAGGAATTGTCTCTGTTAGTCA
TGCTGATTCAAAGAGTTTATTACCTTTTGCTAAACCTAAGATCAGGTTGATAACGCCATTTCCGTTATCTGTTAACAGTC
ACTTTACGACACAAGTGTCGATGAAGGCCATTACGGGGTTAAGGAATGAAGCTGGATTCGATATAGAACAACAATCGTTG
GGAAATGCTGTTCTTGCTCGAGCGGTGGTTTCTTCACAAGCGCGATGGATTATTCGGACTCATACTTCTGTTCGGCAAAG
TATTATTCAATCAGTTAATCGCGAGATTAGGCATCTGCATCATTTTCCGCTGATAAGTGCCTTGGCATTCAGTGATCGTA
CTAGGTTGACTCACCAGCATTGGCAGCAGTTACGTGACAGTGGTTTGCTCCATTTGATCTCTATATCCGGTTTGCATATT
GGGATGGCGTACGCTTTTGGCGCATCCATCGGTGTATGTGCTCGTTATATGGCATCAAATTTGTTGTATTTACCTGCGGT
TATAGGTTTGATAGCGGCTTTCTCTTATGCATGGCTAGCTGATTTCTCGTTACCAACAACGCGTGCTTTTTCTGTGTGTG
CGATATACGTGATGCTCAAGTATTTCCTTGTTCATTGGAGTGCTTGGCGTGTTTTATTGCTCGCTGTCGCTATTCAATTG
AGTATGAGTCCATTTGCTTTTTTTAGTATGAGCTTTTGGCTTTCATACCTTTCTGTTGCCGCTGTTTTGCTCGCGGTGAA
TGAGATGCAGCAGGGCAGCAAGGGTAGAAAACAACCGCTTCTAGCGTTACTTAAAGTTCAGTTTTTGCTGACACTTTTGA
TCATGCCTATTAGCGGCTACTTTTTTGTAGGAACAAGTCTAGCCTCTATTTTCTACAATCTTGTTTTTATCCCTTGGTTT
GGTTTTGTGGTTGTTCCAGTGATATTCCTCGCTCTTGCTGTGTCATCATCACTTATGACCTCATCAATCACATATTGGTT
TCCCGGGGTGCTATGGCAATTATTGGATTGGCTGCTTTGGCCTATGACGTGGGCGTTGGAGTTTGCACAAGGCAGTTGGC
AACCGCTTAGCGTTGAGCAATCATCGTTAGTGTTACTTGTTTGTGTACTTTTTATATTGAGGCGCTTTTTTGACCGGTCT
GCGTGGATGCTATTAAGTGGTGTAGTGGTCATGCTTACGTTGCCATTTTCAAAAGAGAGGCAAGGGTGGCGATTAGACGT
ACTCGATGTTGGGCATGGGTTAGCAGTACTCATAGAAAAAGACAGCCACATTATGTTGTATGACACAGGTAAAGCGTGGG
CAGGTGGCAGTATTGCTGAACAGGTGATTCGGCCTGTATTGCATCGCCGAGGCTATAACTCAATCGACACTTTAGTTTTA
AGCCACACTGATGCCGACCATGCTGGTGGCCGGCTTTATCTTGAGTCCCAGTTTGAACCGAGCAGGAAATTGAGTAGCCA
AAAATTGCTAAATTATCAGCCTTGTGTAGCGAATAATAATTGGACTTGGCAGGGCTTAATGGTAGAAGTGGTGTGGCCAC
CTAAGCTTGTCAATCGCGCATACAACCCTCATTCGTGCGTTCTGCGTATCACGGATCCTCAAACCCATTTTCGAGTCCTA
CTCACTGGAGATATTGAAGCGATAAGTGAGTGGTTGTTAACTCGGCAGGCTGAGAAGATAAACAGTGATGTGATGCTCGT
CCCTCATCATGGCAGTAAAAGCTCATCTAATCCCAAATTTATCCATGCTGTGAATCCGACATTGGCGATTGCGTCACTAG
CAAAAAACAATCAGTGGGGTATGCCAGCAGAAAGCGTAAAAGCGGCATATCAGTCCGCAAGCGTTGAATGGCTGGATACA
GGACATGATGGACAAATCAGCGTGTTTATTAATCAAGATAATTGGTATTTTGATACTAAACGTCGAGAGTCATTTGAGCC
CTGGTATAGGCAGATGCTGCGTAAGGGAGTAGAATAA
ATGACTCTCTCAGATAAAAGTTGGACCTTGGTGTTATTTGTAGTAACCGTTATATCGTCAGCATGGTGGCCGACAATGCC
AGATTGGCGATGGACGCTATTGGGAATGATTACTATCGGTTCAATCATAAAATGGCGTCAGGGCGTAATCTGCATAGGAG
CAATTTTGGGCGCTATGGTTGTCGTCGTCCATGGCAATATCATGGAGCATCATAAACAAGCCCTTTTTCAAGCAGGTGAG
GATCTTACCATAATTGGCACAGTTGACAGCCCTTTTACGCAAATAAGTCACGGATATGAAGGAATTGTCTCTGTTAGTCA
TGCTGATTCAAAGAGTTTATTACCTTTTGCTAAACCTAAGATCAGGTTGATAACGCCATTTCCGTTATCTGTTAACAGTC
ACTTTACGACACAAGTGTCGATGAAGGCCATTACGGGGTTAAGGAATGAAGCTGGATTCGATATAGAACAACAATCGTTG
GGAAATGCTGTTCTTGCTCGAGCGGTGGTTTCTTCACAAGCGCGATGGATTATTCGGACTCATACTTCTGTTCGGCAAAG
TATTATTCAATCAGTTAATCGCGAGATTAGGCATCTGCATCATTTTCCGCTGATAAGTGCCTTGGCATTCAGTGATCGTA
CTAGGTTGACTCACCAGCATTGGCAGCAGTTACGTGACAGTGGTTTGCTCCATTTGATCTCTATATCCGGTTTGCATATT
GGGATGGCGTACGCTTTTGGCGCATCCATCGGTGTATGTGCTCGTTATATGGCATCAAATTTGTTGTATTTACCTGCGGT
TATAGGTTTGATAGCGGCTTTCTCTTATGCATGGCTAGCTGATTTCTCGTTACCAACAACGCGTGCTTTTTCTGTGTGTG
CGATATACGTGATGCTCAAGTATTTCCTTGTTCATTGGAGTGCTTGGCGTGTTTTATTGCTCGCTGTCGCTATTCAATTG
AGTATGAGTCCATTTGCTTTTTTTAGTATGAGCTTTTGGCTTTCATACCTTTCTGTTGCCGCTGTTTTGCTCGCGGTGAA
TGAGATGCAGCAGGGCAGCAAGGGTAGAAAACAACCGCTTCTAGCGTTACTTAAAGTTCAGTTTTTGCTGACACTTTTGA
TCATGCCTATTAGCGGCTACTTTTTTGTAGGAACAAGTCTAGCCTCTATTTTCTACAATCTTGTTTTTATCCCTTGGTTT
GGTTTTGTGGTTGTTCCAGTGATATTCCTCGCTCTTGCTGTGTCATCATCACTTATGACCTCATCAATCACATATTGGTT
TCCCGGGGTGCTATGGCAATTATTGGATTGGCTGCTTTGGCCTATGACGTGGGCGTTGGAGTTTGCACAAGGCAGTTGGC
AACCGCTTAGCGTTGAGCAATCATCGTTAGTGTTACTTGTTTGTGTACTTTTTATATTGAGGCGCTTTTTTGACCGGTCT
GCGTGGATGCTATTAAGTGGTGTAGTGGTCATGCTTACGTTGCCATTTTCAAAAGAGAGGCAAGGGTGGCGATTAGACGT
ACTCGATGTTGGGCATGGGTTAGCAGTACTCATAGAAAAAGACAGCCACATTATGTTGTATGACACAGGTAAAGCGTGGG
CAGGTGGCAGTATTGCTGAACAGGTGATTCGGCCTGTATTGCATCGCCGAGGCTATAACTCAATCGACACTTTAGTTTTA
AGCCACACTGATGCCGACCATGCTGGTGGCCGGCTTTATCTTGAGTCCCAGTTTGAACCGAGCAGGAAATTGAGTAGCCA
AAAATTGCTAAATTATCAGCCTTGTGTAGCGAATAATAATTGGACTTGGCAGGGCTTAATGGTAGAAGTGGTGTGGCCAC
CTAAGCTTGTCAATCGCGCATACAACCCTCATTCGTGCGTTCTGCGTATCACGGATCCTCAAACCCATTTTCGAGTCCTA
CTCACTGGAGATATTGAAGCGATAAGTGAGTGGTTGTTAACTCGGCAGGCTGAGAAGATAAACAGTGATGTGATGCTCGT
CCCTCATCATGGCAGTAAAAGCTCATCTAATCCCAAATTTATCCATGCTGTGAATCCGACATTGGCGATTGCGTCACTAG
CAAAAAACAATCAGTGGGGTATGCCAGCAGAAAGCGTAAAAGCGGCATATCAGTCCGCAAGCGTTGAATGGCTGGATACA
GGACATGATGGACAAATCAGCGTGTTTATTAATCAAGATAATTGGTATTTTGATACTAAACGTCGAGAGTCATTTGAGCC
CTGGTATAGGCAGATGCTGCGTAAGGGAGTAGAATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio campbellii strain DS40M4 |
63.456 |
100 |
0.635 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
63.325 |
100 |
0.633 |
| comEC | Vibrio cholerae strain A1552 |
41.953 |
100 |
0.42 |