Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | VAA049_RS05020 | Genome accession | NZ_CP010811 |
| Coordinates | 1065370..1067613 (+) | Length | 747 a.a. |
| NCBI ID | WP_162484598.1 | Uniprot ID | - |
| Organism | Vibrio cholerae strain 1154-74 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1060370..1072613
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| VAA049_RS04995 (VAA049_988) | - | 1060833..1061408 (-) | 576 | WP_000999601.1 | PilZ domain-containing protein | - |
| VAA049_RS05000 (VAA049_989) | lolC | 1061584..1062792 (+) | 1209 | WP_000468896.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| VAA049_RS05005 (VAA049_990) | lolD | 1062785..1063471 (+) | 687 | WP_001061289.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| VAA049_RS05010 (VAA049_991) | lolE | 1063472..1064716 (+) | 1245 | WP_000493013.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| VAA049_RS05015 (VAA049_992) | - | 1064831..1065361 (-) | 531 | WP_001881633.1 | DUF2062 domain-containing protein | - |
| VAA049_RS05020 (VAA049_993) | comEC | 1065370..1067613 (+) | 2244 | WP_162484598.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| VAA049_RS05025 (VAA049_994) | msbA | 1067644..1069392 (+) | 1749 | WP_000052155.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| VAA049_RS05030 (VAA049_995) | lpxK | 1069395..1070402 (+) | 1008 | WP_046122381.1 | tetraacyldisaccharide 4'-kinase | - |
| VAA049_RS05035 (VAA049_996) | - | 1070383..1070562 (+) | 180 | WP_000350068.1 | Trm112 family protein | - |
| VAA049_RS05040 (VAA049_997) | kdsB | 1070562..1071320 (+) | 759 | WP_046121491.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
Sequence
Protein
Download Length: 747 a.a. Molecular weight: 84373.43 Da Isoelectric Point: 7.5362
>NTDB_id=139942 VAA049_RS05020 WP_162484598.1 1065370..1067613(+) (comEC) [Vibrio cholerae strain 1154-74]
MTLLSNYWSLISFSITVLSAPYWPWMPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGP
DIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYM
AQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHTSPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHI
GIAFSVGYFLGLGMMRLHAQLLWYPFVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLGAVL
LWSPFASLSNSFWMSFLAVAIVLYQLASQSQSQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVI
VPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMIG
LWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDTGSSWPGGSYVQSVIEPILQQRGLRQVDGVILSHLDNDHAGDW
QGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEW
LLARDGAQLQSEVMSVPHHGSKTSSTAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGQAGQISLFFY
PDQLDWFTQRSLGWQPWYRQMLRKGVE
MTLLSNYWSLISFSITVLSAPYWPWMPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGP
DIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYM
AQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHTSPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHI
GIAFSVGYFLGLGMMRLHAQLLWYPFVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLGAVL
LWSPFASLSNSFWMSFLAVAIVLYQLASQSQSQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVI
VPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMIG
LWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDTGSSWPGGSYVQSVIEPILQQRGLRQVDGVILSHLDNDHAGDW
QGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEW
LLARDGAQLQSEVMSVPHHGSKTSSTAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGQAGQISLFFY
PDQLDWFTQRSLGWQPWYRQMLRKGVE
Nucleotide
Download Length: 2244 bp
>NTDB_id=139942 VAA049_RS05020 WP_162484598.1 1065370..1067613(+) (comEC) [Vibrio cholerae strain 1154-74]
ATGACTCTCTTGTCGAATTACTGGTCTCTCATTTCTTTTTCGATCACCGTGCTGTCTGCCCCTTATTGGCCTTGGATGCC
GAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCT
TCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCG
GATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGA
AGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGACC
GCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTGGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATG
GCGCAATCTGTCGTCGCACGGGCGGTCGTAAAACCTGACACTGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAG
TTTGTTTTTTGAGCTAGAACAATTAACGCATACCAGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAA
AAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCGATTTCGGGGCTGCACATT
GGTATCGCTTTTAGCGTGGGTTATTTTCTCGGGCTCGGAATGATGCGTCTTCATGCTCAGTTATTGTGGTACCCTTTTGT
GTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATCATGTGCT
TACTCAATGTGGCGTTGATCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTGGGCGCGGTCTTG
CTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGC
CAGTCAAAGCCAGAGTCAGGTGTGGTGGAAAGCTCTGCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCACCGGTCA
CGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATT
GTCCCAGCTTTGTTTTTGGGCCTACTACTCATGGTGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTG
GACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTACCCAGCAAAGTACAAGGTGTAG
TCGCAGCGAGTGTGGCCATCCTCTTGCTTTATCGATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATTGGC
TTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGAT
TGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGATTGAGC
CTATTCTCCAACAGCGAGGGCTACGCCAAGTCGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGG
CAAGGTTTAGCTGAGCGCTGGCAACCTAATTGGATTCGTGCCAGCCAACTTGGGACAGAGTTTATGCCTTGTATCCGTGG
TGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATT
CGTGTGTGATTCGTATGACTGATACTCAGTCTAACCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGG
CTGCTTGCTCGAGACGGAGCGCAACTGCAAAGTGAGGTGATGAGCGTGCCGCACCACGGCAGTAAAACGTCGTCCACCGC
AGAGTTTATTGCCCAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGC
AAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACAAGCTGGGCAAATTAGCCTCTTTTTCTAT
CCAGATCAGCTGGATTGGTTTACCCAGCGTAGCCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGA
ATGA
ATGACTCTCTTGTCGAATTACTGGTCTCTCATTTCTTTTTCGATCACCGTGCTGTCTGCCCCTTATTGGCCTTGGATGCC
GAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCT
TCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCG
GATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGA
AGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGACC
GCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTGGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATG
GCGCAATCTGTCGTCGCACGGGCGGTCGTAAAACCTGACACTGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAG
TTTGTTTTTTGAGCTAGAACAATTAACGCATACCAGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAA
AAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCGATTTCGGGGCTGCACATT
GGTATCGCTTTTAGCGTGGGTTATTTTCTCGGGCTCGGAATGATGCGTCTTCATGCTCAGTTATTGTGGTACCCTTTTGT
GTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATCATGTGCT
TACTCAATGTGGCGTTGATCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTGGGCGCGGTCTTG
CTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGC
CAGTCAAAGCCAGAGTCAGGTGTGGTGGAAAGCTCTGCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCACCGGTCA
CGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATT
GTCCCAGCTTTGTTTTTGGGCCTACTACTCATGGTGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTG
GACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTACCCAGCAAAGTACAAGGTGTAG
TCGCAGCGAGTGTGGCCATCCTCTTGCTTTATCGATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATTGGC
TTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGAT
TGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGATTGAGC
CTATTCTCCAACAGCGAGGGCTACGCCAAGTCGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGG
CAAGGTTTAGCTGAGCGCTGGCAACCTAATTGGATTCGTGCCAGCCAACTTGGGACAGAGTTTATGCCTTGTATCCGTGG
TGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATT
CGTGTGTGATTCGTATGACTGATACTCAGTCTAACCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGG
CTGCTTGCTCGAGACGGAGCGCAACTGCAAAGTGAGGTGATGAGCGTGCCGCACCACGGCAGTAAAACGTCGTCCACCGC
AGAGTTTATTGCCCAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGC
AAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACAAGCTGGGCAAATTAGCCTCTTTTTCTAT
CCAGATCAGCTGGATTGGTTTACCCAGCGTAGCCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGA
ATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio cholerae strain A1552 |
98.929 |
100 |
0.989 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
41.215 |
100 |
0.418 |
| comEC | Vibrio campbellii strain DS40M4 |
40.957 |
100 |
0.412 |