Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | VFH86_RS04945 | Genome accession | NZ_CP142014 |
| Coordinates | 1057201..1059444 (+) | Length | 747 a.a. |
| NCBI ID | WP_345659016.1 | Uniprot ID | - |
| Organism | Vibrio cholerae strain LK-18 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1052201..1064444
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| VFH86_RS04920 (VFH86_04920) | - | 1052664..1053239 (-) | 576 | WP_111673968.1 | PilZ domain-containing protein | - |
| VFH86_RS04925 (VFH86_04925) | lolC | 1053415..1054623 (+) | 1209 | WP_000468891.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| VFH86_RS04930 (VFH86_04930) | lolD | 1054616..1055302 (+) | 687 | WP_001061290.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| VFH86_RS04935 (VFH86_04935) | lolE | 1055303..1056547 (+) | 1245 | WP_345658741.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| VFH86_RS04940 (VFH86_04940) | - | 1056662..1057192 (-) | 531 | WP_001881633.1 | DUF2062 domain-containing protein | - |
| VFH86_RS04945 (VFH86_04945) | comEC | 1057201..1059444 (+) | 2244 | WP_345659016.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| VFH86_RS04950 (VFH86_04950) | msbA | 1059475..1061223 (+) | 1749 | WP_336915668.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| VFH86_RS04955 (VFH86_04955) | lpxK | 1061226..1062233 (+) | 1008 | WP_187371900.1 | tetraacyldisaccharide 4'-kinase | - |
| VFH86_RS04960 (VFH86_04960) | - | 1062214..1062393 (+) | 180 | WP_000350068.1 | Trm112 family protein | - |
| VFH86_RS04965 (VFH86_04965) | kdsB | 1062393..1063151 (+) | 759 | WP_000011327.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
Sequence
Protein
Download Length: 747 a.a. Molecular weight: 84443.61 Da Isoelectric Point: 7.7612
>NTDB_id=920934 VFH86_RS04945 WP_345659016.1 1057201..1059444(+) (comEC) [Vibrio cholerae strain LK-18]
MTLLSNYWSLISFSITVLSAPYWPWMPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGP
DIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYM
AQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHTSPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHI
GIAFSVGYFLGLGMMRLHAQLLWSPFVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLVAVL
LWSPFASLSNSFWMSFLAVAIVLYQLASQSQRQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVI
VPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMIG
LWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDTGSSWPGGSYVQSVIEPILQQRGLRQVDGVILSHLDNDHAGDW
QGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEW
LLARDGAQLQSEVMIVPHHGSKTSSTAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFY
PDQLDWFTQRSLGWQPWYRQMLRKGVE
MTLLSNYWSLISFSITVLSAPYWPWMPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGP
DIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYM
AQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHTSPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHI
GIAFSVGYFLGLGMMRLHAQLLWSPFVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLVAVL
LWSPFASLSNSFWMSFLAVAIVLYQLASQSQRQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVI
VPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMIG
LWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDTGSSWPGGSYVQSVIEPILQQRGLRQVDGVILSHLDNDHAGDW
QGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEW
LLARDGAQLQSEVMIVPHHGSKTSSTAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFY
PDQLDWFTQRSLGWQPWYRQMLRKGVE
Nucleotide
Download Length: 2244 bp
>NTDB_id=920934 VFH86_RS04945 WP_345659016.1 1057201..1059444(+) (comEC) [Vibrio cholerae strain LK-18]
ATGACTCTCTTGTCGAATTACTGGTCTCTCATTTCTTTTTCGATCACCGTGCTGTCTGCCCCTTATTGGCCTTGGATGCC
GAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCT
TCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCG
GATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGA
AGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGACC
GCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTGGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATG
GCGCAATCTGTCGTCGCACGGGCGGTCGTAAAACCTGACACCGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAG
TTTGTTTTTTGAGCTGGAACAATTAACGCATACCAGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAA
AAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCGATTTCGGGGCTGCACATT
GGTATCGCTTTTAGCGTGGGGTATTTTCTCGGGCTCGGCATGATGCGTCTTCATGCTCAGTTATTGTGGTCCCCTTTTGT
GTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATCATGTGCT
TACTCAATGTGGCGTTGATCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGTCGCGGTCTTG
CTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGC
CAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCTCTGCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCACCGGTCA
CGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATT
GTCCCAGCTTTGTTTTTGGGCCTACTACTCATGGTGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTG
GACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTACCCAGCAAAGTACAAGGTGTAG
TCGCAGCGAGTGTGGCCATCCTCTTGCTTTATCGATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATTGGC
TTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGAT
TGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGATTGAGC
CTATTCTCCAACAGCGAGGGCTACGCCAAGTCGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGG
CAAGGTTTAGCTGAGCGCTGGCAACCCAATTGGATTCGTGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATCCGTGG
TGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATT
CGTGTGTGATTCGTATGACCGATACTCAGTCTAACCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGG
CTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTGAGGTGATGATCGTGCCGCATCACGGCAGTAAAACGTCGTCCACCGC
AGAGTTTATAGCACAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGC
AAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTTTCTAT
CCAGATCAGCTGGATTGGTTTACCCAGCGTAGTCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGA
ATGA
ATGACTCTCTTGTCGAATTACTGGTCTCTCATTTCTTTTTCGATCACCGTGCTGTCTGCCCCTTATTGGCCTTGGATGCC
GAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCT
TCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCG
GATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGA
AGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGACC
GCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTGGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATG
GCGCAATCTGTCGTCGCACGGGCGGTCGTAAAACCTGACACCGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAG
TTTGTTTTTTGAGCTGGAACAATTAACGCATACCAGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAA
AAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCGATTTCGGGGCTGCACATT
GGTATCGCTTTTAGCGTGGGGTATTTTCTCGGGCTCGGCATGATGCGTCTTCATGCTCAGTTATTGTGGTCCCCTTTTGT
GTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATCATGTGCT
TACTCAATGTGGCGTTGATCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGTCGCGGTCTTG
CTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGC
CAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCTCTGCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCACCGGTCA
CGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATT
GTCCCAGCTTTGTTTTTGGGCCTACTACTCATGGTGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTG
GACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTACCCAGCAAAGTACAAGGTGTAG
TCGCAGCGAGTGTGGCCATCCTCTTGCTTTATCGATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATTGGC
TTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGAT
TGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGATTGAGC
CTATTCTCCAACAGCGAGGGCTACGCCAAGTCGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGG
CAAGGTTTAGCTGAGCGCTGGCAACCCAATTGGATTCGTGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATCCGTGG
TGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATT
CGTGTGTGATTCGTATGACCGATACTCAGTCTAACCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGG
CTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTGAGGTGATGATCGTGCCGCATCACGGCAGTAAAACGTCGTCCACCGC
AGAGTTTATAGCACAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGC
AAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTTTCTAT
CCAGATCAGCTGGATTGGTTTACCCAGCGTAGTCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGA
ATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio cholerae strain A1552 |
99.598 |
100 |
0.996 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
41.347 |
100 |
0.419 |
| comEC | Vibrio campbellii strain DS40M4 |
41.09 |
100 |
0.414 |