Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | VCM66_RS08975 | Genome accession | NC_012578 |
| Coordinates | 1960677..1962920 (-) | Length | 747 a.a. |
| NCBI ID | WP_000173776.1 | Uniprot ID | - |
| Organism | Vibrio cholerae M66-2 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1955677..1967920
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| VCM66_RS08950 (VCM66_1798) | tnpA | 1956364..1956801 (-) | 438 | WP_000503164.1 | IS200/IS605-like element IS1004 family transposase | - |
| VCM66_RS08955 (VCM66_1799) | kdsB | 1956970..1957728 (-) | 759 | WP_000011329.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| VCM66_RS08960 (VCM66_1800) | - | 1957728..1957907 (-) | 180 | WP_000350068.1 | Trm112 family protein | - |
| VCM66_RS08965 (VCM66_1801) | lpxK | 1957888..1958895 (-) | 1008 | WP_001994134.1 | tetraacyldisaccharide 4'-kinase | - |
| VCM66_RS08970 (VCM66_1802) | msbA | 1958898..1960646 (-) | 1749 | WP_000052153.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| VCM66_RS08975 (VCM66_1803) | comEC | 1960677..1962920 (-) | 2244 | WP_000173776.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| VCM66_RS08980 (VCM66_1804) | - | 1962929..1963459 (+) | 531 | WP_001881633.1 | DUF2062 domain-containing protein | - |
| VCM66_RS08985 (VCM66_1806) | lolE | 1963574..1964818 (-) | 1245 | WP_000493010.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| VCM66_RS08990 (VCM66_1807) | lolD | 1964819..1965505 (-) | 687 | WP_001061290.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| VCM66_RS08995 (VCM66_1808) | lolC | 1965498..1966706 (-) | 1209 | WP_000468900.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| VCM66_RS09000 (VCM66_1809) | - | 1966882..1967457 (+) | 576 | WP_000999601.1 | PilZ domain-containing protein | - |
Sequence
Protein
Download Length: 747 a.a. Molecular weight: 84511.71 Da Isoelectric Point: 7.7612
>NTDB_id=33829 VCM66_RS08975 WP_000173776.1 1960677..1962920(-) (comEC) [Vibrio cholerae M66-2]
MTLLSNYWSLISFSITVLSAPYWPWMPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGP
DIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYM
AQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHTSPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHI
GIAFSVGYFLGLGMMRFHAQLLWSPFVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLVAVL
LWSPFASLSNSFWMSFLAVAIVLYQLASQSQRQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVI
VPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMIG
LWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDTGSSWPGGSYVQSVIEPMLQQRGLRQVDGVILSHLDNDHAGDW
QGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEW
LLARDGAQLQSEVMIVPHHGSKTSSTAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFY
LDQLDWFTQRSLGWQPWYRQMLRKGVE
MTLLSNYWSLISFSITVLSAPYWPWMPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGP
DIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYM
AQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHTSPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHI
GIAFSVGYFLGLGMMRFHAQLLWSPFVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLVAVL
LWSPFASLSNSFWMSFLAVAIVLYQLASQSQRQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVI
VPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMIG
LWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDTGSSWPGGSYVQSVIEPMLQQRGLRQVDGVILSHLDNDHAGDW
QGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEW
LLARDGAQLQSEVMIVPHHGSKTSSTAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFY
LDQLDWFTQRSLGWQPWYRQMLRKGVE
Nucleotide
Download Length: 2244 bp
>NTDB_id=33829 VCM66_RS08975 WP_000173776.1 1960677..1962920(-) (comEC) [Vibrio cholerae M66-2]
ATGACTCTCTTGTCGAATTACTGGTCTCTCATTTCTTTTTCGATCACCGTGCTGTCTGCCCCTTATTGGCCTTGGATGCC
GAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCT
TCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCG
GATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGA
AGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGATC
GCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTGGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATG
GCGCAATCTGTCGTCGCACGAGCGGTCGTAAAACCTGACACTGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAG
TTTGTTTTTTGAGCTAGAGCAATTAACGCATACCAGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAA
AAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCCATTTCGGGGCTGCACATT
GGTATCGCTTTTAGCGTGGGGTATTTTCTCGGGCTCGGCATGATGCGTTTTCATGCTCAGTTATTGTGGTCCCCTTTTGT
GTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATTATGTGCT
TACTCAATGTGGCGTTGATCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGTCGCGGTCTTG
CTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGC
CAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCTCTTCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCACCGGTCA
CGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATT
GTCCCAGCTTTGTTTTTGGGTCTATTACTCATGGTGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTG
GACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTCCCCAGCAAAGTACAAGGTGTGG
TCGCAGCGAGTGTGGCCATCCTCTTGCTTTATCGATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATTGGC
TTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGAT
TGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGATTGAGC
CTATGCTCCAACAGCGGGGGCTACGCCAAGTGGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGG
CAAGGTTTAGCTGAGCGCTGGCAACCCAATTGGATTCGTGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATCCGTGG
TGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATT
CGTGTGTGATTCGTATGACCGATACTCAGTCTAACCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGG
CTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTGAGGTGATGATCGTGCCGCACCACGGCAGTAAAACATCGTCCACCGC
AGAGTTTATTGCCCAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGC
AAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTTTCTAT
CTAGATCAGCTGGATTGGTTTACCCAGCGTAGCCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGA
ATGA
ATGACTCTCTTGTCGAATTACTGGTCTCTCATTTCTTTTTCGATCACCGTGCTGTCTGCCCCTTATTGGCCTTGGATGCC
GAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCT
TCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCG
GATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGA
AGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGATC
GCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTGGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATG
GCGCAATCTGTCGTCGCACGAGCGGTCGTAAAACCTGACACTGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAG
TTTGTTTTTTGAGCTAGAGCAATTAACGCATACCAGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAA
AAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCCATTTCGGGGCTGCACATT
GGTATCGCTTTTAGCGTGGGGTATTTTCTCGGGCTCGGCATGATGCGTTTTCATGCTCAGTTATTGTGGTCCCCTTTTGT
GTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATTATGTGCT
TACTCAATGTGGCGTTGATCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGTCGCGGTCTTG
CTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGC
CAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCTCTTCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCACCGGTCA
CGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATT
GTCCCAGCTTTGTTTTTGGGTCTATTACTCATGGTGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTG
GACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTCCCCAGCAAAGTACAAGGTGTGG
TCGCAGCGAGTGTGGCCATCCTCTTGCTTTATCGATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATTGGC
TTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGAT
TGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGATTGAGC
CTATGCTCCAACAGCGGGGGCTACGCCAAGTGGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGG
CAAGGTTTAGCTGAGCGCTGGCAACCCAATTGGATTCGTGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATCCGTGG
TGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATT
CGTGTGTGATTCGTATGACCGATACTCAGTCTAACCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGG
CTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTGAGGTGATGATCGTGCCGCACCACGGCAGTAAAACATCGTCCACCGC
AGAGTTTATTGCCCAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGC
AAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTTTCTAT
CTAGATCAGCTGGATTGGTTTACCCAGCGTAGCCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGA
ATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio cholerae strain A1552 |
100 |
100 |
1 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
41.215 |
100 |
0.418 |
| comEC | Vibrio campbellii strain DS40M4 |
40.957 |
100 |
0.412 |