Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | OC536_RS05460 | Genome accession | NZ_OW443150 |
| Coordinates | 1165160..1167328 (+) | Length | 722 a.a. |
| NCBI ID | WP_172779501.1 | Uniprot ID | - |
| Organism | Vibrio cholerae strain CNRVC190247 isolate YE-NCPHL-18035-PI | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1160160..1172328
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| OC536_RS05435 (CNRVC190247_01095) | - | 1160548..1161123 (-) | 576 | WP_000999601.1 | PilZ domain-containing protein | - |
| OC536_RS05440 (CNRVC190247_01096) | lolC | 1161299..1162507 (+) | 1209 | WP_032476337.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| OC536_RS05445 (CNRVC190247_01097) | lolD | 1162500..1163186 (+) | 687 | WP_032476336.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| OC536_RS05450 (CNRVC190247_01098) | lolE | 1163187..1164431 (+) | 1245 | WP_000493013.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| OC536_RS05455 (CNRVC190247_01099) | - | 1164546..1165076 (-) | 531 | WP_001881633.1 | DUF2062 domain-containing protein | - |
| OC536_RS05460 (CNRVC190247_01100) | comEC | 1165160..1167328 (+) | 2169 | WP_172779501.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| OC536_RS05465 (CNRVC190247_01101) | msbA | 1167359..1169107 (+) | 1749 | WP_000052155.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| OC536_RS05470 (CNRVC190247_01102) | lpxK | 1169110..1170117 (+) | 1008 | WP_032476388.1 | tetraacyldisaccharide 4'-kinase | - |
| OC536_RS05475 (CNRVC190247_01103) | - | 1170098..1170277 (+) | 180 | WP_000350068.1 | Trm112 family protein | - |
| OC536_RS05480 (CNRVC190247_01104) | kdsB | 1170277..1171050 (+) | 774 | WP_032476335.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
Sequence
Protein
Download Length: 722 a.a. Molecular weight: 81548.26 Da Isoelectric Point: 7.9738
>NTDB_id=1152343 OC536_RS05460 WP_172779501.1 1165160..1167328(+) (comEC) [Vibrio cholerae strain CNRVC190247 isolate YE-NCPHL-18035-PI]
MPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGPDIIIKGRVDSFFTQTRYAYEGFVLI
HEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYMAQSVVARAVVKPDTAYQIVQESGIR
SSLFFELEQLTHTSSYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHIGIAFSVGYFLGLGMMRLHAQLLWSP
FVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALLMLAFPLSALKRILLTLVAVLLWSPFASLSNSFWMSFLAVAIVLYQ
LASQSQRQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVIVPALFLGLLLMVVWPSMAAAYWPWV
DWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMNLKACSLLLGMIGLWWWFPSLTPLWRMDVLDVGHGLAI
VIEQDERAIVYDTGSSWPGGSYVQSVIEPILQQRGLRQVDGVILSHLDNDHAGDWQGLAERWQPNWIRASQLGTEFMPCI
RGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSKHSVLLSGDVTAMGEWLLARDGAQLQSEVMIVPHHGSKTSS
TAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFYPDQLDWFTQRSLGWQPWYRQMLRKG
VE
MPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGPDIIIKGRVDSFFTQTRYAYEGFVLI
HEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYMAQSVVARAVVKPDTAYQIVQESGIR
SSLFFELEQLTHTSSYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHIGIAFSVGYFLGLGMMRLHAQLLWSP
FVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALLMLAFPLSALKRILLTLVAVLLWSPFASLSNSFWMSFLAVAIVLYQ
LASQSQRQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVIVPALFLGLLLMVVWPSMAAAYWPWV
DWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMNLKACSLLLGMIGLWWWFPSLTPLWRMDVLDVGHGLAI
VIEQDERAIVYDTGSSWPGGSYVQSVIEPILQQRGLRQVDGVILSHLDNDHAGDWQGLAERWQPNWIRASQLGTEFMPCI
RGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSKHSVLLSGDVTAMGEWLLARDGAQLQSEVMIVPHHGSKTSS
TAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFYPDQLDWFTQRSLGWQPWYRQMLRKG
VE
Nucleotide
Download Length: 2169 bp
>NTDB_id=1152343 OC536_RS05460 WP_172779501.1 1165160..1167328(+) (comEC) [Vibrio cholerae strain CNRVC190247 isolate YE-NCPHL-18035-PI]
ATGCCGAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCT
TGGCTTCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAG
GGCCGGATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATT
CATGAAGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAA
TGACCGCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTGGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATT
ACATGGCGCAATCTGTCGTCGCACGAGCGGTCGTAAAACCTGACACTGCTTATCAAATTGTGCAAGAGAGTGGCATAAGG
TCAAGTTTGTTTTTTGAGCTAGAGCAATTAACGCATACCAGCTCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGA
GCGAAAAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCGATTTCGGGGCTGC
ACATTGGTATCGCTTTTAGCGTGGGGTATTTTCTCGGGCTCGGCATGATGCGTCTTCATGCTCAGTTATTGTGGTCCCCT
TTTGTGTGTGGGGCTTTACTGGCGGTGCTCTATGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATTAT
GTGCTTACTCAATGTGGCGTTGCTCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGTCGCGG
TCTTGCTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTTGCGATTGTTCTCTACCAA
TTAGCCAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCTCTGCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCGCC
GGTCACGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGG
TGATTGTCCCAGCTTTGTTTTTGGGTCTATTACTCATGGTGGTATGGCCTAGTATGGCCGCCGCTTACTGGCCTTGGGTG
GATTGGACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTCCCCAGCAAAGTACAAGG
TGTGGTCGCAGCGAGTGTGGCCATCCTCTTACTTTATCGATTTATGAACTTAAAAGCCTGCAGCTTATTATTGGGTATGA
TTGGCTTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATT
GTGATTGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGAT
TGAGCCTATTCTCCAACAGCGAGGGCTACGCCAAGTCGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTG
ATTGGCAAGGTTTAGCTGAGCGCTGGCAACCTAATTGGATTCGTGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATC
CGTGGTGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCA
GCATTCGTGTGTGATTCGTATGACCGATACTCAGTCTAAGCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGG
AGTGGCTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTGAGGTGATGATCGTGCCGCATCACGGCAGTAAAACGTCGTCC
ACCGCAGAGTTTATTGCCCAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAA
TCCGCAAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTT
TCTATCCAGATCAGCTGGATTGGTTTACCCAGCGTAGCCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGA
GTAGAATGA
ATGCCGAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCT
TGGCTTCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAG
GGCCGGATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATT
CATGAAGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAA
TGACCGCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTGGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATT
ACATGGCGCAATCTGTCGTCGCACGAGCGGTCGTAAAACCTGACACTGCTTATCAAATTGTGCAAGAGAGTGGCATAAGG
TCAAGTTTGTTTTTTGAGCTAGAGCAATTAACGCATACCAGCTCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGA
GCGAAAAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCGATTTCGGGGCTGC
ACATTGGTATCGCTTTTAGCGTGGGGTATTTTCTCGGGCTCGGCATGATGCGTCTTCATGCTCAGTTATTGTGGTCCCCT
TTTGTGTGTGGGGCTTTACTGGCGGTGCTCTATGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATTAT
GTGCTTACTCAATGTGGCGTTGCTCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGTCGCGG
TCTTGCTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTTGCGATTGTTCTCTACCAA
TTAGCCAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCTCTGCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCGCC
GGTCACGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGG
TGATTGTCCCAGCTTTGTTTTTGGGTCTATTACTCATGGTGGTATGGCCTAGTATGGCCGCCGCTTACTGGCCTTGGGTG
GATTGGACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTCCCCAGCAAAGTACAAGG
TGTGGTCGCAGCGAGTGTGGCCATCCTCTTACTTTATCGATTTATGAACTTAAAAGCCTGCAGCTTATTATTGGGTATGA
TTGGCTTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATT
GTGATTGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGAT
TGAGCCTATTCTCCAACAGCGAGGGCTACGCCAAGTCGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTG
ATTGGCAAGGTTTAGCTGAGCGCTGGCAACCTAATTGGATTCGTGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATC
CGTGGTGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCA
GCATTCGTGTGTGATTCGTATGACCGATACTCAGTCTAAGCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGG
AGTGGCTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTGAGGTGATGATCGTGCCGCATCACGGCAGTAAAACGTCGTCC
ACCGCAGAGTTTATTGCCCAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAA
TCCGCAAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTT
TCTATCCAGATCAGCTGGATTGGTTTACCCAGCGTAGCCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGA
GTAGAATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio cholerae strain A1552 |
98.892 |
100 |
0.989 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
41.257 |
100 |
0.418 |
| comEC | Vibrio campbellii strain DS40M4 |
41.265 |
100 |
0.416 |