Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | KHN79_RS09215 | Genome accession | NZ_HG992749 |
| Coordinates | 2034887..2037145 (-) | Length | 752 a.a. |
| NCBI ID | WP_182008509.1 | Uniprot ID | - |
| Organism | Vibrio sp. B1FLJ16 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2029887..2042145
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| KHN79_RS09195 (ACOMICROBIO_LOCUS1336) | kdsB | 2031159..2031917 (-) | 759 | WP_182008506.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| KHN79_RS09200 (ACOMICROBIO_FLGHMIGD_01863) | - | 2031917..2032096 (-) | 180 | WP_005378451.1 | Trm112 family protein | - |
| KHN79_RS09205 (ACOMICROBIO_LOCUS1337) | lpxK | 2032077..2033084 (-) | 1008 | WP_182008507.1 | tetraacyldisaccharide 4'-kinase | - |
| KHN79_RS09210 (ACOMICROBIO_LOCUS1338) | msbA | 2033107..2034855 (-) | 1749 | WP_182008508.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| KHN79_RS09215 (ACOMICROBIO_FLGHMIGD_01866) | comEC | 2034887..2037145 (-) | 2259 | WP_182008509.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| KHN79_RS09220 (ACOMICROBIO_FLGHMIGD_01867) | - | 2037154..2037663 (+) | 510 | WP_182008510.1 | DUF2062 domain-containing protein | - |
| KHN79_RS09225 (ACOMICROBIO_LOCUS1339) | lolE | 2038018..2039262 (-) | 1245 | WP_182008511.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| KHN79_RS09230 (ACOMICROBIO_LOCUS1340) | lolD | 2039265..2039972 (-) | 708 | WP_182008512.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| KHN79_RS09235 (ACOMICROBIO_LOCUS1341) | lolC | 2039965..2041173 (-) | 1209 | WP_182008513.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| KHN79_RS09240 (ACOMICROBIO_FLGHMIGD_01871) | - | 2041456..2042025 (+) | 570 | WP_182008514.1 | PilZ domain-containing protein | - |
Sequence
Protein
Download Length: 752 a.a. Molecular weight: 83814.16 Da Isoelectric Point: 8.2399
>NTDB_id=1112433 KHN79_RS09215 WP_182008509.1 2034887..2037145(-) (comEC) [Vibrio sp. B1FLJ16]
MTLSEKSWTLVLFVASVISSAWWPIMPDWPWLLLGIITTGLLIKLRRGLISIGIIWGFMVVIIHGNVVEYQRQALLKSGV
NSTITGEVDSSFKQISHGYVGIVTINQVNDHNLLPFLKPKVRLVTPFPIPVNSEFTTTAQVKPIFGLRNEAGFDAEKGAM
GQGISARVIVPADASWMIRTTSSFRQTFIDRVLMDISHLDHLPLISALAFADRSSLDDDDWIELRDSGLLHLISISGLHI
GMAFSFGLVLGVGVRIVFPRWALIPSITGLSVALGYAWMADFSLPTVRAILVCVIYVLLKHALIYWSAWRVVLLAVAIQL
FIQPFASFSMSFWLSYLSVGLVLFAVHLVQMQKGSLIAKIRAAFSMQLCLSLLVVPLGAYFFSGFSLSAILYNLVFIPWF
GFVVVPLMFLALFLSLLLPDLANPVWQLVDFSLWPLSESLQYALGTWLTLSLETTWLFALLSVCLVLKRFLSSQAWVLLV
AVVVCVSSMRDRYKEGWRVDVLDVGHGLAVIIEKDGQVLLYDTGKAWQGGSIAEQIITPILQRRGYTSIDTLVLSHVDND
HAGGQHIIEETFGPVVKRSSQYFQGYQACVRGESWIWQGLDIQVLWPPKLVRRAYNPHSCVLRVSDPESGINVLLTGDIE
AISEWILLREPNKLRSEIMLVPHHGSKSSSNPRFIEAVSPVIAIASLAKDNQWGMPAKSVVSSYQNAGALWLDTGESGQV
TFKVSEGKWYFSTKRSDTFGPWYRQMLRKGVE
MTLSEKSWTLVLFVASVISSAWWPIMPDWPWLLLGIITTGLLIKLRRGLISIGIIWGFMVVIIHGNVVEYQRQALLKSGV
NSTITGEVDSSFKQISHGYVGIVTINQVNDHNLLPFLKPKVRLVTPFPIPVNSEFTTTAQVKPIFGLRNEAGFDAEKGAM
GQGISARVIVPADASWMIRTTSSFRQTFIDRVLMDISHLDHLPLISALAFADRSSLDDDDWIELRDSGLLHLISISGLHI
GMAFSFGLVLGVGVRIVFPRWALIPSITGLSVALGYAWMADFSLPTVRAILVCVIYVLLKHALIYWSAWRVVLLAVAIQL
FIQPFASFSMSFWLSYLSVGLVLFAVHLVQMQKGSLIAKIRAAFSMQLCLSLLVVPLGAYFFSGFSLSAILYNLVFIPWF
GFVVVPLMFLALFLSLLLPDLANPVWQLVDFSLWPLSESLQYALGTWLTLSLETTWLFALLSVCLVLKRFLSSQAWVLLV
AVVVCVSSMRDRYKEGWRVDVLDVGHGLAVIIEKDGQVLLYDTGKAWQGGSIAEQIITPILQRRGYTSIDTLVLSHVDND
HAGGQHIIEETFGPVVKRSSQYFQGYQACVRGESWIWQGLDIQVLWPPKLVRRAYNPHSCVLRVSDPESGINVLLTGDIE
AISEWILLREPNKLRSEIMLVPHHGSKSSSNPRFIEAVSPVIAIASLAKDNQWGMPAKSVVSSYQNAGALWLDTGESGQV
TFKVSEGKWYFSTKRSDTFGPWYRQMLRKGVE
Nucleotide
Download Length: 2259 bp
>NTDB_id=1112433 KHN79_RS09215 WP_182008509.1 2034887..2037145(-) (comEC) [Vibrio sp. B1FLJ16]
ATGACTCTCTCAGAAAAAAGTTGGACCTTGGTGTTATTTGTAGCAAGTGTCATTTCATCAGCCTGGTGGCCGATTATGCC
GGACTGGCCTTGGTTACTGCTGGGAATAATTACCACTGGCTTACTTATCAAATTACGTCGTGGCTTAATCAGCATAGGCA
TAATCTGGGGCTTTATGGTCGTCATTATCCACGGCAATGTGGTTGAGTATCAGCGACAAGCCCTTTTAAAATCAGGGGTG
AATAGTACCATAACTGGCGAAGTTGACAGCTCTTTTAAGCAAATAAGTCATGGATATGTAGGTATCGTGACTATAAATCA
GGTCAACGATCACAACTTATTACCTTTTCTTAAACCTAAAGTGCGTTTAGTCACCCCTTTTCCCATCCCTGTTAACAGCG
AATTTACGACGACGGCGCAGGTTAAGCCTATATTCGGACTGCGTAATGAAGCCGGGTTCGATGCAGAAAAAGGGGCAATG
GGGCAAGGCATTTCAGCCAGAGTTATTGTGCCCGCCGATGCGAGTTGGATGATACGAACCACCTCCAGCTTTCGCCAGAC
TTTTATTGATCGGGTATTGATGGATATATCTCATCTCGACCATCTGCCGTTAATCAGTGCATTGGCTTTTGCGGACCGGT
CAAGCCTTGACGATGATGACTGGATCGAACTCAGAGACAGTGGCTTACTGCATCTGATCTCTATTTCAGGTCTGCACATT
GGGATGGCGTTTAGTTTCGGTTTGGTACTTGGCGTTGGAGTTCGTATTGTTTTCCCCCGTTGGGCGCTTATACCTTCAAT
CACCGGATTATCTGTAGCTCTTGGTTATGCATGGATGGCGGACTTTTCCTTGCCAACGGTTCGGGCAATATTGGTCTGCG
TTATTTATGTATTGCTTAAACATGCTCTAATTTACTGGAGTGCATGGAGGGTGGTATTGCTGGCGGTGGCTATCCAGTTG
TTCATACAGCCTTTTGCTTCATTTAGTATGAGTTTCTGGCTGTCCTATTTGTCGGTTGGCTTGGTACTGTTTGCGGTTCA
TTTGGTCCAAATGCAAAAGGGCTCCCTGATTGCTAAAATCCGTGCAGCCTTTAGCATGCAACTTTGTCTGAGTTTGTTGG
TTGTCCCTCTTGGTGCTTACTTTTTCAGCGGCTTTAGCTTGTCAGCAATCCTCTACAATCTGGTTTTTATTCCTTGGTTT
GGGTTTGTCGTTGTTCCACTCATGTTTTTGGCATTATTTCTTTCGTTACTGCTGCCGGATTTGGCCAACCCAGTATGGCA
ATTAGTGGACTTTTCGCTTTGGCCGCTAAGCGAATCACTTCAGTATGCACTTGGAACCTGGTTAACGCTTTCGCTTGAAA
CGACATGGCTTTTTGCCTTGTTAAGTGTTTGTCTTGTTCTGAAGCGCTTTTTATCAAGTCAAGCGTGGGTGCTTTTAGTT
GCCGTTGTAGTCTGCGTCAGTTCAATGCGTGATCGCTATAAAGAAGGTTGGCGAGTGGATGTTCTCGATGTCGGGCATGG
CTTGGCTGTCATTATCGAAAAAGACGGTCAAGTTTTACTGTACGATACGGGGAAAGCCTGGCAGGGAGGCAGTATTGCGG
AGCAGATTATCACTCCGATACTCCAACGCAGGGGTTACACGTCGATTGATACTCTGGTGCTAAGCCATGTCGATAATGAT
CATGCCGGAGGGCAGCACATCATTGAAGAGACATTTGGGCCGGTAGTAAAACGTAGTAGCCAGTATTTTCAAGGTTATCA
AGCCTGTGTTAGGGGAGAAAGCTGGATTTGGCAGGGGCTCGATATCCAGGTTCTCTGGCCACCCAAACTGGTCAGGCGCG
CTTATAACCCTCACTCATGTGTACTAAGAGTCAGCGACCCGGAATCCGGGATTAATGTGCTTCTTACCGGAGATATAGAG
GCTATTAGCGAGTGGATATTACTCAGGGAACCAAACAAACTACGGAGTGAGATCATGCTTGTGCCGCATCATGGGAGTAA
AAGCTCGTCTAATCCGCGATTTATTGAGGCGGTTTCACCAGTGATAGCCATTGCATCTTTAGCAAAAGATAATCAGTGGG
GGATGCCTGCTAAGTCGGTCGTTTCCTCTTACCAAAATGCTGGCGCTCTTTGGCTCGATACCGGCGAGAGTGGTCAAGTA
ACGTTCAAAGTAAGTGAAGGCAAGTGGTACTTTAGCACCAAACGCAGCGATACATTTGGGCCTTGGTATAGGCAGATGCT
GCGTAAGGGGGTAGAATAA
ATGACTCTCTCAGAAAAAAGTTGGACCTTGGTGTTATTTGTAGCAAGTGTCATTTCATCAGCCTGGTGGCCGATTATGCC
GGACTGGCCTTGGTTACTGCTGGGAATAATTACCACTGGCTTACTTATCAAATTACGTCGTGGCTTAATCAGCATAGGCA
TAATCTGGGGCTTTATGGTCGTCATTATCCACGGCAATGTGGTTGAGTATCAGCGACAAGCCCTTTTAAAATCAGGGGTG
AATAGTACCATAACTGGCGAAGTTGACAGCTCTTTTAAGCAAATAAGTCATGGATATGTAGGTATCGTGACTATAAATCA
GGTCAACGATCACAACTTATTACCTTTTCTTAAACCTAAAGTGCGTTTAGTCACCCCTTTTCCCATCCCTGTTAACAGCG
AATTTACGACGACGGCGCAGGTTAAGCCTATATTCGGACTGCGTAATGAAGCCGGGTTCGATGCAGAAAAAGGGGCAATG
GGGCAAGGCATTTCAGCCAGAGTTATTGTGCCCGCCGATGCGAGTTGGATGATACGAACCACCTCCAGCTTTCGCCAGAC
TTTTATTGATCGGGTATTGATGGATATATCTCATCTCGACCATCTGCCGTTAATCAGTGCATTGGCTTTTGCGGACCGGT
CAAGCCTTGACGATGATGACTGGATCGAACTCAGAGACAGTGGCTTACTGCATCTGATCTCTATTTCAGGTCTGCACATT
GGGATGGCGTTTAGTTTCGGTTTGGTACTTGGCGTTGGAGTTCGTATTGTTTTCCCCCGTTGGGCGCTTATACCTTCAAT
CACCGGATTATCTGTAGCTCTTGGTTATGCATGGATGGCGGACTTTTCCTTGCCAACGGTTCGGGCAATATTGGTCTGCG
TTATTTATGTATTGCTTAAACATGCTCTAATTTACTGGAGTGCATGGAGGGTGGTATTGCTGGCGGTGGCTATCCAGTTG
TTCATACAGCCTTTTGCTTCATTTAGTATGAGTTTCTGGCTGTCCTATTTGTCGGTTGGCTTGGTACTGTTTGCGGTTCA
TTTGGTCCAAATGCAAAAGGGCTCCCTGATTGCTAAAATCCGTGCAGCCTTTAGCATGCAACTTTGTCTGAGTTTGTTGG
TTGTCCCTCTTGGTGCTTACTTTTTCAGCGGCTTTAGCTTGTCAGCAATCCTCTACAATCTGGTTTTTATTCCTTGGTTT
GGGTTTGTCGTTGTTCCACTCATGTTTTTGGCATTATTTCTTTCGTTACTGCTGCCGGATTTGGCCAACCCAGTATGGCA
ATTAGTGGACTTTTCGCTTTGGCCGCTAAGCGAATCACTTCAGTATGCACTTGGAACCTGGTTAACGCTTTCGCTTGAAA
CGACATGGCTTTTTGCCTTGTTAAGTGTTTGTCTTGTTCTGAAGCGCTTTTTATCAAGTCAAGCGTGGGTGCTTTTAGTT
GCCGTTGTAGTCTGCGTCAGTTCAATGCGTGATCGCTATAAAGAAGGTTGGCGAGTGGATGTTCTCGATGTCGGGCATGG
CTTGGCTGTCATTATCGAAAAAGACGGTCAAGTTTTACTGTACGATACGGGGAAAGCCTGGCAGGGAGGCAGTATTGCGG
AGCAGATTATCACTCCGATACTCCAACGCAGGGGTTACACGTCGATTGATACTCTGGTGCTAAGCCATGTCGATAATGAT
CATGCCGGAGGGCAGCACATCATTGAAGAGACATTTGGGCCGGTAGTAAAACGTAGTAGCCAGTATTTTCAAGGTTATCA
AGCCTGTGTTAGGGGAGAAAGCTGGATTTGGCAGGGGCTCGATATCCAGGTTCTCTGGCCACCCAAACTGGTCAGGCGCG
CTTATAACCCTCACTCATGTGTACTAAGAGTCAGCGACCCGGAATCCGGGATTAATGTGCTTCTTACCGGAGATATAGAG
GCTATTAGCGAGTGGATATTACTCAGGGAACCAAACAAACTACGGAGTGAGATCATGCTTGTGCCGCATCATGGGAGTAA
AAGCTCGTCTAATCCGCGATTTATTGAGGCGGTTTCACCAGTGATAGCCATTGCATCTTTAGCAAAAGATAATCAGTGGG
GGATGCCTGCTAAGTCGGTCGTTTCCTCTTACCAAAATGCTGGCGCTCTTTGGCTCGATACCGGCGAGAGTGGTCAAGTA
ACGTTCAAAGTAAGTGAAGGCAAGTGGTACTTTAGCACCAAACGCAGCGATACATTTGGGCCTTGGTATAGGCAGATGCT
GCGTAAGGGGGTAGAATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio campbellii strain DS40M4 |
64.495 |
100 |
0.645 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
64.362 |
100 |
0.644 |
| comEC | Vibrio cholerae strain A1552 |
41.425 |
100 |
0.418 |