Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | HPK20_RS05080 | Genome accession | NZ_CP053664 |
| Coordinates | 1102633..1104870 (+) | Length | 745 a.a. |
| NCBI ID | WP_171933493.1 | Uniprot ID | - |
| Organism | Vibrio fluvialis strain A8 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1097633..1109870
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HPK20_RS05055 (HPK20_05055) | - | 1098084..1098653 (-) | 570 | WP_020327793.1 | hypothetical protein | - |
| HPK20_RS05060 (HPK20_05060) | lolC | 1098829..1100037 (+) | 1209 | WP_055453396.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| HPK20_RS05065 (HPK20_05065) | lolD | 1100030..1100716 (+) | 687 | WP_020327790.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| HPK20_RS05070 (HPK20_05070) | lolE | 1100717..1101961 (+) | 1245 | WP_158124606.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| HPK20_RS05075 (HPK20_05075) | - | 1102094..1102624 (-) | 531 | WP_020327788.1 | DUF2062 domain-containing protein | - |
| HPK20_RS05080 (HPK20_05080) | comEC | 1102633..1104870 (+) | 2238 | WP_171933493.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| HPK20_RS05085 (HPK20_05085) | msbA | 1104903..1106651 (+) | 1749 | WP_020327786.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| HPK20_RS05090 (HPK20_05090) | lpxK | 1106655..1107662 (+) | 1008 | WP_171933494.1 | tetraacyldisaccharide 4'-kinase | - |
| HPK20_RS05095 (HPK20_05095) | - | 1107643..1107822 (+) | 180 | WP_020327784.1 | Trm112 family protein | - |
| HPK20_RS05100 (HPK20_05100) | kdsB | 1107822..1108565 (+) | 744 | WP_154184588.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 83560.93 Da Isoelectric Point: 8.7466
>NTDB_id=446125 HPK20_RS05080 WP_171933493.1 1102633..1104870(+) (comEC) [Vibrio fluvialis strain A8]
MTLYSNYWMLASFSLTVLSAPLWPWMPEWDFAFICLAALMTTLVVSRFRVFGGIALALLVIVTHGNVVRSQSNTIFQAGQ
DITIKGEVDSFFKQISYGYEGTVVVRSINGQQLHTFWQPKVRLIAPVDLQIGDQFEFSVMVKPVYGRRNEAGFDLEAYYF
SQGWVARVNVKPQSKFEVVSTPNFRSMLYRQIKTWTQNSPSQGMILALTFGDRNGIHEAEWRSLRNSGLIHLVAISGLHI
GMAFAIGYLIGTAMMRLHVSLLWMPFVCGMCIAAIYAWLAGFTLPTQRALLMCGLNVALTMSGMRVTAVQRILITLAAVL
IVTPFAPLSNSFWLSFLAVALVLYQLADRTTRHGWRKWLTVQCSLVTMMIPVSAYFFSGFSVSSALYNLIFIPYFSFVIV
PLLFLALFLTTFVGDITWLWRGIDLSFWPLQRALEWAGSSWIAVSQTATVVFSVGLLLWVCRPVLSWRAQLYAGIALLGF
GFFTPHDERWRVDILDVGHGLAVLIERNHNALLYDTGSSWPEGSYVRSLIVPILNQRGMDSLDGLILSHTDNDHAGGLKD
AETLLSPKWVVASQSGPNWQACHAGEQWEWQGLTMTALWPPQTVNRAYNLQSCVIRLSDPEYGHSLLLAGDVTAVGEWLL
SRQPMDIQSDVIIVPHHGSKTSSTRQFIERVSPQVAIASLAKGNQWQLPHRSVVARYVDAGSHWLDTGEAGQITLTYQAQ
SRQLSTLRHIGNPSWYRQMLRKGVE
MTLYSNYWMLASFSLTVLSAPLWPWMPEWDFAFICLAALMTTLVVSRFRVFGGIALALLVIVTHGNVVRSQSNTIFQAGQ
DITIKGEVDSFFKQISYGYEGTVVVRSINGQQLHTFWQPKVRLIAPVDLQIGDQFEFSVMVKPVYGRRNEAGFDLEAYYF
SQGWVARVNVKPQSKFEVVSTPNFRSMLYRQIKTWTQNSPSQGMILALTFGDRNGIHEAEWRSLRNSGLIHLVAISGLHI
GMAFAIGYLIGTAMMRLHVSLLWMPFVCGMCIAAIYAWLAGFTLPTQRALLMCGLNVALTMSGMRVTAVQRILITLAAVL
IVTPFAPLSNSFWLSFLAVALVLYQLADRTTRHGWRKWLTVQCSLVTMMIPVSAYFFSGFSVSSALYNLIFIPYFSFVIV
PLLFLALFLTTFVGDITWLWRGIDLSFWPLQRALEWAGSSWIAVSQTATVVFSVGLLLWVCRPVLSWRAQLYAGIALLGF
GFFTPHDERWRVDILDVGHGLAVLIERNHNALLYDTGSSWPEGSYVRSLIVPILNQRGMDSLDGLILSHTDNDHAGGLKD
AETLLSPKWVVASQSGPNWQACHAGEQWEWQGLTMTALWPPQTVNRAYNLQSCVIRLSDPEYGHSLLLAGDVTAVGEWLL
SRQPMDIQSDVIIVPHHGSKTSSTRQFIERVSPQVAIASLAKGNQWQLPHRSVVARYVDAGSHWLDTGEAGQITLTYQAQ
SRQLSTLRHIGNPSWYRQMLRKGVE
Nucleotide
Download Length: 2238 bp
>NTDB_id=446125 HPK20_RS05080 WP_171933493.1 1102633..1104870(+) (comEC) [Vibrio fluvialis strain A8]
ATGACTCTCTATTCGAATTACTGGATGCTCGCTTCGTTCTCGCTCACCGTATTGTCTGCGCCCCTTTGGCCGTGGATGCC
AGAGTGGGATTTTGCATTCATTTGCCTTGCTGCTCTGATGACCACGCTCGTGGTGAGCCGATTCAGAGTTTTTGGCGGGA
TCGCGCTTGCCTTGTTAGTGATAGTAACACACGGCAACGTTGTGAGATCTCAATCCAACACTATTTTTCAAGCAGGGCAG
GATATTACCATAAAAGGTGAAGTTGACAGCTTTTTTAAGCAAATTAGTTATGGTTATGAAGGGACTGTAGTAGTCAGATC
AATCAATGGACAACAATTGCACACTTTTTGGCAGCCGAAGGTGCGGTTAATTGCTCCGGTCGATCTGCAAATTGGCGATC
AGTTTGAGTTTTCTGTAATGGTTAAACCCGTTTATGGCCGCCGTAATGAGGCCGGTTTTGACTTAGAAGCTTACTACTTC
AGTCAGGGTTGGGTTGCGAGAGTGAACGTTAAACCTCAGTCAAAATTTGAGGTCGTTTCAACGCCGAATTTTCGCAGTAT
GTTGTACCGCCAAATCAAAACATGGACGCAAAACAGCCCTTCTCAAGGCATGATCTTGGCGCTGACATTTGGTGACCGCA
ACGGCATTCATGAAGCTGAATGGCGCTCACTGAGAAATAGCGGACTCATCCATCTGGTGGCGATATCCGGTTTGCATATT
GGCATGGCGTTCGCGATTGGTTATCTGATTGGAACTGCGATGATGCGGCTGCATGTCAGTTTGCTGTGGATGCCTTTCGT
TTGCGGGATGTGTATTGCCGCGATTTATGCATGGCTGGCAGGATTTACCTTGCCGACCCAGCGCGCTTTGTTGATGTGCG
GTCTCAACGTCGCTCTCACGATGAGTGGCATGCGTGTTACGGCGGTGCAGCGAATCCTTATCACCCTTGCTGCGGTATTA
ATCGTGACGCCGTTTGCGCCGCTCTCCAACAGTTTCTGGCTCTCATTCTTGGCGGTCGCGTTGGTTTTGTATCAATTGGC
AGACCGAACAACACGTCATGGCTGGCGAAAATGGCTGACGGTGCAATGCTCGTTAGTCACTATGATGATTCCAGTTTCCG
CGTATTTTTTCTCCGGGTTCAGTGTTTCGTCGGCACTCTATAACCTGATTTTTATTCCTTACTTCAGCTTTGTCATCGTC
CCGCTGCTGTTTCTCGCACTGTTTCTCACTACCTTTGTCGGAGATATCACCTGGCTGTGGCGTGGGATTGACCTGAGCTT
CTGGCCACTGCAGCGGGCACTGGAATGGGCGGGATCGAGCTGGATTGCTGTAAGTCAGACGGCGACGGTCGTATTTAGTG
TTGGCCTTCTCCTTTGGGTTTGCCGCCCGGTATTAAGCTGGCGAGCTCAACTCTATGCGGGAATTGCATTACTTGGCTTT
GGCTTTTTTACCCCACACGATGAGCGTTGGCGAGTTGACATCCTTGATGTCGGGCACGGTCTGGCAGTGTTGATTGAACG
GAATCACAACGCCCTGTTGTACGACACCGGCAGTAGCTGGCCTGAAGGCAGTTACGTTCGCTCTTTGATTGTTCCAATAC
TCAATCAGCGCGGTATGGACTCTCTGGATGGTTTAATTCTCAGTCATACAGACAATGACCACGCTGGTGGTTTAAAAGAT
GCCGAAACCTTATTGTCCCCGAAATGGGTTGTGGCGAGTCAGTCTGGTCCAAATTGGCAGGCTTGCCACGCGGGAGAACA
GTGGGAATGGCAAGGACTGACGATGACTGCACTGTGGCCGCCGCAAACCGTCAATCGTGCTTATAATCTGCAGTCGTGCG
TGATTCGATTAAGCGATCCTGAATACGGTCATTCACTACTTCTGGCTGGCGATGTTACGGCAGTCGGTGAGTGGTTACTC
AGTCGGCAACCCATGGATATTCAGAGCGATGTCATCATTGTGCCTCATCACGGCAGTAAAACGTCCTCAACGCGTCAGTT
TATCGAGCGGGTTTCCCCGCAAGTGGCCATTGCGTCTCTGGCGAAAGGGAATCAATGGCAGTTGCCGCATCGCAGTGTGG
TGGCGCGTTATGTGGATGCGGGGTCCCACTGGCTTGATACTGGCGAGGCGGGGCAAATTACTCTGACTTATCAAGCTCAA
TCCCGGCAGCTATCGACTCTGCGCCACATAGGAAATCCTTCATGGTATAGGCAGATGCTACGTAAAGGGGTAGAATGA
ATGACTCTCTATTCGAATTACTGGATGCTCGCTTCGTTCTCGCTCACCGTATTGTCTGCGCCCCTTTGGCCGTGGATGCC
AGAGTGGGATTTTGCATTCATTTGCCTTGCTGCTCTGATGACCACGCTCGTGGTGAGCCGATTCAGAGTTTTTGGCGGGA
TCGCGCTTGCCTTGTTAGTGATAGTAACACACGGCAACGTTGTGAGATCTCAATCCAACACTATTTTTCAAGCAGGGCAG
GATATTACCATAAAAGGTGAAGTTGACAGCTTTTTTAAGCAAATTAGTTATGGTTATGAAGGGACTGTAGTAGTCAGATC
AATCAATGGACAACAATTGCACACTTTTTGGCAGCCGAAGGTGCGGTTAATTGCTCCGGTCGATCTGCAAATTGGCGATC
AGTTTGAGTTTTCTGTAATGGTTAAACCCGTTTATGGCCGCCGTAATGAGGCCGGTTTTGACTTAGAAGCTTACTACTTC
AGTCAGGGTTGGGTTGCGAGAGTGAACGTTAAACCTCAGTCAAAATTTGAGGTCGTTTCAACGCCGAATTTTCGCAGTAT
GTTGTACCGCCAAATCAAAACATGGACGCAAAACAGCCCTTCTCAAGGCATGATCTTGGCGCTGACATTTGGTGACCGCA
ACGGCATTCATGAAGCTGAATGGCGCTCACTGAGAAATAGCGGACTCATCCATCTGGTGGCGATATCCGGTTTGCATATT
GGCATGGCGTTCGCGATTGGTTATCTGATTGGAACTGCGATGATGCGGCTGCATGTCAGTTTGCTGTGGATGCCTTTCGT
TTGCGGGATGTGTATTGCCGCGATTTATGCATGGCTGGCAGGATTTACCTTGCCGACCCAGCGCGCTTTGTTGATGTGCG
GTCTCAACGTCGCTCTCACGATGAGTGGCATGCGTGTTACGGCGGTGCAGCGAATCCTTATCACCCTTGCTGCGGTATTA
ATCGTGACGCCGTTTGCGCCGCTCTCCAACAGTTTCTGGCTCTCATTCTTGGCGGTCGCGTTGGTTTTGTATCAATTGGC
AGACCGAACAACACGTCATGGCTGGCGAAAATGGCTGACGGTGCAATGCTCGTTAGTCACTATGATGATTCCAGTTTCCG
CGTATTTTTTCTCCGGGTTCAGTGTTTCGTCGGCACTCTATAACCTGATTTTTATTCCTTACTTCAGCTTTGTCATCGTC
CCGCTGCTGTTTCTCGCACTGTTTCTCACTACCTTTGTCGGAGATATCACCTGGCTGTGGCGTGGGATTGACCTGAGCTT
CTGGCCACTGCAGCGGGCACTGGAATGGGCGGGATCGAGCTGGATTGCTGTAAGTCAGACGGCGACGGTCGTATTTAGTG
TTGGCCTTCTCCTTTGGGTTTGCCGCCCGGTATTAAGCTGGCGAGCTCAACTCTATGCGGGAATTGCATTACTTGGCTTT
GGCTTTTTTACCCCACACGATGAGCGTTGGCGAGTTGACATCCTTGATGTCGGGCACGGTCTGGCAGTGTTGATTGAACG
GAATCACAACGCCCTGTTGTACGACACCGGCAGTAGCTGGCCTGAAGGCAGTTACGTTCGCTCTTTGATTGTTCCAATAC
TCAATCAGCGCGGTATGGACTCTCTGGATGGTTTAATTCTCAGTCATACAGACAATGACCACGCTGGTGGTTTAAAAGAT
GCCGAAACCTTATTGTCCCCGAAATGGGTTGTGGCGAGTCAGTCTGGTCCAAATTGGCAGGCTTGCCACGCGGGAGAACA
GTGGGAATGGCAAGGACTGACGATGACTGCACTGTGGCCGCCGCAAACCGTCAATCGTGCTTATAATCTGCAGTCGTGCG
TGATTCGATTAAGCGATCCTGAATACGGTCATTCACTACTTCTGGCTGGCGATGTTACGGCAGTCGGTGAGTGGTTACTC
AGTCGGCAACCCATGGATATTCAGAGCGATGTCATCATTGTGCCTCATCACGGCAGTAAAACGTCCTCAACGCGTCAGTT
TATCGAGCGGGTTTCCCCGCAAGTGGCCATTGCGTCTCTGGCGAAAGGGAATCAATGGCAGTTGCCGCATCGCAGTGTGG
TGGCGCGTTATGTGGATGCGGGGTCCCACTGGCTTGATACTGGCGAGGCGGGGCAAATTACTCTGACTTATCAAGCTCAA
TCCCGGCAGCTATCGACTCTGCGCCACATAGGAAATCCTTCATGGTATAGGCAGATGCTACGTAAAGGGGTAGAATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio cholerae strain A1552 |
53.867 |
100 |
0.542 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
42.231 |
100 |
0.427 |
| comEC | Vibrio campbellii strain DS40M4 |
41.402 |
100 |
0.42 |