Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | CEQ50_RS09220 | Genome accession | NZ_CP022103 |
| Coordinates | 2027041..2029284 (-) | Length | 747 a.a. |
| NCBI ID | WP_088732650.1 | Uniprot ID | - |
| Organism | Vibrio anguillarum strain CNEVA NB11008 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2022041..2034284
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| CEQ50_RS09195 (CEQ50_09230) | - | 2022776..2023294 (+) | 519 | WP_017045183.1 | cytochrome b | - |
| CEQ50_RS09200 (CEQ50_09235) | kdsB | 2023337..2024089 (-) | 753 | WP_088732648.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| CEQ50_RS09205 (CEQ50_09240) | - | 2024089..2024268 (-) | 180 | WP_013856535.1 | Trm112 family protein | - |
| CEQ50_RS09210 (CEQ50_09245) | lpxK | 2024249..2025256 (-) | 1008 | WP_088732649.1 | tetraacyldisaccharide 4'-kinase | - |
| CEQ50_RS09215 (CEQ50_09250) | msbA | 2025260..2027008 (-) | 1749 | WP_017045180.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| CEQ50_RS09220 (CEQ50_09255) | comEC | 2027041..2029284 (-) | 2244 | WP_088732650.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| CEQ50_RS09225 (CEQ50_09260) | - | 2029291..2029800 (+) | 510 | WP_010319231.1 | DUF2062 domain-containing protein | - |
| CEQ50_RS09230 (CEQ50_09265) | lolE | 2029939..2031183 (-) | 1245 | WP_013856531.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| CEQ50_RS09235 (CEQ50_09270) | lolD | 2031184..2031870 (-) | 687 | WP_047687976.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| CEQ50_RS09240 (CEQ50_09275) | lolC | 2031863..2033071 (-) | 1209 | WP_088732651.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| CEQ50_RS09245 (CEQ50_09280) | - | 2033237..2033806 (+) | 570 | WP_017048079.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 747 a.a. Molecular weight: 83405.23 Da Isoelectric Point: 9.4397
>NTDB_id=236832 CEQ50_RS09220 WP_088732650.1 2027041..2029284(-) (comEC) [Vibrio anguillarum strain CNEVA NB11008]
MTLLSNYWTLASFSLTAISASYWPWMPDWKWSMPLFAILILSVGYRKLRFLSGVTMALMVIVISGNLLREQSNTIFQTGS
DITINGQVNSFFRQITHGYEGTISIRSINGQTLGIFLRPKVRLIAPLALKYGDVAEFSITVKPIFGRLNETGFDAEAYSL
SQGIVARATVNNGQSYRIESLPEWRAQWYQKIKTWLADDPNLGILMALTFGERSDISTAQWQALRDSGLIHLVAISGLHI
GIAFGFGYSLGLVLMRLHHRMGWSPFVVGSLCALGYAWLAGFTLPTQRALIMCLLNVAMIVLNVRINTLQRLLMTLSAVL
LIDPFAALSSSFWMSFIAVSVVFYQLSLPQTQRFFLWRLLTMQIGLVLCMLPVTAYFFGGVSTSAALYNLVFIPWFSFVV
VPAIFVALVCTVLTFDGAPMVWHWVAKLLIPVDWSIHYSSLSWFPISGAVLALLSSALLLFVFSPLISRFALAVYSIIVM
VSWLTIPIKTGWRIDVLDVGHGLAVLIEKEGRYVLYDTGASWQGGDHIQATVAPVLNKRGATQLDGLILSHLDNDHAGGR
AEVERVWQPKWKRASQTIAGYQPCIAGEQWQWQQLSFDVIWPPKTVARAYNPHSCVIRIFDQNTGFSVLLPGDVDAMSEW
LLARTEQSLQSHILLVPHHGSRTSSTVALISKVNPEVAIASLAKGGRWQLPSAQVVQRYQQHGANWLDTGEAGQISVIFG
MQNYQVSSLRISRSQAWYRQMLRNEVE
MTLLSNYWTLASFSLTAISASYWPWMPDWKWSMPLFAILILSVGYRKLRFLSGVTMALMVIVISGNLLREQSNTIFQTGS
DITINGQVNSFFRQITHGYEGTISIRSINGQTLGIFLRPKVRLIAPLALKYGDVAEFSITVKPIFGRLNETGFDAEAYSL
SQGIVARATVNNGQSYRIESLPEWRAQWYQKIKTWLADDPNLGILMALTFGERSDISTAQWQALRDSGLIHLVAISGLHI
GIAFGFGYSLGLVLMRLHHRMGWSPFVVGSLCALGYAWLAGFTLPTQRALIMCLLNVAMIVLNVRINTLQRLLMTLSAVL
LIDPFAALSSSFWMSFIAVSVVFYQLSLPQTQRFFLWRLLTMQIGLVLCMLPVTAYFFGGVSTSAALYNLVFIPWFSFVV
VPAIFVALVCTVLTFDGAPMVWHWVAKLLIPVDWSIHYSSLSWFPISGAVLALLSSALLLFVFSPLISRFALAVYSIIVM
VSWLTIPIKTGWRIDVLDVGHGLAVLIEKEGRYVLYDTGASWQGGDHIQATVAPVLNKRGATQLDGLILSHLDNDHAGGR
AEVERVWQPKWKRASQTIAGYQPCIAGEQWQWQQLSFDVIWPPKTVARAYNPHSCVIRIFDQNTGFSVLLPGDVDAMSEW
LLARTEQSLQSHILLVPHHGSRTSSTVALISKVNPEVAIASLAKGGRWQLPSAQVVQRYQQHGANWLDTGEAGQISVIFG
MQNYQVSSLRISRSQAWYRQMLRNEVE
Nucleotide
Download Length: 2244 bp
>NTDB_id=236832 CEQ50_RS09220 WP_088732650.1 2027041..2029284(-) (comEC) [Vibrio anguillarum strain CNEVA NB11008]
ATGACTCTCTTATCTAATTACTGGACCCTAGCTTCGTTTTCGCTAACCGCCATTTCTGCTTCTTATTGGCCTTGGATGCC
AGATTGGAAATGGAGTATGCCTCTATTCGCCATTCTCATACTCTCTGTCGGTTATAGAAAACTGCGCTTTCTCTCAGGAG
TAACAATGGCTTTAATGGTCATTGTGATCAGCGGGAACTTATTACGTGAGCAGTCCAACACTATTTTTCAGACAGGTTCG
GATATTACCATAAACGGACAGGTTAACAGCTTTTTTAGACAAATTACTCATGGTTATGAAGGAACAATTTCGATTCGATC
AATCAATGGTCAAACCTTAGGCATTTTTTTGCGGCCCAAGGTGCGATTAATAGCTCCTTTAGCGTTGAAGTATGGTGACG
TGGCTGAATTCTCCATCACAGTAAAACCTATCTTTGGTCGGTTAAATGAGACCGGGTTTGATGCAGAGGCGTACTCTTTG
AGCCAAGGTATTGTGGCTCGTGCAACGGTTAATAATGGACAGTCTTATCGCATTGAATCATTACCAGAATGGCGAGCACA
ATGGTACCAGAAGATCAAAACCTGGCTCGCTGATGATCCCAATCTTGGTATTTTAATGGCGCTTACGTTTGGTGAGCGCA
GTGATATTTCAACGGCACAGTGGCAAGCATTACGAGACAGTGGGCTGATTCATTTGGTTGCCATTTCTGGTCTGCACATT
GGCATCGCGTTTGGTTTTGGTTATAGCCTTGGGCTCGTATTGATGCGTTTGCATCATCGCATGGGGTGGTCGCCATTCGT
CGTAGGAAGCTTGTGCGCACTAGGCTATGCTTGGTTGGCTGGCTTTACCTTGCCCACTCAACGCGCGCTCATCATGTGTT
TGCTCAATGTGGCGATGATTGTTCTCAATGTCCGTATCAATACTTTGCAGCGACTGCTTATGACGTTATCGGCGGTGCTG
CTTATCGATCCTTTTGCTGCGTTATCCAGTAGCTTTTGGATGTCCTTTATTGCTGTCTCAGTGGTGTTCTACCAACTTTC
TCTACCTCAAACTCAGCGATTTTTTTTATGGCGTTTATTGACGATGCAGATTGGGTTAGTGCTGTGTATGCTTCCGGTAA
CTGCTTATTTTTTCGGTGGAGTCAGTACCAGTGCGGCGCTCTATAACTTAGTGTTTATTCCTTGGTTCAGTTTTGTGGTG
GTTCCGGCGATCTTTGTGGCTTTAGTGTGTACGGTTTTGACTTTTGATGGCGCACCTATGGTGTGGCATTGGGTGGCCAA
ATTACTCATTCCAGTTGATTGGTCAATACATTATTCAAGCCTCAGTTGGTTTCCTATTTCTGGGGCTGTTTTGGCTCTGT
TGAGCAGTGCTCTGTTGCTGTTTGTGTTTTCACCGTTGATCTCGCGTTTTGCACTGGCGGTTTACAGCATCATTGTGATG
GTTTCTTGGCTCACCATTCCGATAAAAACAGGTTGGCGGATTGATGTGTTAGATGTTGGGCATGGGTTGGCGGTGTTGAT
CGAAAAAGAAGGACGCTACGTGCTTTATGATACTGGTGCAAGTTGGCAAGGCGGTGATCACATCCAGGCAACAGTTGCCC
CCGTGTTAAACAAACGGGGAGCTACGCAATTAGACGGGTTGATTTTGAGTCACTTGGATAACGACCATGCAGGTGGAAGA
GCGGAAGTTGAGCGTGTTTGGCAGCCCAAATGGAAACGTGCTAGTCAAACGATAGCGGGTTACCAACCTTGTATTGCTGG
TGAACAATGGCAGTGGCAACAACTCTCTTTTGATGTTATCTGGCCACCAAAAACCGTAGCGCGTGCTTACAATCCTCACT
CTTGTGTGATTCGAATCTTTGACCAAAATACCGGTTTTTCAGTGCTTTTACCCGGGGATGTGGATGCCATGAGTGAGTGG
TTACTTGCGCGAACAGAGCAATCCTTACAAAGTCATATCCTGCTTGTACCACACCACGGTAGTAGGACATCTTCTACTGT
TGCTTTGATTAGTAAGGTGAATCCTGAGGTCGCTATTGCTTCGCTCGCCAAAGGCGGACGCTGGCAATTACCTTCCGCAC
AGGTTGTTCAACGCTACCAACAACATGGAGCAAACTGGTTGGATACGGGTGAAGCAGGGCAAATTAGCGTTATTTTTGGC
ATGCAGAATTATCAAGTTAGCAGCTTGCGTATCTCTCGCTCTCAGGCTTGGTATAGGCAGATGCTCCGTAACGAGGTAGA
ATGA
ATGACTCTCTTATCTAATTACTGGACCCTAGCTTCGTTTTCGCTAACCGCCATTTCTGCTTCTTATTGGCCTTGGATGCC
AGATTGGAAATGGAGTATGCCTCTATTCGCCATTCTCATACTCTCTGTCGGTTATAGAAAACTGCGCTTTCTCTCAGGAG
TAACAATGGCTTTAATGGTCATTGTGATCAGCGGGAACTTATTACGTGAGCAGTCCAACACTATTTTTCAGACAGGTTCG
GATATTACCATAAACGGACAGGTTAACAGCTTTTTTAGACAAATTACTCATGGTTATGAAGGAACAATTTCGATTCGATC
AATCAATGGTCAAACCTTAGGCATTTTTTTGCGGCCCAAGGTGCGATTAATAGCTCCTTTAGCGTTGAAGTATGGTGACG
TGGCTGAATTCTCCATCACAGTAAAACCTATCTTTGGTCGGTTAAATGAGACCGGGTTTGATGCAGAGGCGTACTCTTTG
AGCCAAGGTATTGTGGCTCGTGCAACGGTTAATAATGGACAGTCTTATCGCATTGAATCATTACCAGAATGGCGAGCACA
ATGGTACCAGAAGATCAAAACCTGGCTCGCTGATGATCCCAATCTTGGTATTTTAATGGCGCTTACGTTTGGTGAGCGCA
GTGATATTTCAACGGCACAGTGGCAAGCATTACGAGACAGTGGGCTGATTCATTTGGTTGCCATTTCTGGTCTGCACATT
GGCATCGCGTTTGGTTTTGGTTATAGCCTTGGGCTCGTATTGATGCGTTTGCATCATCGCATGGGGTGGTCGCCATTCGT
CGTAGGAAGCTTGTGCGCACTAGGCTATGCTTGGTTGGCTGGCTTTACCTTGCCCACTCAACGCGCGCTCATCATGTGTT
TGCTCAATGTGGCGATGATTGTTCTCAATGTCCGTATCAATACTTTGCAGCGACTGCTTATGACGTTATCGGCGGTGCTG
CTTATCGATCCTTTTGCTGCGTTATCCAGTAGCTTTTGGATGTCCTTTATTGCTGTCTCAGTGGTGTTCTACCAACTTTC
TCTACCTCAAACTCAGCGATTTTTTTTATGGCGTTTATTGACGATGCAGATTGGGTTAGTGCTGTGTATGCTTCCGGTAA
CTGCTTATTTTTTCGGTGGAGTCAGTACCAGTGCGGCGCTCTATAACTTAGTGTTTATTCCTTGGTTCAGTTTTGTGGTG
GTTCCGGCGATCTTTGTGGCTTTAGTGTGTACGGTTTTGACTTTTGATGGCGCACCTATGGTGTGGCATTGGGTGGCCAA
ATTACTCATTCCAGTTGATTGGTCAATACATTATTCAAGCCTCAGTTGGTTTCCTATTTCTGGGGCTGTTTTGGCTCTGT
TGAGCAGTGCTCTGTTGCTGTTTGTGTTTTCACCGTTGATCTCGCGTTTTGCACTGGCGGTTTACAGCATCATTGTGATG
GTTTCTTGGCTCACCATTCCGATAAAAACAGGTTGGCGGATTGATGTGTTAGATGTTGGGCATGGGTTGGCGGTGTTGAT
CGAAAAAGAAGGACGCTACGTGCTTTATGATACTGGTGCAAGTTGGCAAGGCGGTGATCACATCCAGGCAACAGTTGCCC
CCGTGTTAAACAAACGGGGAGCTACGCAATTAGACGGGTTGATTTTGAGTCACTTGGATAACGACCATGCAGGTGGAAGA
GCGGAAGTTGAGCGTGTTTGGCAGCCCAAATGGAAACGTGCTAGTCAAACGATAGCGGGTTACCAACCTTGTATTGCTGG
TGAACAATGGCAGTGGCAACAACTCTCTTTTGATGTTATCTGGCCACCAAAAACCGTAGCGCGTGCTTACAATCCTCACT
CTTGTGTGATTCGAATCTTTGACCAAAATACCGGTTTTTCAGTGCTTTTACCCGGGGATGTGGATGCCATGAGTGAGTGG
TTACTTGCGCGAACAGAGCAATCCTTACAAAGTCATATCCTGCTTGTACCACACCACGGTAGTAGGACATCTTCTACTGT
TGCTTTGATTAGTAAGGTGAATCCTGAGGTCGCTATTGCTTCGCTCGCCAAAGGCGGACGCTGGCAATTACCTTCCGCAC
AGGTTGTTCAACGCTACCAACAACATGGAGCAAACTGGTTGGATACGGGTGAAGCAGGGCAAATTAGCGTTATTTTTGGC
ATGCAGAATTATCAAGTTAGCAGCTTGCGTATCTCTCGCTCTCAGGCTTGGTATAGGCAGATGCTCCGTAACGAGGTAGA
ATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio cholerae strain A1552 |
52.139 |
100 |
0.522 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
44.079 |
100 |
0.448 |
| comEC | Vibrio campbellii strain DS40M4 |
43.874 |
100 |
0.446 |