Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | CEG15_RS08075 | Genome accession | NZ_CP022099 |
| Coordinates | 1800045..1802288 (-) | Length | 747 a.a. |
| NCBI ID | WP_088728958.1 | Uniprot ID | - |
| Organism | Vibrio anguillarum strain S3 4/9 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1795045..1807288
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| CEG15_RS08055 (CEG15_08055) | kdsB | 1796335..1797093 (-) | 759 | WP_088728955.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| CEG15_RS08060 (CEG15_08060) | - | 1797093..1797272 (-) | 180 | WP_013856535.1 | Trm112 family protein | - |
| CEG15_RS08065 (CEG15_08065) | lpxK | 1797253..1798260 (-) | 1008 | WP_088728956.1 | tetraacyldisaccharide 4'-kinase | - |
| CEG15_RS08070 (CEG15_08070) | msbA | 1798264..1800012 (-) | 1749 | WP_088728957.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| CEG15_RS08075 (CEG15_08075) | comEC | 1800045..1802288 (-) | 2244 | WP_088728958.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| CEG15_RS08080 (CEG15_08080) | - | 1802295..1802804 (+) | 510 | WP_010319231.1 | DUF2062 domain-containing protein | - |
| CEG15_RS08085 (CEG15_08085) | lolE | 1802943..1804187 (-) | 1245 | WP_088728959.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| CEG15_RS08090 (CEG15_08090) | lolD | 1804188..1804874 (-) | 687 | WP_088728960.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| CEG15_RS08095 (CEG15_08095) | lolC | 1804867..1806075 (-) | 1209 | WP_088728961.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| CEG15_RS08100 (CEG15_08100) | - | 1806241..1806810 (+) | 570 | WP_017048079.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 747 a.a. Molecular weight: 83441.31 Da Isoelectric Point: 9.5312
>NTDB_id=236736 CEG15_RS08075 WP_088728958.1 1800045..1802288(-) (comEC) [Vibrio anguillarum strain S3 4/9]
MTLLSNYWTLASFSLTAISASYWPWMPDWKWSMPLFAILILSVGYRKLRFLSGVTMALMVIVISGNLLREQSNTIFQTGS
DITINGQVNSFFRQITHGYEGTISIRSINGQTLGIFLRPKVRLIAPLPLKYGDVAEFSITVKPIFGRLNETGFDAEAYSL
SQGIVARATVNNGQSYRIESLPEWRAQWYQKIKTWLADDPNLGILMALTFGERSDISTAQWQALRDSGLIHLVAISGLHI
GIAFGFGYGLGLVLMRLHHRMGWSPFVVGSLCALGYAWLAGFTLPTQRALIMCLLNVAMIVLNVRINTLQRLLMTLSAVL
LIDPFAALSSSFWMSFIAVSVVFYQLSLPQTNRFFLWRLLTMQIGLVLCMLPVTAYFFGGVSTSAALYNLVFIPWFSFVV
VPAIFVALVCTVLTFDGAPMVWHWVAKLLIPVDWSIHYSSLSWFPISGSVLALLSSALLLFVFSPLISRFALAVYSLIVM
VSWLTIPIKTGWRIDVLDVGHGLAVLIEKEGRYVLYDTGASWQGGDHIQATVAPVLNKRGATQLDGLILSHLDNDHAGGR
AEVERVWQPKWKRASQTIAGYQPCIAGEQWQWQQLSFDVIWPPKTVARAYNPHSCVIRIFDPNTGFSVLLPGDVDAMSEW
LLARTEQSLQSHILLVPHHGSRTSSTVALIRKVNPEVAIASLAKGGRWQLPSAQVVQRYQQHGANWLDTGEAGQISVIFG
MQNYQVSSLRISRSQAWYRQMLRNEVE
MTLLSNYWTLASFSLTAISASYWPWMPDWKWSMPLFAILILSVGYRKLRFLSGVTMALMVIVISGNLLREQSNTIFQTGS
DITINGQVNSFFRQITHGYEGTISIRSINGQTLGIFLRPKVRLIAPLPLKYGDVAEFSITVKPIFGRLNETGFDAEAYSL
SQGIVARATVNNGQSYRIESLPEWRAQWYQKIKTWLADDPNLGILMALTFGERSDISTAQWQALRDSGLIHLVAISGLHI
GIAFGFGYGLGLVLMRLHHRMGWSPFVVGSLCALGYAWLAGFTLPTQRALIMCLLNVAMIVLNVRINTLQRLLMTLSAVL
LIDPFAALSSSFWMSFIAVSVVFYQLSLPQTNRFFLWRLLTMQIGLVLCMLPVTAYFFGGVSTSAALYNLVFIPWFSFVV
VPAIFVALVCTVLTFDGAPMVWHWVAKLLIPVDWSIHYSSLSWFPISGSVLALLSSALLLFVFSPLISRFALAVYSLIVM
VSWLTIPIKTGWRIDVLDVGHGLAVLIEKEGRYVLYDTGASWQGGDHIQATVAPVLNKRGATQLDGLILSHLDNDHAGGR
AEVERVWQPKWKRASQTIAGYQPCIAGEQWQWQQLSFDVIWPPKTVARAYNPHSCVIRIFDPNTGFSVLLPGDVDAMSEW
LLARTEQSLQSHILLVPHHGSRTSSTVALIRKVNPEVAIASLAKGGRWQLPSAQVVQRYQQHGANWLDTGEAGQISVIFG
MQNYQVSSLRISRSQAWYRQMLRNEVE
Nucleotide
Download Length: 2244 bp
>NTDB_id=236736 CEG15_RS08075 WP_088728958.1 1800045..1802288(-) (comEC) [Vibrio anguillarum strain S3 4/9]
ATGACTCTCTTATCTAATTACTGGACCCTAGCTTCGTTTTCGCTAACCGCCATTTCTGCTTCTTATTGGCCTTGGATGCC
AGATTGGAAATGGAGTATGCCTCTATTCGCCATTCTTATACTCTCTGTCGGTTATAGAAAACTGCGCTTTCTCTCAGGAG
TAACAATGGCTTTAATGGTCATTGTGATCAGCGGGAACTTATTACGTGAGCAGTCCAACACTATTTTTCAGACAGGTTCG
GATATTACCATAAACGGACAGGTTAACAGCTTTTTTAGACAAATTACTCATGGTTATGAAGGAACAATTTCGATTCGATC
AATCAATGGTCAAACCTTAGGCATTTTTTTGCGGCCCAAGGTGCGATTAATTGCTCCTTTACCGTTGAAGTATGGTGACG
TGGCTGAATTCTCCATCACAGTAAAACCTATCTTTGGTCGGTTAAATGAGACCGGGTTTGATGCAGAGGCGTACTCTTTG
AGCCAAGGTATTGTGGCTCGTGCAACGGTTAATAATGGACAGTCTTATCGCATTGAATCATTACCAGAATGGCGAGCACA
ATGGTATCAGAAGATCAAAACCTGGCTCGCTGATGATCCCAATCTTGGTATTTTAATGGCGCTTACGTTTGGTGAGCGCA
GTGATATTTCAACGGCACAGTGGCAAGCATTACGAGACAGTGGGCTGATTCATTTGGTTGCCATTTCTGGTCTGCACATT
GGCATCGCGTTTGGTTTTGGTTATGGCCTTGGGCTCGTATTGATGCGTTTGCATCATCGCATGGGGTGGTCGCCATTCGT
CGTAGGAAGCTTGTGCGCACTAGGCTATGCTTGGTTGGCTGGCTTTACCTTGCCCACTCAACGCGCACTGATCATGTGTT
TGCTTAATGTGGCGATGATTGTCCTCAATGTCCGTATCAATACGTTGCAGCGACTGCTTATGACGTTATCGGCGGTGCTG
CTCATCGATCCTTTTGCTGCGTTATCCAGTAGCTTTTGGATGTCCTTTATTGCTGTCTCAGTGGTGTTCTACCAACTTTC
TCTACCTCAAACTAATCGATTTTTTTTATGGCGTTTATTGACGATGCAGATTGGGTTAGTGCTGTGTATGCTTCCGGTAA
CCGCTTATTTTTTCGGTGGGGTCAGTACCAGTGCGGCGCTCTATAACTTAGTGTTTATTCCTTGGTTCAGTTTTGTGGTG
GTTCCGGCGATCTTTGTGGCTTTAGTGTGTACGGTTTTGACTTTTGATGGCGCACCTATGGTGTGGCATTGGGTGGCCAA
ATTACTCATTCCAGTTGATTGGTCAATACATTATTCAAGCCTCAGTTGGTTTCCTATTTCTGGGTCTGTTTTGGCTCTGT
TGAGCAGTGCTCTGTTGCTGTTTGTGTTTTCACCGTTGATCTCGCGTTTTGCACTTGCGGTTTACAGCCTCATTGTGATG
GTTTCTTGGCTCACCATTCCGATAAAAACAGGTTGGCGGATTGATGTGTTAGATGTTGGGCATGGGTTGGCAGTGTTGAT
CGAAAAAGAAGGACGCTACGTGCTTTATGATACTGGTGCAAGTTGGCAGGGTGGTGATCACATCCAAGCAACAGTTGCCC
CCGTGTTAAACAAACGGGGAGCTACGCAATTAGACGGGTTGATTTTGAGTCACTTGGATAACGACCATGCAGGTGGAAGA
GCGGAAGTTGAGCGTGTTTGGCAGCCCAAATGGAAACGTGCTAGTCAAACGATAGCGGGTTACCAACCTTGTATTGCTGG
CGAACAATGGCAGTGGCAACAACTCTCTTTTGATGTTATCTGGCCACCAAAAACTGTAGCGCGGGCTTACAATCCTCACT
CTTGTGTGATTCGAATCTTTGACCCAAATACCGGTTTTTCAGTGCTTTTACCCGGGGATGTGGATGCCATGAGTGAGTGG
TTACTTGCGCGAACAGAGCAATCCTTACAAAGTCATATCCTGCTTGTACCGCACCACGGTAGTAGGACATCTTCTACTGT
TGCTTTGATTCGTAAGGTGAATCCTGAGGTCGCTATTGCCTCGCTCGCCAAAGGCGGACGCTGGCAATTACCTTCCGCAC
AGGTTGTTCAACGCTACCAGCAACATGGAGCAAACTGGTTGGATACGGGTGAAGCAGGGCAAATTAGCGTTATTTTTGGC
ATGCAGAATTATCAAGTTAGCAGCTTGCGTATCTCTCGCTCTCAGGCTTGGTATAGGCAGATGCTCCGTAACGAGGTAGA
ATGA
ATGACTCTCTTATCTAATTACTGGACCCTAGCTTCGTTTTCGCTAACCGCCATTTCTGCTTCTTATTGGCCTTGGATGCC
AGATTGGAAATGGAGTATGCCTCTATTCGCCATTCTTATACTCTCTGTCGGTTATAGAAAACTGCGCTTTCTCTCAGGAG
TAACAATGGCTTTAATGGTCATTGTGATCAGCGGGAACTTATTACGTGAGCAGTCCAACACTATTTTTCAGACAGGTTCG
GATATTACCATAAACGGACAGGTTAACAGCTTTTTTAGACAAATTACTCATGGTTATGAAGGAACAATTTCGATTCGATC
AATCAATGGTCAAACCTTAGGCATTTTTTTGCGGCCCAAGGTGCGATTAATTGCTCCTTTACCGTTGAAGTATGGTGACG
TGGCTGAATTCTCCATCACAGTAAAACCTATCTTTGGTCGGTTAAATGAGACCGGGTTTGATGCAGAGGCGTACTCTTTG
AGCCAAGGTATTGTGGCTCGTGCAACGGTTAATAATGGACAGTCTTATCGCATTGAATCATTACCAGAATGGCGAGCACA
ATGGTATCAGAAGATCAAAACCTGGCTCGCTGATGATCCCAATCTTGGTATTTTAATGGCGCTTACGTTTGGTGAGCGCA
GTGATATTTCAACGGCACAGTGGCAAGCATTACGAGACAGTGGGCTGATTCATTTGGTTGCCATTTCTGGTCTGCACATT
GGCATCGCGTTTGGTTTTGGTTATGGCCTTGGGCTCGTATTGATGCGTTTGCATCATCGCATGGGGTGGTCGCCATTCGT
CGTAGGAAGCTTGTGCGCACTAGGCTATGCTTGGTTGGCTGGCTTTACCTTGCCCACTCAACGCGCACTGATCATGTGTT
TGCTTAATGTGGCGATGATTGTCCTCAATGTCCGTATCAATACGTTGCAGCGACTGCTTATGACGTTATCGGCGGTGCTG
CTCATCGATCCTTTTGCTGCGTTATCCAGTAGCTTTTGGATGTCCTTTATTGCTGTCTCAGTGGTGTTCTACCAACTTTC
TCTACCTCAAACTAATCGATTTTTTTTATGGCGTTTATTGACGATGCAGATTGGGTTAGTGCTGTGTATGCTTCCGGTAA
CCGCTTATTTTTTCGGTGGGGTCAGTACCAGTGCGGCGCTCTATAACTTAGTGTTTATTCCTTGGTTCAGTTTTGTGGTG
GTTCCGGCGATCTTTGTGGCTTTAGTGTGTACGGTTTTGACTTTTGATGGCGCACCTATGGTGTGGCATTGGGTGGCCAA
ATTACTCATTCCAGTTGATTGGTCAATACATTATTCAAGCCTCAGTTGGTTTCCTATTTCTGGGTCTGTTTTGGCTCTGT
TGAGCAGTGCTCTGTTGCTGTTTGTGTTTTCACCGTTGATCTCGCGTTTTGCACTTGCGGTTTACAGCCTCATTGTGATG
GTTTCTTGGCTCACCATTCCGATAAAAACAGGTTGGCGGATTGATGTGTTAGATGTTGGGCATGGGTTGGCAGTGTTGAT
CGAAAAAGAAGGACGCTACGTGCTTTATGATACTGGTGCAAGTTGGCAGGGTGGTGATCACATCCAAGCAACAGTTGCCC
CCGTGTTAAACAAACGGGGAGCTACGCAATTAGACGGGTTGATTTTGAGTCACTTGGATAACGACCATGCAGGTGGAAGA
GCGGAAGTTGAGCGTGTTTGGCAGCCCAAATGGAAACGTGCTAGTCAAACGATAGCGGGTTACCAACCTTGTATTGCTGG
CGAACAATGGCAGTGGCAACAACTCTCTTTTGATGTTATCTGGCCACCAAAAACTGTAGCGCGGGCTTACAATCCTCACT
CTTGTGTGATTCGAATCTTTGACCCAAATACCGGTTTTTCAGTGCTTTTACCCGGGGATGTGGATGCCATGAGTGAGTGG
TTACTTGCGCGAACAGAGCAATCCTTACAAAGTCATATCCTGCTTGTACCGCACCACGGTAGTAGGACATCTTCTACTGT
TGCTTTGATTCGTAAGGTGAATCCTGAGGTCGCTATTGCCTCGCTCGCCAAAGGCGGACGCTGGCAATTACCTTCCGCAC
AGGTTGTTCAACGCTACCAGCAACATGGAGCAAACTGGTTGGATACGGGTGAAGCAGGGCAAATTAGCGTTATTTTTGGC
ATGCAGAATTATCAAGTTAGCAGCTTGCGTATCTCTCGCTCTCAGGCTTGGTATAGGCAGATGCTCCGTAACGAGGTAGA
ATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio cholerae strain A1552 |
52.005 |
100 |
0.521 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
44.211 |
100 |
0.45 |
| comEC | Vibrio campbellii strain DS40M4 |
43.874 |
100 |
0.446 |