Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | CEA93_RS09375 | Genome accession | NZ_CP021980 |
| Coordinates | 2003925..2006168 (-) | Length | 747 a.a. |
| NCBI ID | WP_029189791.1 | Uniprot ID | - |
| Organism | Vibrio anguillarum strain 87-9-116 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1998925..2011168
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| CEA93_RS09350 (CEA93_09350) | - | 1999660..2000178 (+) | 519 | WP_013856537.1 | cytochrome b | - |
| CEA93_RS09355 (CEA93_09355) | kdsB | 2000221..2000973 (-) | 753 | WP_013856536.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| CEA93_RS09360 (CEA93_09360) | - | 2000973..2001152 (-) | 180 | WP_013856535.1 | Trm112 family protein | - |
| CEA93_RS09365 (CEA93_09365) | lpxK | 2001133..2002140 (-) | 1008 | WP_013856534.1 | tetraacyldisaccharide 4'-kinase | - |
| CEA93_RS09370 (CEA93_09370) | msbA | 2002144..2003892 (-) | 1749 | WP_013856533.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| CEA93_RS09375 (CEA93_09375) | comEC | 2003925..2006168 (-) | 2244 | WP_029189791.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| CEA93_RS09380 (CEA93_09380) | - | 2006175..2006684 (+) | 510 | WP_010319231.1 | DUF2062 domain-containing protein | - |
| CEA93_RS09385 (CEA93_09385) | lolE | 2006823..2008067 (-) | 1245 | WP_013856531.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| CEA93_RS09390 (CEA93_09390) | lolD | 2008068..2008754 (-) | 687 | WP_013856530.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| CEA93_RS09395 (CEA93_09395) | lolC | 2008747..2009955 (-) | 1209 | WP_013856529.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| CEA93_RS09400 (CEA93_09400) | - | 2010121..2010690 (+) | 570 | WP_013856528.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 747 a.a. Molecular weight: 83533.44 Da Isoelectric Point: 9.4397
>NTDB_id=235576 CEA93_RS09375 WP_029189791.1 2003925..2006168(-) (comEC) [Vibrio anguillarum strain 87-9-116]
MTLLSNYWTLASFSLTAISASYWPWMPDWKWIMPLFAILILSVVYRKLRFLSGVTMALMVIVISGNLLREQSNTIFQTGS
DITINGQVNSFFRQITHGYEGTISIRSINGQTLGIFLRPKVRLIAPLPLKYGDVAEFSITVKPIFGRLNETGFDAEAYSL
SQGIVARATVNNGQSYRIESLPEWRAQWYQKIKTWLADDPNLGILMALTFGERSDISTAQWQALRDSGLIHLVAISGLHI
GIAFGFGYSLGLVLMRLHHRMGWSPFVVGSLCALGYAWLAGFTLPTQRALIMCLLNVAMIVLNVRINTLQRLLMTLSAVL
LIDPFAALSSSFWMSFIAVSVVFYQLSLPQTNRFFLWRLLTMQIGLVLCMLPVTAYFFGGVSTSAALYNLVFIPWFSFVV
VPAIFVALVCTFLTFDGAPMVWHWVAKLLIPVDWSIHYSSLSWFPISGAVLALLSSALLLFVFSPLISRFALAVYSLIVM
VSWLTIPIKTGWRIDVLDVGHGLAVLIEKEGRYVLYDTGASWQGGDHIQATVAPVLNKRGATQLDGLILSHLDNDHAGGR
AEVERVWQPKWKRASQTIAGYQPCIAGEQWQWQQLSFDVIWPPKTVARAYNPHSCVIRIFDQNTGFSVLLPGDVDAMSEW
LLARTEQSLQSHILLVPHHGSRTSSTVALISKVNPEVAIASLAKGGRWQLPSAQVVQRYQQHGANWLDTGEAGQISVIFG
MQNYQVSSLRISRSQAWYRQMLRNEVE
MTLLSNYWTLASFSLTAISASYWPWMPDWKWIMPLFAILILSVVYRKLRFLSGVTMALMVIVISGNLLREQSNTIFQTGS
DITINGQVNSFFRQITHGYEGTISIRSINGQTLGIFLRPKVRLIAPLPLKYGDVAEFSITVKPIFGRLNETGFDAEAYSL
SQGIVARATVNNGQSYRIESLPEWRAQWYQKIKTWLADDPNLGILMALTFGERSDISTAQWQALRDSGLIHLVAISGLHI
GIAFGFGYSLGLVLMRLHHRMGWSPFVVGSLCALGYAWLAGFTLPTQRALIMCLLNVAMIVLNVRINTLQRLLMTLSAVL
LIDPFAALSSSFWMSFIAVSVVFYQLSLPQTNRFFLWRLLTMQIGLVLCMLPVTAYFFGGVSTSAALYNLVFIPWFSFVV
VPAIFVALVCTFLTFDGAPMVWHWVAKLLIPVDWSIHYSSLSWFPISGAVLALLSSALLLFVFSPLISRFALAVYSLIVM
VSWLTIPIKTGWRIDVLDVGHGLAVLIEKEGRYVLYDTGASWQGGDHIQATVAPVLNKRGATQLDGLILSHLDNDHAGGR
AEVERVWQPKWKRASQTIAGYQPCIAGEQWQWQQLSFDVIWPPKTVARAYNPHSCVIRIFDQNTGFSVLLPGDVDAMSEW
LLARTEQSLQSHILLVPHHGSRTSSTVALISKVNPEVAIASLAKGGRWQLPSAQVVQRYQQHGANWLDTGEAGQISVIFG
MQNYQVSSLRISRSQAWYRQMLRNEVE
Nucleotide
Download Length: 2244 bp
>NTDB_id=235576 CEA93_RS09375 WP_029189791.1 2003925..2006168(-) (comEC) [Vibrio anguillarum strain 87-9-116]
ATGACTCTCTTATCTAATTACTGGACCCTAGCTTCGTTTTCGCTAACCGCCATTTCTGCTTCTTATTGGCCTTGGATGCC
AGATTGGAAATGGATTATGCCTCTATTCGCCATTCTTATACTCTCTGTCGTTTATAGAAAACTGCGCTTTCTCTCAGGAG
TAACAATGGCTTTAATGGTCATTGTGATCAGCGGGAACTTATTACGTGAGCAGTCCAACACTATTTTTCAGACAGGTTCG
GATATTACCATAAACGGACAGGTTAACAGCTTTTTTAGACAAATTACTCATGGTTATGAAGGAACAATTTCGATTCGATC
AATCAATGGTCAAACCTTAGGCATTTTTTTGCGGCCCAAGGTGCGATTAATAGCTCCTTTACCGTTGAAGTATGGTGACG
TGGCTGAATTCTCCATCACAGTAAAACCTATCTTTGGTCGGTTAAATGAGACCGGGTTTGATGCAGAGGCGTACTCTTTG
AGCCAAGGTATTGTGGCTCGTGCAACGGTTAATAATGGACAGTCTTATCGCATTGAATCATTACCAGAATGGCGAGCGCA
ATGGTACCAGAAGATCAAAACCTGGCTCGCTGATGATCCCAATCTTGGTATTTTAATGGCGCTTACGTTTGGTGAGCGCA
GTGATATTTCAACGGCACAGTGGCAAGCATTACGAGACAGTGGGCTGATTCATTTGGTTGCCATTTCTGGTCTGCACATT
GGCATCGCGTTTGGTTTTGGTTATAGCCTTGGGCTCGTATTGATGCGTTTGCATCATCGCATGGGGTGGTCGCCATTCGT
CGTAGGAAGCTTGTGCGCACTAGGCTATGCTTGGTTGGCTGGCTTTACCTTGCCCACTCAACGCGCGCTCATCATGTGTT
TGCTCAATGTGGCGATGATTGTTCTCAATGTCCGTATCAATACTTTGCAGCGACTGCTTATGACGTTATCGGCGGTGCTG
CTCATCGATCCTTTTGCTGCGTTATCCAGTAGCTTTTGGATGTCCTTTATTGCTGTCTCAGTGGTGTTCTACCAACTTTC
TCTACCTCAAACTAATCGATTTTTTTTATGGCGTTTATTGACGATGCAGATTGGGTTAGTGCTGTGTATGCTTCCGGTAA
CCGCTTATTTTTTCGGTGGGGTCAGTACCAGTGCGGCGCTCTATAACTTAGTGTTTATTCCTTGGTTCAGTTTTGTGGTG
GTTCCGGCGATCTTTGTGGCTTTAGTCTGTACGTTTTTGACTTTTGATGGCGCACCTATGGTGTGGCATTGGGTGGCCAA
ATTGCTCATTCCAGTTGATTGGTCAATACATTATTCAAGCCTCAGTTGGTTTCCTATTTCTGGGGCTGTTTTGGCTCTGT
TGAGCAGTGCTCTGTTGCTGTTTGTGTTTTCACCGTTGATCTCGCGTTTTGCACTTGCGGTTTACAGCCTCATTGTGATG
GTTTCTTGGCTCACCATTCCGATAAAAACAGGTTGGCGGATTGATGTGTTAGATGTTGGGCATGGGTTAGCGGTGTTGAT
CGAAAAAGAAGGACGCTACGTGCTTTATGATACTGGTGCAAGTTGGCAAGGTGGTGATCACATCCAAGCAACAGTTGCCC
CCGTGTTAAACAAACGGGGAGCTACGCAATTAGACGGGTTGATTTTGAGTCACTTGGATAACGACCATGCAGGTGGAAGA
GCGGAAGTTGAGCGTGTTTGGCAGCCCAAATGGAAACGTGCTAGTCAAACGATAGCGGGTTACCAACCTTGTATTGCTGG
CGAACAATGGCAGTGGCAACAACTCTCTTTTGATGTTATCTGGCCACCAAAAACTGTAGCGCGGGCTTACAATCCTCACT
CTTGTGTGATTCGAATCTTTGACCAAAATACCGGTTTTTCAGTGCTTTTACCCGGGGATGTGGATGCTATGAGTGAGTGG
TTACTTGCGCGAACAGAGCAATCCTTACAAAGTCATATCCTGCTTGTACCACACCACGGTAGTAGGACATCTTCTACTGT
TGCTTTGATTAGTAAGGTGAACCCTGAAGTCGCTATTGCCTCGCTCGCTAAAGGCGGACGCTGGCAATTACCTTCCGCAC
AGGTTGTTCAACGCTACCAACAACATGGAGCAAACTGGTTGGATACGGGTGAAGCAGGGCAAATTAGCGTTATTTTTGGC
ATGCAGAATTATCAAGTTAGCAGCTTGCGTATCTCTCGCTCTCAGGCTTGGTATAGGCAGATGCTCCGTAACGAGGTAGA
ATGA
ATGACTCTCTTATCTAATTACTGGACCCTAGCTTCGTTTTCGCTAACCGCCATTTCTGCTTCTTATTGGCCTTGGATGCC
AGATTGGAAATGGATTATGCCTCTATTCGCCATTCTTATACTCTCTGTCGTTTATAGAAAACTGCGCTTTCTCTCAGGAG
TAACAATGGCTTTAATGGTCATTGTGATCAGCGGGAACTTATTACGTGAGCAGTCCAACACTATTTTTCAGACAGGTTCG
GATATTACCATAAACGGACAGGTTAACAGCTTTTTTAGACAAATTACTCATGGTTATGAAGGAACAATTTCGATTCGATC
AATCAATGGTCAAACCTTAGGCATTTTTTTGCGGCCCAAGGTGCGATTAATAGCTCCTTTACCGTTGAAGTATGGTGACG
TGGCTGAATTCTCCATCACAGTAAAACCTATCTTTGGTCGGTTAAATGAGACCGGGTTTGATGCAGAGGCGTACTCTTTG
AGCCAAGGTATTGTGGCTCGTGCAACGGTTAATAATGGACAGTCTTATCGCATTGAATCATTACCAGAATGGCGAGCGCA
ATGGTACCAGAAGATCAAAACCTGGCTCGCTGATGATCCCAATCTTGGTATTTTAATGGCGCTTACGTTTGGTGAGCGCA
GTGATATTTCAACGGCACAGTGGCAAGCATTACGAGACAGTGGGCTGATTCATTTGGTTGCCATTTCTGGTCTGCACATT
GGCATCGCGTTTGGTTTTGGTTATAGCCTTGGGCTCGTATTGATGCGTTTGCATCATCGCATGGGGTGGTCGCCATTCGT
CGTAGGAAGCTTGTGCGCACTAGGCTATGCTTGGTTGGCTGGCTTTACCTTGCCCACTCAACGCGCGCTCATCATGTGTT
TGCTCAATGTGGCGATGATTGTTCTCAATGTCCGTATCAATACTTTGCAGCGACTGCTTATGACGTTATCGGCGGTGCTG
CTCATCGATCCTTTTGCTGCGTTATCCAGTAGCTTTTGGATGTCCTTTATTGCTGTCTCAGTGGTGTTCTACCAACTTTC
TCTACCTCAAACTAATCGATTTTTTTTATGGCGTTTATTGACGATGCAGATTGGGTTAGTGCTGTGTATGCTTCCGGTAA
CCGCTTATTTTTTCGGTGGGGTCAGTACCAGTGCGGCGCTCTATAACTTAGTGTTTATTCCTTGGTTCAGTTTTGTGGTG
GTTCCGGCGATCTTTGTGGCTTTAGTCTGTACGTTTTTGACTTTTGATGGCGCACCTATGGTGTGGCATTGGGTGGCCAA
ATTGCTCATTCCAGTTGATTGGTCAATACATTATTCAAGCCTCAGTTGGTTTCCTATTTCTGGGGCTGTTTTGGCTCTGT
TGAGCAGTGCTCTGTTGCTGTTTGTGTTTTCACCGTTGATCTCGCGTTTTGCACTTGCGGTTTACAGCCTCATTGTGATG
GTTTCTTGGCTCACCATTCCGATAAAAACAGGTTGGCGGATTGATGTGTTAGATGTTGGGCATGGGTTAGCGGTGTTGAT
CGAAAAAGAAGGACGCTACGTGCTTTATGATACTGGTGCAAGTTGGCAAGGTGGTGATCACATCCAAGCAACAGTTGCCC
CCGTGTTAAACAAACGGGGAGCTACGCAATTAGACGGGTTGATTTTGAGTCACTTGGATAACGACCATGCAGGTGGAAGA
GCGGAAGTTGAGCGTGTTTGGCAGCCCAAATGGAAACGTGCTAGTCAAACGATAGCGGGTTACCAACCTTGTATTGCTGG
CGAACAATGGCAGTGGCAACAACTCTCTTTTGATGTTATCTGGCCACCAAAAACTGTAGCGCGGGCTTACAATCCTCACT
CTTGTGTGATTCGAATCTTTGACCAAAATACCGGTTTTTCAGTGCTTTTACCCGGGGATGTGGATGCTATGAGTGAGTGG
TTACTTGCGCGAACAGAGCAATCCTTACAAAGTCATATCCTGCTTGTACCACACCACGGTAGTAGGACATCTTCTACTGT
TGCTTTGATTAGTAAGGTGAACCCTGAAGTCGCTATTGCCTCGCTCGCTAAAGGCGGACGCTGGCAATTACCTTCCGCAC
AGGTTGTTCAACGCTACCAACAACATGGAGCAAACTGGTTGGATACGGGTGAAGCAGGGCAAATTAGCGTTATTTTTGGC
ATGCAGAATTATCAAGTTAGCAGCTTGCGTATCTCTCGCTCTCAGGCTTGGTATAGGCAGATGCTCCGTAACGAGGTAGA
ATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio cholerae strain A1552 |
51.872 |
100 |
0.519 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
43.931 |
100 |
0.446 |
| comEC | Vibrio campbellii strain DS40M4 |
43.857 |
100 |
0.444 |