Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | M2042_RS05040 | Genome accession | NZ_AP023187 |
| Coordinates | 1077876..1080161 (+) | Length | 761 a.a. |
| NCBI ID | WP_104978931.1 | Uniprot ID | - |
| Organism | Vibrio alginolyticus strain HLBS-07 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1072876..1085161
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| M2042_RS05015 (HLBS07_09830) | - | 1073235..1073804 (-) | 570 | WP_065274290.1 | hypothetical protein | - |
| M2042_RS05020 (HLBS07_09840) | lolC | 1074054..1075262 (+) | 1209 | WP_031778997.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| M2042_RS05025 (HLBS07_09850) | lolD | 1075255..1075962 (+) | 708 | WP_005378457.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| M2042_RS05030 (HLBS07_09860) | lolE | 1075965..1077209 (+) | 1245 | WP_065274291.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| M2042_RS05035 | - | 1077358..1077867 (-) | 510 | WP_005387834.1 | DUF2062 domain-containing protein | - |
| M2042_RS05040 (HLBS07_09870) | comEC | 1077876..1080161 (+) | 2286 | WP_104978931.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| M2042_RS05045 (HLBS07_09880) | - | 1080165..1081154 (-) | 990 | WP_104978932.1 | glycosyltransferase | - |
| M2042_RS05050 (HLBS07_09890) | msbA | 1081499..1083247 (+) | 1749 | WP_104978934.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| M2042_RS05055 (HLBS07_09900) | lpxK | 1083253..1084260 (+) | 1008 | WP_104978935.1 | tetraacyldisaccharide 4'-kinase | - |
| M2042_RS05060 (HLBS07_09910) | - | 1084241..1084420 (+) | 180 | WP_104978937.1 | Trm112 family protein | - |
Sequence
Protein
Download Length: 761 a.a. Molecular weight: 85456.54 Da Isoelectric Point: 8.1037
>NTDB_id=81530 M2042_RS05040 WP_104978931.1 1077876..1080161(+) (comEC) [Vibrio alginolyticus strain HLBS-07]
MTLSEKSWTLALFVASVISSAWWPTIPDWRWLLLGIITTGSIIKLRRGLISIGAIAGFMVVIVHGNVLESQRQALFQAGE
NITIIGKVDSPFTQISHGYEGIVAIEAVNSQTLLPFLKPKIRLITPFPLDVNSEFTTSISLKPITGLRNEAGFDAEKHAM
GKNIIARAVVNKDAKWIVRSNVSLRQSIIADVADDVSSLHHFSLISALVFADRSWLSKEDWQALRDSGLLHLVSISGLHI
GMAFTFGLVVGGTVRYLLPQYQLLPSLYGLIFAVAYAWLADFSLPTTRAVSVCVVYVLLKYTLVHWTSWRVLLLAVSLQL
LVQPFASYSMSFWLSYLSVCAVLFAINLIQHQRGDWKTKLKAVLLTQLMLGALIVPVSGHFFSGFSLSSIAYNLVFIPWF
GFIVVPLMFLALFSSLFLEMIAKPLWHLVDWSLQPLSTSIQYAIGSWQPISMEMTMLVMAIVVCLIFQRFMSQYAWYLLL
GIVISIKLASEYEPHWRIDVLDVGHGLAVVVEKDDKVLLYDTGKAWQSGSIAEQVVTPVLHQRGYESIDTLILSHSDNDH
AGGRFFIEDAFSPQRKLSSQFFIEYEPCTRGDQWKWQGLNMEVLWPPALVQRAYNPHSCVLRLTDPISNFSMLFTGDIEA
ISEWILLREPGKLQSDVMLVPHHGSKSSSNPRFIEAVSPSVAVASLAKNNRWGMPAENVVASYRKLGITWLDTGEGGQVS
FFIHRDNWRSETKRSDTFEPWYRQMLRSGTNIKRPNVELGL
MTLSEKSWTLALFVASVISSAWWPTIPDWRWLLLGIITTGSIIKLRRGLISIGAIAGFMVVIVHGNVLESQRQALFQAGE
NITIIGKVDSPFTQISHGYEGIVAIEAVNSQTLLPFLKPKIRLITPFPLDVNSEFTTSISLKPITGLRNEAGFDAEKHAM
GKNIIARAVVNKDAKWIVRSNVSLRQSIIADVADDVSSLHHFSLISALVFADRSWLSKEDWQALRDSGLLHLVSISGLHI
GMAFTFGLVVGGTVRYLLPQYQLLPSLYGLIFAVAYAWLADFSLPTTRAVSVCVVYVLLKYTLVHWTSWRVLLLAVSLQL
LVQPFASYSMSFWLSYLSVCAVLFAINLIQHQRGDWKTKLKAVLLTQLMLGALIVPVSGHFFSGFSLSSIAYNLVFIPWF
GFIVVPLMFLALFSSLFLEMIAKPLWHLVDWSLQPLSTSIQYAIGSWQPISMEMTMLVMAIVVCLIFQRFMSQYAWYLLL
GIVISIKLASEYEPHWRIDVLDVGHGLAVVVEKDDKVLLYDTGKAWQSGSIAEQVVTPVLHQRGYESIDTLILSHSDNDH
AGGRFFIEDAFSPQRKLSSQFFIEYEPCTRGDQWKWQGLNMEVLWPPALVQRAYNPHSCVLRLTDPISNFSMLFTGDIEA
ISEWILLREPGKLQSDVMLVPHHGSKSSSNPRFIEAVSPSVAVASLAKNNRWGMPAENVVASYRKLGITWLDTGEGGQVS
FFIHRDNWRSETKRSDTFEPWYRQMLRSGTNIKRPNVELGL
Nucleotide
Download Length: 2286 bp
>NTDB_id=81530 M2042_RS05040 WP_104978931.1 1077876..1080161(+) (comEC) [Vibrio alginolyticus strain HLBS-07]
ATGACTCTCTCAGAAAAAAGTTGGACCTTGGCGTTATTTGTAGCGAGCGTAATCTCGTCAGCTTGGTGGCCGACAATACC
AGATTGGCGTTGGTTGCTGCTGGGAATAATAACCACTGGCTCAATAATCAAATTACGTCGTGGCTTAATTAGCATAGGCG
CAATTGCGGGCTTTATGGTTGTCATCGTCCACGGCAATGTATTGGAGTCTCAGAGACAAGCCCTTTTTCAAGCAGGGGAG
AATATTACCATAATTGGTAAAGTTGACAGCCCTTTTACGCAAATAAGTCACGGTTATGAAGGTATTGTCGCTATAGAAGC
GGTAAATTCTCAAACTCTGTTACCTTTTCTTAAACCTAAAATCCGTCTTATTACGCCTTTTCCACTCGATGTTAACAGTG
AGTTTACGACATCCATCTCGTTAAAGCCGATTACAGGTCTACGCAACGAAGCTGGCTTTGATGCTGAAAAGCACGCGATG
GGTAAAAATATTATCGCCCGAGCGGTTGTTAACAAAGATGCTAAGTGGATTGTTCGTTCTAATGTATCTCTGCGTCAATC
CATTATTGCCGATGTCGCTGATGATGTGTCTTCCCTTCATCATTTCTCGTTAATAAGTGCGTTAGTTTTTGCTGACCGTT
CTTGGTTGTCTAAGGAAGATTGGCAAGCGCTCAGGGACAGTGGTTTACTGCATTTGGTCTCCATTTCTGGTTTACATATT
GGTATGGCGTTTACTTTTGGGCTAGTCGTGGGCGGTACCGTTCGTTATCTATTACCCCAATACCAGTTGTTACCTAGCTT
ATATGGGCTGATTTTCGCTGTTGCCTACGCTTGGTTAGCGGATTTCTCTCTACCTACTACGCGAGCAGTGTCGGTATGCG
TTGTTTACGTTCTCCTGAAGTATACTCTGGTGCACTGGACTTCTTGGCGAGTTCTTTTGCTTGCAGTATCTCTACAACTG
CTTGTCCAACCTTTTGCATCCTACAGTATGAGCTTTTGGTTATCATACCTCTCCGTTTGTGCGGTATTGTTCGCAATTAA
TTTGATTCAACATCAAAGGGGAGATTGGAAAACGAAATTAAAAGCGGTATTGCTTACTCAATTGATGTTGGGCGCTTTAA
TCGTACCTGTTAGTGGCCATTTCTTTTCCGGTTTTAGTTTGTCCTCAATAGCCTACAACTTGGTATTTATTCCTTGGTTC
GGATTTATCGTTGTACCTTTGATGTTTCTAGCCCTGTTCTCATCTTTGTTTCTTGAAATGATAGCGAAGCCTCTGTGGCA
TTTGGTGGACTGGTCATTACAGCCATTGAGCACGTCAATACAGTACGCAATAGGTTCTTGGCAACCTATTAGCATGGAAA
TGACAATGCTGGTTATGGCGATAGTGGTTTGCTTAATATTTCAACGGTTCATGTCTCAGTACGCGTGGTATTTGTTACTC
GGCATCGTTATTAGTATCAAGCTTGCATCGGAGTATGAACCGCATTGGCGTATCGATGTATTAGATGTAGGTCATGGCCT
TGCAGTGGTTGTAGAAAAAGACGACAAGGTGTTGCTGTATGACACTGGTAAAGCATGGCAAAGCGGCAGTATCGCAGAGC
AGGTCGTCACTCCGGTTTTGCATCAACGAGGTTATGAAAGCATAGATACACTGATTCTAAGCCATTCCGATAATGATCAC
GCAGGCGGCCGCTTTTTCATTGAAGATGCTTTTTCTCCACAACGTAAACTTAGTAGTCAATTCTTTATCGAATATGAGCC
TTGCACTAGAGGCGACCAATGGAAATGGCAAGGATTAAACATGGAAGTGCTCTGGCCTCCTGCACTTGTTCAACGTGCGT
ATAATCCGCACTCCTGTGTATTACGACTTACTGACCCTATTTCTAACTTTAGTATGCTTTTTACTGGTGATATTGAGGCC
ATTAGCGAGTGGATTTTGCTTAGAGAACCCGGAAAGTTACAAAGCGATGTGATGCTAGTCCCTCACCACGGGAGTAAAAG
TTCATCGAATCCTCGATTTATCGAGGCGGTGAGCCCGAGTGTGGCAGTTGCTTCTTTAGCAAAAAACAATCGATGGGGAA
TGCCAGCGGAAAATGTCGTTGCTTCATACCGAAAGCTTGGCATTACTTGGCTTGATACTGGTGAGGGTGGTCAGGTAAGC
TTCTTTATTCACCGTGATAATTGGCGCTCAGAAACCAAACGTAGCGATACATTTGAGCCTTGGTATAGGCAGATGCTGCG
TAGCGGAACTAATATAAAAAGACCTAATGTCGAATTAGGTCTGTAA
ATGACTCTCTCAGAAAAAAGTTGGACCTTGGCGTTATTTGTAGCGAGCGTAATCTCGTCAGCTTGGTGGCCGACAATACC
AGATTGGCGTTGGTTGCTGCTGGGAATAATAACCACTGGCTCAATAATCAAATTACGTCGTGGCTTAATTAGCATAGGCG
CAATTGCGGGCTTTATGGTTGTCATCGTCCACGGCAATGTATTGGAGTCTCAGAGACAAGCCCTTTTTCAAGCAGGGGAG
AATATTACCATAATTGGTAAAGTTGACAGCCCTTTTACGCAAATAAGTCACGGTTATGAAGGTATTGTCGCTATAGAAGC
GGTAAATTCTCAAACTCTGTTACCTTTTCTTAAACCTAAAATCCGTCTTATTACGCCTTTTCCACTCGATGTTAACAGTG
AGTTTACGACATCCATCTCGTTAAAGCCGATTACAGGTCTACGCAACGAAGCTGGCTTTGATGCTGAAAAGCACGCGATG
GGTAAAAATATTATCGCCCGAGCGGTTGTTAACAAAGATGCTAAGTGGATTGTTCGTTCTAATGTATCTCTGCGTCAATC
CATTATTGCCGATGTCGCTGATGATGTGTCTTCCCTTCATCATTTCTCGTTAATAAGTGCGTTAGTTTTTGCTGACCGTT
CTTGGTTGTCTAAGGAAGATTGGCAAGCGCTCAGGGACAGTGGTTTACTGCATTTGGTCTCCATTTCTGGTTTACATATT
GGTATGGCGTTTACTTTTGGGCTAGTCGTGGGCGGTACCGTTCGTTATCTATTACCCCAATACCAGTTGTTACCTAGCTT
ATATGGGCTGATTTTCGCTGTTGCCTACGCTTGGTTAGCGGATTTCTCTCTACCTACTACGCGAGCAGTGTCGGTATGCG
TTGTTTACGTTCTCCTGAAGTATACTCTGGTGCACTGGACTTCTTGGCGAGTTCTTTTGCTTGCAGTATCTCTACAACTG
CTTGTCCAACCTTTTGCATCCTACAGTATGAGCTTTTGGTTATCATACCTCTCCGTTTGTGCGGTATTGTTCGCAATTAA
TTTGATTCAACATCAAAGGGGAGATTGGAAAACGAAATTAAAAGCGGTATTGCTTACTCAATTGATGTTGGGCGCTTTAA
TCGTACCTGTTAGTGGCCATTTCTTTTCCGGTTTTAGTTTGTCCTCAATAGCCTACAACTTGGTATTTATTCCTTGGTTC
GGATTTATCGTTGTACCTTTGATGTTTCTAGCCCTGTTCTCATCTTTGTTTCTTGAAATGATAGCGAAGCCTCTGTGGCA
TTTGGTGGACTGGTCATTACAGCCATTGAGCACGTCAATACAGTACGCAATAGGTTCTTGGCAACCTATTAGCATGGAAA
TGACAATGCTGGTTATGGCGATAGTGGTTTGCTTAATATTTCAACGGTTCATGTCTCAGTACGCGTGGTATTTGTTACTC
GGCATCGTTATTAGTATCAAGCTTGCATCGGAGTATGAACCGCATTGGCGTATCGATGTATTAGATGTAGGTCATGGCCT
TGCAGTGGTTGTAGAAAAAGACGACAAGGTGTTGCTGTATGACACTGGTAAAGCATGGCAAAGCGGCAGTATCGCAGAGC
AGGTCGTCACTCCGGTTTTGCATCAACGAGGTTATGAAAGCATAGATACACTGATTCTAAGCCATTCCGATAATGATCAC
GCAGGCGGCCGCTTTTTCATTGAAGATGCTTTTTCTCCACAACGTAAACTTAGTAGTCAATTCTTTATCGAATATGAGCC
TTGCACTAGAGGCGACCAATGGAAATGGCAAGGATTAAACATGGAAGTGCTCTGGCCTCCTGCACTTGTTCAACGTGCGT
ATAATCCGCACTCCTGTGTATTACGACTTACTGACCCTATTTCTAACTTTAGTATGCTTTTTACTGGTGATATTGAGGCC
ATTAGCGAGTGGATTTTGCTTAGAGAACCCGGAAAGTTACAAAGCGATGTGATGCTAGTCCCTCACCACGGGAGTAAAAG
TTCATCGAATCCTCGATTTATCGAGGCGGTGAGCCCGAGTGTGGCAGTTGCTTCTTTAGCAAAAAACAATCGATGGGGAA
TGCCAGCGGAAAATGTCGTTGCTTCATACCGAAAGCTTGGCATTACTTGGCTTGATACTGGTGAGGGTGGTCAGGTAAGC
TTCTTTATTCACCGTGATAATTGGCGCTCAGAAACCAAACGTAGCGATACATTTGAGCCTTGGTATAGGCAGATGCTGCG
TAGCGGAACTAATATAAAAAGACCTAATGTCGAATTAGGTCTGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
69.292 |
98.423 |
0.682 |
| comEC | Vibrio campbellii strain DS40M4 |
66.176 |
98.292 |
0.65 |
| comEC | Vibrio cholerae strain A1552 |
43.369 |
99.08 |
0.43 |