Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | CU052_RS00040 | Genome accession | NZ_CP025537 |
| Coordinates | 5938..8193 (-) | Length | 751 a.a. |
| NCBI ID | WP_101904055.1 | Uniprot ID | - |
| Organism | Vibrio harveyi strain 345 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 938..13193
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| CU052_RS00015 (CU052_00415) | - | 1663..1842 (-) | 180 | WP_005378451.1 | Trm112 family protein | - |
| CU052_RS00020 (CU052_00420) | lpxK | 1823..2830 (-) | 1008 | WP_101904052.1 | tetraacyldisaccharide 4'-kinase | - |
| CU052_RS00025 (CU052_00425) | msbA | 2836..4584 (-) | 1749 | WP_050936084.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| CU052_RS29695 (CU052_00430) | - | 4910..5194 (+) | 285 | WP_101904053.1 | glycosyltransferase | - |
| CU052_RS00035 (CU052_00435) | - | 5214..5900 (+) | 687 | WP_265093699.1 | glycosyltransferase | - |
| CU052_RS00040 (CU052_00440) | comEC | 5938..8193 (-) | 2256 | WP_101904055.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| CU052_RS00045 (CU052_00445) | - | 8202..8711 (+) | 510 | WP_005446692.1 | DUF2062 domain-containing protein | - |
| CU052_RS00050 (CU052_00450) | lolE | 8909..10153 (-) | 1245 | WP_009698315.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| CU052_RS00055 (CU052_00455) | lolD | 10156..10863 (-) | 708 | WP_005446689.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| CU052_RS00060 (CU052_00460) | lolC | 10856..12064 (-) | 1209 | WP_026000130.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| CU052_RS00065 (CU052_00465) | - | 12329..12898 (+) | 570 | WP_009698318.1 | PilZ domain-containing protein | - |
Sequence
Protein
Download Length: 751 a.a. Molecular weight: 84396.89 Da Isoelectric Point: 9.6792
>NTDB_id=262378 CU052_RS00040 WP_101904055.1 5938..8193(-) (comEC) [Vibrio harveyi strain 345]
MTLLEKSLTLALFVASVISSAWWPTIPDWRWLLLGIIATGSIIKLRRGLLSIGVISGFMVVIIHGNVMEHQRQALFQAGV
NITINGKVDSPFTQISHGYEGIARVHQVNSQNLLPFFKPKIRLITPFPLAVNSEFTTEVTIKPIFGLLNEAGHDAEKQAV
GKGIVARATVSKDSAWLIRERSSLRTQIIAVVHKHIVQLEHFALIRALAFSDRTLLSRYDWQLLRDSGLLHLVSISGLHI
GMAFAFGMSFGVVVRYALPKFVFLPSLFGLATAFLYSWLADFSLPTTRAFSVCLIYLLLKSALIYWSAWRVLLLAVAIQL
CIEPFSALTMSFWLSYLSVIAVLFAVNCVQHSRGNWIRKLGTLFKIQLVLTVLIIPISGLFFAGTSLSSILYNLIFIPWF
GFVVVPLMFVALIITPFSVHLANMLWQWLDWMLVPLTWSLPFALGSWQSLSSQATLWVLALGVCVLSMRFLNRETSGVLF
LVITSLALWYERKSDGWRIDVLDVGHGLAVLLEKEGEVLLYDTGKTWAYGSIAEQVIAPILYRRGFGSIDMFVVSHADSD
HAGGRAYIERHFAPVRKFSSQNYANYQPCIAGERWKWQALEFEVLWPPKLVKRAYNPHSCVIRVVDTKTDFKLLLTGDIE
AVSEWILVRNPDQLKSDVVIVPHHGSKSSSNPKFVEAIAPKLAIASLAKGNQWGMPANNVVLAYENANAKWLDTGNGGQI
SVLIEQENWYFETKRSETFDPWYRQMLRNGN
MTLLEKSLTLALFVASVISSAWWPTIPDWRWLLLGIIATGSIIKLRRGLLSIGVISGFMVVIIHGNVMEHQRQALFQAGV
NITINGKVDSPFTQISHGYEGIARVHQVNSQNLLPFFKPKIRLITPFPLAVNSEFTTEVTIKPIFGLLNEAGHDAEKQAV
GKGIVARATVSKDSAWLIRERSSLRTQIIAVVHKHIVQLEHFALIRALAFSDRTLLSRYDWQLLRDSGLLHLVSISGLHI
GMAFAFGMSFGVVVRYALPKFVFLPSLFGLATAFLYSWLADFSLPTTRAFSVCLIYLLLKSALIYWSAWRVLLLAVAIQL
CIEPFSALTMSFWLSYLSVIAVLFAVNCVQHSRGNWIRKLGTLFKIQLVLTVLIIPISGLFFAGTSLSSILYNLIFIPWF
GFVVVPLMFVALIITPFSVHLANMLWQWLDWMLVPLTWSLPFALGSWQSLSSQATLWVLALGVCVLSMRFLNRETSGVLF
LVITSLALWYERKSDGWRIDVLDVGHGLAVLLEKEGEVLLYDTGKTWAYGSIAEQVIAPILYRRGFGSIDMFVVSHADSD
HAGGRAYIERHFAPVRKFSSQNYANYQPCIAGERWKWQALEFEVLWPPKLVKRAYNPHSCVIRVVDTKTDFKLLLTGDIE
AVSEWILVRNPDQLKSDVVIVPHHGSKSSSNPKFVEAIAPKLAIASLAKGNQWGMPANNVVLAYENANAKWLDTGNGGQI
SVLIEQENWYFETKRSETFDPWYRQMLRNGN
Nucleotide
Download Length: 2256 bp
>NTDB_id=262378 CU052_RS00040 WP_101904055.1 5938..8193(-) (comEC) [Vibrio harveyi strain 345]
ATGACTCTCTTAGAAAAAAGTTTGACCTTGGCTTTATTTGTAGCGAGCGTTATTTCGTCTGCATGGTGGCCGACGATACC
AGATTGGCGTTGGTTGCTGCTGGGAATAATTGCCACTGGCTCAATAATAAAATTACGACGTGGCTTATTGAGCATAGGCG
TAATTTCGGGCTTTATGGTTGTCATTATCCACGGCAATGTTATGGAGCATCAAAGACAAGCCCTGTTTCAAGCAGGGGTG
AATATTACCATAAATGGCAAAGTTGACAGCCCTTTTACGCAAATAAGTCACGGATATGAAGGAATTGCGCGTGTCCATCA
GGTGAATTCTCAAAACTTGTTACCTTTTTTTAAACCGAAAATCCGGTTGATAACGCCTTTCCCACTCGCCGTTAACAGTG
AGTTCACTACCGAAGTGACAATTAAACCCATCTTTGGGCTTTTAAACGAAGCAGGTCATGACGCCGAAAAGCAGGCAGTA
GGAAAGGGCATTGTCGCAAGGGCGACGGTTTCAAAGGATTCTGCGTGGCTTATTCGTGAGCGATCATCATTAAGAACGCA
AATTATCGCCGTCGTTCATAAGCACATTGTTCAGCTTGAACATTTCGCTCTTATTCGTGCTTTGGCGTTTAGTGACCGTA
CGCTTCTGTCTCGATATGACTGGCAACTCCTACGTGATAGTGGCTTACTGCATTTGGTTTCGATTTCGGGTTTACATATA
GGAATGGCGTTTGCTTTTGGTATGAGCTTTGGGGTTGTTGTTAGGTACGCTTTGCCAAAATTTGTCTTTTTACCTTCCTT
GTTTGGGCTTGCTACTGCTTTTTTGTATTCATGGTTAGCAGACTTCTCTTTGCCTACGACTCGAGCTTTTTCAGTATGTT
TGATTTACTTGTTATTAAAGTCTGCTTTGATTTATTGGAGCGCTTGGCGCGTGCTGCTCCTTGCTGTAGCAATACAGTTG
TGCATCGAGCCGTTTTCTGCACTCACTATGAGCTTTTGGCTGTCTTATCTCTCTGTTATCGCCGTTTTATTTGCAGTGAA
TTGTGTACAACATAGCCGTGGCAATTGGATTAGGAAACTGGGTACCTTATTCAAAATTCAGTTAGTACTCACTGTGTTGA
TCATTCCGATTAGTGGGTTGTTTTTTGCCGGGACGAGCCTCTCCTCTATTTTATATAACCTCATTTTTATCCCTTGGTTC
GGTTTTGTTGTCGTTCCACTGATGTTTGTTGCACTTATCATTACCCCGTTTTCAGTGCACTTGGCGAACATGCTGTGGCA
GTGGTTAGATTGGATGCTTGTACCCCTTACTTGGTCTTTGCCATTTGCTTTGGGGAGTTGGCAATCTCTTAGCTCACAGG
CAACGTTGTGGGTGCTGGCATTGGGCGTTTGTGTGCTTTCGATGCGATTTTTAAATCGAGAAACCTCAGGCGTTTTGTTC
TTGGTCATCACAAGTCTGGCGTTGTGGTATGAACGAAAATCCGATGGTTGGCGTATTGATGTGTTAGATGTTGGGCATGG
ACTTGCAGTGCTCCTAGAAAAAGAAGGTGAAGTGCTTTTATATGACACGGGTAAAACATGGGCTTATGGAAGTATTGCTG
AACAAGTTATTGCTCCTATCTTGTATCGCAGAGGGTTCGGTTCTATCGACATGTTTGTCGTCAGTCACGCCGACTCAGAT
CATGCTGGAGGACGTGCATATATCGAAAGGCACTTCGCCCCAGTTCGTAAGTTTAGTAGCCAAAACTACGCTAACTACCA
ACCTTGTATTGCAGGGGAACGATGGAAATGGCAAGCGCTAGAATTTGAAGTTCTTTGGCCTCCGAAGTTGGTTAAACGTG
CATATAACCCACATTCATGCGTGATTCGTGTCGTCGATACCAAAACGGATTTTAAGTTGCTATTAACGGGCGATATTGAA
GCCGTTAGTGAATGGATCTTGGTGAGAAACCCCGATCAGTTAAAAAGTGATGTTGTGATTGTCCCGCATCACGGAAGTAA
AAGCTCTTCTAACCCCAAGTTTGTTGAAGCGATTGCCCCAAAACTGGCGATCGCATCTCTGGCAAAAGGGAATCAGTGGG
GAATGCCTGCAAATAACGTGGTTCTAGCGTACGAGAACGCCAATGCGAAATGGCTGGATACGGGGAATGGCGGTCAAATA
AGCGTCCTTATTGAGCAAGAGAATTGGTATTTTGAAACGAAACGAAGTGAGACATTTGACCCTTGGTATAGGCAGATGCT
GCGTAACGGAAATTAA
ATGACTCTCTTAGAAAAAAGTTTGACCTTGGCTTTATTTGTAGCGAGCGTTATTTCGTCTGCATGGTGGCCGACGATACC
AGATTGGCGTTGGTTGCTGCTGGGAATAATTGCCACTGGCTCAATAATAAAATTACGACGTGGCTTATTGAGCATAGGCG
TAATTTCGGGCTTTATGGTTGTCATTATCCACGGCAATGTTATGGAGCATCAAAGACAAGCCCTGTTTCAAGCAGGGGTG
AATATTACCATAAATGGCAAAGTTGACAGCCCTTTTACGCAAATAAGTCACGGATATGAAGGAATTGCGCGTGTCCATCA
GGTGAATTCTCAAAACTTGTTACCTTTTTTTAAACCGAAAATCCGGTTGATAACGCCTTTCCCACTCGCCGTTAACAGTG
AGTTCACTACCGAAGTGACAATTAAACCCATCTTTGGGCTTTTAAACGAAGCAGGTCATGACGCCGAAAAGCAGGCAGTA
GGAAAGGGCATTGTCGCAAGGGCGACGGTTTCAAAGGATTCTGCGTGGCTTATTCGTGAGCGATCATCATTAAGAACGCA
AATTATCGCCGTCGTTCATAAGCACATTGTTCAGCTTGAACATTTCGCTCTTATTCGTGCTTTGGCGTTTAGTGACCGTA
CGCTTCTGTCTCGATATGACTGGCAACTCCTACGTGATAGTGGCTTACTGCATTTGGTTTCGATTTCGGGTTTACATATA
GGAATGGCGTTTGCTTTTGGTATGAGCTTTGGGGTTGTTGTTAGGTACGCTTTGCCAAAATTTGTCTTTTTACCTTCCTT
GTTTGGGCTTGCTACTGCTTTTTTGTATTCATGGTTAGCAGACTTCTCTTTGCCTACGACTCGAGCTTTTTCAGTATGTT
TGATTTACTTGTTATTAAAGTCTGCTTTGATTTATTGGAGCGCTTGGCGCGTGCTGCTCCTTGCTGTAGCAATACAGTTG
TGCATCGAGCCGTTTTCTGCACTCACTATGAGCTTTTGGCTGTCTTATCTCTCTGTTATCGCCGTTTTATTTGCAGTGAA
TTGTGTACAACATAGCCGTGGCAATTGGATTAGGAAACTGGGTACCTTATTCAAAATTCAGTTAGTACTCACTGTGTTGA
TCATTCCGATTAGTGGGTTGTTTTTTGCCGGGACGAGCCTCTCCTCTATTTTATATAACCTCATTTTTATCCCTTGGTTC
GGTTTTGTTGTCGTTCCACTGATGTTTGTTGCACTTATCATTACCCCGTTTTCAGTGCACTTGGCGAACATGCTGTGGCA
GTGGTTAGATTGGATGCTTGTACCCCTTACTTGGTCTTTGCCATTTGCTTTGGGGAGTTGGCAATCTCTTAGCTCACAGG
CAACGTTGTGGGTGCTGGCATTGGGCGTTTGTGTGCTTTCGATGCGATTTTTAAATCGAGAAACCTCAGGCGTTTTGTTC
TTGGTCATCACAAGTCTGGCGTTGTGGTATGAACGAAAATCCGATGGTTGGCGTATTGATGTGTTAGATGTTGGGCATGG
ACTTGCAGTGCTCCTAGAAAAAGAAGGTGAAGTGCTTTTATATGACACGGGTAAAACATGGGCTTATGGAAGTATTGCTG
AACAAGTTATTGCTCCTATCTTGTATCGCAGAGGGTTCGGTTCTATCGACATGTTTGTCGTCAGTCACGCCGACTCAGAT
CATGCTGGAGGACGTGCATATATCGAAAGGCACTTCGCCCCAGTTCGTAAGTTTAGTAGCCAAAACTACGCTAACTACCA
ACCTTGTATTGCAGGGGAACGATGGAAATGGCAAGCGCTAGAATTTGAAGTTCTTTGGCCTCCGAAGTTGGTTAAACGTG
CATATAACCCACATTCATGCGTGATTCGTGTCGTCGATACCAAAACGGATTTTAAGTTGCTATTAACGGGCGATATTGAA
GCCGTTAGTGAATGGATCTTGGTGAGAAACCCCGATCAGTTAAAAAGTGATGTTGTGATTGTCCCGCATCACGGAAGTAA
AAGCTCTTCTAACCCCAAGTTTGTTGAAGCGATTGCCCCAAAACTGGCGATCGCATCTCTGGCAAAAGGGAATCAGTGGG
GAATGCCTGCAAATAACGTGGTTCTAGCGTACGAGAACGCCAATGCGAAATGGCTGGATACGGGGAATGGCGGTCAAATA
AGCGTCCTTATTGAGCAAGAGAATTGGTATTTTGAAACGAAACGAAGTGAGACATTTGACCCTTGGTATAGGCAGATGCT
GCGTAACGGAAATTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio campbellii strain DS40M4 |
75.434 |
99.734 |
0.752 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
65.154 |
99.734 |
0.65 |
| comEC | Vibrio cholerae strain A1552 |
40.559 |
100 |
0.406 |