Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | HJ37_RS06640 | Genome accession | NC_016445 |
| Coordinates | 1495062..1497191 (-) | Length | 709 a.a. |
| NCBI ID | WP_001911453.1 | Uniprot ID | - |
| Organism | Vibrio cholerae O1 str. 2010EL-1786 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1490062..1502191
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HJ37_RS06620 (Vch1786_I1367) | kdsB | 1491355..1492113 (-) | 759 | WP_000011329.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| HJ37_RS06625 (Vch1786_I1368) | - | 1492113..1492292 (-) | 180 | WP_000350068.1 | Trm112 family protein | - |
| HJ37_RS06630 (Vch1786_I1369) | lpxK | 1492273..1493280 (-) | 1008 | WP_001918694.1 | tetraacyldisaccharide 4'-kinase | - |
| HJ37_RS06635 (Vch1786_I1370) | msbA | 1493283..1495031 (-) | 1749 | WP_000052153.1 | lipid A ABC transporter ATP-binding protein/permease MsbA | - |
| HJ37_RS06640 (Vch1786_I1371) | comEC | 1495062..1497191 (-) | 2130 | WP_001911453.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| HJ37_RS06645 (Vch1786_I1372) | - | 1497314..1497844 (+) | 531 | WP_001881633.1 | DUF2062 domain-containing protein | - |
| HJ37_RS06650 (Vch1786_I1373) | lolE | 1497959..1499203 (-) | 1245 | WP_000493010.1 | lipoprotein-releasing ABC transporter permease subunit LolE | - |
| HJ37_RS06655 (Vch1786_I1374) | lolD | 1499204..1499890 (-) | 687 | WP_001061290.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| HJ37_RS06660 (Vch1786_I1375) | lolC | 1499883..1501091 (-) | 1209 | WP_000468900.1 | lipoprotein-releasing ABC transporter permease subunit LolC | - |
| HJ37_RS06665 (Vch1786_I1376) | - | 1501267..1501842 (+) | 576 | WP_000999601.1 | PilZ domain-containing protein | - |
Sequence
Protein
Download Length: 709 a.a. Molecular weight: 80009.33 Da Isoelectric Point: 7.7984
>NTDB_id=42751 HJ37_RS06640 WP_001911453.1 1495062..1497191(-) (comEC) [Vibrio cholerae O1 str. 2010EL-1786]
MVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGPDIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTR
PRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYMAQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHT
SPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHIGIAFSVGYFLGLGMMRFHAQLLWSPFVCGALLAVLYAW
LAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLVAVLLWSPFASLSNSFWMSFLAVAIVLYQLASQSQRQVWWKA
LLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVIVPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQF
ADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMIGLWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDT
GSSWPGGSYVQSVIEPMLQQRGLRQVDGVILSHLDNDHAGDWQGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFT
VLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEWLLARDGAQLQSEVMIVPHHGSKTSSTAEFIAQVNPKLA
IASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFYLDQLDWFTQRSLGWQPWYRQMLRKGVE
MVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGPDIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTR
PRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYMAQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHT
SPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHIGIAFSVGYFLGLGMMRFHAQLLWSPFVCGALLAVLYAW
LAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLVAVLLWSPFASLSNSFWMSFLAVAIVLYQLASQSQRQVWWKA
LLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVIVPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQF
ADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMIGLWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDT
GSSWPGGSYVQSVIEPMLQQRGLRQVDGVILSHLDNDHAGDWQGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFT
VLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEWLLARDGAQLQSEVMIVPHHGSKTSSTAEFIAQVNPKLA
IASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFYLDQLDWFTQRSLGWQPWYRQMLRKGVE
Nucleotide
Download Length: 2130 bp
>NTDB_id=42751 HJ37_RS06640 WP_001911453.1 1495062..1497191(-) (comEC) [Vibrio cholerae O1 str. 2010EL-1786]
ATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCTTCGTGGCTGCCATACTAACCATTGTGCTACAGGG
CAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCGGATATTATCATAAAAGGCCGTGTTGACAGCTTTT
TTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGAAGTGAATGGACAAACCTTAAACAAAATGACTCGC
CCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGATCGCGTCGAATTTTCGGTAACTCTCAAGCCGATAGT
GGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATGGCGCAATCTGTCGTCGCACGAGCGGTCGTAAAAC
CTGACACTGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAGTTTGTTTTTTGAGCTAGAGCAATTAACGCATACC
AGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAAAAGGTATTGATGAGCAAGAGTGGCAAGCCTTACG
CAATAGTGGCTTAATTCATTTAGTGGCCATTTCGGGGCTGCACATTGGTATCGCTTTTAGCGTGGGGTATTTTCTCGGGC
TCGGCATGATGCGTTTTCATGCTCAGTTATTGTGGTCCCCTTTTGTGTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGG
CTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATTATGTGCTTACTCAATGTGGCGTTGATCATGTTGGCTTTTCC
TCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGTCGCGGTCTTGCTTTGGTCGCCATTCGCCTCACTTTCAAACAGTT
TCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGCCAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCT
CTTCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCACCGGTCACGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGC
AGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATTGTCCCAGCTTTGTTTTTGGGTCTATTACTCATGG
TGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTGGACGTTTTTACCGCTCGATTGGGCTTTGCAGTTT
GCCGATGTAGGCTGGTGGGTGGTCCCCAGCAAAGTACAAGGTGTGGTCGCAGCGAGTGTGGCCATCCTCTTGCTTTATCG
ATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATTGGCTTATGGTGGTGGTTTCCCTCTCTCACTCCACTTT
GGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGATTGAGCAAGATGAGCGAGCAATTGTCTACGATACA
GGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGATTGAGCCTATGCTCCAACAGCGGGGGCTACGCCAAGTGGA
TGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGGCAAGGTTTAGCTGAGCGCTGGCAACCCAATTGGA
TTCGTGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATCCGTGGTGAAAGCTGGCAGTGGCAATCTCTCCATTTTACG
GTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATTCGTGTGTGATTCGTATGACCGATACTCAGTCTAA
CCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGGCTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTG
AGGTGATGATCGTGCCGCACCACGGCAGTAAAACATCGTCCACCGCAGAGTTTATTGCCCAAGTGAATCCCAAACTTGCG
ATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGCAAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGA
GTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTTTCTATCTAGATCAGCTGGATTGGTTTACCCAGCGTAGCC
TTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGAATGA
ATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCTTCGTGGCTGCCATACTAACCATTGTGCTACAGGG
CAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCGGATATTATCATAAAAGGCCGTGTTGACAGCTTTT
TTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGAAGTGAATGGACAAACCTTAAACAAAATGACTCGC
CCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGATCGCGTCGAATTTTCGGTAACTCTCAAGCCGATAGT
GGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATGGCGCAATCTGTCGTCGCACGAGCGGTCGTAAAAC
CTGACACTGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAGTTTGTTTTTTGAGCTAGAGCAATTAACGCATACC
AGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAAAAGGTATTGATGAGCAAGAGTGGCAAGCCTTACG
CAATAGTGGCTTAATTCATTTAGTGGCCATTTCGGGGCTGCACATTGGTATCGCTTTTAGCGTGGGGTATTTTCTCGGGC
TCGGCATGATGCGTTTTCATGCTCAGTTATTGTGGTCCCCTTTTGTGTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGG
CTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATTATGTGCTTACTCAATGTGGCGTTGATCATGTTGGCTTTTCC
TCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGTCGCGGTCTTGCTTTGGTCGCCATTCGCCTCACTTTCAAACAGTT
TCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGCCAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCT
CTTCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCACCGGTCACGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGC
AGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATTGTCCCAGCTTTGTTTTTGGGTCTATTACTCATGG
TGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTGGACGTTTTTACCGCTCGATTGGGCTTTGCAGTTT
GCCGATGTAGGCTGGTGGGTGGTCCCCAGCAAAGTACAAGGTGTGGTCGCAGCGAGTGTGGCCATCCTCTTGCTTTATCG
ATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATTGGCTTATGGTGGTGGTTTCCCTCTCTCACTCCACTTT
GGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGATTGAGCAAGATGAGCGAGCAATTGTCTACGATACA
GGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGATTGAGCCTATGCTCCAACAGCGGGGGCTACGCCAAGTGGA
TGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGGCAAGGTTTAGCTGAGCGCTGGCAACCCAATTGGA
TTCGTGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATCCGTGGTGAAAGCTGGCAGTGGCAATCTCTCCATTTTACG
GTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATTCGTGTGTGATTCGTATGACCGATACTCAGTCTAA
CCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGGCTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTG
AGGTGATGATCGTGCCGCACCACGGCAGTAAAACATCGTCCACCGCAGAGTTTATTGCCCAAGTGAATCCCAAACTTGCG
ATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGCAAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGA
GTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTTTCTATCTAGATCAGCTGGATTGGTTTACCCAGCGTAGCC
TTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGAATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Vibrio cholerae strain A1552 |
100 |
100 |
1 |
| comEC | Vibrio parahaemolyticus RIMD 2210633 |
41.433 |
100 |
0.416 |
| comEC | Vibrio campbellii strain DS40M4 |
41.301 |
99.718 |
0.412 |