Detailed information
Overview
| Name | comE | Type | Machinery gene |
| Locus tag | LK420_RS08220 | Genome accession | NZ_CP085949 |
| Coordinates | 1696315..1697658 (-) | Length | 447 a.a. |
| NCBI ID | WP_021034382.1 | Uniprot ID | - |
| Organism | Haemophilus influenzae strain FDAARGOS_1562 | ||
| Function | type IV pilus biogenesis and function (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1691315..1702658
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| LK420_RS08190 (LK420_08190) | - | 1692360..1692632 (-) | 273 | WP_005630954.1 | HU family DNA-binding protein | - |
| LK420_RS08195 (LK420_08195) | - | 1692771..1693361 (-) | 591 | WP_021034387.1 | YjaG family protein | - |
| LK420_RS08200 (LK420_08200) | nudC | 1693397..1694191 (-) | 795 | WP_021034386.1 | NAD(+) diphosphatase | - |
| LK420_RS08205 (LK420_08205) | nfuA | 1694259..1694855 (-) | 597 | WP_021034385.1 | Fe-S biogenesis protein NfuA | - |
| LK420_RS08210 (LK420_08210) | - | 1694931..1695617 (-) | 687 | WP_021034384.1 | ComF family protein | - |
| LK420_RS08215 (LK420_08215) | tnpA | 1695865..1696318 (-) | 454 | Protein_1574 | IS200/IS605 family transposase | - |
| LK420_RS08220 (LK420_08220) | comE | 1696315..1697658 (-) | 1344 | WP_021034382.1 | type IV pilus secretin PilQ | Machinery gene |
| LK420_RS08225 (LK420_08225) | comD | 1697668..1698081 (-) | 414 | WP_021034381.1 | pilus assembly protein PilP | Machinery gene |
| LK420_RS08230 (LK420_08230) | comC | 1698078..1698599 (-) | 522 | WP_021034380.1 | hypothetical protein | Machinery gene |
| LK420_RS08235 (LK420_08235) | comB | 1698596..1699102 (-) | 507 | WP_021034379.1 | hypothetical protein | Machinery gene |
| LK420_RS08240 (LK420_08240) | comA | 1699103..1699900 (-) | 798 | WP_021034378.1 | pilus assembly protein PilM | Machinery gene |
| LK420_RS08245 (LK420_08245) | - | 1699999..1702599 (+) | 2601 | WP_021034377.1 | penicillin-binding protein 1A | - |
Sequence
Protein
Download Length: 447 a.a. Molecular weight: 49516.68 Da Isoelectric Point: 6.7046
>NTDB_id=621520 LK420_RS08220 WP_021034382.1 1696315..1697658(-) (comE) [Haemophilus influenzae strain FDAARGOS_1562]
MKKYFLKCGYFLVCFCLPLIVFANPKTDNERFFIRLSQAPLAQTLEQLAFQQDVNLVIDEALESNISLRLDNIDMPRLLQ
IIAKSKKLTLNKDEGIYYLNGGQSGKWQVAGNLTTNEPHLESHTVKLHFAKASELMKSLTTGSGSLLSSAGSITFDDRSN
LLVIQDEPHSVQNIKKLISEMDKPIEQIAIEARIVTITDESLKELGVRWGIFNPTENARRVAGSLAGNSFENIADNLNVN
FATTTTPAGSIALQVAKINGRLLDLELSALERENNVEIIASPRLLTTNKKSASIKQGTEIPYIVSNTRNDTQSVEFREAV
LGLEVTPHISKDNNILLDLLVSQNSPGSRVSYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKSEDKVPLLGDI
PVIKRLFSKESERHQKRELVIFVTPHILKAGETLEALKQKSAGKKLQ
MKKYFLKCGYFLVCFCLPLIVFANPKTDNERFFIRLSQAPLAQTLEQLAFQQDVNLVIDEALESNISLRLDNIDMPRLLQ
IIAKSKKLTLNKDEGIYYLNGGQSGKWQVAGNLTTNEPHLESHTVKLHFAKASELMKSLTTGSGSLLSSAGSITFDDRSN
LLVIQDEPHSVQNIKKLISEMDKPIEQIAIEARIVTITDESLKELGVRWGIFNPTENARRVAGSLAGNSFENIADNLNVN
FATTTTPAGSIALQVAKINGRLLDLELSALERENNVEIIASPRLLTTNKKSASIKQGTEIPYIVSNTRNDTQSVEFREAV
LGLEVTPHISKDNNILLDLLVSQNSPGSRVSYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKSEDKVPLLGDI
PVIKRLFSKESERHQKRELVIFVTPHILKAGETLEALKQKSAGKKLQ
Nucleotide
Download Length: 1344 bp
>NTDB_id=621520 LK420_RS08220 WP_021034382.1 1696315..1697658(-) (comE) [Haemophilus influenzae strain FDAARGOS_1562]
ATGAAGAAATATTTTTTAAAGTGCGGTTATTTTTTAGTCTGTTTTTGTTTACCCTTAATTGTTTTTGCTAATCCCAAAAC
TGATAACGAACGTTTTTTTATTCGTTTATCGCAAGCACCTTTAGCTCAAACATTAGAGCAGTTGGCTTTTCAACAAGATG
TGAATTTAGTGATAGATGAAGCATTAGAAAGTAATATTTCATTGAGATTAGATAATATTGATATGCCACGTTTACTACAA
ATAATCGCTAAAAGTAAGAAGCTTACTTTGAATAAAGATGAGGGTATTTACTATCTGAATGGAGGGCAATCAGGCAAATG
GCAAGTTGCAGGAAATCTTACTACAAATGAACCGCACTTAGAGAGCCATACAGTAAAACTCCATTTTGCTAAAGCCTCTG
AATTAATGAAGTCCTTAACGACAGGAAGTGGCTCTTTGCTTTCATCAGCTGGGAGCATTACCTTTGATGATCGTAGTAAT
TTGCTCGTGATTCAAGATGAACCTCATTCTGTGCAAAATATCAAAAAACTAATTTCTGAAATGGATAAACCTATTGAGCA
GATTGCTATTGAAGCGCGTATTGTGACGATAACAGATGAGAGTTTGAAAGAACTTGGCGTTCGGTGGGGGATTTTTAATC
CAACGGAAAATGCAAGACGAGTTGCGGGCAGTCTTGCAGGCAATAGCTTTGAAAATATTGCGGATAATCTTAATGTAAAT
TTTGCGACAACGACAACACCTGCTGGCTCGATAGCATTACAAGTCGCCAAAATTAATGGGCGATTGCTCGATTTAGAATT
GAGTGCGTTAGAGCGTGAAAATAATGTAGAAATTATCGCAAGCCCTCGCTTACTAACCACAAATAAGAAAAGTGCAAGCA
TTAAACAGGGGACAGAAATTCCTTATATTGTGAGCAATACTCGTAACGATACGCAATCTGTGGAATTTCGCGAGGCGGTG
CTTGGTTTGGAAGTCACACCACATATTTCTAAAGATAACAATATTTTACTTGATTTATTAGTGAGCCAAAACTCACCAGG
TTCTCGTGTTTCTTATGGACAAAATGAGGTGGTTTCTATTGATAAGCAAGAAATTAATACTCAGGTTTTTGCCAAAGATG
GGGAAACCATTGTGCTTGGCGGCGTGTTTCACGACACAATTACGAAAAGCGAAGATAAGGTGCCATTGCTTGGCGATATA
CCCGTTATTAAACGATTATTCAGCAAAGAAAGCGAACGACATCAAAAACGTGAGCTCGTGATTTTCGTGACGCCGCATAT
TTTAAAAGCAGGAGAAACGTTAGAGGCGTTGAAGCAAAAAAGTGCGGGGAAAAAGTTACAATGA
ATGAAGAAATATTTTTTAAAGTGCGGTTATTTTTTAGTCTGTTTTTGTTTACCCTTAATTGTTTTTGCTAATCCCAAAAC
TGATAACGAACGTTTTTTTATTCGTTTATCGCAAGCACCTTTAGCTCAAACATTAGAGCAGTTGGCTTTTCAACAAGATG
TGAATTTAGTGATAGATGAAGCATTAGAAAGTAATATTTCATTGAGATTAGATAATATTGATATGCCACGTTTACTACAA
ATAATCGCTAAAAGTAAGAAGCTTACTTTGAATAAAGATGAGGGTATTTACTATCTGAATGGAGGGCAATCAGGCAAATG
GCAAGTTGCAGGAAATCTTACTACAAATGAACCGCACTTAGAGAGCCATACAGTAAAACTCCATTTTGCTAAAGCCTCTG
AATTAATGAAGTCCTTAACGACAGGAAGTGGCTCTTTGCTTTCATCAGCTGGGAGCATTACCTTTGATGATCGTAGTAAT
TTGCTCGTGATTCAAGATGAACCTCATTCTGTGCAAAATATCAAAAAACTAATTTCTGAAATGGATAAACCTATTGAGCA
GATTGCTATTGAAGCGCGTATTGTGACGATAACAGATGAGAGTTTGAAAGAACTTGGCGTTCGGTGGGGGATTTTTAATC
CAACGGAAAATGCAAGACGAGTTGCGGGCAGTCTTGCAGGCAATAGCTTTGAAAATATTGCGGATAATCTTAATGTAAAT
TTTGCGACAACGACAACACCTGCTGGCTCGATAGCATTACAAGTCGCCAAAATTAATGGGCGATTGCTCGATTTAGAATT
GAGTGCGTTAGAGCGTGAAAATAATGTAGAAATTATCGCAAGCCCTCGCTTACTAACCACAAATAAGAAAAGTGCAAGCA
TTAAACAGGGGACAGAAATTCCTTATATTGTGAGCAATACTCGTAACGATACGCAATCTGTGGAATTTCGCGAGGCGGTG
CTTGGTTTGGAAGTCACACCACATATTTCTAAAGATAACAATATTTTACTTGATTTATTAGTGAGCCAAAACTCACCAGG
TTCTCGTGTTTCTTATGGACAAAATGAGGTGGTTTCTATTGATAAGCAAGAAATTAATACTCAGGTTTTTGCCAAAGATG
GGGAAACCATTGTGCTTGGCGGCGTGTTTCACGACACAATTACGAAAAGCGAAGATAAGGTGCCATTGCTTGGCGATATA
CCCGTTATTAAACGATTATTCAGCAAAGAAAGCGAACGACATCAAAAACGTGAGCTCGTGATTTTCGTGACGCCGCATAT
TTTAAAAGCAGGAGAAACGTTAGAGGCGTTGAAGCAAAAAAGTGCGGGGAAAAAGTTACAATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comE | Haemophilus influenzae Rd KW20 |
95.955 |
99.553 |
0.955 |
| comE | Haemophilus influenzae 86-028NP |
95.73 |
99.553 |
0.953 |
| comE | Glaesserella parasuis strain SC1401 |
52.941 |
95.078 |
0.503 |
| pilQ | Vibrio campbellii strain DS40M4 |
42.959 |
93.736 |
0.403 |
| pilQ | Vibrio cholerae O1 biovar El Tor strain E7946 |
42.233 |
92.17 |
0.389 |
| pilQ | Vibrio cholerae strain A1552 |
42.233 |
92.17 |
0.389 |