Detailed information
Overview
| Name | comE | Type | Machinery gene |
| Locus tag | DQN24_RS02925 | Genome accession | NZ_LS483411 |
| Coordinates | 580902..582245 (-) | Length | 447 a.a. |
| NCBI ID | WP_021034382.1 | Uniprot ID | - |
| Organism | Haemophilus influenzae strain NCTC11426 | ||
| Function | type IV pilus biogenesis and function (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 575902..587245
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| DQN24_RS02895 (NCTC11426_00574) | - | 576947..577219 (-) | 273 | WP_005630954.1 | HU family DNA-binding protein | - |
| DQN24_RS02900 (NCTC11426_00575) | - | 577358..577948 (-) | 591 | WP_021034387.1 | YjaG family protein | - |
| DQN24_RS02905 (NCTC11426_00576) | nudC | 577984..578778 (-) | 795 | WP_021034386.1 | NAD(+) diphosphatase | - |
| DQN24_RS02910 (NCTC11426_00577) | nfuA | 578846..579442 (-) | 597 | WP_021034385.1 | Fe-S biogenesis protein NfuA | - |
| DQN24_RS02915 (NCTC11426_00578) | - | 579518..580204 (-) | 687 | WP_021034384.1 | ComF family protein | - |
| DQN24_RS02920 (NCTC11426_00579) | tnpA | 580452..580905 (-) | 454 | Protein_556 | IS200/IS605 family transposase | - |
| DQN24_RS02925 (NCTC11426_00580) | comE | 580902..582245 (-) | 1344 | WP_021034382.1 | type IV pilus secretin PilQ | Machinery gene |
| DQN24_RS02930 (NCTC11426_00581) | comD | 582255..582668 (-) | 414 | WP_021034381.1 | pilus assembly protein PilP | Machinery gene |
| DQN24_RS02935 (NCTC11426_00582) | comC | 582665..583186 (-) | 522 | WP_111695365.1 | competence protein C | Machinery gene |
| DQN24_RS02940 (NCTC11426_00583) | comB | 583183..583692 (-) | 510 | WP_111695366.1 | competence protein B | Machinery gene |
| DQN24_RS02945 (NCTC11426_00584) | comA | 583693..584490 (-) | 798 | WP_110430714.1 | pilus assembly protein PilM | Machinery gene |
| DQN24_RS02950 (NCTC11426_00585) | - | 584589..587183 (+) | 2595 | WP_110430713.1 | penicillin-binding protein 1A | - |
Sequence
Protein
Download Length: 447 a.a. Molecular weight: 49516.68 Da Isoelectric Point: 6.7046
>NTDB_id=1140655 DQN24_RS02925 WP_021034382.1 580902..582245(-) (comE) [Haemophilus influenzae strain NCTC11426]
MKKYFLKCGYFLVCFCLPLIVFANPKTDNERFFIRLSQAPLAQTLEQLAFQQDVNLVIDEALESNISLRLDNIDMPRLLQ
IIAKSKKLTLNKDEGIYYLNGGQSGKWQVAGNLTTNEPHLESHTVKLHFAKASELMKSLTTGSGSLLSSAGSITFDDRSN
LLVIQDEPHSVQNIKKLISEMDKPIEQIAIEARIVTITDESLKELGVRWGIFNPTENARRVAGSLAGNSFENIADNLNVN
FATTTTPAGSIALQVAKINGRLLDLELSALERENNVEIIASPRLLTTNKKSASIKQGTEIPYIVSNTRNDTQSVEFREAV
LGLEVTPHISKDNNILLDLLVSQNSPGSRVSYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKSEDKVPLLGDI
PVIKRLFSKESERHQKRELVIFVTPHILKAGETLEALKQKSAGKKLQ
MKKYFLKCGYFLVCFCLPLIVFANPKTDNERFFIRLSQAPLAQTLEQLAFQQDVNLVIDEALESNISLRLDNIDMPRLLQ
IIAKSKKLTLNKDEGIYYLNGGQSGKWQVAGNLTTNEPHLESHTVKLHFAKASELMKSLTTGSGSLLSSAGSITFDDRSN
LLVIQDEPHSVQNIKKLISEMDKPIEQIAIEARIVTITDESLKELGVRWGIFNPTENARRVAGSLAGNSFENIADNLNVN
FATTTTPAGSIALQVAKINGRLLDLELSALERENNVEIIASPRLLTTNKKSASIKQGTEIPYIVSNTRNDTQSVEFREAV
LGLEVTPHISKDNNILLDLLVSQNSPGSRVSYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKSEDKVPLLGDI
PVIKRLFSKESERHQKRELVIFVTPHILKAGETLEALKQKSAGKKLQ
Nucleotide
Download Length: 1344 bp
>NTDB_id=1140655 DQN24_RS02925 WP_021034382.1 580902..582245(-) (comE) [Haemophilus influenzae strain NCTC11426]
ATGAAGAAATATTTTTTAAAGTGCGGTTATTTTTTAGTCTGTTTTTGTTTACCCTTAATTGTTTTTGCTAATCCCAAAAC
TGATAACGAACGTTTTTTTATTCGTTTATCGCAAGCACCTTTAGCTCAAACATTAGAGCAGTTGGCTTTTCAACAAGATG
TGAATTTAGTGATAGATGAAGCATTAGAAAGTAATATTTCATTGAGATTAGATAATATTGATATGCCACGTTTACTACAA
ATAATCGCTAAAAGTAAGAAGCTTACTTTGAATAAAGATGAGGGTATTTACTATCTGAATGGAGGGCAATCAGGCAAATG
GCAAGTTGCAGGAAATCTTACTACAAATGAACCGCACTTAGAGAGCCATACAGTAAAACTCCATTTTGCTAAAGCCTCTG
AATTAATGAAGTCCTTAACGACAGGAAGTGGCTCTTTGCTTTCATCAGCTGGGAGCATTACCTTTGATGATCGTAGTAAT
TTGCTCGTGATTCAAGATGAACCTCATTCTGTGCAAAATATCAAAAAACTAATTTCTGAAATGGATAAACCTATTGAGCA
GATTGCTATTGAAGCGCGTATTGTGACGATAACAGATGAGAGTTTGAAAGAACTTGGCGTTCGGTGGGGGATTTTTAATC
CAACGGAAAATGCAAGACGAGTTGCGGGCAGTCTTGCAGGCAATAGCTTTGAAAATATTGCGGATAATCTTAATGTAAAT
TTTGCGACAACGACAACACCTGCTGGCTCGATAGCATTACAAGTCGCCAAAATTAATGGGCGATTGCTCGATTTAGAATT
GAGTGCGTTAGAGCGTGAAAATAATGTAGAAATTATCGCAAGCCCTCGCTTACTAACCACAAATAAGAAAAGTGCAAGCA
TTAAACAGGGGACAGAAATTCCTTATATTGTGAGCAATACTCGTAACGATACGCAATCTGTGGAATTTCGCGAGGCGGTG
CTTGGTTTGGAAGTCACACCACATATTTCTAAAGATAACAATATTTTACTTGATTTATTAGTGAGCCAAAACTCACCAGG
TTCTCGTGTTTCTTATGGACAAAATGAGGTGGTTTCTATTGATAAGCAAGAAATTAATACTCAGGTTTTTGCCAAAGATG
GGGAAACCATTGTGCTTGGCGGCGTGTTTCACGACACAATTACGAAAAGCGAAGATAAGGTGCCATTGCTTGGCGATATA
CCCGTTATTAAACGATTATTCAGCAAAGAAAGCGAACGACATCAAAAACGTGAGCTCGTGATTTTCGTGACGCCGCATAT
TTTAAAAGCAGGAGAAACGTTAGAGGCGTTGAAGCAAAAAAGTGCGGGGAAAAAGTTACAATGA
ATGAAGAAATATTTTTTAAAGTGCGGTTATTTTTTAGTCTGTTTTTGTTTACCCTTAATTGTTTTTGCTAATCCCAAAAC
TGATAACGAACGTTTTTTTATTCGTTTATCGCAAGCACCTTTAGCTCAAACATTAGAGCAGTTGGCTTTTCAACAAGATG
TGAATTTAGTGATAGATGAAGCATTAGAAAGTAATATTTCATTGAGATTAGATAATATTGATATGCCACGTTTACTACAA
ATAATCGCTAAAAGTAAGAAGCTTACTTTGAATAAAGATGAGGGTATTTACTATCTGAATGGAGGGCAATCAGGCAAATG
GCAAGTTGCAGGAAATCTTACTACAAATGAACCGCACTTAGAGAGCCATACAGTAAAACTCCATTTTGCTAAAGCCTCTG
AATTAATGAAGTCCTTAACGACAGGAAGTGGCTCTTTGCTTTCATCAGCTGGGAGCATTACCTTTGATGATCGTAGTAAT
TTGCTCGTGATTCAAGATGAACCTCATTCTGTGCAAAATATCAAAAAACTAATTTCTGAAATGGATAAACCTATTGAGCA
GATTGCTATTGAAGCGCGTATTGTGACGATAACAGATGAGAGTTTGAAAGAACTTGGCGTTCGGTGGGGGATTTTTAATC
CAACGGAAAATGCAAGACGAGTTGCGGGCAGTCTTGCAGGCAATAGCTTTGAAAATATTGCGGATAATCTTAATGTAAAT
TTTGCGACAACGACAACACCTGCTGGCTCGATAGCATTACAAGTCGCCAAAATTAATGGGCGATTGCTCGATTTAGAATT
GAGTGCGTTAGAGCGTGAAAATAATGTAGAAATTATCGCAAGCCCTCGCTTACTAACCACAAATAAGAAAAGTGCAAGCA
TTAAACAGGGGACAGAAATTCCTTATATTGTGAGCAATACTCGTAACGATACGCAATCTGTGGAATTTCGCGAGGCGGTG
CTTGGTTTGGAAGTCACACCACATATTTCTAAAGATAACAATATTTTACTTGATTTATTAGTGAGCCAAAACTCACCAGG
TTCTCGTGTTTCTTATGGACAAAATGAGGTGGTTTCTATTGATAAGCAAGAAATTAATACTCAGGTTTTTGCCAAAGATG
GGGAAACCATTGTGCTTGGCGGCGTGTTTCACGACACAATTACGAAAAGCGAAGATAAGGTGCCATTGCTTGGCGATATA
CCCGTTATTAAACGATTATTCAGCAAAGAAAGCGAACGACATCAAAAACGTGAGCTCGTGATTTTCGTGACGCCGCATAT
TTTAAAAGCAGGAGAAACGTTAGAGGCGTTGAAGCAAAAAAGTGCGGGGAAAAAGTTACAATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comE | Haemophilus influenzae Rd KW20 |
95.955 |
99.553 |
0.955 |
| comE | Haemophilus influenzae 86-028NP |
95.73 |
99.553 |
0.953 |
| comE | Glaesserella parasuis strain SC1401 |
52.941 |
95.078 |
0.503 |
| pilQ | Vibrio campbellii strain DS40M4 |
42.959 |
93.736 |
0.403 |
| pilQ | Vibrio cholerae O1 biovar El Tor strain E7946 |
42.233 |
92.17 |
0.389 |
| pilQ | Vibrio cholerae strain A1552 |
42.233 |
92.17 |
0.389 |