Detailed information
Overview
| Name | comE | Type | Machinery gene |
| Locus tag | INP92_RS03565 | Genome accession | NZ_CP063122 |
| Coordinates | 726731..728113 (+) | Length | 460 a.a. |
| NCBI ID | WP_111387967.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C125_4 | ||
| Function | type IV pilus biogenesis and function (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 721731..733113
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP92_RS09410 | - | 721742..722434 (-) | 693 | Protein_685 | penicillin-binding protein 1A | - |
| INP92_RS03540 (INP92_03540) | - | 722510..724360 (-) | 1851 | Protein_686 | penicillin-binding protein 1A | - |
| INP92_RS03545 (INP92_03545) | - | 724460..725308 (+) | 849 | WP_111387975.1 | competence protein ComA | - |
| INP92_RS03550 (INP92_03550) | - | 725292..725807 (+) | 516 | WP_232088003.1 | PilN domain-containing protein | - |
| INP92_RS03555 (INP92_03555) | - | 725804..726346 (+) | 543 | WP_111387971.1 | competence protein ComC | - |
| INP92_RS03560 (INP92_03560) | - | 726346..726729 (+) | 384 | WP_111387969.1 | pilus assembly protein PilP | - |
| INP92_RS03565 (INP92_03565) | comE | 726731..728113 (+) | 1383 | WP_111387967.1 | type IV pilus secretin PilQ | Machinery gene |
| INP92_RS03570 (INP92_03570) | - | 728133..728822 (+) | 690 | WP_111387965.1 | ComF family protein | - |
| INP92_RS03575 (INP92_03575) | nfuA | 728938..729522 (+) | 585 | WP_005694707.1 | Fe-S biogenesis protein NfuA | - |
| INP92_RS03580 (INP92_03580) | comM | 729566..731095 (-) | 1530 | WP_111387963.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP92_RS03585 (INP92_03585) | yihA | 731225..731839 (-) | 615 | WP_197554855.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP92_RS03590 (INP92_03590) | - | 731953..732822 (+) | 870 | WP_197554857.1 | VirK/YbjX family protein | - |
Sequence
Protein
Download Length: 460 a.a. Molecular weight: 51131.75 Da Isoelectric Point: 7.9498
>NTDB_id=493141 INP92_RS03565 WP_111387967.1 726731..728113(+) (comE) [Haemophilus parainfluenzae strain M1C125_4]
MLNQKIKTKCGQFLMCFLILWTTYSAAENRVFSLRLKQAPMVATLQQLALEQNANLMIDDELEGKLSLQLDNVDFDRLLR
SVAKIKGFSFYQENNIYYLGKPSQHEQYAEKMTEPMAISGESLPSETPLVSTTVKLHFAKASDVMKSLTTGSGSLLSPSG
TITFDDRSNVLLIQDDARSVKNIKKLIAELDKPIEQIVIEARIVTITDESLKELGVRWGIFNPTEAAHRVSGSLDANGFS
NISNNLNVNFATTVTPAGSLALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVTNGKNDT
QSVEFREAVLGLEVTPHISKNNNILLDLLVSQNSPGNRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKGV
DKVPLLGDIPGIKRLFSKESERHQKRELVIFVTPHILKQGERMEMAKKEKHFKQVEKVKK
MLNQKIKTKCGQFLMCFLILWTTYSAAENRVFSLRLKQAPMVATLQQLALEQNANLMIDDELEGKLSLQLDNVDFDRLLR
SVAKIKGFSFYQENNIYYLGKPSQHEQYAEKMTEPMAISGESLPSETPLVSTTVKLHFAKASDVMKSLTTGSGSLLSPSG
TITFDDRSNVLLIQDDARSVKNIKKLIAELDKPIEQIVIEARIVTITDESLKELGVRWGIFNPTEAAHRVSGSLDANGFS
NISNNLNVNFATTVTPAGSLALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVTNGKNDT
QSVEFREAVLGLEVTPHISKNNNILLDLLVSQNSPGNRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKGV
DKVPLLGDIPGIKRLFSKESERHQKRELVIFVTPHILKQGERMEMAKKEKHFKQVEKVKK
Nucleotide
Download Length: 1383 bp
>NTDB_id=493141 INP92_RS03565 WP_111387967.1 726731..728113(+) (comE) [Haemophilus parainfluenzae strain M1C125_4]
ATGCTAAACCAGAAAATAAAAACAAAGTGCGGTCAGTTTTTAATGTGTTTTTTGATCCTGTGGACAACTTACTCAGCGGC
AGAAAATCGCGTCTTTTCACTTCGGTTAAAACAGGCGCCAATGGTAGCAACTCTCCAGCAACTTGCTCTTGAGCAAAATG
CTAATTTAATGATTGATGATGAGCTAGAAGGAAAACTTTCATTGCAATTAGATAACGTAGATTTTGATCGCTTATTACGT
TCCGTGGCAAAAATCAAAGGGTTCTCTTTTTATCAAGAAAATAATATTTATTATCTGGGTAAGCCTTCTCAACATGAACA
ATATGCAGAGAAAATGACAGAACCTATGGCGATTAGCGGAGAAAGTTTGCCTAGCGAAACACCACTGGTGAGTACAACGG
TTAAACTTCATTTTGCCAAGGCCTCTGATGTGATGAAATCGTTAACAACTGGTAGTGGTTCTTTGCTTTCACCTAGCGGC
ACGATTACCTTTGATGACCGAAGCAATGTATTACTGATTCAGGATGATGCACGTTCTGTCAAAAATATCAAAAAGTTAAT
CGCAGAGTTGGATAAACCCATTGAGCAAATCGTGATTGAAGCACGTATTGTGACGATTACTGATGAAAGCCTAAAAGAAT
TAGGTGTGCGTTGGGGCATTTTTAATCCTACTGAGGCAGCCCATCGAGTGAGTGGCAGTTTAGATGCGAATGGATTTAGT
AATATCAGTAATAATTTAAATGTGAATTTTGCGACAACCGTCACGCCAGCTGGCTCATTAGCTCTTCAAGTAGCTAAAAT
TAATGGTCGATTATTAGACCTAGAATTGACCGCACTTGAACGTGAAAATAACGTAGAAATTATTGCAAGCCCTCGCTTAC
TCACAACCAATAAGAAAAGCGCAAGCATTAAACAAGGGACAGAAATTCCTTATGTGGTGACGAATGGGAAAAATGACACC
CAATCAGTGGAATTTAGAGAGGCGGTGTTGGGATTAGAAGTGACACCGCATATTTCGAAGAATAATAATATTTTATTGGA
TTTATTAGTGAGTCAAAATTCCCCGGGAAATCGCGTGGCTTACGGGCAAAATGAAGTGGTGTCCATTGATAAACAAGAAA
TTAACACGCAAGTTTTTGCCAAAGATGGGGAAACAATTGTATTGGGTGGTGTATTCCACGATACGATCACGAAAGGTGTC
GATAAAGTACCATTATTGGGCGATATTCCTGGTATTAAGCGCTTATTTAGTAAGGAAAGTGAACGACATCAAAAGCGAGA
ACTCGTCATTTTTGTGACCCCTCATATTTTGAAACAAGGTGAAAGAATGGAGATGGCTAAGAAAGAAAAGCATTTTAAGC
AAGTTGAAAAAGTGAAAAAATAA
ATGCTAAACCAGAAAATAAAAACAAAGTGCGGTCAGTTTTTAATGTGTTTTTTGATCCTGTGGACAACTTACTCAGCGGC
AGAAAATCGCGTCTTTTCACTTCGGTTAAAACAGGCGCCAATGGTAGCAACTCTCCAGCAACTTGCTCTTGAGCAAAATG
CTAATTTAATGATTGATGATGAGCTAGAAGGAAAACTTTCATTGCAATTAGATAACGTAGATTTTGATCGCTTATTACGT
TCCGTGGCAAAAATCAAAGGGTTCTCTTTTTATCAAGAAAATAATATTTATTATCTGGGTAAGCCTTCTCAACATGAACA
ATATGCAGAGAAAATGACAGAACCTATGGCGATTAGCGGAGAAAGTTTGCCTAGCGAAACACCACTGGTGAGTACAACGG
TTAAACTTCATTTTGCCAAGGCCTCTGATGTGATGAAATCGTTAACAACTGGTAGTGGTTCTTTGCTTTCACCTAGCGGC
ACGATTACCTTTGATGACCGAAGCAATGTATTACTGATTCAGGATGATGCACGTTCTGTCAAAAATATCAAAAAGTTAAT
CGCAGAGTTGGATAAACCCATTGAGCAAATCGTGATTGAAGCACGTATTGTGACGATTACTGATGAAAGCCTAAAAGAAT
TAGGTGTGCGTTGGGGCATTTTTAATCCTACTGAGGCAGCCCATCGAGTGAGTGGCAGTTTAGATGCGAATGGATTTAGT
AATATCAGTAATAATTTAAATGTGAATTTTGCGACAACCGTCACGCCAGCTGGCTCATTAGCTCTTCAAGTAGCTAAAAT
TAATGGTCGATTATTAGACCTAGAATTGACCGCACTTGAACGTGAAAATAACGTAGAAATTATTGCAAGCCCTCGCTTAC
TCACAACCAATAAGAAAAGCGCAAGCATTAAACAAGGGACAGAAATTCCTTATGTGGTGACGAATGGGAAAAATGACACC
CAATCAGTGGAATTTAGAGAGGCGGTGTTGGGATTAGAAGTGACACCGCATATTTCGAAGAATAATAATATTTTATTGGA
TTTATTAGTGAGTCAAAATTCCCCGGGAAATCGCGTGGCTTACGGGCAAAATGAAGTGGTGTCCATTGATAAACAAGAAA
TTAACACGCAAGTTTTTGCCAAAGATGGGGAAACAATTGTATTGGGTGGTGTATTCCACGATACGATCACGAAAGGTGTC
GATAAAGTACCATTATTGGGCGATATTCCTGGTATTAAGCGCTTATTTAGTAAGGAAAGTGAACGACATCAAAAGCGAGA
ACTCGTCATTTTTGTGACCCCTCATATTTTGAAACAAGGTGAAAGAATGGAGATGGCTAAGAAAGAAAAGCATTTTAAGC
AAGTTGAAAAAGTGAAAAAATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comE | Haemophilus influenzae Rd KW20 |
73.497 |
97.609 |
0.717 |
| comE | Haemophilus influenzae 86-028NP |
72.383 |
97.609 |
0.707 |
| comE | Glaesserella parasuis strain SC1401 |
53.901 |
91.957 |
0.496 |
| pilQ | Vibrio campbellii strain DS40M4 |
41.204 |
93.913 |
0.387 |
| pilQ | Vibrio cholerae O1 biovar El Tor strain E7946 |
41.509 |
92.174 |
0.383 |
| pilQ | Vibrio cholerae strain A1552 |
41.509 |
92.174 |
0.383 |