Detailed information
Overview
| Name | comE | Type | Machinery gene |
| Locus tag | INP94_RS03970 | Genome accession | NZ_CP063120 |
| Coordinates | 801983..803365 (+) | Length | 460 a.a. |
| NCBI ID | WP_197544094.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C137_2 | ||
| Function | type IV pilus biogenesis and function (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 796983..808365
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP94_RS10930 | - | 796994..797686 (-) | 693 | Protein_768 | penicillin-binding protein 1A | - |
| INP94_RS03945 (INP94_03945) | - | 797762..799612 (-) | 1851 | Protein_769 | penicillin-binding protein 1A | - |
| INP94_RS03950 (INP94_03950) | - | 799712..800560 (+) | 849 | WP_197544090.1 | pilus assembly protein PilM | - |
| INP94_RS03955 (INP94_03955) | - | 800547..801059 (+) | 513 | WP_197544091.1 | PilN domain-containing protein | - |
| INP94_RS03960 (INP94_03960) | - | 801056..801598 (+) | 543 | WP_197544092.1 | competence protein ComC | - |
| INP94_RS03965 (INP94_03965) | - | 801598..801981 (+) | 384 | WP_197544093.1 | pilus assembly protein PilP | - |
| INP94_RS03970 (INP94_03970) | comE | 801983..803365 (+) | 1383 | WP_197544094.1 | type IV pilus secretin PilQ | Machinery gene |
| INP94_RS03975 (INP94_03975) | - | 803385..804074 (+) | 690 | WP_197544095.1 | ComF family protein | - |
| INP94_RS03980 (INP94_03980) | nfuA | 804188..804772 (+) | 585 | WP_005694707.1 | Fe-S biogenesis protein NfuA | - |
| INP94_RS03985 (INP94_03985) | comM | 804817..806346 (-) | 1530 | WP_197544096.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP94_RS03990 (INP94_03990) | yihA | 806475..807089 (-) | 615 | WP_032822378.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP94_RS03995 (INP94_03995) | - | 807203..808072 (+) | 870 | WP_197544097.1 | VirK/YbjX family protein | - |
Sequence
Protein
Download Length: 460 a.a. Molecular weight: 51059.63 Da Isoelectric Point: 7.1773
>NTDB_id=493078 INP94_RS03970 WP_197544094.1 801983..803365(+) (comE) [Haemophilus parainfluenzae strain M1C137_2]
MVKQKIKTKCGQFLMCFLILWSTYSAAENRVFSLRLKQAPMVATLQQLALEQNANLMIDDELEGTLSLQLDNVDFDRLLR
SVAKIKGLSFYQENDIYYLGKPSQHEQYAEKMAEPMAISGESLSSEIPLVSTTVKLHFAKASDVMKSLTTGSGSLLSPSG
NITFDDRSNVLLIQDDSRSVKNIKKLIAELDKPIEQIVIEARIVTITDESLKELGVRWGIFNPTEAAHRVSGSLDANGFS
NISNNLNVNFATTVTPAGSLALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVTNGKNDT
QSVEFREAVLGLEVTPHISKDNNILLDLLVSQNSPGNRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKGV
DKVPLLGDIPGIKRLFSKESERHQKRELVIFVTPHILKQGERMEMAKKEKHFKQVEKVKK
MVKQKIKTKCGQFLMCFLILWSTYSAAENRVFSLRLKQAPMVATLQQLALEQNANLMIDDELEGTLSLQLDNVDFDRLLR
SVAKIKGLSFYQENDIYYLGKPSQHEQYAEKMAEPMAISGESLSSEIPLVSTTVKLHFAKASDVMKSLTTGSGSLLSPSG
NITFDDRSNVLLIQDDSRSVKNIKKLIAELDKPIEQIVIEARIVTITDESLKELGVRWGIFNPTEAAHRVSGSLDANGFS
NISNNLNVNFATTVTPAGSLALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVTNGKNDT
QSVEFREAVLGLEVTPHISKDNNILLDLLVSQNSPGNRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKGV
DKVPLLGDIPGIKRLFSKESERHQKRELVIFVTPHILKQGERMEMAKKEKHFKQVEKVKK
Nucleotide
Download Length: 1383 bp
>NTDB_id=493078 INP94_RS03970 WP_197544094.1 801983..803365(+) (comE) [Haemophilus parainfluenzae strain M1C137_2]
ATGGTAAAGCAGAAAATAAAAACAAAGTGCGGTCAGTTTTTAATGTGTTTTTTGATCCTATGGTCAACTTACTCAGCGGC
AGAAAATCGTGTTTTTTCACTTCGCTTAAAACAGGCGCCCATGGTGGCGACACTCCAGCAACTTGCTCTTGAGCAAAATG
CCAATTTAATGATTGATGATGAGCTAGAAGGAACACTTTCATTGCAATTAGATAACGTGGACTTTGATCGTTTATTGCGT
TCTGTGGCAAAAATCAAAGGGCTCTCTTTTTATCAAGAAAATGATATTTATTATTTGGGTAAGCCTTCTCAACATGAACA
ATATGCAGAGAAAATGGCAGAACCTATGGCAATTAGCGGAGAAAGTTTGTCTAGTGAAATACCTCTTGTGAGTACAACGG
TTAAACTGCATTTTGCCAAGGCCTCTGATGTGATGAAATCGTTAACAACAGGGAGCGGTTCTTTGCTTTCACCTAGTGGC
AACATTACCTTTGATGACCGAAGCAATGTATTACTGATTCAGGATGATTCACGTTCTGTCAAAAATATCAAAAAGTTAAT
CGCAGAGTTGGATAAACCCATTGAGCAAATCGTGATTGAAGCACGTATTGTGACGATTACTGATGAAAGCCTAAAAGAAT
TAGGTGTGCGTTGGGGCATTTTTAATCCTACTGAAGCAGCACATCGAGTGAGTGGTAGTCTAGATGCAAATGGCTTTAGT
AATATCAGTAATAATTTAAACGTGAATTTTGCGACAACGGTCACGCCAGCTGGCTCATTAGCACTTCAAGTGGCTAAAAT
TAATGGTCGATTATTAGATTTAGAATTGACCGCACTTGAACGTGAAAATAACGTAGAAATTATTGCAAGCCCTCGTTTAC
TCACAACTAATAAGAAAAGCGCAAGCATTAAACAAGGGACAGAAATTCCTTATGTAGTGACAAATGGTAAAAATGATACA
CAGTCAGTAGAGTTTCGCGAGGCAGTCTTAGGATTGGAAGTCACACCACATATTTCGAAGGATAATAATATTTTATTGGA
TTTATTAGTCAGTCAAAATTCCCCGGGAAATCGCGTGGCTTACGGGCAAAATGAAGTTGTATCCATTGATAAACAAGAAA
TTAATACACAAGTTTTTGCCAAAGATGGCGAAACAATTGTATTGGGAGGGGTATTCCACGACACAATCACGAAAGGTGTC
GATAAAGTACCATTATTGGGCGATATTCCAGGTATTAAGCGTCTATTTAGTAAGGAAAGTGAACGACATCAAAAACGCGA
ACTCGTTATTTTTGTGACCCCTCATATTTTGAAACAAGGTGAAAGAATGGAAATGGCTAAGAAAGAAAAGCATTTTAAGC
AAGTTGAAAAAGTGAAAAAATAA
ATGGTAAAGCAGAAAATAAAAACAAAGTGCGGTCAGTTTTTAATGTGTTTTTTGATCCTATGGTCAACTTACTCAGCGGC
AGAAAATCGTGTTTTTTCACTTCGCTTAAAACAGGCGCCCATGGTGGCGACACTCCAGCAACTTGCTCTTGAGCAAAATG
CCAATTTAATGATTGATGATGAGCTAGAAGGAACACTTTCATTGCAATTAGATAACGTGGACTTTGATCGTTTATTGCGT
TCTGTGGCAAAAATCAAAGGGCTCTCTTTTTATCAAGAAAATGATATTTATTATTTGGGTAAGCCTTCTCAACATGAACA
ATATGCAGAGAAAATGGCAGAACCTATGGCAATTAGCGGAGAAAGTTTGTCTAGTGAAATACCTCTTGTGAGTACAACGG
TTAAACTGCATTTTGCCAAGGCCTCTGATGTGATGAAATCGTTAACAACAGGGAGCGGTTCTTTGCTTTCACCTAGTGGC
AACATTACCTTTGATGACCGAAGCAATGTATTACTGATTCAGGATGATTCACGTTCTGTCAAAAATATCAAAAAGTTAAT
CGCAGAGTTGGATAAACCCATTGAGCAAATCGTGATTGAAGCACGTATTGTGACGATTACTGATGAAAGCCTAAAAGAAT
TAGGTGTGCGTTGGGGCATTTTTAATCCTACTGAAGCAGCACATCGAGTGAGTGGTAGTCTAGATGCAAATGGCTTTAGT
AATATCAGTAATAATTTAAACGTGAATTTTGCGACAACGGTCACGCCAGCTGGCTCATTAGCACTTCAAGTGGCTAAAAT
TAATGGTCGATTATTAGATTTAGAATTGACCGCACTTGAACGTGAAAATAACGTAGAAATTATTGCAAGCCCTCGTTTAC
TCACAACTAATAAGAAAAGCGCAAGCATTAAACAAGGGACAGAAATTCCTTATGTAGTGACAAATGGTAAAAATGATACA
CAGTCAGTAGAGTTTCGCGAGGCAGTCTTAGGATTGGAAGTCACACCACATATTTCGAAGGATAATAATATTTTATTGGA
TTTATTAGTCAGTCAAAATTCCCCGGGAAATCGCGTGGCTTACGGGCAAAATGAAGTTGTATCCATTGATAAACAAGAAA
TTAATACACAAGTTTTTGCCAAAGATGGCGAAACAATTGTATTGGGAGGGGTATTCCACGACACAATCACGAAAGGTGTC
GATAAAGTACCATTATTGGGCGATATTCCAGGTATTAAGCGTCTATTTAGTAAGGAAAGTGAACGACATCAAAAACGCGA
ACTCGTTATTTTTGTGACCCCTCATATTTTGAAACAAGGTGAAAGAATGGAAATGGCTAAGAAAGAAAAGCATTTTAAGC
AAGTTGAAAAAGTGAAAAAATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comE | Haemophilus influenzae Rd KW20 |
73.497 |
97.609 |
0.717 |
| comE | Haemophilus influenzae 86-028NP |
72.606 |
97.609 |
0.709 |
| comE | Glaesserella parasuis strain SC1401 |
53.774 |
92.174 |
0.496 |
| pilQ | Vibrio campbellii strain DS40M4 |
41.801 |
94.13 |
0.393 |
| pilQ | Vibrio cholerae O1 biovar El Tor strain E7946 |
41.981 |
92.174 |
0.387 |
| pilQ | Vibrio cholerae strain A1552 |
41.981 |
92.174 |
0.387 |