Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INP93_RS03650 | Genome accession | NZ_CP063121 |
| Coordinates | 739094..740623 (-) | Length | 509 a.a. |
| NCBI ID | WP_197545167.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C130_2 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 734094..745623
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP93_RS03620 (INP93_03620) | - | 734824..735336 (+) | 513 | WP_197545164.1 | PilN domain-containing protein | - |
| INP93_RS03625 (INP93_03625) | - | 735333..735875 (+) | 543 | WP_049369637.1 | hypothetical protein | - |
| INP93_RS03630 (INP93_03630) | - | 735875..736258 (+) | 384 | WP_197545165.1 | pilus assembly protein PilP | - |
| INP93_RS03635 (INP93_03635) | comE | 736260..737642 (+) | 1383 | WP_049369641.1 | type IV pilus secretin PilQ | Machinery gene |
| INP93_RS03640 (INP93_03640) | - | 737662..738351 (+) | 690 | WP_049369643.1 | ComF family protein | - |
| INP93_RS03645 (INP93_03645) | nfuA | 738466..739050 (+) | 585 | WP_197545166.1 | Fe-S biogenesis protein NfuA | - |
| INP93_RS03650 (INP93_03650) | comM | 739094..740623 (-) | 1530 | WP_197545167.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP93_RS03655 (INP93_03655) | yihA | 740753..741367 (-) | 615 | WP_065243638.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP93_RS03660 (INP93_03660) | - | 741481..742350 (+) | 870 | WP_197545168.1 | VirK/YbjX family protein | - |
| INP93_RS03665 (INP93_03665) | oppF | 742389..743387 (-) | 999 | WP_111327710.1 | murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF | - |
| INP93_RS03670 (INP93_03670) | - | 743384..744355 (-) | 972 | WP_111327711.1 | ABC transporter ATP-binding protein | - |
| INP93_RS03675 (INP93_03675) | oppC | 744365..745300 (-) | 936 | WP_111327712.1 | oligopeptide ABC transporter permease OppC | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55753.17 Da Isoelectric Point: 9.7439
>NTDB_id=493111 INP93_RS03650 WP_197545167.1 739094..740623(-) (comM) [Haemophilus parainfluenzae strain M1C130_2]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLFAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLFAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
Nucleotide
Download Length: 1530 bp
>NTDB_id=493111 INP93_RS03650 WP_197545167.1 739094..740623(-) (comM) [Haemophilus parainfluenzae strain M1C130_2]
ATGTCTCTTGCCATTGTTTATAGCCGTGCCTCAATGGGTGTACAAGCCCCTCTTGTTACCATTGAGGTACATTTAAGTAA
TGGGAAGCCTGGATTTACCTTAGTTGGTCTCCCCGAAAAAACCGTAAAAGAAGCGCAAGATCGGGTCAGAAGTGCATTAA
TGAATGCCCAGTTCAAATATCCTGCTAAACGCATCACCGTCAACCTTGCACCAGCCGATTTACCCAAAGAAGGGGGACGA
TTTGATTTACCCATTGCGATAGGTATCTTGGCTGCATCAGATCAACTCGACGGTAGCCGTTTAAAACAATTTGAATTCGT
TGGCGAACTCGCATTAACAGGTGAATTACGTGGCGTACACGGTGTCATTCCTGCCATTTTAGCGGCACAAAAAGCGAAAC
GAGCCCCCATTATCGCTTATCAAAATGCCAATGAGGCTTCGCTTGTTTCAGAACAAGAGACTTATTTTGCTAAAAACCTT
TTGGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCCTCATTATTCGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTAGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCACTAACAATAGCTGCCGCAG
GTCAGCATAATCTCCTCTTTTTAGGTCCACCCGGTACGGGTAAAACCATGTTGGCAAGTCGTCTCACGGCATTGCTCCCT
GAAATGACGGACTTAGAAGCGATTGAAACAGCTTCAGTTACGAGCTTAGTCCAAAATGAGTTAAATTTTCACAATTGGAA
ACAACGTCCATTTCGAGCGCCACATCATAGCGCCTCTTTGCCTGCCTTAGTTGGCGGAGGAACCATTCCAAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAATTGCCTGAATTTGAGCGTAAAGTGCTTGATGCACTTCGC
CAACCATTAGAAAGTGGAGAAATTATTATTTCGCGTGCCAATGCTAAAATCCAATTTCCTGCTCGCTTTCAATTAGTTGC
GGCAATGAATCCTAGTCCAACGGGGCATTATACTGGAACACATAATCGTACCTCGCCACAACAAGTGATGCGTTATTTAA
ACCGCCTTTCAGGGCCGTTTTTAGATCGTTTTGATTTATCTATTGAAGTACCTCTTCTACCACAAGGTAGTTTACAAAAT
ACGGGAGATAGAGGCGAAACAAGCCAACAAGTACGAGAAAAAGTCTTAAAAGTAAGGGAAATCCAGCTTGCTCGAGCCGG
TAAAATCAATGCGTATCTTTCAAGCAAAGAAATTGAACGGGACTGTAAATTACAGGATAAAGATGCGCTATTTCTAGAGA
ATGCTTTAAATAAACTCGGGCTTTCAGTGAGAGCCTACCATCGGATTTTAAAAGTCTCACGTACAATTGCCGATTTAAAT
GGCGAAAAAGAAATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGAGCGATGGATCGGTTGTTACAGAAATTATC
TGCTGCTTAA
ATGTCTCTTGCCATTGTTTATAGCCGTGCCTCAATGGGTGTACAAGCCCCTCTTGTTACCATTGAGGTACATTTAAGTAA
TGGGAAGCCTGGATTTACCTTAGTTGGTCTCCCCGAAAAAACCGTAAAAGAAGCGCAAGATCGGGTCAGAAGTGCATTAA
TGAATGCCCAGTTCAAATATCCTGCTAAACGCATCACCGTCAACCTTGCACCAGCCGATTTACCCAAAGAAGGGGGACGA
TTTGATTTACCCATTGCGATAGGTATCTTGGCTGCATCAGATCAACTCGACGGTAGCCGTTTAAAACAATTTGAATTCGT
TGGCGAACTCGCATTAACAGGTGAATTACGTGGCGTACACGGTGTCATTCCTGCCATTTTAGCGGCACAAAAAGCGAAAC
GAGCCCCCATTATCGCTTATCAAAATGCCAATGAGGCTTCGCTTGTTTCAGAACAAGAGACTTATTTTGCTAAAAACCTT
TTGGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCCTCATTATTCGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTAGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCACTAACAATAGCTGCCGCAG
GTCAGCATAATCTCCTCTTTTTAGGTCCACCCGGTACGGGTAAAACCATGTTGGCAAGTCGTCTCACGGCATTGCTCCCT
GAAATGACGGACTTAGAAGCGATTGAAACAGCTTCAGTTACGAGCTTAGTCCAAAATGAGTTAAATTTTCACAATTGGAA
ACAACGTCCATTTCGAGCGCCACATCATAGCGCCTCTTTGCCTGCCTTAGTTGGCGGAGGAACCATTCCAAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAATTGCCTGAATTTGAGCGTAAAGTGCTTGATGCACTTCGC
CAACCATTAGAAAGTGGAGAAATTATTATTTCGCGTGCCAATGCTAAAATCCAATTTCCTGCTCGCTTTCAATTAGTTGC
GGCAATGAATCCTAGTCCAACGGGGCATTATACTGGAACACATAATCGTACCTCGCCACAACAAGTGATGCGTTATTTAA
ACCGCCTTTCAGGGCCGTTTTTAGATCGTTTTGATTTATCTATTGAAGTACCTCTTCTACCACAAGGTAGTTTACAAAAT
ACGGGAGATAGAGGCGAAACAAGCCAACAAGTACGAGAAAAAGTCTTAAAAGTAAGGGAAATCCAGCTTGCTCGAGCCGG
TAAAATCAATGCGTATCTTTCAAGCAAAGAAATTGAACGGGACTGTAAATTACAGGATAAAGATGCGCTATTTCTAGAGA
ATGCTTTAAATAAACTCGGGCTTTCAGTGAGAGCCTACCATCGGATTTTAAAAGTCTCACGTACAATTGCCGATTTAAAT
GGCGAAAAAGAAATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGAGCGATGGATCGGTTGTTACAGAAATTATC
TGCTGCTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
92.505 |
99.607 |
0.921 |
| comM | Glaesserella parasuis strain SC1401 |
79.684 |
99.607 |
0.794 |
| comM | Vibrio cholerae strain A1552 |
67.323 |
99.804 |
0.672 |
| comM | Vibrio campbellii strain DS40M4 |
65.945 |
99.804 |
0.658 |
| comM | Legionella pneumophila str. Paris |
51.984 |
99.018 |
0.515 |
| comM | Legionella pneumophila strain ERS1305867 |
51.984 |
99.018 |
0.515 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.379 |
100 |
0.479 |