Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INP96_RS03530 | Genome accession | NZ_CP063116 |
| Coordinates | 720490..722019 (-) | Length | 509 a.a. |
| NCBI ID | WP_197556220.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C146_1 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 715490..727019
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP96_RS03500 (INP96_03500) | - | 716220..716732 (+) | 513 | WP_070582957.1 | competence protein ComB | - |
| INP96_RS03505 (INP96_03505) | - | 716729..717271 (+) | 543 | WP_070582958.1 | competence protein ComC | - |
| INP96_RS03510 (INP96_03510) | - | 717271..717654 (+) | 384 | WP_070582959.1 | pilus assembly protein PilP | - |
| INP96_RS03515 (INP96_03515) | comE | 717656..719038 (+) | 1383 | WP_197556217.1 | type IV pilus secretin PilQ | Machinery gene |
| INP96_RS03520 (INP96_03520) | - | 719059..719748 (+) | 690 | WP_197556218.1 | ComF family protein | - |
| INP96_RS03525 (INP96_03525) | nfuA | 719862..720446 (+) | 585 | WP_197556219.1 | Fe-S biogenesis protein NfuA | - |
| INP96_RS03530 (INP96_03530) | comM | 720490..722019 (-) | 1530 | WP_197556220.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP96_RS03535 (INP96_03535) | yihA | 722149..722763 (-) | 615 | WP_197556221.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP96_RS03540 (INP96_03540) | - | 722877..723746 (+) | 870 | WP_197556222.1 | VirK/YbjX family protein | - |
| INP96_RS03545 (INP96_03545) | oppF | 723786..724784 (-) | 999 | WP_070592507.1 | murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF | - |
| INP96_RS03550 (INP96_03550) | - | 724781..725752 (-) | 972 | WP_197556223.1 | ABC transporter ATP-binding protein | - |
| INP96_RS03555 (INP96_03555) | oppC | 725762..726697 (-) | 936 | WP_005651799.1 | oligopeptide ABC transporter permease OppC | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55755.20 Da Isoelectric Point: 9.7439
>NTDB_id=493016 INP96_RS03530 WP_197556220.1 720490..722019(-) (comM) [Haemophilus parainfluenzae strain M1C146_1]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALMIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGESSQQVREKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQSHLAEALGYRAMDRLLQKLSAT
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALMIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGESSQQVREKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQSHLAEALGYRAMDRLLQKLSAT
Nucleotide
Download Length: 1530 bp
>NTDB_id=493016 INP96_RS03530 WP_197556220.1 720490..722019(-) (comM) [Haemophilus parainfluenzae strain M1C146_1]
ATGTCTCTTGCCATTGTTTATAGCCGTGCATCAATGGGGGTACAAGCTCCTCTTGTTACCATTGAGGTACACTTAAGTAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACAGTAAAAGAAGCGCAAGATCGTGTCAGAAGTGCATTAA
TGAATGCCCAGTTCAAATATCCTGCTAAACGTATCACCGTCAACCTTGCACCAGCCGATTTACCCAAAGAAGGGGGACGA
TTTGATTTACCCATTGCAATCGGTATTTTAGCTGCATCAGATCAACTTGATGGCAGTCGTTTAAAACAGTTTGAATTCGT
TGGCGAGCTCGCATTAACGGGTGAATTACGTGGTGTACACGGCGTGATTCCCGCCATTTTAGCGGCACAAAAAGCGAAAC
GAGCCCCCATTATCGCTTATCAAAATGCCAATGAAGCTTCACTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAATCTT
TTGGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCCTCATTACTCGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTAGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCACTAATGATAGCTGCCGCAG
GCCAGCACAACCTGCTCTTTTTAGGGCCACCTGGGACAGGTAAAACCATGTTAGCAAGTCGTCTCACTGCATTGCTCCCT
GAAATGACCGATTTAGAAGCGATTGAAACCGCTTCGGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGTCCGTTTCGAGCGCCACATCATAGTGCCTCATTACCTGCCTTAGTAGGAGGAGGAACCATTCCCAAACCAGGCG
AAATATCGCTCGCACATAATGGTGTACTTTTCCTTGATGAGCTACCTGAGTTTGAGCGTAAAGTGCTTGATGCACTTCGC
CAGCCATTAGAAAGTGGAGAAATTATTATTTCGCGTGCCAATGCGAAAATCCAATTTCCTGCTCGCTTTCAATTAGTTGC
AGCGATGAATCCAAGTCCAACAGGGCATTATACCGGAACACATAATCGCACCTCGCCTCAGCAAGTGATGCGTTATTTAA
ATCGCCTTTCTGGGCCGTTTTTAGATCGTTTTGATTTATCCATTGAAGTCCCTCTTCTACCGCAAGGTAGCCTACAAAAT
ACGGGGGATAGAGGCGAAAGCAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGGGAAATTCAACTTGCTCGAGCCGG
TAAAATCAATGCGTATCTATCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTCCTTGAGA
ATGCACTGAATAAGCTAGGACTTTCAGTGAGAGCCTATCATCGGATTTTAAAAGTCTCACGTACAATTGCTGATTTAAAT
GGCGAAAAAGAGATACAACAATCTCATTTAGCAGAAGCACTAGGATATCGGGCGATGGATCGGTTGTTACAGAAATTGTC
TGCTACTTAA
ATGTCTCTTGCCATTGTTTATAGCCGTGCATCAATGGGGGTACAAGCTCCTCTTGTTACCATTGAGGTACACTTAAGTAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACAGTAAAAGAAGCGCAAGATCGTGTCAGAAGTGCATTAA
TGAATGCCCAGTTCAAATATCCTGCTAAACGTATCACCGTCAACCTTGCACCAGCCGATTTACCCAAAGAAGGGGGACGA
TTTGATTTACCCATTGCAATCGGTATTTTAGCTGCATCAGATCAACTTGATGGCAGTCGTTTAAAACAGTTTGAATTCGT
TGGCGAGCTCGCATTAACGGGTGAATTACGTGGTGTACACGGCGTGATTCCCGCCATTTTAGCGGCACAAAAAGCGAAAC
GAGCCCCCATTATCGCTTATCAAAATGCCAATGAAGCTTCACTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAATCTT
TTGGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCCTCATTACTCGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTAGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCACTAATGATAGCTGCCGCAG
GCCAGCACAACCTGCTCTTTTTAGGGCCACCTGGGACAGGTAAAACCATGTTAGCAAGTCGTCTCACTGCATTGCTCCCT
GAAATGACCGATTTAGAAGCGATTGAAACCGCTTCGGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGTCCGTTTCGAGCGCCACATCATAGTGCCTCATTACCTGCCTTAGTAGGAGGAGGAACCATTCCCAAACCAGGCG
AAATATCGCTCGCACATAATGGTGTACTTTTCCTTGATGAGCTACCTGAGTTTGAGCGTAAAGTGCTTGATGCACTTCGC
CAGCCATTAGAAAGTGGAGAAATTATTATTTCGCGTGCCAATGCGAAAATCCAATTTCCTGCTCGCTTTCAATTAGTTGC
AGCGATGAATCCAAGTCCAACAGGGCATTATACCGGAACACATAATCGCACCTCGCCTCAGCAAGTGATGCGTTATTTAA
ATCGCCTTTCTGGGCCGTTTTTAGATCGTTTTGATTTATCCATTGAAGTCCCTCTTCTACCGCAAGGTAGCCTACAAAAT
ACGGGGGATAGAGGCGAAAGCAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGGGAAATTCAACTTGCTCGAGCCGG
TAAAATCAATGCGTATCTATCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTCCTTGAGA
ATGCACTGAATAAGCTAGGACTTTCAGTGAGAGCCTATCATCGGATTTTAAAAGTCTCACGTACAATTGCTGATTTAAAT
GGCGAAAAAGAGATACAACAATCTCATTTAGCAGAAGCACTAGGATATCGGGCGATGGATCGGTTGTTACAGAAATTGTC
TGCTACTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
91.913 |
99.607 |
0.916 |
| comM | Glaesserella parasuis strain SC1401 |
79.487 |
99.607 |
0.792 |
| comM | Vibrio cholerae strain A1552 |
67.126 |
99.804 |
0.67 |
| comM | Vibrio campbellii strain DS40M4 |
66.012 |
100 |
0.66 |
| comM | Legionella pneumophila str. Paris |
52.183 |
99.018 |
0.517 |
| comM | Legionella pneumophila strain ERS1305867 |
52.183 |
99.018 |
0.517 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.573 |
100 |
0.481 |