Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INP90_RS03535 | Genome accession | NZ_CP063124 |
| Coordinates | 722207..723736 (-) | Length | 509 a.a. |
| NCBI ID | WP_197542727.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C113_1 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 717207..728736
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP90_RS03505 (INP90_03505) | - | 717936..718448 (+) | 513 | WP_197559947.1 | competence protein ComB | - |
| INP90_RS03510 (INP90_03510) | - | 718445..718987 (+) | 543 | WP_197542724.1 | competence protein ComC | - |
| INP90_RS03515 (INP90_03515) | - | 718987..719370 (+) | 384 | WP_070582959.1 | pilus assembly protein PilP | - |
| INP90_RS03520 (INP90_03520) | comE | 719372..720754 (+) | 1383 | WP_197542725.1 | type IV pilus secretin PilQ | Machinery gene |
| INP90_RS03525 (INP90_03525) | - | 720774..721463 (+) | 690 | WP_197542726.1 | ComF family protein | - |
| INP90_RS03530 (INP90_03530) | nfuA | 721579..722163 (+) | 585 | WP_014064659.1 | Fe-S biogenesis protein NfuA | - |
| INP90_RS03535 (INP90_03535) | comM | 722207..723736 (-) | 1530 | WP_197542727.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP90_RS03540 (INP90_03540) | yihA | 723866..724480 (-) | 615 | WP_197559948.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP90_RS03545 (INP90_03545) | - | 724594..725463 (+) | 870 | WP_070582963.1 | VirK/YbjX family protein | - |
| INP90_RS03550 (INP90_03550) | oppF | 725503..726501 (-) | 999 | WP_105895958.1 | murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF | - |
| INP90_RS03555 (INP90_03555) | - | 726498..727469 (-) | 972 | WP_070582965.1 | ABC transporter ATP-binding protein | - |
| INP90_RS03560 (INP90_03560) | oppC | 727479..728414 (-) | 936 | WP_049373849.1 | oligopeptide ABC transporter permease OppC | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55753.13 Da Isoelectric Point: 9.6293
>NTDB_id=493204 INP90_RS03535 WP_197542727.1 722207..723736(-) (comM) [Haemophilus parainfluenzae strain M1C113_1]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLDRAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQSHLAEALGYRAMDRLLQKLSAA
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLDRAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQSHLAEALGYRAMDRLLQKLSAA
Nucleotide
Download Length: 1530 bp
>NTDB_id=493204 INP90_RS03535 WP_197542727.1 722207..723736(-) (comM) [Haemophilus parainfluenzae strain M1C113_1]
ATGTCTCTTGCTATTGTTTACAGCCGTGCATCAATGGGTGTACAAGCCCCTCTTGTTACCATTGAGGTACACTTAAGCAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCACAAGATCGTGTCAGAAGTGCATTAA
TGAATGCCCAGTTCAAATATCCTGCTAAACGCATCACCGTCAACCTTGCACCGGCCGATTTACCCAAAGAAGGGGGTCGA
TTTGATTTACCTATTGCAATCGGTATTTTAGCCGCATCAGATCAACTTGATGGCAGCCGTTTAAAACAGTTTGAATTCGT
TGGTGAACTTGCATTAACAGGTGAATTACGTGGTGTACACGGCGTGATTCCCGCCATTTTAGCAGCGCAAAAAGCAAAAC
GAGCCCCCATTATTGCTTATCAAAATGCTAATGAAGCTTCACTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAACCTT
TTGGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCCTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTAGATTTGACGGATATTATTGGTCAGCAACACGCCAAACGAGCTCTGACGATTGCAGCCGCAG
GCCAGCACAACCTACTCTTTTTAGGTCCTCCCGGTACAGGTAAAACCATGTTGGCAAGTCGTCTCACTGCATTGCTCCCT
GAAATGACAGATTTAGAAGCCATTGAAACCGCTTCAGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGTCCGTTTCGAGCGCCACATCATAGTGCCTCATTACCTGCCTTAGTAGGTGGAGGAACCATTCCAAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAATTGCCTGAGTTTGAGCGTAAAGTACTAGATGCACTTCGC
CAGCCATTAGAAAGTGGAGAAATTATTATTTCGCGTGCCAATGCGAAAATCCAATTCCCCGCTCGCTTTCAACTCGTTGC
TGCGATGAATCCAAGTCCAACGGGTCATTATACTGGAACACATAACCGAACCTCACCACAACAAGTGATGCGTTATTTAA
ATCGCCTTTCGGGGCCGTTTTTAGATCGCTTTGATTTATCCATTGAAGTACCTCTTCTGCCACAAGGTAGCTTACAGAAT
ACAGGAGATAGAGGTGAGACAAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGGGAAATTCAACTTGATCGAGCCGG
TAAAATCAATGCATATCTTTCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTCCTTGAGA
ATGCACTCAATAAGCTTGGACTTTCAGTAAGAGCCTATCATCGGATTTTAAAAGTTTCACGCACAATTGCTGATTTAAAT
GGCGAAAAAGAAATACAACAATCTCATTTAGCCGAAGCGCTAGGATATCGGGCCATGGATCGATTATTACAGAAATTATC
TGCTGCTTAA
ATGTCTCTTGCTATTGTTTACAGCCGTGCATCAATGGGTGTACAAGCCCCTCTTGTTACCATTGAGGTACACTTAAGCAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCACAAGATCGTGTCAGAAGTGCATTAA
TGAATGCCCAGTTCAAATATCCTGCTAAACGCATCACCGTCAACCTTGCACCGGCCGATTTACCCAAAGAAGGGGGTCGA
TTTGATTTACCTATTGCAATCGGTATTTTAGCCGCATCAGATCAACTTGATGGCAGCCGTTTAAAACAGTTTGAATTCGT
TGGTGAACTTGCATTAACAGGTGAATTACGTGGTGTACACGGCGTGATTCCCGCCATTTTAGCAGCGCAAAAAGCAAAAC
GAGCCCCCATTATTGCTTATCAAAATGCTAATGAAGCTTCACTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAACCTT
TTGGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCCTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTAGATTTGACGGATATTATTGGTCAGCAACACGCCAAACGAGCTCTGACGATTGCAGCCGCAG
GCCAGCACAACCTACTCTTTTTAGGTCCTCCCGGTACAGGTAAAACCATGTTGGCAAGTCGTCTCACTGCATTGCTCCCT
GAAATGACAGATTTAGAAGCCATTGAAACCGCTTCAGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGTCCGTTTCGAGCGCCACATCATAGTGCCTCATTACCTGCCTTAGTAGGTGGAGGAACCATTCCAAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAATTGCCTGAGTTTGAGCGTAAAGTACTAGATGCACTTCGC
CAGCCATTAGAAAGTGGAGAAATTATTATTTCGCGTGCCAATGCGAAAATCCAATTCCCCGCTCGCTTTCAACTCGTTGC
TGCGATGAATCCAAGTCCAACGGGTCATTATACTGGAACACATAACCGAACCTCACCACAACAAGTGATGCGTTATTTAA
ATCGCCTTTCGGGGCCGTTTTTAGATCGCTTTGATTTATCCATTGAAGTACCTCTTCTGCCACAAGGTAGCTTACAGAAT
ACAGGAGATAGAGGTGAGACAAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGGGAAATTCAACTTGATCGAGCCGG
TAAAATCAATGCATATCTTTCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTCCTTGAGA
ATGCACTCAATAAGCTTGGACTTTCAGTAAGAGCCTATCATCGGATTTTAAAAGTTTCACGCACAATTGCTGATTTAAAT
GGCGAAAAAGAAATACAACAATCTCATTTAGCCGAAGCGCTAGGATATCGGGCCATGGATCGATTATTACAGAAATTATC
TGCTGCTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
92.308 |
99.607 |
0.919 |
| comM | Glaesserella parasuis strain SC1401 |
79.684 |
99.607 |
0.794 |
| comM | Vibrio cholerae strain A1552 |
66.732 |
99.804 |
0.666 |
| comM | Vibrio campbellii strain DS40M4 |
65.815 |
100 |
0.658 |
| comM | Legionella pneumophila str. Paris |
51.984 |
99.018 |
0.515 |
| comM | Legionella pneumophila strain ERS1305867 |
51.984 |
99.018 |
0.515 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.379 |
100 |
0.479 |