Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INP95_RS03575 | Genome accession | NZ_CP063117 |
| Coordinates | 739054..740583 (-) | Length | 509 a.a. |
| NCBI ID | WP_197560914.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C142_1 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 734054..745583
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP95_RS03545 (INP95_03545) | - | 734787..735299 (+) | 513 | WP_070592528.1 | PilN domain-containing protein | - |
| INP95_RS03550 (INP95_03550) | - | 735296..735838 (+) | 543 | WP_070592526.1 | competence protein ComC | - |
| INP95_RS03555 (INP95_03555) | - | 735838..736221 (+) | 384 | WP_049385075.1 | pilus assembly protein PilP | - |
| INP95_RS03560 (INP95_03560) | comE | 736223..737605 (+) | 1383 | WP_049385074.1 | type IV pilus secretin PilQ | Machinery gene |
| INP95_RS03565 (INP95_03565) | - | 737625..738314 (+) | 690 | WP_070592520.1 | ComF family protein | - |
| INP95_RS03570 (INP95_03570) | nfuA | 738430..739014 (+) | 585 | WP_005694707.1 | Fe-S biogenesis protein NfuA | - |
| INP95_RS03575 (INP95_03575) | comM | 739054..740583 (-) | 1530 | WP_197560914.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP95_RS03580 (INP95_03580) | yihA | 740713..741327 (-) | 615 | WP_049385069.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP95_RS03585 (INP95_03585) | - | 741440..742309 (+) | 870 | WP_197560915.1 | VirK/YbjX family protein | - |
| INP95_RS03590 (INP95_03590) | oppF | 742349..743347 (-) | 999 | WP_005700062.1 | murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF | - |
| INP95_RS03595 (INP95_03595) | - | 743344..744315 (-) | 972 | WP_049385065.1 | ABC transporter ATP-binding protein | - |
| INP95_RS03600 (INP95_03600) | oppC | 744325..745260 (-) | 936 | WP_049385064.1 | oligopeptide ABC transporter permease OppC | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55745.24 Da Isoelectric Point: 9.7439
>NTDB_id=493047 INP95_RS03575 WP_197560914.1 739054..740583(-) (comM) [Haemophilus parainfluenzae strain M1C142_1]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIITYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
AGDRGETSQQVREKVLKVREIQLARAGKINAYLLSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIITYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
AGDRGETSQQVREKVLKVREIQLARAGKINAYLLSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
Nucleotide
Download Length: 1530 bp
>NTDB_id=493047 INP95_RS03575 WP_197560914.1 739054..740583(-) (comM) [Haemophilus parainfluenzae strain M1C142_1]
ATGTCTCTTGCCATTGTTTATAGCCGTGCATCAATGGGGGTACAAGCCCCTCTTGTTACCATTGAGGTACACTTAAGTAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCGCAAGATCGTGTCAGAAGTGCATTAA
TGAATGCGCAGTTCAAATATCCTGCCAAACGCATCACCGTAAATCTTGCACCGGCTGATTTACCCAAAGAAGGAGGGAGA
TTTGATTTACCTATTGCAATCGGTATTTTAGCTGCATCAGATCAATTAGACGGTAGCCGTTTAAAACAATTTGAATTCGT
TGGCGAGCTCGCCTTGACGGGGGAATTACGTGGCGTACACGGTGTCATTCCTGCCATTTTAGCGGCACAAAAAGCCAAAC
GTGCCCCCATTATCACTTATCAAAATGCCAATGAGGCGTCGCTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAATCTG
CTAGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCTTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTGGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCTCTAACGATAGCGGCAGCTG
GTCAGCACAACCTACTCTTTTTAGGGCCACCTGGGACGGGTAAAACCATGTTGGCAAGTCGCCTCACGGCATTGCTCCCT
GAAATGACGGATTTAGAAGCCATTGAAACCGCTTCGGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGACCATTCCGAGCACCACATCATAGTGCCTCATTACCAGCCTTAGTGGGTGGAGGAACCATTCCCAAACCAGGCG
AAATATCGCTTGCACACAATGGTGTACTTTTCCTTGATGAACTACCTGAATTTGAGCGTAAAGTACTGGACGCTCTACGA
CAACCCTTAGAAAGTGGAGAAATTATTATTTCTCGTGCCAATGCGAAAATCCAATTCCCTGCTCGCTTTCAATTAGTTGC
GGCGATGAACCCAAGTCCAACGGGGCATTATACCGGAACACATAACCGCACCTCACCGCAGCAAGTGATGCGTTATTTAA
ATCGCCTTTCAGGGCCGTTTTTAGATCGTTTTGATTTATCTATTGAAGTCCCTCTTCTGCCACAAGGTAGCTTACAAAAT
GCGGGAGATAGAGGTGAGACAAGCCAACAAGTGCGAGAAAAAGTGTTAAAAGTGAGAGAAATTCAACTTGCTCGAGCCGG
TAAAATAAATGCGTATCTTTTAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTTCTGGAGA
ATGCTTTAAATAAACTCGGGCTTTCTGTTAGAGCCTACCATCGGATTTTAAAAGTCTCACGCACAATTGCCGATTTAAAT
GGCGAAAAAGAAATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGAGCGATGGATCGGTTGTTACAGAAATTATC
TGCTGCTTAA
ATGTCTCTTGCCATTGTTTATAGCCGTGCATCAATGGGGGTACAAGCCCCTCTTGTTACCATTGAGGTACACTTAAGTAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCGCAAGATCGTGTCAGAAGTGCATTAA
TGAATGCGCAGTTCAAATATCCTGCCAAACGCATCACCGTAAATCTTGCACCGGCTGATTTACCCAAAGAAGGAGGGAGA
TTTGATTTACCTATTGCAATCGGTATTTTAGCTGCATCAGATCAATTAGACGGTAGCCGTTTAAAACAATTTGAATTCGT
TGGCGAGCTCGCCTTGACGGGGGAATTACGTGGCGTACACGGTGTCATTCCTGCCATTTTAGCGGCACAAAAAGCCAAAC
GTGCCCCCATTATCACTTATCAAAATGCCAATGAGGCGTCGCTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAATCTG
CTAGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCTTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTGGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCTCTAACGATAGCGGCAGCTG
GTCAGCACAACCTACTCTTTTTAGGGCCACCTGGGACGGGTAAAACCATGTTGGCAAGTCGCCTCACGGCATTGCTCCCT
GAAATGACGGATTTAGAAGCCATTGAAACCGCTTCGGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGACCATTCCGAGCACCACATCATAGTGCCTCATTACCAGCCTTAGTGGGTGGAGGAACCATTCCCAAACCAGGCG
AAATATCGCTTGCACACAATGGTGTACTTTTCCTTGATGAACTACCTGAATTTGAGCGTAAAGTACTGGACGCTCTACGA
CAACCCTTAGAAAGTGGAGAAATTATTATTTCTCGTGCCAATGCGAAAATCCAATTCCCTGCTCGCTTTCAATTAGTTGC
GGCGATGAACCCAAGTCCAACGGGGCATTATACCGGAACACATAACCGCACCTCACCGCAGCAAGTGATGCGTTATTTAA
ATCGCCTTTCAGGGCCGTTTTTAGATCGTTTTGATTTATCTATTGAAGTCCCTCTTCTGCCACAAGGTAGCTTACAAAAT
GCGGGAGATAGAGGTGAGACAAGCCAACAAGTGCGAGAAAAAGTGTTAAAAGTGAGAGAAATTCAACTTGCTCGAGCCGG
TAAAATAAATGCGTATCTTTTAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTTCTGGAGA
ATGCTTTAAATAAACTCGGGCTTTCTGTTAGAGCCTACCATCGGATTTTAAAAGTCTCACGCACAATTGCCGATTTAAAT
GGCGAAAAAGAAATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGAGCGATGGATCGGTTGTTACAGAAATTATC
TGCTGCTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
92.11 |
99.607 |
0.917 |
| comM | Glaesserella parasuis strain SC1401 |
79.882 |
99.607 |
0.796 |
| comM | Vibrio cholerae strain A1552 |
67.126 |
99.804 |
0.67 |
| comM | Vibrio campbellii strain DS40M4 |
66.012 |
100 |
0.66 |
| comM | Legionella pneumophila str. Paris |
51.984 |
99.018 |
0.515 |
| comM | Legionella pneumophila strain ERS1305867 |
51.984 |
99.018 |
0.515 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.379 |
100 |
0.479 |