Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INP91_RS02430 | Genome accession | NZ_CP063123 |
| Coordinates | 500025..501554 (-) | Length | 509 a.a. |
| NCBI ID | WP_049375812.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C120_2 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 495025..506554
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP91_RS02400 (INP91_02400) | - | 495753..496265 (+) | 513 | WP_049365758.1 | hypothetical protein | - |
| INP91_RS02405 (INP91_02405) | - | 496337..496807 (+) | 471 | WP_232086149.1 | YdbH domain-containing protein | - |
| INP91_RS02410 (INP91_02410) | - | 496804..497187 (+) | 384 | WP_049365760.1 | pilus assembly protein PilP | - |
| INP91_RS02415 (INP91_02415) | comE | 497189..498571 (+) | 1383 | WP_197546138.1 | type IV pilus secretin PilQ | Machinery gene |
| INP91_RS02420 (INP91_02420) | - | 498591..499280 (+) | 690 | WP_197546139.1 | ComF family protein | - |
| INP91_RS02425 (INP91_02425) | nfuA | 499396..499980 (+) | 585 | WP_005694707.1 | Fe-S biogenesis protein NfuA | - |
| INP91_RS02430 (INP91_02430) | comM | 500025..501554 (-) | 1530 | WP_049375812.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP91_RS02435 (INP91_02435) | yihA | 501683..502297 (-) | 615 | WP_032822378.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP91_RS02440 (INP91_02440) | - | 502411..503280 (+) | 870 | WP_197546140.1 | VirK/YbjX family protein | - |
| INP91_RS02445 (INP91_02445) | oppF | 503320..504318 (-) | 999 | WP_005700062.1 | murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF | - |
| INP91_RS02450 (INP91_02450) | - | 504315..505286 (-) | 972 | WP_070868468.1 | ABC transporter ATP-binding protein | - |
| INP91_RS02455 (INP91_02455) | oppC | 505296..506231 (-) | 936 | WP_005700072.1 | oligopeptide ABC transporter permease OppC | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55719.15 Da Isoelectric Point: 9.7439
>NTDB_id=493167 INP91_RS02430 WP_049375812.1 500025..501554(-) (comM) [Haemophilus parainfluenzae strain M1C120_2]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
Nucleotide
Download Length: 1530 bp
>NTDB_id=493167 INP91_RS02430 WP_049375812.1 500025..501554(-) (comM) [Haemophilus parainfluenzae strain M1C120_2]
ATGTCTCTTGCCATTGTTTATAGCCGTGCATCAATGGGGGTACAAGCCCCTCTTGTTACCATTGAGGTACACTTAAGTAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCGCAAGATCGCGTTAGAAGTGCATTAA
TGAATGCGCAGTTCAAATATCCTGCCAAACGCATCACCGTAAATCTTGCACCGGCTGATTTACCCAAAGAAGGGGGACGA
TTTGATTTACCCATTGCGATAGGTATCTTGGCTGCATCAGATCAATTAGATGGTAGTCGTTTAAAACAATTTGAATTCGT
TGGTGAACTCGCATTAACGGGTGAATTACGTGGCGTACACGGTGTCATTCCTGCTATTTTAGCGGCACAAAAAGCCAAAC
GTGCCCCCATTATCGCTTATCAAAATGCCAATGAGGCATCGCTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAATCTG
CTAGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCGTTGGCTTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTGGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCACTAACAATTGCGGCTGCAG
GTCAGCATAATCTGCTCTTTTTAGGACCTCCTGGTACAGGTAAAACCATGTTGGCAAGTCGTCTCACTGCATTGCTCCCT
GAAATGACCGATTTAGAAGCCATTGAAACCGCTTCGGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGACCATTCCGAGCACCACATCATAGTGCCTCATTACCAGCCTTAGTGGGCGGAGGAACCATTCCAAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAACTACCTGAGTTTGAGCGAAAAGTACTGGATGCGCTTCGT
CAGCCCTTAGAAAGTGGAGAAATTATTATTTCTCGTGCCAATGCGAAAATCCAATTCCCTGCTCGCTTTCAATTAGTTGC
TGCGATGAACCCAAGTCCAACGGGGCATTATACCGGAACACATAATCGTACCTCACCGCAGCAAGTGATGCGTTATTTAA
ATCGCCTTTCAGGGCCATTTTTAGATCGCTTTGATTTGTCCATTGAAGTGCCTCTTCTGCCACAAGGTAGTTTACAAAAT
ACGGGAGATAGAGGCGAAACAAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGGGAAATTCAACTTGCTCGAGCCGG
TAAAATCAATGCGTATCTTTCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTTCTGGAGA
ATGCTTTAAATAAACTCGGGCTTTCTGTTAGAGCCTACCATCGGATTTTAAAAGTCTCACGTACAATTGCCGATTTAAAT
GGTGAAAAAGAAATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGGGCGATGGATCGGTTGTTACAAAAATTATC
TGCTGCTTAA
ATGTCTCTTGCCATTGTTTATAGCCGTGCATCAATGGGGGTACAAGCCCCTCTTGTTACCATTGAGGTACACTTAAGTAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCGCAAGATCGCGTTAGAAGTGCATTAA
TGAATGCGCAGTTCAAATATCCTGCCAAACGCATCACCGTAAATCTTGCACCGGCTGATTTACCCAAAGAAGGGGGACGA
TTTGATTTACCCATTGCGATAGGTATCTTGGCTGCATCAGATCAATTAGATGGTAGTCGTTTAAAACAATTTGAATTCGT
TGGTGAACTCGCATTAACGGGTGAATTACGTGGCGTACACGGTGTCATTCCTGCTATTTTAGCGGCACAAAAAGCCAAAC
GTGCCCCCATTATCGCTTATCAAAATGCCAATGAGGCATCGCTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAATCTG
CTAGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCGTTGGCTTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTGGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCACTAACAATTGCGGCTGCAG
GTCAGCATAATCTGCTCTTTTTAGGACCTCCTGGTACAGGTAAAACCATGTTGGCAAGTCGTCTCACTGCATTGCTCCCT
GAAATGACCGATTTAGAAGCCATTGAAACCGCTTCGGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGACCATTCCGAGCACCACATCATAGTGCCTCATTACCAGCCTTAGTGGGCGGAGGAACCATTCCAAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAACTACCTGAGTTTGAGCGAAAAGTACTGGATGCGCTTCGT
CAGCCCTTAGAAAGTGGAGAAATTATTATTTCTCGTGCCAATGCGAAAATCCAATTCCCTGCTCGCTTTCAATTAGTTGC
TGCGATGAACCCAAGTCCAACGGGGCATTATACCGGAACACATAATCGTACCTCACCGCAGCAAGTGATGCGTTATTTAA
ATCGCCTTTCAGGGCCATTTTTAGATCGCTTTGATTTGTCCATTGAAGTGCCTCTTCTGCCACAAGGTAGTTTACAAAAT
ACGGGAGATAGAGGCGAAACAAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGGGAAATTCAACTTGCTCGAGCCGG
TAAAATCAATGCGTATCTTTCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTTCTGGAGA
ATGCTTTAAATAAACTCGGGCTTTCTGTTAGAGCCTACCATCGGATTTTAAAAGTCTCACGTACAATTGCCGATTTAAAT
GGTGAAAAAGAAATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGGGCGATGGATCGGTTGTTACAAAAATTATC
TGCTGCTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
92.505 |
99.607 |
0.921 |
| comM | Glaesserella parasuis strain SC1401 |
79.882 |
99.607 |
0.796 |
| comM | Vibrio cholerae strain A1552 |
67.126 |
99.804 |
0.67 |
| comM | Vibrio campbellii strain DS40M4 |
66.012 |
100 |
0.66 |
| comM | Legionella pneumophila str. Paris |
52.183 |
99.018 |
0.517 |
| comM | Legionella pneumophila strain ERS1305867 |
52.183 |
99.018 |
0.517 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.379 |
100 |
0.479 |