Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INP97_RS03495 | Genome accession | NZ_CP063113 |
| Coordinates | 708429..709958 (-) | Length | 509 a.a. |
| NCBI ID | WP_197563234.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C147_1 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 703429..714958
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP97_RS03465 (INP97_03460) | - | 704159..704671 (+) | 513 | WP_197563225.1 | competence protein ComB | - |
| INP97_RS03470 (INP97_03465) | - | 704668..705210 (+) | 543 | WP_197563227.1 | competence protein ComC | - |
| INP97_RS03475 (INP97_03470) | - | 705210..705593 (+) | 384 | WP_197563228.1 | pilus assembly protein PilP | - |
| INP97_RS03480 (INP97_03475) | comE | 705595..706977 (+) | 1383 | WP_197563230.1 | type IV pilus secretin PilQ | Machinery gene |
| INP97_RS03485 (INP97_03480) | - | 706997..707686 (+) | 690 | WP_197563232.1 | ComF family protein | - |
| INP97_RS03490 (INP97_03485) | nfuA | 707801..708385 (+) | 585 | WP_049365763.1 | Fe-S biogenesis protein NfuA | - |
| INP97_RS03495 (INP97_03490) | comM | 708429..709958 (-) | 1530 | WP_197563234.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP97_RS03500 (INP97_03495) | yihA | 710088..710702 (-) | 615 | WP_197563236.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP97_RS03505 (INP97_03500) | - | 710816..711685 (+) | 870 | WP_197563238.1 | VirK/YbjX family protein | - |
| INP97_RS03510 (INP97_03505) | oppF | 711725..712723 (-) | 999 | WP_197563240.1 | murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF | - |
| INP97_RS03515 (INP97_03510) | - | 712720..713691 (-) | 972 | WP_070776027.1 | ABC transporter ATP-binding protein | - |
| INP97_RS03520 (INP97_03515) | oppC | 713701..714636 (-) | 936 | WP_005700072.1 | oligopeptide ABC transporter permease OppC | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55672.14 Da Isoelectric Point: 9.7439
>NTDB_id=492984 INP97_RS03495 WP_197563234.1 708429..709958(-) (comM) [Haemophilus parainfluenzae strain M1C147_1]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVPEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSATNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
AGDRGETSQQVRDKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLEKALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVPEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSATNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
AGDRGETSQQVRDKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLEKALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
Nucleotide
Download Length: 1530 bp
>NTDB_id=492984 INP97_RS03495 WP_197563234.1 708429..709958(-) (comM) [Haemophilus parainfluenzae strain M1C147_1]
ATGTCTCTTGCTATTGTTTACAGCCGTGCATCAATGGGTGTACAAGCCCCTCTTGTTACGATTGAAGTACATTTAAGTAA
CGGGAAGCCTGGATTTACCTTAGTTGGTCTACCTGAAAAAACCGTAAAAGAAGCGCAAGATCGTGTCAGAAGTGCATTAA
TGAATGCGCAGTTCAAATATCCTGCCAAACGCATCACCGTAAATCTTGCACCGGCTGATTTACCCAAAGAAGGAGGGAGA
TTTGATTTACCTATTGCAATCGGTATTTTAGCAGCATCGGATCAACTCGACGGTAGCCGTTTAAAACAATTTGAATTCGT
TGGCGAGCTCGCCTTGACGGGGGAATTACGTGGCGTACACGGTGTCATTCCTGCCATTTTAGCGGCACAAAAAGCCAAAC
GCGCCCCCATTATCGCTTATCAAAATGCCAATGAGGCATCGCTTGTCCCAGAGCAAGAGACTTATTTCGCTAAAAATCTG
CTAGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCTTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTACAAATCACTTGGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCACTAACAATTGCGGCTGCAG
GTCAGCACAACCTACTCTTTTTAGGACCACCTGGGACGGGTAAAACCATGTTGGCTAGCCGTCTCACTGCATTGCTCCCT
GAAATGACGGATTTAGAAGCCATTGAAACCGCTTCGGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGTCCGTTTCGAGCGCCACATCATAGTGCTTCTTTGCCGGCCTTAGTTGGCGGAGGAACCATTCCAAAACCAGGCG
AAATATCGCTCGCCCATAACGGCGTACTTTTTCTTGATGAATTGCCTGAGTTTGAGCGAAAAGTACTGGATGCGCTTCGT
CAGCCCTTAGAAAGTGGAGAAATTATTATTTCTCGTGCCAATGCGAAAATCCAATTCCCTGCTCGCTTTCAATTAGTTGC
TGCGATGAACCCAAGTCCAACGGGGCATTATACCGGAACACATAACCGCACCTCGCCGCAGCAAGTGATGCGTTATTTAA
ATCGCCTTTCAGGGCCATTTTTAGATCGTTTTGATTTATCTATTGAAGTCCCTCTTCTGCCACAAGGTAGCTTACAAAAT
GCGGGAGATAGAGGTGAGACAAGCCAACAAGTACGAGATAAAGTGTTAAAAGTGAGAGAGATTCAACTTGCTCGAGCCGG
TAAAATCAATGCGTATCTTTCAAGCAAAGAAATTGAACGGGATTGTAAATTACAAGATAAAGATGCCCTGTTTCTTGAAA
AGGCGCTGAATAAGCTAGGACTTTCAGTGAGAGCCTATCATCGGATTTTAAAAGTCTCACGTACAATTGCCGATTTGAAT
GGCGAAAAAGAGATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGAGCCATGGATAGATTGTTACAGAAATTATC
TGCTGCTTAA
ATGTCTCTTGCTATTGTTTACAGCCGTGCATCAATGGGTGTACAAGCCCCTCTTGTTACGATTGAAGTACATTTAAGTAA
CGGGAAGCCTGGATTTACCTTAGTTGGTCTACCTGAAAAAACCGTAAAAGAAGCGCAAGATCGTGTCAGAAGTGCATTAA
TGAATGCGCAGTTCAAATATCCTGCCAAACGCATCACCGTAAATCTTGCACCGGCTGATTTACCCAAAGAAGGAGGGAGA
TTTGATTTACCTATTGCAATCGGTATTTTAGCAGCATCGGATCAACTCGACGGTAGCCGTTTAAAACAATTTGAATTCGT
TGGCGAGCTCGCCTTGACGGGGGAATTACGTGGCGTACACGGTGTCATTCCTGCCATTTTAGCGGCACAAAAAGCCAAAC
GCGCCCCCATTATCGCTTATCAAAATGCCAATGAGGCATCGCTTGTCCCAGAGCAAGAGACTTATTTCGCTAAAAATCTG
CTAGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCTTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTACAAATCACTTGGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCACTAACAATTGCGGCTGCAG
GTCAGCACAACCTACTCTTTTTAGGACCACCTGGGACGGGTAAAACCATGTTGGCTAGCCGTCTCACTGCATTGCTCCCT
GAAATGACGGATTTAGAAGCCATTGAAACCGCTTCGGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGTCCGTTTCGAGCGCCACATCATAGTGCTTCTTTGCCGGCCTTAGTTGGCGGAGGAACCATTCCAAAACCAGGCG
AAATATCGCTCGCCCATAACGGCGTACTTTTTCTTGATGAATTGCCTGAGTTTGAGCGAAAAGTACTGGATGCGCTTCGT
CAGCCCTTAGAAAGTGGAGAAATTATTATTTCTCGTGCCAATGCGAAAATCCAATTCCCTGCTCGCTTTCAATTAGTTGC
TGCGATGAACCCAAGTCCAACGGGGCATTATACCGGAACACATAACCGCACCTCGCCGCAGCAAGTGATGCGTTATTTAA
ATCGCCTTTCAGGGCCATTTTTAGATCGTTTTGATTTATCTATTGAAGTCCCTCTTCTGCCACAAGGTAGCTTACAAAAT
GCGGGAGATAGAGGTGAGACAAGCCAACAAGTACGAGATAAAGTGTTAAAAGTGAGAGAGATTCAACTTGCTCGAGCCGG
TAAAATCAATGCGTATCTTTCAAGCAAAGAAATTGAACGGGATTGTAAATTACAAGATAAAGATGCCCTGTTTCTTGAAA
AGGCGCTGAATAAGCTAGGACTTTCAGTGAGAGCCTATCATCGGATTTTAAAAGTCTCACGTACAATTGCCGATTTGAAT
GGCGAAAAAGAGATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGAGCCATGGATAGATTGTTACAGAAATTATC
TGCTGCTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
91.913 |
99.607 |
0.916 |
| comM | Glaesserella parasuis strain SC1401 |
79.487 |
99.607 |
0.792 |
| comM | Vibrio cholerae strain A1552 |
67.126 |
99.804 |
0.67 |
| comM | Vibrio campbellii strain DS40M4 |
65.354 |
99.804 |
0.652 |
| comM | Legionella pneumophila str. Paris |
51.984 |
99.018 |
0.515 |
| comM | Legionella pneumophila strain ERS1305867 |
51.984 |
99.018 |
0.515 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.07 |
100 |
0.473 |