Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INP94_RS03985 | Genome accession | NZ_CP063120 |
| Coordinates | 804817..806346 (-) | Length | 509 a.a. |
| NCBI ID | WP_197544096.1 | Uniprot ID | A0A7M1NYL4 |
| Organism | Haemophilus parainfluenzae strain M1C137_2 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 799817..811346
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP94_RS03955 (INP94_03955) | - | 800547..801059 (+) | 513 | WP_197544091.1 | PilN domain-containing protein | - |
| INP94_RS03960 (INP94_03960) | - | 801056..801598 (+) | 543 | WP_197544092.1 | competence protein ComC | - |
| INP94_RS03965 (INP94_03965) | - | 801598..801981 (+) | 384 | WP_197544093.1 | pilus assembly protein PilP | - |
| INP94_RS03970 (INP94_03970) | comE | 801983..803365 (+) | 1383 | WP_197544094.1 | type IV pilus secretin PilQ | Machinery gene |
| INP94_RS03975 (INP94_03975) | - | 803385..804074 (+) | 690 | WP_197544095.1 | ComF family protein | - |
| INP94_RS03980 (INP94_03980) | nfuA | 804188..804772 (+) | 585 | WP_005694707.1 | Fe-S biogenesis protein NfuA | - |
| INP94_RS03985 (INP94_03985) | comM | 804817..806346 (-) | 1530 | WP_197544096.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP94_RS03990 (INP94_03990) | yihA | 806475..807089 (-) | 615 | WP_032822378.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP94_RS03995 (INP94_03995) | - | 807203..808072 (+) | 870 | WP_197544097.1 | VirK/YbjX family protein | - |
| INP94_RS04000 (INP94_04000) | oppF | 808112..809110 (-) | 999 | WP_197544098.1 | murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF | - |
| INP94_RS04005 (INP94_04005) | - | 809107..810078 (-) | 972 | WP_197544099.1 | ABC transporter ATP-binding protein | - |
| INP94_RS04010 (INP94_04010) | oppC | 810088..811023 (-) | 936 | WP_070868471.1 | oligopeptide ABC transporter permease OppC | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55665.06 Da Isoelectric Point: 9.6621
>NTDB_id=493080 INP94_RS03985 WP_197544096.1 804817..806346(-) (comM) [Haemophilus parainfluenzae strain M1C137_2]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSHLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFQNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLARAGKINAHLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSHLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFQNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLARAGKINAHLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
Nucleotide
Download Length: 1530 bp
>NTDB_id=493080 INP94_RS03985 WP_197544096.1 804817..806346(-) (comM) [Haemophilus parainfluenzae strain M1C137_2]
ATGTCTCTTGCCATTGTTTATAGCCGTGCATCAATGGGGGTACAAGCCCCTCTTGTTACCATTGAGGTACACTTAAGTAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCGCAAGATCGCGTCAGAAGTGCATTAA
TGAATGCGCAGTTCAAATATCCTGCCAAACGCATCACCGTAAATCTTGCACCGGCTGATTTACCCAAAGAAGGGGGACGA
TTTGATTTACCCATCGCGATAGGTATCTTGGCTGCATCAGATCAATTAGATGGTAGTCATTTAAAACAATTTGAATTCGT
TGGTGAACTCGCATTAACGGGTGAATTACGTGGCGTACACGGTGTCATTCCTGCCATTTTAGCGGCACAAAAAGCCAAAC
GTGCCCCCATTATCGCTTATCAAAATGCCAATGAGGCATCGCTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAATCTG
CTAGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCTTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTGGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCACTAACAATTGCGGCTGCAG
GTCAGCATAATCTGCTCTTTTTAGGACCTCCTGGTACAGGTAAAACCATGTTGGCAAGTCGTCTCACTGCATTACTCCCA
GAAATGACTGATTTAGAAGCCATTGAAACCGCTTCGGTCACAAGCTTAGTACAAAATGAATTAAATTTTCAGAACTGGAA
ACAACGCCCATTTCGAGCGCCACATCATAGTGCATCTTTGCCTGCATTAGTGGGTGGAGGAACCATTCCAAAACCTGGCG
AAATATCGCTCGCACATAACGGTGTACTTTTTCTTGATGAGCTACCTGAGTTTGAGCGAAAAGTACTGGATGCACTTCGC
CAGCCATTAGAAAGTGGAGAAATTATTATTTCGCGTGCTAATGCGAAAATCCAATTTCCTGCTCGCTTTCAACTCGTTGC
GGCGATGAATCCAAGTCCAACGGGGCATTATACCGGAACACATAATCGTACCTCGCCACAACAAGTGATGCGTTATTTAA
ATCGCCTTTCAGGACCATTTTTAGATCGCTTTGATCTATCCATTGAAGTGCCTCTTCTGCCACAAGGTAGTTTACAAAAT
ACGGGAGATAGAGGCGAAACAAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGGGAGATTCAACTTGCTCGAGCCGG
TAAAATCAATGCGCATCTTTCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTTCTGGAGA
ATGCTTTAAATAAACTCGGGCTTTCTGTTAGAGCCTACCATCGGATTTTAAAAGTCTCACGTACAATTGCCGATTTAAAT
GGTGAAAAAGAAATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGGGCGATGGATCGGTTGTTACAAAAATTATC
TGCTGCTTAA
ATGTCTCTTGCCATTGTTTATAGCCGTGCATCAATGGGGGTACAAGCCCCTCTTGTTACCATTGAGGTACACTTAAGTAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCGCAAGATCGCGTCAGAAGTGCATTAA
TGAATGCGCAGTTCAAATATCCTGCCAAACGCATCACCGTAAATCTTGCACCGGCTGATTTACCCAAAGAAGGGGGACGA
TTTGATTTACCCATCGCGATAGGTATCTTGGCTGCATCAGATCAATTAGATGGTAGTCATTTAAAACAATTTGAATTCGT
TGGTGAACTCGCATTAACGGGTGAATTACGTGGCGTACACGGTGTCATTCCTGCCATTTTAGCGGCACAAAAAGCCAAAC
GTGCCCCCATTATCGCTTATCAAAATGCCAATGAGGCATCGCTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAATCTG
CTAGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCTTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTGGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCACTAACAATTGCGGCTGCAG
GTCAGCATAATCTGCTCTTTTTAGGACCTCCTGGTACAGGTAAAACCATGTTGGCAAGTCGTCTCACTGCATTACTCCCA
GAAATGACTGATTTAGAAGCCATTGAAACCGCTTCGGTCACAAGCTTAGTACAAAATGAATTAAATTTTCAGAACTGGAA
ACAACGCCCATTTCGAGCGCCACATCATAGTGCATCTTTGCCTGCATTAGTGGGTGGAGGAACCATTCCAAAACCTGGCG
AAATATCGCTCGCACATAACGGTGTACTTTTTCTTGATGAGCTACCTGAGTTTGAGCGAAAAGTACTGGATGCACTTCGC
CAGCCATTAGAAAGTGGAGAAATTATTATTTCGCGTGCTAATGCGAAAATCCAATTTCCTGCTCGCTTTCAACTCGTTGC
GGCGATGAATCCAAGTCCAACGGGGCATTATACCGGAACACATAATCGTACCTCGCCACAACAAGTGATGCGTTATTTAA
ATCGCCTTTCAGGACCATTTTTAGATCGCTTTGATCTATCCATTGAAGTGCCTCTTCTGCCACAAGGTAGTTTACAAAAT
ACGGGAGATAGAGGCGAAACAAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGGGAGATTCAACTTGCTCGAGCCGG
TAAAATCAATGCGCATCTTTCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTTCTGGAGA
ATGCTTTAAATAAACTCGGGCTTTCTGTTAGAGCCTACCATCGGATTTTAAAAGTCTCACGTACAATTGCCGATTTAAAT
GGTGAAAAAGAAATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGGGCGATGGATCGGTTGTTACAAAAATTATC
TGCTGCTTAA
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
92.308 |
99.607 |
0.919 |
| comM | Glaesserella parasuis strain SC1401 |
80.079 |
99.607 |
0.798 |
| comM | Vibrio cholerae strain A1552 |
66.929 |
99.804 |
0.668 |
| comM | Vibrio campbellii strain DS40M4 |
65.815 |
100 |
0.658 |
| comM | Legionella pneumophila str. Paris |
52.183 |
99.018 |
0.517 |
| comM | Legionella pneumophila strain ERS1305867 |
52.183 |
99.018 |
0.517 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.379 |
100 |
0.479 |