Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INP92_RS03580 | Genome accession | NZ_CP063122 |
| Coordinates | 729566..731095 (-) | Length | 509 a.a. |
| NCBI ID | WP_111387963.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C125_4 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 724566..736095
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP92_RS03550 (INP92_03550) | - | 725292..725807 (+) | 516 | WP_232088003.1 | PilN domain-containing protein | - |
| INP92_RS03555 (INP92_03555) | - | 725804..726346 (+) | 543 | WP_111387971.1 | competence protein ComC | - |
| INP92_RS03560 (INP92_03560) | - | 726346..726729 (+) | 384 | WP_111387969.1 | pilus assembly protein PilP | - |
| INP92_RS03565 (INP92_03565) | comE | 726731..728113 (+) | 1383 | WP_111387967.1 | type IV pilus secretin PilQ | Machinery gene |
| INP92_RS03570 (INP92_03570) | - | 728133..728822 (+) | 690 | WP_111387965.1 | ComF family protein | - |
| INP92_RS03575 (INP92_03575) | nfuA | 728938..729522 (+) | 585 | WP_005694707.1 | Fe-S biogenesis protein NfuA | - |
| INP92_RS03580 (INP92_03580) | comM | 729566..731095 (-) | 1530 | WP_111387963.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP92_RS03585 (INP92_03585) | yihA | 731225..731839 (-) | 615 | WP_197554855.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP92_RS03590 (INP92_03590) | - | 731953..732822 (+) | 870 | WP_197554857.1 | VirK/YbjX family protein | - |
| INP92_RS03595 (INP92_03595) | oppF | 732862..733860 (-) | 999 | WP_197554859.1 | murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF | - |
| INP92_RS03600 (INP92_03600) | - | 733857..734828 (-) | 972 | WP_197554861.1 | ABC transporter ATP-binding protein | - |
| INP92_RS03605 (INP92_03605) | oppC | 734838..735773 (-) | 936 | WP_197554863.1 | oligopeptide ABC transporter permease OppC | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55777.23 Da Isoelectric Point: 9.7439
>NTDB_id=493143 INP92_RS03580 WP_111387963.1 729566..731095(-) (comM) [Haemophilus parainfluenzae strain M1C125_4]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAIGFSTKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASITSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAIGFSTKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASITSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
Nucleotide
Download Length: 1530 bp
>NTDB_id=493143 INP92_RS03580 WP_111387963.1 729566..731095(-) (comM) [Haemophilus parainfluenzae strain M1C125_4]
ATGTCTCTTGCTATTGTTTACAGCCGAGCCTCAATGGGCGTACAAGCCCCTCTTGTTACTATTGAGGTACACTTAAGTAA
CGGAAAGCCTGGATTTACCTTAGTAGGCCTCCCCGAAAAAACCGTAAAAGAAGCGCAAGATCGTGTCAGAAGTGCATTAA
TGAATGCGCAGTTCAAATATCCTGCCAAACGCATCACCGTAAATCTTGCACCGGCTGATTTACCCAAAGAAGGGGGACGA
TTTGATTTACCCATTGCGATAGGTATCTTGGCTGCATCAGATCAATTAGACGGTAGCCGTTTAAAACAATTTGAATTCGT
TGGCGAGCTCGCCTTGACGGGGGAATTACGTGGCGTACACGGTGTCATTCCTGCCATTTTAGCGGCACAAAAAGCCAAAC
GTGCCCCCATTATCGCTTATCAAAATGCCAATGAAGCATCGCTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAATCTG
CTAGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTAGCTTCATTACTTGCACAAGAAAGTGCGATCGGATT
TTCAACTAAAAATCACTTGGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCCCTAACAATAGCTGCTGCAG
GCCAACACAACCTACTCTTTTTAGGGCCACCGGGGACGGGTAAAACCATGTTGGCAAGTCGTCTCACAGCATTGCTCCCT
GAAATGACCGATTTAGAAGCGATTGAAACGGCTTCAATTACGAGCTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGACCATTCCGAGCACCACATCATAGTGCCTCATTACCAGCCTTAGTGGGTGGAGGAACCATTCCCAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAATTGCCTGAGTTTGAGCGTAAAGTGCTTGATGCACTTCGC
CAGCCATTAGAAAGTGGAGAAATTATTATTTCACGTGCCAATGCAAAAATACAGTTCCCCGCTCGCTTTCAACTCGTTGC
GGCAATGAATCCAAGTCCAACCGGACATTATACCGGAACACATAACCGCACCTCACCACAACAAGTAATGCGTTATTTAA
ATCGCCTTTCAGGGCCATTTTTAGATCGTTTTGATTTATCTATTGAAGTCCCTCTTCTGCCACAAGGTAGCTTACAGAAT
ACGGGGGATAGAGGCGAAACCAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGAGAAATTCAACTTGCTCGAGCCGG
TAAAATCAATGCGTATCTTTCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTCCTTGAGA
ATGCACTGAATAAGCTAGGACTTTCGGTGAGAGCCTATCATCGGATTTTAAAAGTCTCACGTACAATTGCCGATTTGAAT
GGCGAAAAAGAGATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGAGCGATGGATCGGTTGTTACAGAAATTATC
TGCTGCTTAA
ATGTCTCTTGCTATTGTTTACAGCCGAGCCTCAATGGGCGTACAAGCCCCTCTTGTTACTATTGAGGTACACTTAAGTAA
CGGAAAGCCTGGATTTACCTTAGTAGGCCTCCCCGAAAAAACCGTAAAAGAAGCGCAAGATCGTGTCAGAAGTGCATTAA
TGAATGCGCAGTTCAAATATCCTGCCAAACGCATCACCGTAAATCTTGCACCGGCTGATTTACCCAAAGAAGGGGGACGA
TTTGATTTACCCATTGCGATAGGTATCTTGGCTGCATCAGATCAATTAGACGGTAGCCGTTTAAAACAATTTGAATTCGT
TGGCGAGCTCGCCTTGACGGGGGAATTACGTGGCGTACACGGTGTCATTCCTGCCATTTTAGCGGCACAAAAAGCCAAAC
GTGCCCCCATTATCGCTTATCAAAATGCCAATGAAGCATCGCTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAATCTG
CTAGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTAGCTTCATTACTTGCACAAGAAAGTGCGATCGGATT
TTCAACTAAAAATCACTTGGATTTGACCGATATTATTGGTCAGCAACATGCTAAACGAGCCCTAACAATAGCTGCTGCAG
GCCAACACAACCTACTCTTTTTAGGGCCACCGGGGACGGGTAAAACCATGTTGGCAAGTCGTCTCACAGCATTGCTCCCT
GAAATGACCGATTTAGAAGCGATTGAAACGGCTTCAATTACGAGCTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGACCATTCCGAGCACCACATCATAGTGCCTCATTACCAGCCTTAGTGGGTGGAGGAACCATTCCCAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAATTGCCTGAGTTTGAGCGTAAAGTGCTTGATGCACTTCGC
CAGCCATTAGAAAGTGGAGAAATTATTATTTCACGTGCCAATGCAAAAATACAGTTCCCCGCTCGCTTTCAACTCGTTGC
GGCAATGAATCCAAGTCCAACCGGACATTATACCGGAACACATAACCGCACCTCACCACAACAAGTAATGCGTTATTTAA
ATCGCCTTTCAGGGCCATTTTTAGATCGTTTTGATTTATCTATTGAAGTCCCTCTTCTGCCACAAGGTAGCTTACAGAAT
ACGGGGGATAGAGGCGAAACCAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGAGAAATTCAACTTGCTCGAGCCGG
TAAAATCAATGCGTATCTTTCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTCCTTGAGA
ATGCACTGAATAAGCTAGGACTTTCGGTGAGAGCCTATCATCGGATTTTAAAAGTCTCACGTACAATTGCCGATTTGAAT
GGCGAAAAAGAGATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGAGCGATGGATCGGTTGTTACAGAAATTATC
TGCTGCTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
92.11 |
99.607 |
0.917 |
| comM | Glaesserella parasuis strain SC1401 |
79.684 |
99.607 |
0.794 |
| comM | Vibrio cholerae strain A1552 |
67.323 |
99.804 |
0.672 |
| comM | Vibrio campbellii strain DS40M4 |
65.815 |
100 |
0.658 |
| comM | Legionella pneumophila str. Paris |
52.381 |
99.018 |
0.519 |
| comM | Legionella pneumophila strain ERS1305867 |
52.381 |
99.018 |
0.519 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.573 |
100 |
0.481 |