Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INP99_RS03735 | Genome accession | NZ_CP063111 |
| Coordinates | 763877..765406 (-) | Length | 509 a.a. |
| NCBI ID | WP_197542727.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C152_1 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 758877..770406
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP99_RS03705 (INP99_03700) | - | 759606..760118 (+) | 513 | WP_197542723.1 | competence protein ComB | - |
| INP99_RS03710 (INP99_03705) | - | 760115..760657 (+) | 543 | WP_197542724.1 | competence protein ComC | - |
| INP99_RS03715 (INP99_03710) | - | 760657..761040 (+) | 384 | WP_070582959.1 | pilus assembly protein PilP | - |
| INP99_RS03720 (INP99_03715) | comE | 761042..762424 (+) | 1383 | WP_197542725.1 | type IV pilus secretin PilQ | Machinery gene |
| INP99_RS03725 (INP99_03720) | - | 762444..763133 (+) | 690 | WP_197542726.1 | ComF family protein | - |
| INP99_RS03730 (INP99_03725) | nfuA | 763249..763833 (+) | 585 | WP_014064659.1 | Fe-S biogenesis protein NfuA | - |
| INP99_RS03735 (INP99_03730) | comM | 763877..765406 (-) | 1530 | WP_197542727.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP99_RS03740 (INP99_03735) | yihA | 765536..766150 (-) | 615 | WP_065243638.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP99_RS03745 (INP99_03740) | - | 766264..767133 (+) | 870 | WP_197542728.1 | VirK/YbjX family protein | - |
| INP99_RS03750 (INP99_03745) | oppF | 767173..768171 (-) | 999 | WP_197542729.1 | murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF | - |
| INP99_RS03755 (INP99_03750) | - | 768168..769139 (-) | 972 | WP_049373848.1 | ABC transporter ATP-binding protein | - |
| INP99_RS03760 (INP99_03755) | oppC | 769149..770084 (-) | 936 | WP_049373849.1 | oligopeptide ABC transporter permease OppC | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55753.13 Da Isoelectric Point: 9.6293
>NTDB_id=492922 INP99_RS03735 WP_197542727.1 763877..765406(-) (comM) [Haemophilus parainfluenzae strain M1C152_1]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLDRAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQSHLAEALGYRAMDRLLQKLSAA
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQKAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLDRAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQSHLAEALGYRAMDRLLQKLSAA
Nucleotide
Download Length: 1530 bp
>NTDB_id=492922 INP99_RS03735 WP_197542727.1 763877..765406(-) (comM) [Haemophilus parainfluenzae strain M1C152_1]
ATGTCTCTTGCTATTGTTTACAGCCGTGCATCAATGGGTGTACAAGCCCCTCTTGTTACCATTGAGGTACACTTAAGCAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCACAAGATCGTGTCAGAAGTGCATTAA
TGAATGCCCAGTTCAAATATCCTGCTAAACGCATCACCGTCAACCTTGCACCGGCCGATTTACCCAAAGAAGGGGGTCGA
TTTGATTTACCTATTGCAATCGGTATTTTAGCCGCATCAGATCAACTTGATGGCAGCCGTTTAAAACAGTTTGAATTCGT
TGGTGAACTTGCATTAACAGGTGAATTACGTGGTGTACACGGCGTGATTCCCGCCATTTTAGCAGCGCAAAAAGCAAAAC
GAGCCCCCATTATTGCTTATCAAAATGCTAATGAAGCTTCACTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAACCTT
TTGGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCCTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTAGATTTGACGGATATTATTGGTCAGCAACACGCCAAACGAGCTCTGACGATTGCAGCCGCAG
GCCAGCACAACCTACTCTTTTTAGGTCCTCCCGGTACAGGTAAAACCATGTTGGCAAGTCGTCTCACTGCATTGCTCCCT
GAAATGACAGATTTAGAAGCCATTGAAACCGCTTCAGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGTCCGTTTCGAGCGCCACATCATAGTGCCTCATTACCTGCCTTAGTAGGTGGAGGAACCATTCCAAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAATTGCCTGAGTTTGAGCGTAAAGTACTAGATGCACTTCGC
CAGCCATTAGAAAGTGGAGAAATTATTATTTCGCGTGCCAATGCGAAAATCCAATTCCCCGCTCGCTTTCAACTCGTTGC
TGCGATGAATCCAAGTCCAACGGGTCATTATACTGGAACACATAACCGAACCTCACCACAACAAGTGATGCGTTATTTAA
ATCGCCTTTCGGGGCCGTTTTTAGATCGCTTTGATTTATCCATTGAAGTACCTCTTCTGCCACAAGGTAGCTTACAGAAT
ACAGGAGATAGAGGTGAGACAAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGGGAAATTCAACTTGATCGAGCCGG
TAAAATCAATGCATATCTTTCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTCCTTGAGA
ATGCACTCAATAAGCTTGGACTTTCAGTAAGAGCCTATCATCGGATTTTAAAAGTTTCACGCACAATTGCTGATTTAAAT
GGCGAAAAAGAAATACAACAATCTCATTTAGCCGAAGCGCTAGGATATCGGGCCATGGATCGATTATTACAGAAATTATC
TGCTGCTTAA
ATGTCTCTTGCTATTGTTTACAGCCGTGCATCAATGGGTGTACAAGCCCCTCTTGTTACCATTGAGGTACACTTAAGCAA
CGGAAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCACAAGATCGTGTCAGAAGTGCATTAA
TGAATGCCCAGTTCAAATATCCTGCTAAACGCATCACCGTCAACCTTGCACCGGCCGATTTACCCAAAGAAGGGGGTCGA
TTTGATTTACCTATTGCAATCGGTATTTTAGCCGCATCAGATCAACTTGATGGCAGCCGTTTAAAACAGTTTGAATTCGT
TGGTGAACTTGCATTAACAGGTGAATTACGTGGTGTACACGGCGTGATTCCCGCCATTTTAGCAGCGCAAAAAGCAAAAC
GAGCCCCCATTATTGCTTATCAAAATGCTAATGAAGCTTCACTTGTCTCAGAGCAAGAGACTTATTTCGCTAAAAACCTT
TTGGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCCTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTAGATTTGACGGATATTATTGGTCAGCAACACGCCAAACGAGCTCTGACGATTGCAGCCGCAG
GCCAGCACAACCTACTCTTTTTAGGTCCTCCCGGTACAGGTAAAACCATGTTGGCAAGTCGTCTCACTGCATTGCTCCCT
GAAATGACAGATTTAGAAGCCATTGAAACCGCTTCAGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGTCCGTTTCGAGCGCCACATCATAGTGCCTCATTACCTGCCTTAGTAGGTGGAGGAACCATTCCAAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAATTGCCTGAGTTTGAGCGTAAAGTACTAGATGCACTTCGC
CAGCCATTAGAAAGTGGAGAAATTATTATTTCGCGTGCCAATGCGAAAATCCAATTCCCCGCTCGCTTTCAACTCGTTGC
TGCGATGAATCCAAGTCCAACGGGTCATTATACTGGAACACATAACCGAACCTCACCACAACAAGTGATGCGTTATTTAA
ATCGCCTTTCGGGGCCGTTTTTAGATCGCTTTGATTTATCCATTGAAGTACCTCTTCTGCCACAAGGTAGCTTACAGAAT
ACAGGAGATAGAGGTGAGACAAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGGGAAATTCAACTTGATCGAGCCGG
TAAAATCAATGCATATCTTTCAAGCAAAGAAATTGAACGAGACTGTAAATTACAAGATAAAGATGCATTATTCCTTGAGA
ATGCACTCAATAAGCTTGGACTTTCAGTAAGAGCCTATCATCGGATTTTAAAAGTTTCACGCACAATTGCTGATTTAAAT
GGCGAAAAAGAAATACAACAATCTCATTTAGCCGAAGCGCTAGGATATCGGGCCATGGATCGATTATTACAGAAATTATC
TGCTGCTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
92.308 |
99.607 |
0.919 |
| comM | Glaesserella parasuis strain SC1401 |
79.684 |
99.607 |
0.794 |
| comM | Vibrio cholerae strain A1552 |
66.732 |
99.804 |
0.666 |
| comM | Vibrio campbellii strain DS40M4 |
65.815 |
100 |
0.658 |
| comM | Legionella pneumophila str. Paris |
51.984 |
99.018 |
0.515 |
| comM | Legionella pneumophila strain ERS1305867 |
51.984 |
99.018 |
0.515 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.379 |
100 |
0.479 |