Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INP98_RS01465 | Genome accession | NZ_CP063112 |
| Coordinates | 302070..303599 (+) | Length | 509 a.a. |
| NCBI ID | WP_197570154.1 | Uniprot ID | - |
| Organism | Haemophilus parainfluenzae strain M1C149_1 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 297070..308599
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INP98_RS01440 (INP98_01440) | oppC | 297392..298327 (+) | 936 | WP_197570151.1 | oligopeptide ABC transporter permease OppC | - |
| INP98_RS01445 (INP98_01445) | - | 298337..299308 (+) | 972 | WP_070868468.1 | ABC transporter ATP-binding protein | - |
| INP98_RS01450 (INP98_01450) | oppF | 299305..300303 (+) | 999 | WP_197570152.1 | murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF | - |
| INP98_RS01455 (INP98_01455) | - | 300343..301212 (-) | 870 | WP_197570153.1 | VirK/YbjX family protein | - |
| INP98_RS01460 (INP98_01460) | yihA | 301326..301940 (+) | 615 | WP_032822378.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| INP98_RS01465 (INP98_01465) | comM | 302070..303599 (+) | 1530 | WP_197570154.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INP98_RS01470 (INP98_01470) | nfuA | 303643..304227 (-) | 585 | WP_197545166.1 | Fe-S biogenesis protein NfuA | - |
| INP98_RS01475 (INP98_01475) | - | 304342..305031 (-) | 690 | WP_197570155.1 | ComF family protein | - |
| INP98_RS01480 (INP98_01480) | comE | 305051..306433 (-) | 1383 | WP_049369641.1 | type IV pilus secretin PilQ | Machinery gene |
| INP98_RS01485 (INP98_01485) | - | 306435..306818 (-) | 384 | WP_049369639.1 | pilus assembly protein PilP | - |
| INP98_RS01490 (INP98_01490) | - | 306818..307360 (-) | 543 | WP_049369637.1 | hypothetical protein | - |
| INP98_RS01495 (INP98_01495) | - | 307357..307869 (-) | 513 | WP_197570156.1 | PilN domain-containing protein | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55733.14 Da Isoelectric Point: 9.7524
>NTDB_id=492940 INP98_RS01465 WP_197570154.1 302070..303599(+) (comM) [Haemophilus parainfluenzae strain M1C149_1]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRVTVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQRAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRVTVNLAPADLPKEGGR
FDLPIAIGILAASDQLDGSRLKQFEFVGELALTGELRGVHGVIPAILAAQRAKRAPIIAYQNANEASLVSEQETYFAKNL
LEVVQFLNNQEKLPLASLLAQESAVGFSAKNHLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTALLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASLPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSQQVREKVLKVREIQLARAGKINAYLSSKEIERDCKLQDKDALFLENALNKLGLSVRAYHRILKVSRTIADLN
GEKEIQQPHLAEALGYRAMDRLLQKLSAA
Nucleotide
Download Length: 1530 bp
>NTDB_id=492940 INP98_RS01465 WP_197570154.1 302070..303599(+) (comM) [Haemophilus parainfluenzae strain M1C149_1]
ATGTCTCTTGCTATTGTTTACAGCCGTGCATCAATGGGTGTACAAGCCCCTCTTGTTACGATTGAAGTACATTTAAGTAA
CGGGAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCGCAAGATCGTGTCAGAAGTGCGTTAA
TGAATGCCCAGTTCAAATATCCTGCTAAACGCGTCACCGTCAACCTTGCGCCAGCTGATTTACCCAAAGAAGGAGGGAGA
TTTGATTTACCTATTGCAATCGGTATTTTAGCTGCATCGGATCAACTCGACGGTAGCCGTTTAAAACAATTTGAATTCGT
TGGCGAACTCGCATTAACAGGTGAATTACGTGGTGTACATGGCGTGATCCCCGCCATTTTAGCGGCACAAAGAGCGAAAC
GAGCCCCCATTATCGCTTATCAAAATGCTAATGAGGCGTCACTTGTCTCAGAGCAAGAGACTTATTTTGCTAAAAACCTT
TTGGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCTTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTAGATTTGACGGATATTATTGGTCAGCAACATGCTAAACGAGCCCTAACAATTGCGGCTGCAG
GTCAGCATAATCTGCTCTTTTTAGGACCTCCTGGTACAGGTAAAACCATGTTGGCAAGTCGTCTCACTGCATTACTCCCA
GAAATGACTGATTTAGAAGCCATTGAAACCGCTTCGGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGACCATTCCGAGCACCACATCATAGTGCCTCATTACCTGCCTTAGTGGGGGGAGGAACCATTCCCAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAACTACCTGAGTTTGAGCGAAAAGTACTGGATGCACTTCGT
CAGCCCTTAGAAAGTGGAGAAATTATTATTTCGCGTGCCAATGCGAAAATCCAATTCCCCGCTCGCTTTCAACTCGTTGC
CGCGATGAATCCAAGTCCAACCGGACATTATACAGGAACACATAATCGCACCTCGCCGCAGCAAGTGATGCGTTATTTAA
ATCGCCTTTCGGGGCCATTTTTAGATCGCTTTGATTTATCCATTGAAGTCCCCCTTCTGCCGCAAGGTAGCCTACAGAAT
ACGGGTGATAGAGGGGAAACAAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGAGAAATTCAACTTGCTCGAGCCGG
TAAAATCAATGCTTATCTTTCAAGCAAAGAAATTGAACGAGATTGTAAATTACAAGATAAAGATGCATTATTTCTGGAGA
ATGCTTTAAATAAACTCGGGCTTTCTGTTAGAGCCTACCATCGGATTTTAAAAGTCTCACGCACAATTGCCGATTTAAAT
GGCGAAAAAGAAATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGGGCGATGGATCGGTTGTTACAGAAATTATC
TGCTGCTTAA
ATGTCTCTTGCTATTGTTTACAGCCGTGCATCAATGGGTGTACAAGCCCCTCTTGTTACGATTGAAGTACATTTAAGTAA
CGGGAAGCCTGGATTTACCTTAGTTGGTCTACCCGAAAAAACCGTAAAAGAAGCGCAAGATCGTGTCAGAAGTGCGTTAA
TGAATGCCCAGTTCAAATATCCTGCTAAACGCGTCACCGTCAACCTTGCGCCAGCTGATTTACCCAAAGAAGGAGGGAGA
TTTGATTTACCTATTGCAATCGGTATTTTAGCTGCATCGGATCAACTCGACGGTAGCCGTTTAAAACAATTTGAATTCGT
TGGCGAACTCGCATTAACAGGTGAATTACGTGGTGTACATGGCGTGATCCCCGCCATTTTAGCGGCACAAAGAGCGAAAC
GAGCCCCCATTATCGCTTATCAAAATGCTAATGAGGCGTCACTTGTCTCAGAGCAAGAGACTTATTTTGCTAAAAACCTT
TTGGAAGTCGTGCAATTCTTAAATAATCAAGAAAAATTACCTTTGGCTTCATTACTTGCACAAGAAAGTGCGGTCGGATT
TTCAGCTAAAAATCACTTAGATTTGACGGATATTATTGGTCAGCAACATGCTAAACGAGCCCTAACAATTGCGGCTGCAG
GTCAGCATAATCTGCTCTTTTTAGGACCTCCTGGTACAGGTAAAACCATGTTGGCAAGTCGTCTCACTGCATTACTCCCA
GAAATGACTGATTTAGAAGCCATTGAAACCGCTTCGGTTACGAGTTTAGTACAAAATGAATTAAATTTTCATAACTGGAA
ACAACGACCATTCCGAGCACCACATCATAGTGCCTCATTACCTGCCTTAGTGGGGGGAGGAACCATTCCCAAACCAGGCG
AAATATCGCTCGCACATAACGGCGTACTTTTTCTTGATGAACTACCTGAGTTTGAGCGAAAAGTACTGGATGCACTTCGT
CAGCCCTTAGAAAGTGGAGAAATTATTATTTCGCGTGCCAATGCGAAAATCCAATTCCCCGCTCGCTTTCAACTCGTTGC
CGCGATGAATCCAAGTCCAACCGGACATTATACAGGAACACATAATCGCACCTCGCCGCAGCAAGTGATGCGTTATTTAA
ATCGCCTTTCGGGGCCATTTTTAGATCGCTTTGATTTATCCATTGAAGTCCCCCTTCTGCCGCAAGGTAGCCTACAGAAT
ACGGGTGATAGAGGGGAAACAAGCCAACAAGTACGAGAAAAAGTGTTAAAAGTGAGAGAAATTCAACTTGCTCGAGCCGG
TAAAATCAATGCTTATCTTTCAAGCAAAGAAATTGAACGAGATTGTAAATTACAAGATAAAGATGCATTATTTCTGGAGA
ATGCTTTAAATAAACTCGGGCTTTCTGTTAGAGCCTACCATCGGATTTTAAAAGTCTCACGCACAATTGCCGATTTAAAT
GGCGAAAAAGAAATACAACAACCTCATTTAGCAGAAGCGCTAGGATATCGGGCGATGGATCGGTTGTTACAGAAATTATC
TGCTGCTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
92.11 |
99.607 |
0.917 |
| comM | Glaesserella parasuis strain SC1401 |
79.487 |
99.607 |
0.792 |
| comM | Vibrio cholerae strain A1552 |
66.929 |
99.804 |
0.668 |
| comM | Vibrio campbellii strain DS40M4 |
65.619 |
100 |
0.656 |
| comM | Legionella pneumophila str. Paris |
51.786 |
99.018 |
0.513 |
| comM | Legionella pneumophila strain ERS1305867 |
51.786 |
99.018 |
0.513 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.184 |
100 |
0.477 |