Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | A4214_RS00175 | Genome accession | NZ_CP015564 |
| Coordinates | 35080..36609 (-) | Length | 509 a.a. |
| NCBI ID | WP_015702557.1 | Uniprot ID | - |
| Organism | Pasteurella multocida strain USDA-ARS-USMARC-60713 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 30080..41609
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| A4214_RS00160 (A4214_00160) | - | 30360..31202 (+) | 843 | WP_015702556.1 | divergent polysaccharide deacetylase family protein | - |
| A4214_RS00165 (A4214_00165) | - | 31223..31909 (+) | 687 | WP_005757690.1 | hypothetical protein | - |
| A4214_RS11060 (A4214_00170) | - | 32016..33581 (-) | 1566 | WP_016532840.1 | TonB-dependent receptor domain-containing protein | - |
| A4214_RS11065 (A4214_00175) | - | 33642..34577 (-) | 936 | WP_041423264.1 | TonB-dependent receptor plug domain-containing protein | - |
| A4214_RS00175 (A4214_00180) | comM | 35080..36609 (-) | 1530 | WP_015702557.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| A4214_RS00180 (A4214_00185) | yihA | 36717..37334 (-) | 618 | WP_005718080.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| A4214_RS00185 (A4214_00190) | - | 37445..38314 (+) | 870 | WP_014668443.1 | VirK/YbjX family protein | - |
| A4214_RS00190 (A4214_00195) | trmL | 38388..38864 (-) | 477 | WP_005755093.1 | tRNA (uridine(34)/cytosine(34)/5- carboxymethylaminomethyluridine(34)-2'-O)- methyltransferase TrmL | - |
| A4214_RS11070 | - | 38958..39215 (-) | 258 | WP_230294056.1 | hypothetical protein | - |
| A4214_RS11075 | - | 39196..39390 (-) | 195 | WP_233791405.1 | hypothetical protein | - |
| A4214_RS11080 | - | 39483..39962 (-) | 480 | WP_233791406.1 | hypothetical protein | - |
| A4214_RS00200 (A4214_00205) | - | 39987..41507 (-) | 1521 | WP_016532830.1 | surface lipoprotein assembly modifier | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55907.32 Da Isoelectric Point: 9.4036
>NTDB_id=180839 A4214_RS00175 WP_015702557.1 35080..36609(-) (comM) [Pasteurella multocida strain USDA-ARS-USMARC-60713]
MSLAIVYSRASMGVQAPLVTIEVHFSNGKPQFNLVGLPEKTVKEAQDRVRSALLNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGMLAASGQIDPQKLQQFEFVGELALTGDLRAVHGVIPAILAAQKAKRKLIIASQNANEASLVSQQDTYFAQHL
LEVVNFLNDHSTLPLASDLPTQHDPALLATTQKDLTDIIGQSHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTTLLP
EMNDQETIETAAVTSLIHHELNFHNWKQRPFRAPHHSASTPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLIAAMNPSPTGHYQGTHNRTSPQQILRYLNRLSGPFLDRFDLSIEVPLLPKGALQQ
QTDRGESSAQVREKVLKVRDIQLARAGKINAYLTSKEIERDCKISDQDALFLENALAKLGLSVRAYHRILKVARTIADLN
NECQIQQCHLAEALGYRAMDRLLQKLSAI
MSLAIVYSRASMGVQAPLVTIEVHFSNGKPQFNLVGLPEKTVKEAQDRVRSALLNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGMLAASGQIDPQKLQQFEFVGELALTGDLRAVHGVIPAILAAQKAKRKLIIASQNANEASLVSQQDTYFAQHL
LEVVNFLNDHSTLPLASDLPTQHDPALLATTQKDLTDIIGQSHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTTLLP
EMNDQETIETAAVTSLIHHELNFHNWKQRPFRAPHHSASTPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLIAAMNPSPTGHYQGTHNRTSPQQILRYLNRLSGPFLDRFDLSIEVPLLPKGALQQ
QTDRGESSAQVREKVLKVRDIQLARAGKINAYLTSKEIERDCKISDQDALFLENALAKLGLSVRAYHRILKVARTIADLN
NECQIQQCHLAEALGYRAMDRLLQKLSAI
Nucleotide
Download Length: 1530 bp
>NTDB_id=180839 A4214_RS00175 WP_015702557.1 35080..36609(-) (comM) [Pasteurella multocida strain USDA-ARS-USMARC-60713]
ATGTCACTTGCAATTGTTTATAGTCGCGCCTCAATGGGCGTACAAGCCCCTTTAGTCACCATTGAGGTGCATTTCAGTAA
TGGTAAACCGCAATTCAACTTAGTCGGCTTACCCGAAAAAACCGTCAAAGAAGCACAAGATCGCGTACGCAGCGCCTTGC
TCAATGCTCAATTTAAATACCCCGCTAAACGGATCACGGTCAATCTCGCCCCCGCCGATTTACCCAAAGAAGGCGGACGT
TTTGATTTACCGATCGCCATTGGTATGCTCGCCGCTTCTGGTCAAATTGATCCACAAAAGTTACAACAATTTGAATTTGT
GGGTGAGTTAGCGCTAACCGGTGATTTACGTGCAGTACATGGCGTCATCCCCGCGATCCTTGCTGCACAAAAAGCCAAGC
GGAAATTGATTATCGCCAGTCAAAATGCCAATGAAGCCTCGTTAGTTTCCCAGCAAGACACCTATTTTGCGCAACATTTA
TTAGAAGTCGTCAATTTCCTCAATGATCATTCCACGTTACCTTTAGCCTCAGACCTGCCTACACAACATGATCCAGCCCT
TTTAGCAACAACACAAAAAGATTTAACCGATATTATCGGACAATCCCATGCTAAACGCGCTTTAACCATTGCTGCCGCAG
GACAACATAATCTGCTTTTTTTGGGGCCGCCCGGTACAGGAAAGACCATGCTCGCCAGCCGATTAACTACCTTGTTACCA
GAAATGAACGATCAAGAAACCATTGAAACCGCCGCAGTGACAAGCCTTATTCATCATGAATTGAATTTTCATAATTGGAA
ACAACGCCCATTTCGTGCCCCTCATCACAGCGCTTCAACACCAGCTTTAGTAGGGGGCGGCACCATACCCAAACCCGGTG
AAATTTCGCTCGCACACAATGGGGTGTTATTTCTCGATGAGTTACCAGAATTTGAACGTAAAGTATTAGATGCCTTACGT
CAACCTTTGGAAAGTGGTGAAATTATCATTTCGCGTGCCAATGCAAAAATTCAATTCCCCGCTCGTTTTCAATTAATTGC
CGCCATGAATCCTAGCCCAACGGGACATTATCAAGGTACTCACAATCGCACCTCACCACAACAAATCTTGCGTTATCTCA
ATCGCTTATCCGGTCCGTTTTTAGATCGCTTTGATTTGTCAATCGAAGTCCCTTTACTACCCAAAGGCGCGTTACAACAG
CAAACAGATCGGGGCGAAAGTAGCGCACAAGTGAGAGAAAAAGTATTAAAAGTACGCGACATACAACTGGCGCGTGCGGG
GAAAATTAATGCTTATTTGACCAGTAAAGAAATTGAGCGAGACTGTAAGATCAGTGATCAAGATGCGTTATTTTTAGAAA
ATGCCTTAGCTAAATTGGGGTTATCCGTACGCGCCTATCATCGTATTTTAAAAGTAGCACGCACCATTGCAGACTTAAAT
AATGAATGCCAAATTCAACAATGCCACCTCGCCGAAGCACTGGGTTATCGGGCGATGGATAGGTTGTTACAGAAGTTATC
TGCAATATAA
ATGTCACTTGCAATTGTTTATAGTCGCGCCTCAATGGGCGTACAAGCCCCTTTAGTCACCATTGAGGTGCATTTCAGTAA
TGGTAAACCGCAATTCAACTTAGTCGGCTTACCCGAAAAAACCGTCAAAGAAGCACAAGATCGCGTACGCAGCGCCTTGC
TCAATGCTCAATTTAAATACCCCGCTAAACGGATCACGGTCAATCTCGCCCCCGCCGATTTACCCAAAGAAGGCGGACGT
TTTGATTTACCGATCGCCATTGGTATGCTCGCCGCTTCTGGTCAAATTGATCCACAAAAGTTACAACAATTTGAATTTGT
GGGTGAGTTAGCGCTAACCGGTGATTTACGTGCAGTACATGGCGTCATCCCCGCGATCCTTGCTGCACAAAAAGCCAAGC
GGAAATTGATTATCGCCAGTCAAAATGCCAATGAAGCCTCGTTAGTTTCCCAGCAAGACACCTATTTTGCGCAACATTTA
TTAGAAGTCGTCAATTTCCTCAATGATCATTCCACGTTACCTTTAGCCTCAGACCTGCCTACACAACATGATCCAGCCCT
TTTAGCAACAACACAAAAAGATTTAACCGATATTATCGGACAATCCCATGCTAAACGCGCTTTAACCATTGCTGCCGCAG
GACAACATAATCTGCTTTTTTTGGGGCCGCCCGGTACAGGAAAGACCATGCTCGCCAGCCGATTAACTACCTTGTTACCA
GAAATGAACGATCAAGAAACCATTGAAACCGCCGCAGTGACAAGCCTTATTCATCATGAATTGAATTTTCATAATTGGAA
ACAACGCCCATTTCGTGCCCCTCATCACAGCGCTTCAACACCAGCTTTAGTAGGGGGCGGCACCATACCCAAACCCGGTG
AAATTTCGCTCGCACACAATGGGGTGTTATTTCTCGATGAGTTACCAGAATTTGAACGTAAAGTATTAGATGCCTTACGT
CAACCTTTGGAAAGTGGTGAAATTATCATTTCGCGTGCCAATGCAAAAATTCAATTCCCCGCTCGTTTTCAATTAATTGC
CGCCATGAATCCTAGCCCAACGGGACATTATCAAGGTACTCACAATCGCACCTCACCACAACAAATCTTGCGTTATCTCA
ATCGCTTATCCGGTCCGTTTTTAGATCGCTTTGATTTGTCAATCGAAGTCCCTTTACTACCCAAAGGCGCGTTACAACAG
CAAACAGATCGGGGCGAAAGTAGCGCACAAGTGAGAGAAAAAGTATTAAAAGTACGCGACATACAACTGGCGCGTGCGGG
GAAAATTAATGCTTATTTGACCAGTAAAGAAATTGAGCGAGACTGTAAGATCAGTGATCAAGATGCGTTATTTTTAGAAA
ATGCCTTAGCTAAATTGGGGTTATCCGTACGCGCCTATCATCGTATTTTAAAAGTAGCACGCACCATTGCAGACTTAAAT
AATGAATGCCAAATTCAACAATGCCACCTCGCCGAAGCACTGGGTTATCGGGCGATGGATAGGTTGTTACAGAAGTTATC
TGCAATATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
84.221 |
99.607 |
0.839 |
| comM | Glaesserella parasuis strain SC1401 |
77.515 |
99.607 |
0.772 |
| comM | Vibrio cholerae strain A1552 |
66.012 |
100 |
0.66 |
| comM | Vibrio campbellii strain DS40M4 |
65.551 |
99.804 |
0.654 |
| comM | Legionella pneumophila str. Paris |
51.4 |
98.232 |
0.505 |
| comM | Legionella pneumophila strain ERS1305867 |
51.4 |
98.232 |
0.505 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
45.333 |
100 |
0.468 |