Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | A4212_RS00185 | Genome accession | NZ_CP015567 |
| Coordinates | 36796..38325 (-) | Length | 509 a.a. |
| NCBI ID | WP_015702557.1 | Uniprot ID | - |
| Organism | Pasteurella multocida strain USDA-ARS-USMARC-60675 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 31796..43325
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| A4212_RS00170 (A4212_00170) | - | 32076..32918 (+) | 843 | WP_015702556.1 | divergent polysaccharide deacetylase family protein | - |
| A4212_RS00175 (A4212_00175) | - | 32939..33625 (+) | 687 | WP_005757690.1 | hypothetical protein | - |
| A4212_RS12080 (A4212_00180) | - | 33732..35327 (-) | 1596 | WP_230294055.1 | TonB-dependent receptor domain-containing protein | - |
| A4212_RS12085 (A4212_00185) | - | 35358..36293 (-) | 936 | WP_041423264.1 | TonB-dependent receptor plug domain-containing protein | - |
| A4212_RS00185 (A4212_00190) | comM | 36796..38325 (-) | 1530 | WP_015702557.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| A4212_RS00190 (A4212_00195) | yihA | 38433..39050 (-) | 618 | WP_005718080.1 | ribosome biogenesis GTP-binding protein YihA/YsxC | - |
| A4212_RS00195 (A4212_00200) | - | 39161..40030 (+) | 870 | WP_014668443.1 | VirK/YbjX family protein | - |
| A4212_RS00200 (A4212_00205) | trmL | 40104..40580 (-) | 477 | WP_005755093.1 | tRNA (uridine(34)/cytosine(34)/5- carboxymethylaminomethyluridine(34)-2'-O)- methyltransferase TrmL | - |
| A4212_RS12090 | - | 40675..40932 (-) | 258 | WP_230294056.1 | hypothetical protein | - |
| A4212_RS00205 (A4212_00210) | - | 40913..41680 (-) | 768 | WP_230294057.1 | hypothetical protein | - |
| A4212_RS00210 (A4212_00215) | - | 41705..43225 (-) | 1521 | WP_016532830.1 | surface lipoprotein assembly modifier | - |
Sequence
Protein
Download Length: 509 a.a. Molecular weight: 55907.32 Da Isoelectric Point: 9.4036
>NTDB_id=180904 A4212_RS00185 WP_015702557.1 36796..38325(-) (comM) [Pasteurella multocida strain USDA-ARS-USMARC-60675]
MSLAIVYSRASMGVQAPLVTIEVHFSNGKPQFNLVGLPEKTVKEAQDRVRSALLNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGMLAASGQIDPQKLQQFEFVGELALTGDLRAVHGVIPAILAAQKAKRKLIIASQNANEASLVSQQDTYFAQHL
LEVVNFLNDHSTLPLASDLPTQHDPALLATTQKDLTDIIGQSHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTTLLP
EMNDQETIETAAVTSLIHHELNFHNWKQRPFRAPHHSASTPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLIAAMNPSPTGHYQGTHNRTSPQQILRYLNRLSGPFLDRFDLSIEVPLLPKGALQQ
QTDRGESSAQVREKVLKVRDIQLARAGKINAYLTSKEIERDCKISDQDALFLENALAKLGLSVRAYHRILKVARTIADLN
NECQIQQCHLAEALGYRAMDRLLQKLSAI
MSLAIVYSRASMGVQAPLVTIEVHFSNGKPQFNLVGLPEKTVKEAQDRVRSALLNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGMLAASGQIDPQKLQQFEFVGELALTGDLRAVHGVIPAILAAQKAKRKLIIASQNANEASLVSQQDTYFAQHL
LEVVNFLNDHSTLPLASDLPTQHDPALLATTQKDLTDIIGQSHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTTLLP
EMNDQETIETAAVTSLIHHELNFHNWKQRPFRAPHHSASTPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLIAAMNPSPTGHYQGTHNRTSPQQILRYLNRLSGPFLDRFDLSIEVPLLPKGALQQ
QTDRGESSAQVREKVLKVRDIQLARAGKINAYLTSKEIERDCKISDQDALFLENALAKLGLSVRAYHRILKVARTIADLN
NECQIQQCHLAEALGYRAMDRLLQKLSAI
Nucleotide
Download Length: 1530 bp
>NTDB_id=180904 A4212_RS00185 WP_015702557.1 36796..38325(-) (comM) [Pasteurella multocida strain USDA-ARS-USMARC-60675]
ATGTCACTTGCAATTGTTTATAGTCGCGCCTCAATGGGCGTACAAGCCCCTTTAGTCACCATTGAGGTGCATTTCAGTAA
TGGTAAACCGCAATTCAACTTAGTCGGCTTACCCGAAAAAACCGTCAAAGAAGCACAAGATCGCGTACGCAGCGCCTTGC
TCAATGCTCAATTTAAATACCCCGCTAAACGGATCACGGTCAATCTCGCCCCCGCCGATTTACCCAAAGAAGGCGGACGT
TTTGATTTACCGATCGCCATTGGTATGCTCGCCGCTTCTGGTCAAATTGATCCACAAAAGTTACAACAATTTGAATTTGT
GGGTGAGTTAGCGCTAACCGGTGATTTACGTGCAGTACATGGCGTCATCCCCGCGATCCTTGCTGCACAAAAAGCCAAGC
GGAAATTGATTATCGCCAGTCAAAATGCCAATGAAGCCTCGTTAGTTTCCCAGCAAGACACCTATTTTGCGCAACATTTA
TTAGAAGTCGTCAATTTCCTCAATGATCATTCCACGTTACCTTTAGCCTCAGACCTGCCTACACAACATGATCCAGCCCT
TTTAGCAACAACACAAAAAGATTTAACCGATATTATCGGACAATCCCATGCTAAACGCGCTTTAACCATTGCTGCCGCAG
GACAACATAATCTGCTTTTTTTGGGGCCGCCCGGTACAGGAAAGACCATGCTCGCCAGCCGATTAACTACCTTGTTACCA
GAAATGAACGATCAAGAAACCATTGAAACCGCCGCAGTGACAAGCCTTATTCATCATGAATTGAATTTTCATAATTGGAA
ACAACGCCCATTTCGTGCCCCTCATCACAGCGCTTCAACACCAGCTTTAGTAGGGGGCGGCACCATACCCAAACCCGGTG
AAATTTCGCTCGCACACAATGGGGTGTTATTTCTCGATGAGTTACCAGAATTTGAACGTAAAGTATTAGATGCCTTACGT
CAACCTTTGGAAAGTGGTGAAATTATCATTTCGCGTGCCAATGCAAAAATTCAATTCCCCGCTCGTTTTCAATTAATTGC
CGCCATGAATCCTAGCCCAACGGGACATTATCAAGGTACTCACAATCGCACCTCACCACAACAAATCTTGCGTTATCTCA
ATCGCTTATCCGGTCCGTTTTTAGATCGCTTTGATTTGTCAATCGAAGTCCCTTTACTACCCAAAGGCGCGTTACAACAG
CAAACAGATCGGGGCGAAAGTAGCGCACAAGTGAGAGAAAAAGTATTAAAAGTACGCGACATACAACTGGCGCGTGCGGG
GAAAATTAATGCTTATTTGACCAGTAAAGAAATTGAGCGAGACTGTAAGATCAGTGATCAAGATGCGTTATTTTTAGAAA
ATGCCTTAGCTAAATTGGGGTTATCCGTACGCGCCTATCATCGTATTTTAAAAGTAGCACGCACCATTGCAGACTTAAAT
AATGAATGCCAAATTCAACAATGCCACCTCGCCGAAGCACTGGGTTATCGGGCGATGGATAGGTTGTTACAGAAGTTATC
TGCAATATAA
ATGTCACTTGCAATTGTTTATAGTCGCGCCTCAATGGGCGTACAAGCCCCTTTAGTCACCATTGAGGTGCATTTCAGTAA
TGGTAAACCGCAATTCAACTTAGTCGGCTTACCCGAAAAAACCGTCAAAGAAGCACAAGATCGCGTACGCAGCGCCTTGC
TCAATGCTCAATTTAAATACCCCGCTAAACGGATCACGGTCAATCTCGCCCCCGCCGATTTACCCAAAGAAGGCGGACGT
TTTGATTTACCGATCGCCATTGGTATGCTCGCCGCTTCTGGTCAAATTGATCCACAAAAGTTACAACAATTTGAATTTGT
GGGTGAGTTAGCGCTAACCGGTGATTTACGTGCAGTACATGGCGTCATCCCCGCGATCCTTGCTGCACAAAAAGCCAAGC
GGAAATTGATTATCGCCAGTCAAAATGCCAATGAAGCCTCGTTAGTTTCCCAGCAAGACACCTATTTTGCGCAACATTTA
TTAGAAGTCGTCAATTTCCTCAATGATCATTCCACGTTACCTTTAGCCTCAGACCTGCCTACACAACATGATCCAGCCCT
TTTAGCAACAACACAAAAAGATTTAACCGATATTATCGGACAATCCCATGCTAAACGCGCTTTAACCATTGCTGCCGCAG
GACAACATAATCTGCTTTTTTTGGGGCCGCCCGGTACAGGAAAGACCATGCTCGCCAGCCGATTAACTACCTTGTTACCA
GAAATGAACGATCAAGAAACCATTGAAACCGCCGCAGTGACAAGCCTTATTCATCATGAATTGAATTTTCATAATTGGAA
ACAACGCCCATTTCGTGCCCCTCATCACAGCGCTTCAACACCAGCTTTAGTAGGGGGCGGCACCATACCCAAACCCGGTG
AAATTTCGCTCGCACACAATGGGGTGTTATTTCTCGATGAGTTACCAGAATTTGAACGTAAAGTATTAGATGCCTTACGT
CAACCTTTGGAAAGTGGTGAAATTATCATTTCGCGTGCCAATGCAAAAATTCAATTCCCCGCTCGTTTTCAATTAATTGC
CGCCATGAATCCTAGCCCAACGGGACATTATCAAGGTACTCACAATCGCACCTCACCACAACAAATCTTGCGTTATCTCA
ATCGCTTATCCGGTCCGTTTTTAGATCGCTTTGATTTGTCAATCGAAGTCCCTTTACTACCCAAAGGCGCGTTACAACAG
CAAACAGATCGGGGCGAAAGTAGCGCACAAGTGAGAGAAAAAGTATTAAAAGTACGCGACATACAACTGGCGCGTGCGGG
GAAAATTAATGCTTATTTGACCAGTAAAGAAATTGAGCGAGACTGTAAGATCAGTGATCAAGATGCGTTATTTTTAGAAA
ATGCCTTAGCTAAATTGGGGTTATCCGTACGCGCCTATCATCGTATTTTAAAAGTAGCACGCACCATTGCAGACTTAAAT
AATGAATGCCAAATTCAACAATGCCACCTCGCCGAAGCACTGGGTTATCGGGCGATGGATAGGTTGTTACAGAAGTTATC
TGCAATATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
84.221 |
99.607 |
0.839 |
| comM | Glaesserella parasuis strain SC1401 |
77.515 |
99.607 |
0.772 |
| comM | Vibrio cholerae strain A1552 |
66.012 |
100 |
0.66 |
| comM | Vibrio campbellii strain DS40M4 |
65.551 |
99.804 |
0.654 |
| comM | Legionella pneumophila str. Paris |
51.4 |
98.232 |
0.505 |
| comM | Legionella pneumophila strain ERS1305867 |
51.4 |
98.232 |
0.505 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
45.333 |
100 |
0.468 |