Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | GXP68_RS20485 | Genome accession | NZ_CP048243 |
| Coordinates | 4478991..4480532 (+) | Length | 513 a.a. |
| NCBI ID | WP_185688867.1 | Uniprot ID | - |
| Organism | Ewingella americana strain B6-1 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 4473991..4485532
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GXP68_RS20460 (GXP68_20440) | - | 4474164..4475093 (-) | 930 | WP_034794715.1 | branched-chain amino acid transaminase | - |
| GXP68_RS20465 (GXP68_20445) | ilvM | 4475110..4475373 (-) | 264 | WP_034794713.1 | acetolactate synthase 2 small subunit | - |
| GXP68_RS20470 (GXP68_20450) | ilvG | 4475370..4477016 (-) | 1647 | WP_185688865.1 | acetolactate synthase 2 catalytic subunit | - |
| GXP68_RS20475 (GXP68_20455) | ilvL | 4477154..4477255 (-) | 102 | WP_015699153.1 | ilv operon leader peptide | - |
| GXP68_RS20480 (GXP68_20460) | - | 4477387..4478625 (-) | 1239 | WP_185688866.1 | MFS transporter | - |
| GXP68_RS20485 (GXP68_20465) | comM | 4478991..4480532 (+) | 1542 | WP_185688867.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| GXP68_RS20490 (GXP68_20470) | - | 4480558..4480896 (-) | 339 | WP_034794706.1 | DUF413 domain-containing protein | - |
| GXP68_RS20495 (GXP68_20475) | hdfR | 4481015..4481842 (+) | 828 | WP_034794703.1 | HTH-type transcriptional regulator HdfR | - |
| GXP68_RS24055 | - | 4481861..4481992 (-) | 132 | WP_272901188.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 513 a.a. Molecular weight: 55871.23 Da Isoelectric Point: 7.8964
>NTDB_id=420116 GXP68_RS20485 WP_185688867.1 4478991..4480532(+) (comM) [Ewingella americana strain B6-1]
MALAVVHTRASLGVQAPGVAVEVHISNGLPALALVGLPETTVKEARDRVRSAILNCGFTFPAKRITVNLAPADLPKEGGR
YDLSIALAILVASEQLSGDKLGDYEFLGELGLSGALRGVNGAIPAALEAQKEGRRLILPQDNRQEMSLLETGIAKVADHL
LQVCAFLQGEEDLHDVERVDPPDIYVNQPDIADIIGQEQSKRALEVAAAGGHNLLLIGPPGTGKTMLASRLPALLPPLTD
QETLETAAIASLVYNPEENGNFSRSRPFRAPHHSTSMSALVGGGSLPRPGEISLAHNGVLFLDELPEFERKVLDALRQPM
ESGEITISRARAKVRYPARTQLIAAMNPSPTGHYQGVHNRTPPQQVLRYLSRLSGPFLDRFDLSIEVPLLPPGVLSLQSQ
KLGSRARESSAQVRERVMRARERQLARSGKINAHMSSSEVEKFCELKKEDAEFLEGVLHKLGLSVRAWHRILKVARTIAD
LNNQSMIEKAHISEALSYRCMDRLLLKLHKSLA
MALAVVHTRASLGVQAPGVAVEVHISNGLPALALVGLPETTVKEARDRVRSAILNCGFTFPAKRITVNLAPADLPKEGGR
YDLSIALAILVASEQLSGDKLGDYEFLGELGLSGALRGVNGAIPAALEAQKEGRRLILPQDNRQEMSLLETGIAKVADHL
LQVCAFLQGEEDLHDVERVDPPDIYVNQPDIADIIGQEQSKRALEVAAAGGHNLLLIGPPGTGKTMLASRLPALLPPLTD
QETLETAAIASLVYNPEENGNFSRSRPFRAPHHSTSMSALVGGGSLPRPGEISLAHNGVLFLDELPEFERKVLDALRQPM
ESGEITISRARAKVRYPARTQLIAAMNPSPTGHYQGVHNRTPPQQVLRYLSRLSGPFLDRFDLSIEVPLLPPGVLSLQSQ
KLGSRARESSAQVRERVMRARERQLARSGKINAHMSSSEVEKFCELKKEDAEFLEGVLHKLGLSVRAWHRILKVARTIAD
LNNQSMIEKAHISEALSYRCMDRLLLKLHKSLA
Nucleotide
Download Length: 1542 bp
>NTDB_id=420116 GXP68_RS20485 WP_185688867.1 4478991..4480532(+) (comM) [Ewingella americana strain B6-1]
ATGGCATTAGCGGTTGTTCACACCAGAGCCTCACTTGGCGTGCAGGCTCCGGGGGTTGCTGTCGAAGTCCATATCAGCAA
CGGCTTGCCCGCTCTGGCGCTGGTTGGCTTACCGGAAACCACGGTAAAAGAGGCACGGGATCGAGTACGCAGCGCCATAC
TCAACTGCGGTTTCACCTTTCCCGCCAAACGCATCACCGTCAATCTGGCTCCCGCCGACCTGCCCAAAGAGGGTGGCCGT
TACGACTTATCCATTGCGTTAGCGATTCTGGTGGCTTCTGAGCAGCTTTCTGGGGATAAGCTGGGTGATTACGAGTTTCT
AGGGGAATTAGGCCTTTCTGGCGCGTTACGTGGCGTAAATGGCGCTATTCCCGCTGCACTAGAAGCCCAAAAAGAGGGGC
GGCGTTTAATCCTCCCGCAAGACAATCGGCAAGAAATGTCACTGCTCGAGACGGGTATCGCCAAAGTCGCCGACCATCTT
CTGCAAGTTTGTGCTTTTTTGCAGGGAGAAGAGGATTTACATGACGTCGAACGCGTCGATCCGCCCGATATCTACGTCAA
CCAGCCCGACATTGCTGACATTATTGGGCAAGAGCAGTCTAAACGAGCGCTGGAAGTCGCCGCCGCCGGTGGGCATAACC
TTTTGCTGATTGGCCCGCCGGGTACCGGGAAAACTATGCTTGCCAGCCGCCTGCCTGCGTTGCTGCCTCCGTTAACCGAC
CAAGAAACTCTAGAGACGGCGGCGATAGCCAGTCTGGTCTACAACCCTGAAGAGAATGGGAATTTCTCACGCAGCCGCCC
TTTCCGCGCCCCTCACCACAGCACCTCCATGAGTGCCTTAGTCGGCGGCGGCTCTTTACCTCGTCCGGGTGAAATTTCGT
TGGCGCACAACGGCGTGCTGTTTCTTGACGAGCTGCCGGAGTTCGAACGCAAAGTACTCGATGCCTTGCGCCAACCCATG
GAGTCAGGCGAAATCACCATCTCCCGCGCCCGCGCCAAGGTGCGTTACCCCGCCAGAACGCAACTCATCGCCGCCATGAA
TCCCAGCCCTACAGGGCATTACCAAGGCGTACACAACCGCACGCCTCCTCAGCAGGTCCTTCGCTATCTCAGCAGGCTCT
CAGGCCCCTTTCTCGACCGTTTTGATTTATCGATTGAAGTGCCTCTCCTGCCACCGGGCGTGCTGAGTTTGCAAAGCCAG
AAGCTCGGCTCTCGCGCCAGAGAGAGCAGTGCGCAGGTACGCGAGCGAGTGATGAGGGCTCGCGAGCGCCAATTAGCCCG
GTCAGGGAAAATTAATGCGCATATGAGCAGCAGTGAAGTCGAAAAATTTTGTGAGCTTAAAAAGGAGGATGCTGAATTTC
TGGAGGGGGTGCTGCACAAGCTGGGATTATCGGTTCGAGCTTGGCATCGTATTCTTAAAGTTGCGAGAACAATAGCCGAT
CTCAATAATCAGTCTATGATTGAGAAAGCTCATATCTCTGAAGCGCTAAGCTACAGATGTATGGATAGATTGCTGCTGAA
GTTACACAAAAGCCTGGCATAA
ATGGCATTAGCGGTTGTTCACACCAGAGCCTCACTTGGCGTGCAGGCTCCGGGGGTTGCTGTCGAAGTCCATATCAGCAA
CGGCTTGCCCGCTCTGGCGCTGGTTGGCTTACCGGAAACCACGGTAAAAGAGGCACGGGATCGAGTACGCAGCGCCATAC
TCAACTGCGGTTTCACCTTTCCCGCCAAACGCATCACCGTCAATCTGGCTCCCGCCGACCTGCCCAAAGAGGGTGGCCGT
TACGACTTATCCATTGCGTTAGCGATTCTGGTGGCTTCTGAGCAGCTTTCTGGGGATAAGCTGGGTGATTACGAGTTTCT
AGGGGAATTAGGCCTTTCTGGCGCGTTACGTGGCGTAAATGGCGCTATTCCCGCTGCACTAGAAGCCCAAAAAGAGGGGC
GGCGTTTAATCCTCCCGCAAGACAATCGGCAAGAAATGTCACTGCTCGAGACGGGTATCGCCAAAGTCGCCGACCATCTT
CTGCAAGTTTGTGCTTTTTTGCAGGGAGAAGAGGATTTACATGACGTCGAACGCGTCGATCCGCCCGATATCTACGTCAA
CCAGCCCGACATTGCTGACATTATTGGGCAAGAGCAGTCTAAACGAGCGCTGGAAGTCGCCGCCGCCGGTGGGCATAACC
TTTTGCTGATTGGCCCGCCGGGTACCGGGAAAACTATGCTTGCCAGCCGCCTGCCTGCGTTGCTGCCTCCGTTAACCGAC
CAAGAAACTCTAGAGACGGCGGCGATAGCCAGTCTGGTCTACAACCCTGAAGAGAATGGGAATTTCTCACGCAGCCGCCC
TTTCCGCGCCCCTCACCACAGCACCTCCATGAGTGCCTTAGTCGGCGGCGGCTCTTTACCTCGTCCGGGTGAAATTTCGT
TGGCGCACAACGGCGTGCTGTTTCTTGACGAGCTGCCGGAGTTCGAACGCAAAGTACTCGATGCCTTGCGCCAACCCATG
GAGTCAGGCGAAATCACCATCTCCCGCGCCCGCGCCAAGGTGCGTTACCCCGCCAGAACGCAACTCATCGCCGCCATGAA
TCCCAGCCCTACAGGGCATTACCAAGGCGTACACAACCGCACGCCTCCTCAGCAGGTCCTTCGCTATCTCAGCAGGCTCT
CAGGCCCCTTTCTCGACCGTTTTGATTTATCGATTGAAGTGCCTCTCCTGCCACCGGGCGTGCTGAGTTTGCAAAGCCAG
AAGCTCGGCTCTCGCGCCAGAGAGAGCAGTGCGCAGGTACGCGAGCGAGTGATGAGGGCTCGCGAGCGCCAATTAGCCCG
GTCAGGGAAAATTAATGCGCATATGAGCAGCAGTGAAGTCGAAAAATTTTGTGAGCTTAAAAAGGAGGATGCTGAATTTC
TGGAGGGGGTGCTGCACAAGCTGGGATTATCGGTTCGAGCTTGGCATCGTATTCTTAAAGTTGCGAGAACAATAGCCGAT
CTCAATAATCAGTCTATGATTGAGAAAGCTCATATCTCTGAAGCGCTAAGCTACAGATGTATGGATAGATTGCTGCTGAA
GTTACACAAAAGCCTGGCATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
63.672 |
99.805 |
0.635 |
| comM | Vibrio cholerae strain A1552 |
63.458 |
99.22 |
0.63 |
| comM | Glaesserella parasuis strain SC1401 |
62.183 |
100 |
0.622 |
| comM | Vibrio campbellii strain DS40M4 |
62.795 |
99.025 |
0.622 |
| comM | Legionella pneumophila str. Paris |
48.718 |
98.83 |
0.481 |
| comM | Legionella pneumophila strain ERS1305867 |
48.718 |
98.83 |
0.481 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
45.315 |
100 |
0.462 |