Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | R2K28_RS00180 | Genome accession | NZ_OY734020 |
| Coordinates | 41564..43072 (-) | Length | 502 a.a. |
| NCBI ID | WP_316367426.1 | Uniprot ID | - |
| Organism | Candidatus Thiodiazotropha sp. CDECU1 isolate 46184 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 36564..48072
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| R2K28_RS00165 | - | 36646..37890 (-) | 1245 | WP_316367421.1 | patatin-like phospholipase family protein | - |
| R2K28_RS00170 | - | 37905..39707 (-) | 1803 | WP_316367423.1 | oleate hydratase | - |
| R2K28_RS00175 | - | 40345..41334 (+) | 990 | WP_316367425.1 | GGDEF domain-containing protein | - |
| R2K28_RS00180 | comM | 41564..43072 (-) | 1509 | WP_316367426.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| R2K28_RS00185 | - | 43118..43474 (-) | 357 | WP_316367427.1 | accessory factor UbiK family protein | - |
| R2K28_RS00190 | - | 43716..44393 (+) | 678 | WP_316367428.1 | TorF family putative porin | - |
| R2K28_RS00195 | - | 44467..44805 (+) | 339 | WP_316367429.1 | P-II family nitrogen regulator | - |
| R2K28_RS00200 | - | 44836..46098 (+) | 1263 | WP_316367430.1 | ammonium transporter | - |
| R2K28_RS00205 | - | 46323..46796 (+) | 474 | WP_316367431.1 | DUF4124 domain-containing protein | - |
| R2K28_RS00210 | - | 46810..47865 (-) | 1056 | WP_316367432.1 | diguanylate cyclase | - |
Sequence
Protein
Download Length: 502 a.a. Molecular weight: 54789.58 Da Isoelectric Point: 7.2351
>NTDB_id=1160743 R2K28_RS00180 WP_316367426.1 41564..43072(-) (comM) [Candidatus Thiodiazotropha sp. CDECU1 isolate 46184]
MSLAILYSRAQEGIQAPLVTVEVHLSNGLPGLSIVGLPEMAVRESKDRVRGALINSQFEFPARRITINLAPADLPKEGGR
FDLPIALGILAASNQLAADPLNHYEFTGELALSGEMRPISGILPVALTARDAGRSLILPQQNAEEAGLVSGLQCYPAKHL
LEVCSHINSVNQLEQFKGVRSPANVTKQQLDMADVYGQSHARRALEISAAGAHSLLYIGPPGTGKSMLASRLPGILPPMS
EEEALECAAIHSVANNRAFEPAQWRQRPYRAPHHTASAAALVGGGSNPKPGEISLAHCGVLFLDELPEFDRHTLEVLREP
LENGHITISRANRQVDYPSRFQMIAAMNPCPCGHLGDGSNRCHCTLDRITRYRNRISGPLLDRIDMHVEVPRQPLQINQE
SPTLEEPSDAIRRRVIDARDIQLERQGCTNQALQGVQIEQVAAPGKEGNALLHRAIEKLGLSMRAYHRILKVARTIADLE
ASPKVETAHISEAIGYRRLDRS
MSLAILYSRAQEGIQAPLVTVEVHLSNGLPGLSIVGLPEMAVRESKDRVRGALINSQFEFPARRITINLAPADLPKEGGR
FDLPIALGILAASNQLAADPLNHYEFTGELALSGEMRPISGILPVALTARDAGRSLILPQQNAEEAGLVSGLQCYPAKHL
LEVCSHINSVNQLEQFKGVRSPANVTKQQLDMADVYGQSHARRALEISAAGAHSLLYIGPPGTGKSMLASRLPGILPPMS
EEEALECAAIHSVANNRAFEPAQWRQRPYRAPHHTASAAALVGGGSNPKPGEISLAHCGVLFLDELPEFDRHTLEVLREP
LENGHITISRANRQVDYPSRFQMIAAMNPCPCGHLGDGSNRCHCTLDRITRYRNRISGPLLDRIDMHVEVPRQPLQINQE
SPTLEEPSDAIRRRVIDARDIQLERQGCTNQALQGVQIEQVAAPGKEGNALLHRAIEKLGLSMRAYHRILKVARTIADLE
ASPKVETAHISEAIGYRRLDRS
Nucleotide
Download Length: 1509 bp
>NTDB_id=1160743 R2K28_RS00180 WP_316367426.1 41564..43072(-) (comM) [Candidatus Thiodiazotropha sp. CDECU1 isolate 46184]
ATGTCACTCGCCATTCTCTATTCCCGGGCCCAAGAGGGCATCCAAGCGCCCCTGGTCACCGTCGAGGTCCACCTCTCCAA
CGGCCTGCCCGGCCTCTCCATCGTCGGCTTGCCCGAAATGGCGGTACGCGAGAGCAAGGACCGGGTCAGGGGTGCCCTGA
TCAACAGCCAGTTTGAATTTCCCGCCCGCCGCATAACCATCAACCTGGCGCCCGCCGATCTGCCGAAAGAGGGGGGGCGA
TTCGACCTTCCCATCGCCCTCGGCATCCTTGCCGCATCGAATCAACTGGCGGCGGACCCATTGAACCACTACGAATTCAC
CGGCGAGCTTGCCCTGTCCGGTGAAATGCGCCCGATCAGCGGGATTCTTCCGGTGGCGCTCACAGCCCGTGATGCGGGAC
GCTCCCTCATCCTGCCGCAACAGAATGCCGAGGAGGCGGGTCTGGTGAGCGGGCTCCAATGCTACCCGGCAAAACACCTG
CTCGAGGTCTGTTCGCACATCAATAGCGTCAACCAGCTGGAACAGTTCAAAGGCGTTCGATCACCAGCGAATGTAACAAA
GCAGCAGCTCGATATGGCCGACGTCTATGGTCAGAGCCACGCCCGGCGCGCCTTGGAGATCAGTGCCGCGGGGGCCCACT
CTCTGCTCTATATCGGCCCCCCCGGCACCGGCAAGTCGATGCTCGCCTCCCGCCTGCCCGGGATACTCCCGCCCATGAGC
GAGGAGGAGGCCCTGGAGTGTGCCGCCATCCACTCGGTGGCCAACAACCGGGCATTCGAGCCTGCCCAGTGGCGTCAAAG
ACCCTATCGCGCACCTCACCACACGGCATCGGCAGCCGCCCTGGTGGGCGGTGGCAGTAATCCGAAGCCGGGGGAGATCT
CCCTGGCCCATTGCGGGGTGCTGTTCCTGGACGAATTGCCTGAGTTCGACCGGCATACCCTGGAGGTGTTGCGCGAACCC
CTGGAGAACGGCCATATCACCATCTCCCGGGCCAATCGCCAGGTCGACTACCCATCCCGCTTTCAGATGATAGCGGCCAT
GAATCCCTGCCCCTGCGGCCACCTGGGGGACGGCAGCAACCGCTGTCACTGCACCCTGGACCGCATAACCCGCTACCGCA
ACCGCATCTCCGGCCCCCTGCTGGATCGCATCGACATGCATGTGGAAGTGCCCCGGCAGCCCCTGCAGATCAACCAGGAA
TCACCCACCCTTGAAGAACCGAGCGATGCCATTCGGCGCCGGGTTATAGATGCCCGTGATATCCAATTAGAACGACAAGG
CTGCACCAACCAGGCACTGCAGGGCGTGCAGATCGAACAGGTGGCCGCCCCGGGGAAGGAGGGTAATGCACTACTGCATC
GGGCCATCGAAAAACTCGGCCTCTCGATGCGGGCCTACCACCGGATATTGAAAGTGGCGCGTACCATCGCCGATCTGGAG
GCGAGCCCGAAGGTGGAGACTGCGCATATCAGCGAGGCGATTGGGTATCGGCGTTTGGACAGGAGTTAA
ATGTCACTCGCCATTCTCTATTCCCGGGCCCAAGAGGGCATCCAAGCGCCCCTGGTCACCGTCGAGGTCCACCTCTCCAA
CGGCCTGCCCGGCCTCTCCATCGTCGGCTTGCCCGAAATGGCGGTACGCGAGAGCAAGGACCGGGTCAGGGGTGCCCTGA
TCAACAGCCAGTTTGAATTTCCCGCCCGCCGCATAACCATCAACCTGGCGCCCGCCGATCTGCCGAAAGAGGGGGGGCGA
TTCGACCTTCCCATCGCCCTCGGCATCCTTGCCGCATCGAATCAACTGGCGGCGGACCCATTGAACCACTACGAATTCAC
CGGCGAGCTTGCCCTGTCCGGTGAAATGCGCCCGATCAGCGGGATTCTTCCGGTGGCGCTCACAGCCCGTGATGCGGGAC
GCTCCCTCATCCTGCCGCAACAGAATGCCGAGGAGGCGGGTCTGGTGAGCGGGCTCCAATGCTACCCGGCAAAACACCTG
CTCGAGGTCTGTTCGCACATCAATAGCGTCAACCAGCTGGAACAGTTCAAAGGCGTTCGATCACCAGCGAATGTAACAAA
GCAGCAGCTCGATATGGCCGACGTCTATGGTCAGAGCCACGCCCGGCGCGCCTTGGAGATCAGTGCCGCGGGGGCCCACT
CTCTGCTCTATATCGGCCCCCCCGGCACCGGCAAGTCGATGCTCGCCTCCCGCCTGCCCGGGATACTCCCGCCCATGAGC
GAGGAGGAGGCCCTGGAGTGTGCCGCCATCCACTCGGTGGCCAACAACCGGGCATTCGAGCCTGCCCAGTGGCGTCAAAG
ACCCTATCGCGCACCTCACCACACGGCATCGGCAGCCGCCCTGGTGGGCGGTGGCAGTAATCCGAAGCCGGGGGAGATCT
CCCTGGCCCATTGCGGGGTGCTGTTCCTGGACGAATTGCCTGAGTTCGACCGGCATACCCTGGAGGTGTTGCGCGAACCC
CTGGAGAACGGCCATATCACCATCTCCCGGGCCAATCGCCAGGTCGACTACCCATCCCGCTTTCAGATGATAGCGGCCAT
GAATCCCTGCCCCTGCGGCCACCTGGGGGACGGCAGCAACCGCTGTCACTGCACCCTGGACCGCATAACCCGCTACCGCA
ACCGCATCTCCGGCCCCCTGCTGGATCGCATCGACATGCATGTGGAAGTGCCCCGGCAGCCCCTGCAGATCAACCAGGAA
TCACCCACCCTTGAAGAACCGAGCGATGCCATTCGGCGCCGGGTTATAGATGCCCGTGATATCCAATTAGAACGACAAGG
CTGCACCAACCAGGCACTGCAGGGCGTGCAGATCGAACAGGTGGCCGCCCCGGGGAAGGAGGGTAATGCACTACTGCATC
GGGCCATCGAAAAACTCGGCCTCTCGATGCGGGCCTACCACCGGATATTGAAAGTGGCGCGTACCATCGCCGATCTGGAG
GCGAGCCCGAAGGTGGAGACTGCGCATATCAGCGAGGCGATTGGGTATCGGCGTTTGGACAGGAGTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
53.968 |
100 |
0.542 |
| comM | Vibrio campbellii strain DS40M4 |
54.092 |
99.801 |
0.54 |
| comM | Vibrio cholerae strain A1552 |
54.092 |
99.801 |
0.54 |
| comM | Glaesserella parasuis strain SC1401 |
53.346 |
100 |
0.54 |
| comM | Legionella pneumophila str. Paris |
51.503 |
99.402 |
0.512 |
| comM | Legionella pneumophila strain ERS1305867 |
51.503 |
99.402 |
0.512 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.173 |
100 |
0.482 |