Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | COW1_RS12385 | Genome accession | NZ_AP024239 |
| Coordinates | 2628883..2630388 (-) | Length | 501 a.a. |
| NCBI ID | WP_096363995.1 | Uniprot ID | A0A1Z4VMG5 |
| Organism | Thiohalobacter sp. COW1 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2623883..2635388
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| COW1_RS12355 (TspCOW1_24530) | - | 2625565..2625885 (+) | 321 | WP_201344370.1 | helix-turn-helix transcriptional regulator | - |
| COW1_RS12360 (TspCOW1_24540) | - | 2625963..2626391 (+) | 429 | WP_096364005.1 | rhodanese-like domain-containing protein | - |
| COW1_RS12365 (TspCOW1_24550) | grxC | 2626398..2626655 (+) | 258 | WP_096364003.1 | glutaredoxin 3 | - |
| COW1_RS12370 (TspCOW1_24560) | secB | 2626763..2627233 (+) | 471 | WP_096364001.1 | protein-export chaperone SecB | - |
| COW1_RS12375 (TspCOW1_24570) | - | 2627252..2628253 (+) | 1002 | WP_096363999.1 | NAD(P)H-dependent glycerol-3-phosphate dehydrogenase | - |
| COW1_RS12380 (TspCOW1_24580) | - | 2628295..2628723 (-) | 429 | WP_096363997.1 | tetratricopeptide repeat protein | - |
| COW1_RS12385 (TspCOW1_24590) | comM | 2628883..2630388 (-) | 1506 | WP_096363995.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| COW1_RS12390 (TspCOW1_24600) | - | 2630593..2630895 (-) | 303 | WP_096363993.1 | accessory factor UbiK family protein | - |
| COW1_RS12395 (TspCOW1_24610) | glnK | 2631235..2631573 (+) | 339 | WP_096363991.1 | P-II family nitrogen regulator | - |
| COW1_RS12400 (TspCOW1_24620) | - | 2631615..2632874 (+) | 1260 | WP_096363989.1 | ammonium transporter | - |
| COW1_RS12405 (TspCOW1_24630) | - | 2632996..2633715 (+) | 720 | WP_096363987.1 | TorF family putative porin | - |
| COW1_RS12410 (TspCOW1_24640) | - | 2633992..2634567 (+) | 576 | WP_096363985.1 | L-threonylcarbamoyladenylate synthase | - |
Sequence
Protein
Download Length: 501 a.a. Molecular weight: 53452.41 Da Isoelectric Point: 8.1925
>NTDB_id=84245 COW1_RS12385 WP_096363995.1 2628883..2630388(-) (comM) [Thiohalobacter sp. COW1]
MSLAVVYSRAQLGIAAPLVSVEVHLSNGLPGLSIVGLPETAVRESKDRVRAALMNSGFEFPARRITINLAPADLPKEGGR
FDLAIALGILAASRQLPPEPLQDHEFTGELSLSGELRPVTGVLPCALQCRQAGRTLVVPEANAAEAALVADGRALQAGHL
LAVTAYLREDQSLEPARVCAGAAGPDTPLPDLAEVRGQAQARRVLEIAAAGGHSLLMVGPPGTGKTLLAQRLPGILPALD
AAEAVETAALRSICGCRIDPGAWGRRPFRAPHHTASAVALVGGGSNPRPGEISLAHNGVLFLDELAEFERRVLEALREPL
ETGSITISRAARQAEFPARFQLVAAMNPCPCGYHGDPTRACRCSPDQIRRYRGRLSGPLLDRFDLQIEVPRQALDWQSAG
GGEPSAVVRERVLAARAAQQRRAGRVNARLEVRALTRDCDLVPELQRFLQQAIERLDLSARAAHRVLRVARTIADLAGAE
ALALAHLSEALQYRVLDRPLP
MSLAVVYSRAQLGIAAPLVSVEVHLSNGLPGLSIVGLPETAVRESKDRVRAALMNSGFEFPARRITINLAPADLPKEGGR
FDLAIALGILAASRQLPPEPLQDHEFTGELSLSGELRPVTGVLPCALQCRQAGRTLVVPEANAAEAALVADGRALQAGHL
LAVTAYLREDQSLEPARVCAGAAGPDTPLPDLAEVRGQAQARRVLEIAAAGGHSLLMVGPPGTGKTLLAQRLPGILPALD
AAEAVETAALRSICGCRIDPGAWGRRPFRAPHHTASAVALVGGGSNPRPGEISLAHNGVLFLDELAEFERRVLEALREPL
ETGSITISRAARQAEFPARFQLVAAMNPCPCGYHGDPTRACRCSPDQIRRYRGRLSGPLLDRFDLQIEVPRQALDWQSAG
GGEPSAVVRERVLAARAAQQRRAGRVNARLEVRALTRDCDLVPELQRFLQQAIERLDLSARAAHRVLRVARTIADLAGAE
ALALAHLSEALQYRVLDRPLP
Nucleotide
Download Length: 1506 bp
>NTDB_id=84245 COW1_RS12385 WP_096363995.1 2628883..2630388(-) (comM) [Thiohalobacter sp. COW1]
ATGTCACTGGCCGTGGTCTACAGCCGCGCCCAGCTGGGGATCGCCGCGCCGCTGGTCAGCGTCGAGGTCCACCTGAGCAA
CGGCCTGCCGGGTCTGTCCATCGTGGGTCTGCCGGAGACGGCGGTGCGCGAGAGCAAGGACCGGGTACGTGCCGCGTTGA
TGAACAGCGGTTTCGAGTTTCCGGCCCGCCGCATCACCATCAACCTGGCCCCGGCCGACCTGCCCAAGGAAGGCGGGCGT
TTTGATCTGGCCATCGCGCTGGGCATTCTGGCCGCCTCCCGACAACTTCCCCCTGAGCCGCTGCAGGATCACGAATTCAC
CGGCGAACTGTCACTGTCCGGCGAACTGCGCCCGGTCACCGGCGTGCTGCCCTGTGCCCTGCAGTGCCGGCAGGCCGGGC
GGACCCTGGTCGTGCCCGAGGCCAACGCCGCCGAGGCCGCGCTGGTCGCCGACGGGCGTGCACTGCAGGCGGGCCATCTG
CTGGCGGTGACCGCTTACCTGCGTGAAGATCAGTCATTAGAGCCGGCCCGCGTTTGCGCCGGCGCTGCGGGCCCGGACAC
CCCCTTGCCCGATCTGGCCGAGGTGCGGGGCCAGGCCCAGGCCCGCCGGGTGCTGGAGATCGCTGCCGCCGGCGGTCACA
GTCTGCTGATGGTCGGCCCGCCCGGGACCGGCAAGACCCTGCTGGCCCAGCGCCTGCCCGGCATCCTCCCGGCGCTGGAT
GCGGCCGAGGCGGTGGAGACGGCCGCCTTGCGCTCCATCTGCGGCTGCCGCATCGACCCAGGCGCCTGGGGCCGCCGTCC
CTTCCGGGCCCCGCACCACACTGCCTCGGCCGTGGCCCTGGTCGGCGGCGGATCCAATCCCCGCCCCGGCGAGATTTCGC
TGGCGCACAATGGGGTGTTGTTCCTCGACGAGCTGGCCGAGTTCGAGCGCCGGGTGCTGGAGGCGCTGCGCGAGCCGCTT
GAGACCGGCAGCATCACCATCTCGCGCGCCGCCCGTCAGGCCGAGTTCCCGGCCCGCTTTCAGCTGGTGGCCGCCATGAA
CCCCTGCCCCTGCGGCTACCACGGTGATCCCACCCGGGCCTGCCGCTGCAGCCCGGACCAGATCCGACGTTACCGCGGTC
GCCTGTCCGGGCCGCTGCTGGACCGCTTCGATCTGCAGATCGAGGTGCCGCGCCAGGCGCTGGACTGGCAGAGCGCCGGC
GGCGGCGAACCCAGCGCGGTGGTGCGGGAGCGGGTGCTGGCGGCGCGCGCAGCTCAGCAGCGTCGCGCGGGTCGGGTCAA
CGCCCGCCTGGAGGTGCGCGCCCTGACGCGGGACTGCGACCTCGTGCCGGAGCTGCAGCGCTTCCTGCAGCAGGCGATCG
AGCGTCTGGATCTGTCCGCCCGCGCCGCCCACCGGGTGCTGCGGGTGGCGCGCACCATCGCCGATCTGGCTGGCGCCGAG
GCGCTGGCGCTGGCGCATCTGAGCGAAGCACTGCAGTACCGTGTCCTCGACCGGCCGCTGCCCTGA
ATGTCACTGGCCGTGGTCTACAGCCGCGCCCAGCTGGGGATCGCCGCGCCGCTGGTCAGCGTCGAGGTCCACCTGAGCAA
CGGCCTGCCGGGTCTGTCCATCGTGGGTCTGCCGGAGACGGCGGTGCGCGAGAGCAAGGACCGGGTACGTGCCGCGTTGA
TGAACAGCGGTTTCGAGTTTCCGGCCCGCCGCATCACCATCAACCTGGCCCCGGCCGACCTGCCCAAGGAAGGCGGGCGT
TTTGATCTGGCCATCGCGCTGGGCATTCTGGCCGCCTCCCGACAACTTCCCCCTGAGCCGCTGCAGGATCACGAATTCAC
CGGCGAACTGTCACTGTCCGGCGAACTGCGCCCGGTCACCGGCGTGCTGCCCTGTGCCCTGCAGTGCCGGCAGGCCGGGC
GGACCCTGGTCGTGCCCGAGGCCAACGCCGCCGAGGCCGCGCTGGTCGCCGACGGGCGTGCACTGCAGGCGGGCCATCTG
CTGGCGGTGACCGCTTACCTGCGTGAAGATCAGTCATTAGAGCCGGCCCGCGTTTGCGCCGGCGCTGCGGGCCCGGACAC
CCCCTTGCCCGATCTGGCCGAGGTGCGGGGCCAGGCCCAGGCCCGCCGGGTGCTGGAGATCGCTGCCGCCGGCGGTCACA
GTCTGCTGATGGTCGGCCCGCCCGGGACCGGCAAGACCCTGCTGGCCCAGCGCCTGCCCGGCATCCTCCCGGCGCTGGAT
GCGGCCGAGGCGGTGGAGACGGCCGCCTTGCGCTCCATCTGCGGCTGCCGCATCGACCCAGGCGCCTGGGGCCGCCGTCC
CTTCCGGGCCCCGCACCACACTGCCTCGGCCGTGGCCCTGGTCGGCGGCGGATCCAATCCCCGCCCCGGCGAGATTTCGC
TGGCGCACAATGGGGTGTTGTTCCTCGACGAGCTGGCCGAGTTCGAGCGCCGGGTGCTGGAGGCGCTGCGCGAGCCGCTT
GAGACCGGCAGCATCACCATCTCGCGCGCCGCCCGTCAGGCCGAGTTCCCGGCCCGCTTTCAGCTGGTGGCCGCCATGAA
CCCCTGCCCCTGCGGCTACCACGGTGATCCCACCCGGGCCTGCCGCTGCAGCCCGGACCAGATCCGACGTTACCGCGGTC
GCCTGTCCGGGCCGCTGCTGGACCGCTTCGATCTGCAGATCGAGGTGCCGCGCCAGGCGCTGGACTGGCAGAGCGCCGGC
GGCGGCGAACCCAGCGCGGTGGTGCGGGAGCGGGTGCTGGCGGCGCGCGCAGCTCAGCAGCGTCGCGCGGGTCGGGTCAA
CGCCCGCCTGGAGGTGCGCGCCCTGACGCGGGACTGCGACCTCGTGCCGGAGCTGCAGCGCTTCCTGCAGCAGGCGATCG
AGCGTCTGGATCTGTCCGCCCGCGCCGCCCACCGGGTGCTGCGGGTGGCGCGCACCATCGCCGATCTGGCTGGCGCCGAG
GCGCTGGCGCTGGCGCATCTGAGCGAAGCACTGCAGTACCGTGTCCTCGACCGGCCGCTGCCCTGA
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Vibrio campbellii strain DS40M4 |
55.534 |
100 |
0.561 |
| comM | Vibrio cholerae strain A1552 |
55.578 |
100 |
0.557 |
| comM | Haemophilus influenzae Rd KW20 |
54.635 |
100 |
0.553 |
| comM | Glaesserella parasuis strain SC1401 |
54.15 |
100 |
0.547 |
| comM | Legionella pneumophila str. Paris |
50.898 |
100 |
0.509 |
| comM | Legionella pneumophila strain ERS1305867 |
50.898 |
100 |
0.509 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
45.972 |
100 |
0.467 |