Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | ABZN20_RS00250 | Genome accession | NZ_CP160399 |
| Coordinates | 51521..53041 (-) | Length | 506 a.a. |
| NCBI ID | WP_367026409.1 | Uniprot ID | - |
| Organism | Methylococcus sp. ANG | ||
| Function | ssDNA binding (predicted from homology) DNA processing |
||
Genomic Context
Location: 46521..58041
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| ABZN20_RS00215 (ABZN20_00215) | - | 46714..46827 (+) | 114 | WP_367027593.1 | type II toxin-antitoxin system RelE/ParE family toxin | - |
| ABZN20_RS00220 (ABZN20_00220) | - | 46835..47182 (+) | 348 | WP_218806155.1 | HigA family addiction module antitoxin | - |
| ABZN20_RS00225 (ABZN20_00225) | - | 47185..47418 (+) | 234 | WP_367026406.1 | type II toxin-antitoxin system MqsA family antitoxin | - |
| ABZN20_RS00230 (ABZN20_00230) | - | 47703..48773 (+) | 1071 | WP_367026407.1 | IS630 family transposase | - |
| ABZN20_RS00245 (ABZN20_00245) | rep | 49521..51524 (-) | 2004 | WP_367026408.1 | DNA helicase Rep | - |
| ABZN20_RS00250 (ABZN20_00250) | comM | 51521..53041 (-) | 1521 | WP_367026409.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| ABZN20_RS00255 (ABZN20_00255) | - | 53050..53301 (-) | 252 | WP_218806162.1 | accessory factor UbiK family protein | - |
| ABZN20_RS00260 (ABZN20_00260) | glnK | 53562..53900 (+) | 339 | WP_010961376.1 | P-II family nitrogen regulator | - |
| ABZN20_RS00265 (ABZN20_00265) | - | 53932..55374 (+) | 1443 | WP_305080242.1 | ammonium transporter | - |
| ABZN20_RS00270 (ABZN20_00270) | - | 55489..55941 (-) | 453 | WP_367026410.1 | DUF1810 domain-containing protein | - |
| ABZN20_RS00275 (ABZN20_00275) | trxC | 55987..56406 (-) | 420 | WP_367026411.1 | thioredoxin TrxC | - |
| ABZN20_RS00280 (ABZN20_00280) | - | 56410..56628 (-) | 219 | WP_218806165.1 | DUF2892 domain-containing protein | - |
| ABZN20_RS00285 (ABZN20_00285) | grxC | 56779..57042 (+) | 264 | WP_218806166.1 | glutaredoxin 3 | - |
Sequence
Protein
Download Length: 506 a.a. Molecular weight: 54133.17 Da Isoelectric Point: 7.7237
>NTDB_id=1020879 ABZN20_RS00250 WP_367026409.1 51521..53041(-) (comM) [Methylococcus sp. ANG]
MALAIVHSRARQGIEAPEVTVEVHISPGLPNLTIVGLPETAVRESKDRVRGALITTGFEFPAQRITVNLAPADLPKEGGR
FDLPIALGILAASKQLRADRLADLECVGELALTGELRPVPGALPVALQSRRAERNLIVPWDNAGEAALVSGACVLPARHL
LDVCAHLDGSRPLAAVESRTVAVPASEEPPDLAEVRGQFQAKRALEIAAAGRHNLLMLGPPGTGKSMLASRMPGILPDLT
EDEALETAAVASVSGMPFDPAAWRRPPYRAPHHTASAPALVGGGSAPKPGEISLAHNGVLFLDELPEFDRRVLEVLREPL
ESGGITISRAAQRLDFPARFQLIAAMNPCPCGYLGDASGRCHCSAEQVARYRARISGPLLDRIDMHVDVPRQDPATLLDG
APRNEETSAQVRTRVIAARERALQRSGQPNALLTPRLIERHCMPDSAGRALLEQAMARLNLSHRAYHRILKLARTIADLA
GSDAITSAHIGEAIGYRRLDRAPAAR
MALAIVHSRARQGIEAPEVTVEVHISPGLPNLTIVGLPETAVRESKDRVRGALITTGFEFPAQRITVNLAPADLPKEGGR
FDLPIALGILAASKQLRADRLADLECVGELALTGELRPVPGALPVALQSRRAERNLIVPWDNAGEAALVSGACVLPARHL
LDVCAHLDGSRPLAAVESRTVAVPASEEPPDLAEVRGQFQAKRALEIAAAGRHNLLMLGPPGTGKSMLASRMPGILPDLT
EDEALETAAVASVSGMPFDPAAWRRPPYRAPHHTASAPALVGGGSAPKPGEISLAHNGVLFLDELPEFDRRVLEVLREPL
ESGGITISRAAQRLDFPARFQLIAAMNPCPCGYLGDASGRCHCSAEQVARYRARISGPLLDRIDMHVDVPRQDPATLLDG
APRNEETSAQVRTRVIAARERALQRSGQPNALLTPRLIERHCMPDSAGRALLEQAMARLNLSHRAYHRILKLARTIADLA
GSDAITSAHIGEAIGYRRLDRAPAAR
Nucleotide
Download Length: 1521 bp
>NTDB_id=1020879 ABZN20_RS00250 WP_367026409.1 51521..53041(-) (comM) [Methylococcus sp. ANG]
ATGGCGCTCGCCATCGTCCATAGCCGGGCCCGCCAGGGCATCGAGGCGCCGGAAGTCACCGTGGAAGTCCATATTTCCCC
GGGGCTCCCCAACCTCACCATCGTCGGCCTGCCGGAAACCGCCGTGCGCGAGAGCAAGGACCGGGTGCGCGGCGCGCTCA
TCACCACCGGTTTCGAATTTCCCGCCCAGCGCATCACCGTCAACCTGGCTCCGGCCGATCTGCCCAAGGAAGGCGGCCGT
TTCGACCTGCCGATCGCCCTCGGCATCCTGGCCGCTTCGAAGCAGCTCCGCGCCGATCGGCTGGCGGACCTGGAATGCGT
CGGCGAGCTGGCGCTCACCGGCGAACTGCGCCCGGTGCCCGGCGCCCTGCCGGTCGCCCTCCAGTCGCGCCGCGCCGAGC
GGAACCTGATCGTACCCTGGGACAACGCCGGCGAAGCGGCCCTGGTCTCCGGCGCCTGCGTGCTCCCGGCCCGCCATCTG
CTGGACGTGTGCGCCCATCTGGACGGCAGCCGCCCCCTCGCCGCCGTCGAATCGCGCACCGTAGCGGTTCCGGCCAGCGA
GGAGCCGCCCGATCTCGCCGAAGTCCGCGGCCAGTTCCAGGCCAAGCGCGCCCTGGAAATCGCCGCCGCCGGCCGCCACA
ACCTGCTGATGCTGGGCCCGCCCGGCACCGGCAAATCCATGCTGGCCTCGCGCATGCCCGGCATCCTGCCCGACCTGACC
GAGGACGAGGCGCTGGAAACCGCGGCGGTCGCCTCGGTCAGCGGCATGCCCTTCGACCCGGCGGCCTGGCGCCGCCCGCC
GTACCGCGCGCCCCATCACACCGCCTCCGCCCCGGCGCTGGTCGGCGGAGGAAGCGCGCCCAAGCCCGGCGAGATTTCGC
TGGCCCACAACGGCGTGCTGTTCCTGGACGAGCTGCCGGAATTCGACCGGCGCGTCCTGGAAGTCCTGCGCGAGCCCTTG
GAAAGCGGCGGCATCACCATTTCCCGCGCCGCCCAGCGGCTGGATTTTCCTGCCCGCTTCCAGCTCATCGCGGCGATGAA
CCCCTGCCCCTGCGGCTACCTGGGCGACGCCTCCGGGCGCTGCCATTGCTCGGCCGAGCAGGTTGCGCGCTACCGCGCGC
GCATCTCCGGACCACTCCTGGACCGGATCGACATGCACGTCGACGTACCCCGCCAAGACCCGGCCACGCTCCTGGACGGC
GCACCGCGGAACGAAGAAACCAGCGCCCAGGTCCGGACCCGCGTGATCGCCGCCCGCGAGCGGGCGCTCCAACGCAGCGG
CCAGCCCAACGCCCTGCTCACGCCGCGCCTGATCGAGCGCCACTGCATGCCCGACAGCGCGGGACGCGCTTTGTTGGAAC
AGGCCATGGCGCGGCTGAACCTGTCGCACCGCGCCTATCACCGCATCCTCAAGCTCGCCCGCACCATCGCCGATCTCGCC
GGGAGCGATGCGATCACCTCCGCCCACATCGGCGAAGCCATCGGCTACCGGCGTCTCGACCGGGCTCCCGCCGCCCGATG
A
ATGGCGCTCGCCATCGTCCATAGCCGGGCCCGCCAGGGCATCGAGGCGCCGGAAGTCACCGTGGAAGTCCATATTTCCCC
GGGGCTCCCCAACCTCACCATCGTCGGCCTGCCGGAAACCGCCGTGCGCGAGAGCAAGGACCGGGTGCGCGGCGCGCTCA
TCACCACCGGTTTCGAATTTCCCGCCCAGCGCATCACCGTCAACCTGGCTCCGGCCGATCTGCCCAAGGAAGGCGGCCGT
TTCGACCTGCCGATCGCCCTCGGCATCCTGGCCGCTTCGAAGCAGCTCCGCGCCGATCGGCTGGCGGACCTGGAATGCGT
CGGCGAGCTGGCGCTCACCGGCGAACTGCGCCCGGTGCCCGGCGCCCTGCCGGTCGCCCTCCAGTCGCGCCGCGCCGAGC
GGAACCTGATCGTACCCTGGGACAACGCCGGCGAAGCGGCCCTGGTCTCCGGCGCCTGCGTGCTCCCGGCCCGCCATCTG
CTGGACGTGTGCGCCCATCTGGACGGCAGCCGCCCCCTCGCCGCCGTCGAATCGCGCACCGTAGCGGTTCCGGCCAGCGA
GGAGCCGCCCGATCTCGCCGAAGTCCGCGGCCAGTTCCAGGCCAAGCGCGCCCTGGAAATCGCCGCCGCCGGCCGCCACA
ACCTGCTGATGCTGGGCCCGCCCGGCACCGGCAAATCCATGCTGGCCTCGCGCATGCCCGGCATCCTGCCCGACCTGACC
GAGGACGAGGCGCTGGAAACCGCGGCGGTCGCCTCGGTCAGCGGCATGCCCTTCGACCCGGCGGCCTGGCGCCGCCCGCC
GTACCGCGCGCCCCATCACACCGCCTCCGCCCCGGCGCTGGTCGGCGGAGGAAGCGCGCCCAAGCCCGGCGAGATTTCGC
TGGCCCACAACGGCGTGCTGTTCCTGGACGAGCTGCCGGAATTCGACCGGCGCGTCCTGGAAGTCCTGCGCGAGCCCTTG
GAAAGCGGCGGCATCACCATTTCCCGCGCCGCCCAGCGGCTGGATTTTCCTGCCCGCTTCCAGCTCATCGCGGCGATGAA
CCCCTGCCCCTGCGGCTACCTGGGCGACGCCTCCGGGCGCTGCCATTGCTCGGCCGAGCAGGTTGCGCGCTACCGCGCGC
GCATCTCCGGACCACTCCTGGACCGGATCGACATGCACGTCGACGTACCCCGCCAAGACCCGGCCACGCTCCTGGACGGC
GCACCGCGGAACGAAGAAACCAGCGCCCAGGTCCGGACCCGCGTGATCGCCGCCCGCGAGCGGGCGCTCCAACGCAGCGG
CCAGCCCAACGCCCTGCTCACGCCGCGCCTGATCGAGCGCCACTGCATGCCCGACAGCGCGGGACGCGCTTTGTTGGAAC
AGGCCATGGCGCGGCTGAACCTGTCGCACCGCGCCTATCACCGCATCCTCAAGCTCGCCCGCACCATCGCCGATCTCGCC
GGGAGCGATGCGATCACCTCCGCCCACATCGGCGAAGCCATCGGCTACCGGCGTCTCGACCGGGCTCCCGCCGCCCGATG
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Vibrio cholerae strain A1552 |
57.086 |
99.012 |
0.565 |
| comM | Vibrio campbellii strain DS40M4 |
56.436 |
99.802 |
0.563 |
| comM | Haemophilus influenzae Rd KW20 |
55.357 |
99.605 |
0.551 |
| comM | Glaesserella parasuis strain SC1401 |
54.274 |
99.407 |
0.54 |
| comM | Legionella pneumophila str. Paris |
49.398 |
98.419 |
0.486 |
| comM | Legionella pneumophila strain ERS1305867 |
49.398 |
98.419 |
0.486 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
46.457 |
100 |
0.466 |