Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | R0135_RS09815 | Genome accession | NZ_CP136864 |
| Coordinates | 2229395..2230885 (+) | Length | 496 a.a. |
| NCBI ID | WP_407346648.1 | Uniprot ID | - |
| Organism | Congregibacter variabilis strain IMCC43200 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 2224395..2235885
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| R0135_RS09785 (R0135_09745) | - | 2224435..2224968 (-) | 534 | WP_407346643.1 | hypothetical protein | - |
| R0135_RS09790 (R0135_09750) | - | 2225235..2225783 (-) | 549 | WP_407346644.1 | PEP-CTERM sorting domain-containing protein | - |
| R0135_RS09795 (R0135_09755) | - | 2226208..2226717 (+) | 510 | WP_407346645.1 | hypothetical protein | - |
| R0135_RS09800 (R0135_09760) | - | 2227058..2228299 (-) | 1242 | WP_407346646.1 | ammonium transporter | - |
| R0135_RS09805 (R0135_09765) | glnK | 2228355..2228693 (-) | 339 | WP_040809830.1 | P-II family nitrogen regulator | - |
| R0135_RS09810 (R0135_09770) | - | 2229007..2229273 (+) | 267 | WP_407346647.1 | accessory factor UbiK family protein | - |
| R0135_RS09815 (R0135_09775) | comM | 2229395..2230885 (+) | 1491 | WP_407346648.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| R0135_RS09820 (R0135_09780) | - | 2231001..2231435 (-) | 435 | WP_407346649.1 | c-type cytochrome | - |
| R0135_RS09825 (R0135_09785) | - | 2231589..2232209 (+) | 621 | WP_407346651.1 | glutathione S-transferase family protein | - |
| R0135_RS09830 (R0135_09790) | - | 2232176..2233300 (+) | 1125 | WP_407346652.1 | DUF3080 family protein | - |
| R0135_RS09835 (R0135_09795) | - | 2233344..2234651 (+) | 1308 | WP_407346653.1 | DUF3422 family protein | - |
| R0135_RS09840 (R0135_09800) | - | 2234653..2235558 (-) | 906 | WP_407346654.1 | phytanoyl-CoA dioxygenase family protein | - |
Sequence
Protein
Download Length: 496 a.a. Molecular weight: 53025.63 Da Isoelectric Point: 8.0104
>NTDB_id=893158 R0135_RS09815 WP_407346648.1 2229395..2230885(+) (comM) [Congregibacter variabilis strain IMCC43200]
MNYAVVLSRANQGLNAPLVRVEVHLSNGLPAFTVVGMPETAVRESKDRVRSALINSHFEFPDRRITVNLAPADLPKGGGR
FDLPIALGILCASGQLPQDALLGSECIGELALDGTLRPVRGTVAAAMAAGQSKRRILLPTDSVQSCRAIPDSTLVHSSDL
LSLCAVLRGRAQPPETSTAPLESMQAGPDLCEVSGQLVPRRALEIAAAGGHNLLLTGPPGTGKTLLANCLPGILPPPDHR
EWLTVCALHDLQGETLRGQQRAFRAPHHSASAAALVGGGSIPRPGEISMAHGGVLFLDELPEFSRHTLDMLREPLETGEI
CLARASCSIRYPARFQLIAAMNPCPCGFSGDPHKPCKCSTAQRLSYSARVSGPLLDRLDLHVRVEREDAGDLFSCGQGEN
SATVRSRVIRSKTQQHRRQGQSNSRLTGSTLIDSCKLGTEEKALVERSAAGLKLSARAVHRMLRVARSIADLAEEDSVSA
PHLQEALAYRDTVQQH
MNYAVVLSRANQGLNAPLVRVEVHLSNGLPAFTVVGMPETAVRESKDRVRSALINSHFEFPDRRITVNLAPADLPKGGGR
FDLPIALGILCASGQLPQDALLGSECIGELALDGTLRPVRGTVAAAMAAGQSKRRILLPTDSVQSCRAIPDSTLVHSSDL
LSLCAVLRGRAQPPETSTAPLESMQAGPDLCEVSGQLVPRRALEIAAAGGHNLLLTGPPGTGKTLLANCLPGILPPPDHR
EWLTVCALHDLQGETLRGQQRAFRAPHHSASAAALVGGGSIPRPGEISMAHGGVLFLDELPEFSRHTLDMLREPLETGEI
CLARASCSIRYPARFQLIAAMNPCPCGFSGDPHKPCKCSTAQRLSYSARVSGPLLDRLDLHVRVEREDAGDLFSCGQGEN
SATVRSRVIRSKTQQHRRQGQSNSRLTGSTLIDSCKLGTEEKALVERSAAGLKLSARAVHRMLRVARSIADLAEEDSVSA
PHLQEALAYRDTVQQH
Nucleotide
Download Length: 1491 bp
>NTDB_id=893158 R0135_RS09815 WP_407346648.1 2229395..2230885(+) (comM) [Congregibacter variabilis strain IMCC43200]
ATGAACTACGCCGTGGTTCTATCTCGGGCCAATCAAGGCCTGAACGCGCCATTGGTGCGTGTTGAAGTACACCTCTCTAA
CGGACTGCCGGCTTTTACGGTCGTGGGGATGCCAGAGACTGCCGTGCGGGAGAGCAAAGATCGCGTGCGCAGCGCGCTGA
TCAACTCCCACTTCGAGTTTCCTGACCGGCGTATTACGGTGAACCTCGCACCTGCGGACCTTCCAAAGGGGGGAGGTCGC
TTCGATCTTCCCATCGCGCTGGGCATCCTCTGTGCGTCGGGGCAGCTACCGCAAGACGCACTGCTGGGAAGCGAGTGCAT
CGGGGAGCTGGCTCTGGATGGGACTTTACGGCCGGTGCGCGGCACTGTCGCCGCGGCAATGGCTGCCGGCCAGAGTAAAC
GCCGAATACTGCTTCCCACTGACTCCGTGCAGTCCTGTCGGGCCATACCCGACAGCACACTCGTCCATTCCTCAGACCTG
CTGTCGCTCTGTGCGGTGCTGCGCGGACGAGCACAGCCGCCCGAAACGAGTACAGCTCCACTGGAGTCCATGCAAGCTGG
GCCTGATCTTTGCGAAGTATCGGGTCAACTCGTACCGCGCCGTGCGTTGGAGATTGCCGCCGCCGGCGGACACAACCTCT
TGCTGACAGGCCCCCCAGGCACGGGTAAAACCCTTCTTGCCAATTGCTTACCGGGCATCCTGCCACCGCCCGATCACCGT
GAGTGGCTTACAGTATGCGCGCTGCATGACCTGCAAGGAGAAACGCTTCGGGGACAGCAACGCGCCTTTAGAGCACCGCA
TCACAGTGCCAGCGCTGCAGCCCTGGTCGGGGGTGGCTCCATCCCCAGGCCAGGAGAGATCTCGATGGCACACGGGGGCG
TCCTGTTTCTCGACGAACTCCCAGAGTTCTCCCGGCACACACTGGATATGCTCAGAGAGCCACTGGAGACCGGCGAGATC
TGCTTGGCCCGTGCAAGTTGCAGTATTCGCTACCCCGCCCGCTTTCAGTTGATTGCGGCGATGAACCCCTGCCCCTGCGG
ATTTTCCGGGGACCCTCACAAGCCCTGTAAATGCAGTACTGCACAACGCCTGAGCTACAGCGCGCGGGTGTCCGGCCCGT
TACTGGACCGACTAGACCTACATGTTCGCGTCGAGCGGGAAGATGCAGGCGACTTGTTTTCCTGTGGCCAGGGCGAAAAC
TCAGCGACTGTGCGCAGCCGCGTAATACGCTCAAAGACACAGCAGCATCGGCGGCAAGGCCAAAGCAACTCTCGGCTCAC
CGGTTCAACGCTTATAGACAGTTGCAAGCTTGGCACTGAAGAAAAAGCACTTGTTGAACGCAGTGCGGCAGGATTGAAAC
TGTCTGCCAGAGCAGTGCACAGGATGCTGCGCGTAGCACGCAGCATTGCCGATCTGGCAGAAGAAGATTCCGTGAGCGCC
CCCCACTTACAGGAAGCCTTAGCTTATCGCGACACGGTGCAACAACATTGA
ATGAACTACGCCGTGGTTCTATCTCGGGCCAATCAAGGCCTGAACGCGCCATTGGTGCGTGTTGAAGTACACCTCTCTAA
CGGACTGCCGGCTTTTACGGTCGTGGGGATGCCAGAGACTGCCGTGCGGGAGAGCAAAGATCGCGTGCGCAGCGCGCTGA
TCAACTCCCACTTCGAGTTTCCTGACCGGCGTATTACGGTGAACCTCGCACCTGCGGACCTTCCAAAGGGGGGAGGTCGC
TTCGATCTTCCCATCGCGCTGGGCATCCTCTGTGCGTCGGGGCAGCTACCGCAAGACGCACTGCTGGGAAGCGAGTGCAT
CGGGGAGCTGGCTCTGGATGGGACTTTACGGCCGGTGCGCGGCACTGTCGCCGCGGCAATGGCTGCCGGCCAGAGTAAAC
GCCGAATACTGCTTCCCACTGACTCCGTGCAGTCCTGTCGGGCCATACCCGACAGCACACTCGTCCATTCCTCAGACCTG
CTGTCGCTCTGTGCGGTGCTGCGCGGACGAGCACAGCCGCCCGAAACGAGTACAGCTCCACTGGAGTCCATGCAAGCTGG
GCCTGATCTTTGCGAAGTATCGGGTCAACTCGTACCGCGCCGTGCGTTGGAGATTGCCGCCGCCGGCGGACACAACCTCT
TGCTGACAGGCCCCCCAGGCACGGGTAAAACCCTTCTTGCCAATTGCTTACCGGGCATCCTGCCACCGCCCGATCACCGT
GAGTGGCTTACAGTATGCGCGCTGCATGACCTGCAAGGAGAAACGCTTCGGGGACAGCAACGCGCCTTTAGAGCACCGCA
TCACAGTGCCAGCGCTGCAGCCCTGGTCGGGGGTGGCTCCATCCCCAGGCCAGGAGAGATCTCGATGGCACACGGGGGCG
TCCTGTTTCTCGACGAACTCCCAGAGTTCTCCCGGCACACACTGGATATGCTCAGAGAGCCACTGGAGACCGGCGAGATC
TGCTTGGCCCGTGCAAGTTGCAGTATTCGCTACCCCGCCCGCTTTCAGTTGATTGCGGCGATGAACCCCTGCCCCTGCGG
ATTTTCCGGGGACCCTCACAAGCCCTGTAAATGCAGTACTGCACAACGCCTGAGCTACAGCGCGCGGGTGTCCGGCCCGT
TACTGGACCGACTAGACCTACATGTTCGCGTCGAGCGGGAAGATGCAGGCGACTTGTTTTCCTGTGGCCAGGGCGAAAAC
TCAGCGACTGTGCGCAGCCGCGTAATACGCTCAAAGACACAGCAGCATCGGCGGCAAGGCCAAAGCAACTCTCGGCTCAC
CGGTTCAACGCTTATAGACAGTTGCAAGCTTGGCACTGAAGAAAAAGCACTTGTTGAACGCAGTGCGGCAGGATTGAAAC
TGTCTGCCAGAGCAGTGCACAGGATGCTGCGCGTAGCACGCAGCATTGCCGATCTGGCAGAAGAAGATTCCGTGAGCGCC
CCCCACTTACAGGAAGCCTTAGCTTATCGCGACACGGTGCAACAACATTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
48.509 |
100 |
0.492 |
| comM | Vibrio cholerae strain A1552 |
48.283 |
99.798 |
0.482 |
| comM | Glaesserella parasuis strain SC1401 |
47.695 |
100 |
0.48 |
| comM | Vibrio campbellii strain DS40M4 |
47.879 |
99.798 |
0.478 |
| comM | Legionella pneumophila str. Paris |
43.898 |
100 |
0.45 |
| comM | Legionella pneumophila strain ERS1305867 |
43.898 |
100 |
0.45 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
39.2 |
100 |
0.395 |