Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | INQ40_RS12230 | Genome accession | NZ_CP063772 |
| Coordinates | 2686609..2688123 (+) | Length | 504 a.a. |
| NCBI ID | WP_194340640.1 | Uniprot ID | - |
| Organism | Lysobacter sp. H21R4 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2681609..2693123
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| INQ40_RS12210 (INQ40_12210) | speE | 2683018..2683869 (+) | 852 | WP_194340639.1 | polyamine aminopropyltransferase | - |
| INQ40_RS12215 (INQ40_12215) | - | 2683876..2685801 (-) | 1926 | WP_228482027.1 | DUF4153 domain-containing protein | - |
| INQ40_RS12220 (INQ40_12220) | - | 2685785..2686123 (-) | 339 | WP_043958974.1 | P-II family nitrogen regulator | - |
| INQ40_RS12225 (INQ40_12225) | - | 2686300..2686599 (+) | 300 | WP_193984931.1 | accessory factor UbiK family protein | - |
| INQ40_RS12230 (INQ40_12230) | comM | 2686609..2688123 (+) | 1515 | WP_194340640.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| INQ40_RS12235 (INQ40_12235) | - | 2688178..2688390 (-) | 213 | WP_194340641.1 | DUF2945 domain-containing protein | - |
| INQ40_RS12240 (INQ40_12240) | aceA | 2688518..2689813 (-) | 1296 | WP_194340642.1 | isocitrate lyase | - |
| INQ40_RS12245 (INQ40_12245) | aceB | 2689867..2691549 (-) | 1683 | WP_194340643.1 | malate synthase A | - |
| INQ40_RS12250 (INQ40_12250) | - | 2691696..2692667 (+) | 972 | WP_194340644.1 | LysR family transcriptional regulator | - |
Sequence
Protein
Download Length: 504 a.a. Molecular weight: 53127.99 Da Isoelectric Point: 8.5239
>NTDB_id=496708 INQ40_RS12230 WP_194340640.1 2686609..2688123(+) (comM) [Lysobacter sp. H21R4]
MNLALVHSRARSGIRSAPVRVEVHLGGGLPSMSIVGLPETAVRESRERVRAAIQCAQFEFPARRITVNLAPADLPKGGGR
FDLPIALGILAASGQIPLEALGEYEFLGELGLTGELRAVDAVLPAALAAAQAGRKLVVPPANGAEAALVSGVETRTARTL
LEVCAMLSGHKSLPRADAAAPSRRVGPDLADVRGQAHARRALEVAAAGGHHLLLVGPPGCGKTLLASRLPGLLPVASDAE
ALDSAAVASLGGRGVDASLWRERPFRSPHHTASAVALVGGGAEPRPGEISMAHNGVLFLDELPEWSRRTLEVLREPLESG
TVTIARAARSVEFPARFQLVAAMNPCPCGWAGDPSGRCRCSPDMVLNYRARISGPLMDRIDLHVEVPRLPPSELRPDAAP
AENSDTVRERVVAARELQLARAGKANAHLSQSETGATCRLAEGDFALLERAIDTLHLSARSMHRILRVARTIADLAGSPQ
IQTTHLSEAIGYRRVDRGMPLATA
MNLALVHSRARSGIRSAPVRVEVHLGGGLPSMSIVGLPETAVRESRERVRAAIQCAQFEFPARRITVNLAPADLPKGGGR
FDLPIALGILAASGQIPLEALGEYEFLGELGLTGELRAVDAVLPAALAAAQAGRKLVVPPANGAEAALVSGVETRTARTL
LEVCAMLSGHKSLPRADAAAPSRRVGPDLADVRGQAHARRALEVAAAGGHHLLLVGPPGCGKTLLASRLPGLLPVASDAE
ALDSAAVASLGGRGVDASLWRERPFRSPHHTASAVALVGGGAEPRPGEISMAHNGVLFLDELPEWSRRTLEVLREPLESG
TVTIARAARSVEFPARFQLVAAMNPCPCGWAGDPSGRCRCSPDMVLNYRARISGPLMDRIDLHVEVPRLPPSELRPDAAP
AENSDTVRERVVAARELQLARAGKANAHLSQSETGATCRLAEGDFALLERAIDTLHLSARSMHRILRVARTIADLAGSPQ
IQTTHLSEAIGYRRVDRGMPLATA
Nucleotide
Download Length: 1515 bp
>NTDB_id=496708 INQ40_RS12230 WP_194340640.1 2686609..2688123(+) (comM) [Lysobacter sp. H21R4]
ATGAACCTGGCACTCGTGCACAGCCGTGCGCGATCGGGCATCCGTTCCGCTCCGGTCCGCGTGGAGGTGCATCTGGGCGG
CGGGCTTCCCTCGATGTCGATCGTCGGCTTGCCGGAAACCGCGGTGCGCGAATCGCGTGAGCGCGTACGTGCCGCGATCC
AGTGCGCCCAGTTCGAATTCCCGGCCCGGCGGATCACCGTCAACCTCGCCCCCGCCGACCTGCCCAAGGGCGGCGGCCGC
TTCGACCTGCCGATCGCGCTGGGGATCCTCGCCGCCAGTGGGCAGATCCCGCTGGAGGCTCTGGGTGAATACGAGTTCCT
CGGCGAGCTGGGCCTGACCGGCGAGTTGCGCGCGGTGGACGCGGTGCTGCCGGCCGCGCTGGCCGCGGCCCAGGCCGGCC
GCAAACTGGTCGTGCCGCCCGCCAACGGCGCCGAAGCCGCACTGGTCAGCGGGGTGGAGACGCGGACCGCGCGCACGCTG
CTGGAAGTGTGCGCGATGTTGTCAGGGCATAAATCGCTGCCACGCGCCGACGCTGCAGCCCCGAGTCGGCGCGTCGGACC
GGACCTGGCCGATGTGCGCGGCCAGGCGCATGCGCGCCGCGCACTGGAGGTCGCCGCGGCGGGTGGTCACCACCTTCTCC
TCGTCGGGCCGCCAGGCTGCGGCAAGACGCTGCTCGCGTCCCGCTTGCCCGGTCTGCTGCCGGTGGCCAGCGACGCCGAA
GCGCTGGATTCGGCGGCCGTCGCTTCGCTGGGCGGACGTGGCGTGGACGCGTCGCTGTGGCGAGAACGCCCGTTCCGCTC
CCCGCACCACACGGCCAGTGCGGTCGCGCTGGTCGGCGGTGGCGCCGAGCCGCGACCGGGCGAAATTTCCATGGCGCACA
ACGGCGTACTGTTCCTGGACGAGCTGCCCGAGTGGAGCCGGCGCACGCTGGAGGTGCTGCGCGAGCCACTGGAGTCGGGC
ACTGTCACCATCGCGCGCGCCGCCCGCAGCGTCGAGTTCCCGGCGCGCTTCCAGCTGGTCGCAGCCATGAACCCCTGTCC
GTGCGGGTGGGCCGGCGACCCCAGCGGGCGCTGCCGCTGCAGCCCGGACATGGTCCTGAACTACCGCGCGCGAATCTCCG
GGCCGCTGATGGACCGGATCGACCTGCACGTCGAGGTGCCGCGGCTGCCGCCATCGGAACTGCGACCGGATGCCGCGCCT
GCCGAGAACAGCGATACCGTCCGCGAGCGCGTGGTCGCCGCCCGCGAGCTGCAGCTGGCTCGTGCGGGCAAGGCCAACGC
CCACCTCAGCCAGTCAGAGACCGGTGCGACTTGTCGCTTGGCCGAGGGAGACTTTGCCCTGCTGGAACGTGCGATCGACA
CCCTCCACCTGTCGGCGCGCTCGATGCACCGGATCCTGCGGGTGGCGCGGACAATCGCCGACCTGGCCGGCAGTCCACAG
ATCCAGACCACGCACCTGAGCGAGGCGATCGGCTATCGGCGAGTGGATCGCGGGATGCCTCTGGCAACCGCCTGA
ATGAACCTGGCACTCGTGCACAGCCGTGCGCGATCGGGCATCCGTTCCGCTCCGGTCCGCGTGGAGGTGCATCTGGGCGG
CGGGCTTCCCTCGATGTCGATCGTCGGCTTGCCGGAAACCGCGGTGCGCGAATCGCGTGAGCGCGTACGTGCCGCGATCC
AGTGCGCCCAGTTCGAATTCCCGGCCCGGCGGATCACCGTCAACCTCGCCCCCGCCGACCTGCCCAAGGGCGGCGGCCGC
TTCGACCTGCCGATCGCGCTGGGGATCCTCGCCGCCAGTGGGCAGATCCCGCTGGAGGCTCTGGGTGAATACGAGTTCCT
CGGCGAGCTGGGCCTGACCGGCGAGTTGCGCGCGGTGGACGCGGTGCTGCCGGCCGCGCTGGCCGCGGCCCAGGCCGGCC
GCAAACTGGTCGTGCCGCCCGCCAACGGCGCCGAAGCCGCACTGGTCAGCGGGGTGGAGACGCGGACCGCGCGCACGCTG
CTGGAAGTGTGCGCGATGTTGTCAGGGCATAAATCGCTGCCACGCGCCGACGCTGCAGCCCCGAGTCGGCGCGTCGGACC
GGACCTGGCCGATGTGCGCGGCCAGGCGCATGCGCGCCGCGCACTGGAGGTCGCCGCGGCGGGTGGTCACCACCTTCTCC
TCGTCGGGCCGCCAGGCTGCGGCAAGACGCTGCTCGCGTCCCGCTTGCCCGGTCTGCTGCCGGTGGCCAGCGACGCCGAA
GCGCTGGATTCGGCGGCCGTCGCTTCGCTGGGCGGACGTGGCGTGGACGCGTCGCTGTGGCGAGAACGCCCGTTCCGCTC
CCCGCACCACACGGCCAGTGCGGTCGCGCTGGTCGGCGGTGGCGCCGAGCCGCGACCGGGCGAAATTTCCATGGCGCACA
ACGGCGTACTGTTCCTGGACGAGCTGCCCGAGTGGAGCCGGCGCACGCTGGAGGTGCTGCGCGAGCCACTGGAGTCGGGC
ACTGTCACCATCGCGCGCGCCGCCCGCAGCGTCGAGTTCCCGGCGCGCTTCCAGCTGGTCGCAGCCATGAACCCCTGTCC
GTGCGGGTGGGCCGGCGACCCCAGCGGGCGCTGCCGCTGCAGCCCGGACATGGTCCTGAACTACCGCGCGCGAATCTCCG
GGCCGCTGATGGACCGGATCGACCTGCACGTCGAGGTGCCGCGGCTGCCGCCATCGGAACTGCGACCGGATGCCGCGCCT
GCCGAGAACAGCGATACCGTCCGCGAGCGCGTGGTCGCCGCCCGCGAGCTGCAGCTGGCTCGTGCGGGCAAGGCCAACGC
CCACCTCAGCCAGTCAGAGACCGGTGCGACTTGTCGCTTGGCCGAGGGAGACTTTGCCCTGCTGGAACGTGCGATCGACA
CCCTCCACCTGTCGGCGCGCTCGATGCACCGGATCCTGCGGGTGGCGCGGACAATCGCCGACCTGGCCGGCAGTCCACAG
ATCCAGACCACGCACCTGAGCGAGGCGATCGGCTATCGGCGAGTGGATCGCGGGATGCCTCTGGCAACCGCCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Vibrio campbellii strain DS40M4 |
54.724 |
100 |
0.552 |
| comM | Vibrio cholerae strain A1552 |
55.11 |
99.008 |
0.546 |
| comM | Glaesserella parasuis strain SC1401 |
53.175 |
100 |
0.532 |
| comM | Haemophilus influenzae Rd KW20 |
52.778 |
100 |
0.528 |
| comM | Legionella pneumophila str. Paris |
50.701 |
99.008 |
0.502 |
| comM | Legionella pneumophila strain ERS1305867 |
50.701 |
99.008 |
0.502 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
44.204 |
100 |
0.446 |