Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | ACJA3S_RS24990 | Genome accession | NZ_CP174199 |
| Coordinates | 5424566..5426056 (+) | Length | 496 a.a. |
| NCBI ID | WP_406820348.1 | Uniprot ID | - |
| Organism | Pseudomonas sp. KnCO4 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 5426553..5442966 | 5424566..5426056 | flank | 497 |
Gene organization within MGE regions
Location: 5424566..5442966
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| ACJA3S_RS24990 (ACJA3S_24990) | comM | 5424566..5426056 (+) | 1491 | WP_406820348.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| ACJA3S_RS24995 (ACJA3S_24995) | - | 5426553..5427023 (-) | 471 | WP_406820349.1 | hypothetical protein | - |
| ACJA3S_RS25000 (ACJA3S_25000) | - | 5427690..5428958 (+) | 1269 | WP_406820350.1 | hypothetical protein | - |
| ACJA3S_RS25005 (ACJA3S_25005) | - | 5429073..5430263 (-) | 1191 | WP_406820351.1 | hypothetical protein | - |
| ACJA3S_RS25010 (ACJA3S_25010) | - | 5430260..5431237 (-) | 978 | WP_406820352.1 | S1 family serine peptidase | - |
| ACJA3S_RS25015 (ACJA3S_25015) | - | 5431234..5432904 (-) | 1671 | WP_406820353.1 | caspase family protein | - |
| ACJA3S_RS25020 (ACJA3S_25020) | - | 5433236..5433409 (-) | 174 | Protein_4904 | nucleoid-associated protein | - |
| ACJA3S_RS25025 (ACJA3S_25025) | - | 5434004..5434285 (+) | 282 | WP_050703256.1 | type II toxin-antitoxin system Phd/YefM family antitoxin | - |
| ACJA3S_RS25030 (ACJA3S_25030) | - | 5434254..5434559 (+) | 306 | WP_406820354.1 | Txe/YoeB family addiction module toxin | - |
| ACJA3S_RS25035 (ACJA3S_25035) | - | 5434956..5435153 (+) | 198 | WP_152993855.1 | hypothetical protein | - |
| ACJA3S_RS25040 (ACJA3S_25040) | - | 5435373..5436650 (-) | 1278 | WP_406820355.1 | hypothetical protein | - |
| ACJA3S_RS25045 (ACJA3S_25045) | - | 5437257..5438060 (+) | 804 | WP_406820356.1 | hypothetical protein | - |
| ACJA3S_RS25050 (ACJA3S_25050) | - | 5438742..5441156 (-) | 2415 | WP_406820357.1 | S8 family peptidase | - |
| ACJA3S_RS25055 (ACJA3S_25055) | - | 5441181..5442164 (-) | 984 | WP_050703253.1 | AAA family ATPase | - |
Sequence
Protein
Download Length: 496 a.a. Molecular weight: 52750.52 Da Isoelectric Point: 7.9061
>NTDB_id=1074974 ACJA3S_RS24990 WP_406820348.1 5424566..5426056(+) (comM) [Pseudomonas sp. KnCO4]
MSLALVHSRAQVGVQAPAVSVETHLANGLPHLTLVGLPETTVKESKDRVRSAIVNSGLNYPPRRITQNLAPADLPKDGGR
YDLAIALGILAADGQVPIAPLTELECLGELALSGKLRPVQGVLPAALAARDAGRALVVPRENAEEASLAGGLVVYAVGHL
LELVAHLNGQVPLPPYAANGLILQQRPYPDLSEVQGQLAAKRALLLAAAGAHNLLFTGPPGTGKTLLASRLPGLLPPLDE
HEALEVAAIRSVSGHTPLSSWPQRPFRHPHHSASGPALVGGGSRPQPGEITLAHHGVLFLDELPEFERRVLEVLREPLES
GEIVIARARDKVRFPARFQLVAAMNPCPCGYLGDPTGRCRCSTEQIARYRNKLSGPLLDRIDLHLTVARESTTLNNQPCG
ETSADVAAKVAEARDVQQKRQGCANAFLDLEGLRRNCGLAAADQAWLESACERLTLSLRAAHRLLKVARTLADLDGSQAI
GRAHLAEALQYRPGSS
MSLALVHSRAQVGVQAPAVSVETHLANGLPHLTLVGLPETTVKESKDRVRSAIVNSGLNYPPRRITQNLAPADLPKDGGR
YDLAIALGILAADGQVPIAPLTELECLGELALSGKLRPVQGVLPAALAARDAGRALVVPRENAEEASLAGGLVVYAVGHL
LELVAHLNGQVPLPPYAANGLILQQRPYPDLSEVQGQLAAKRALLLAAAGAHNLLFTGPPGTGKTLLASRLPGLLPPLDE
HEALEVAAIRSVSGHTPLSSWPQRPFRHPHHSASGPALVGGGSRPQPGEITLAHHGVLFLDELPEFERRVLEVLREPLES
GEIVIARARDKVRFPARFQLVAAMNPCPCGYLGDPTGRCRCSTEQIARYRNKLSGPLLDRIDLHLTVARESTTLNNQPCG
ETSADVAAKVAEARDVQQKRQGCANAFLDLEGLRRNCGLAAADQAWLESACERLTLSLRAAHRLLKVARTLADLDGSQAI
GRAHLAEALQYRPGSS
Nucleotide
Download Length: 1491 bp
>NTDB_id=1074974 ACJA3S_RS24990 WP_406820348.1 5424566..5426056(+) (comM) [Pseudomonas sp. KnCO4]
ATGTCCCTAGCCCTCGTCCATAGCCGCGCCCAGGTGGGCGTACAGGCCCCAGCGGTCAGCGTCGAAACCCACCTGGCCAA
TGGCTTGCCCCATCTCACCCTGGTCGGCCTGCCGGAAACCACGGTCAAGGAAAGCAAGGACCGGGTGCGCAGCGCCATCG
TCAATTCCGGGCTGAACTACCCGCCACGGCGCATCACCCAGAACCTCGCACCCGCCGACCTGCCCAAGGATGGCGGGCGT
TACGACCTGGCCATCGCCCTGGGCATCCTGGCTGCCGATGGCCAGGTACCAATCGCTCCGCTAACCGAACTTGAATGCCT
GGGTGAACTGGCTTTGTCTGGCAAGCTGCGCCCGGTCCAGGGCGTGCTGCCCGCAGCGCTGGCAGCACGCGACGCAGGCA
GGGCGCTGGTGGTGCCGCGGGAAAACGCCGAGGAAGCCAGCCTGGCTGGCGGGCTGGTGGTGTATGCGGTGGGGCATCTG
CTGGAACTGGTCGCCCACCTGAACGGCCAGGTACCACTGCCGCCCTATGCCGCCAACGGCCTGATACTGCAGCAACGCCC
TTACCCGGACCTCAGCGAGGTGCAAGGCCAACTGGCCGCCAAGCGTGCATTGCTGCTGGCCGCGGCCGGGGCGCATAACC
TGTTGTTCACCGGGCCACCCGGCACCGGCAAGACCTTGCTCGCCAGCCGCCTGCCGGGGCTGCTGCCGCCGCTGGACGAG
CACGAGGCGCTGGAAGTGGCTGCGATCCGCTCGGTGAGTGGCCATACACCGCTGAGCAGTTGGCCGCAGCGGCCCTTTCG
CCATCCGCACCACTCGGCCTCCGGCCCGGCGTTGGTCGGTGGCGGCAGCCGACCGCAGCCGGGCGAAATCACCCTTGCCC
ACCATGGTGTGCTGTTTCTGGATGAGTTGCCGGAATTCGAGCGGCGGGTACTGGAGGTGCTGCGCGAGCCCCTGGAATCC
GGCGAGATCGTGATTGCCCGGGCCCGCGACAAGGTGCGCTTCCCCGCCCGGTTCCAGTTGGTGGCGGCAATGAATCCGTG
CCCTTGCGGCTACCTGGGGGATCCCACTGGGCGCTGTCGCTGCAGCACCGAGCAGATCGCGCGGTACCGCAACAAGCTGT
CCGGGCCGTTGCTGGACCGTATCGACCTGCACCTGACCGTGGCCCGCGAGAGCACCACGCTGAATAACCAGCCTTGTGGT
GAAACCAGTGCCGACGTCGCCGCCAAGGTTGCCGAGGCACGGGATGTCCAGCAAAAACGGCAGGGATGCGCCAATGCGTT
TCTCGACCTTGAGGGGCTGCGCCGCAATTGCGGACTGGCAGCGGCAGACCAGGCCTGGCTGGAGAGTGCGTGTGAACGGC
TGACCCTGTCGTTGCGCGCGGCGCACCGCTTGCTGAAGGTGGCGCGAACCCTGGCCGATCTGGATGGTAGCCAGGCAATT
GGCCGGGCGCACCTGGCCGAGGCCCTGCAGTACCGGCCGGGGAGCAGTTAG
ATGTCCCTAGCCCTCGTCCATAGCCGCGCCCAGGTGGGCGTACAGGCCCCAGCGGTCAGCGTCGAAACCCACCTGGCCAA
TGGCTTGCCCCATCTCACCCTGGTCGGCCTGCCGGAAACCACGGTCAAGGAAAGCAAGGACCGGGTGCGCAGCGCCATCG
TCAATTCCGGGCTGAACTACCCGCCACGGCGCATCACCCAGAACCTCGCACCCGCCGACCTGCCCAAGGATGGCGGGCGT
TACGACCTGGCCATCGCCCTGGGCATCCTGGCTGCCGATGGCCAGGTACCAATCGCTCCGCTAACCGAACTTGAATGCCT
GGGTGAACTGGCTTTGTCTGGCAAGCTGCGCCCGGTCCAGGGCGTGCTGCCCGCAGCGCTGGCAGCACGCGACGCAGGCA
GGGCGCTGGTGGTGCCGCGGGAAAACGCCGAGGAAGCCAGCCTGGCTGGCGGGCTGGTGGTGTATGCGGTGGGGCATCTG
CTGGAACTGGTCGCCCACCTGAACGGCCAGGTACCACTGCCGCCCTATGCCGCCAACGGCCTGATACTGCAGCAACGCCC
TTACCCGGACCTCAGCGAGGTGCAAGGCCAACTGGCCGCCAAGCGTGCATTGCTGCTGGCCGCGGCCGGGGCGCATAACC
TGTTGTTCACCGGGCCACCCGGCACCGGCAAGACCTTGCTCGCCAGCCGCCTGCCGGGGCTGCTGCCGCCGCTGGACGAG
CACGAGGCGCTGGAAGTGGCTGCGATCCGCTCGGTGAGTGGCCATACACCGCTGAGCAGTTGGCCGCAGCGGCCCTTTCG
CCATCCGCACCACTCGGCCTCCGGCCCGGCGTTGGTCGGTGGCGGCAGCCGACCGCAGCCGGGCGAAATCACCCTTGCCC
ACCATGGTGTGCTGTTTCTGGATGAGTTGCCGGAATTCGAGCGGCGGGTACTGGAGGTGCTGCGCGAGCCCCTGGAATCC
GGCGAGATCGTGATTGCCCGGGCCCGCGACAAGGTGCGCTTCCCCGCCCGGTTCCAGTTGGTGGCGGCAATGAATCCGTG
CCCTTGCGGCTACCTGGGGGATCCCACTGGGCGCTGTCGCTGCAGCACCGAGCAGATCGCGCGGTACCGCAACAAGCTGT
CCGGGCCGTTGCTGGACCGTATCGACCTGCACCTGACCGTGGCCCGCGAGAGCACCACGCTGAATAACCAGCCTTGTGGT
GAAACCAGTGCCGACGTCGCCGCCAAGGTTGCCGAGGCACGGGATGTCCAGCAAAAACGGCAGGGATGCGCCAATGCGTT
TCTCGACCTTGAGGGGCTGCGCCGCAATTGCGGACTGGCAGCGGCAGACCAGGCCTGGCTGGAGAGTGCGTGTGAACGGC
TGACCCTGTCGTTGCGCGCGGCGCACCGCTTGCTGAAGGTGGCGCGAACCCTGGCCGATCTGGATGGTAGCCAGGCAATT
GGCCGGGCGCACCTGGCCGAGGCCCTGCAGTACCGGCCGGGGAGCAGTTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
55.6 |
100 |
0.56 |
| comM | Vibrio campbellii strain DS40M4 |
55.758 |
99.798 |
0.556 |
| comM | Vibrio cholerae strain A1552 |
55.354 |
99.798 |
0.552 |
| comM | Glaesserella parasuis strain SC1401 |
53.892 |
100 |
0.544 |
| comM | Legionella pneumophila str. Paris |
49.194 |
100 |
0.492 |
| comM | Legionella pneumophila strain ERS1305867 |
49.194 |
100 |
0.492 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
47.117 |
100 |
0.478 |