Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | AACL23_RS03145 | Genome accession | NZ_OZ026471 |
| Coordinates | 661235..662755 (-) | Length | 506 a.a. |
| NCBI ID | WP_100102925.1 | Uniprot ID | - |
| Organism | Candidatus Hamiltonella endosymbiont of Tuberolachnus salignus isolate eee46e84-88ba-4997-9627-ba9a3c08c7b3 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 656235..667755
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| AACL23_RS03130 | - | 656737..656811 (+) | 75 | Protein_606 | transposase | - |
| AACL23_RS03135 | - | 656964..659882 (-) | 2919 | WP_174888361.1 | hypothetical protein | - |
| AACL23_RS03140 | - | 660015..660818 (-) | 804 | WP_339050072.1 | metallophosphoesterase | - |
| AACL23_RS03145 | comM | 661235..662755 (-) | 1521 | WP_100102925.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| AACL23_RS03150 | kdsB | 663102..663848 (-) | 747 | WP_100096033.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| AACL23_RS03155 | - | 663848..664027 (-) | 180 | WP_012737967.1 | Trm112 family protein | - |
| AACL23_RS03160 | - | 664301..664510 (-) | 210 | WP_012737966.1 | cold shock domain-containing protein | - |
| AACL23_RS03165 | lpxK | 664992..665978 (-) | 987 | WP_012737964.1 | tetraacyldisaccharide 4'-kinase | - |
| AACL23_RS03170 | - | 666371..667648 (+) | 1278 | WP_012737963.1 | hemolysin family protein | - |
Sequence
Protein
Download Length: 506 a.a. Molecular weight: 55335.18 Da Isoelectric Point: 9.6166
>NTDB_id=1163195 AACL23_RS03145 WP_100102925.1 661235..662755(-) (comM) [Candidatus Hamiltonella endosymbiont of Tuberolachnus salignus isolate eee46e84-88ba-4997-9627-ba9a3c08c7b3]
MALAVINTRASLGVQSPAVAVEAHISNGLPCFTLVGLPETAVKEARDRVRSAILNSGFMFPAKRITVSLGPADLPKEGGR
YDLPIALAILIASEQVSGHKSVHYEFLGELALSGALRPINSTIPAALACSKANRKLILPSANSLEMTLIPEGEVMIAKHL
LEVCGFLSGEENLLSVVNNETDLTDTYADLQDVIGQQQSKRALEVAAAGGHNLLLIGPPGTGKTMLASRLIGILPALTNE
EALETAAVNSLLNIEKYPTQWRKRPFRSPHHSASIAALVGGGSLPRPGEISLAHNGVLFLDELPEFERRVLDSLREPLES
GEIIISRAMAKIRFPARVQLIAAMNPSPSGHYKGIHHRTTPQQILRYLSKLSGPFLDRFDLSIEVPLLPPGLLQMQENQG
ESSSTIRTRVLKARERQLNRTKKINAHLNNKEVAKFCHISTKDAQFLEQVLLKLGLSVRAWHRILKVARTLADLAEQENI
KKDHLAEALSYRCIDRLFLKLNKTLN
MALAVINTRASLGVQSPAVAVEAHISNGLPCFTLVGLPETAVKEARDRVRSAILNSGFMFPAKRITVSLGPADLPKEGGR
YDLPIALAILIASEQVSGHKSVHYEFLGELALSGALRPINSTIPAALACSKANRKLILPSANSLEMTLIPEGEVMIAKHL
LEVCGFLSGEENLLSVVNNETDLTDTYADLQDVIGQQQSKRALEVAAAGGHNLLLIGPPGTGKTMLASRLIGILPALTNE
EALETAAVNSLLNIEKYPTQWRKRPFRSPHHSASIAALVGGGSLPRPGEISLAHNGVLFLDELPEFERRVLDSLREPLES
GEIIISRAMAKIRFPARVQLIAAMNPSPSGHYKGIHHRTTPQQILRYLSKLSGPFLDRFDLSIEVPLLPPGLLQMQENQG
ESSSTIRTRVLKARERQLNRTKKINAHLNNKEVAKFCHISTKDAQFLEQVLLKLGLSVRAWHRILKVARTLADLAEQENI
KKDHLAEALSYRCIDRLFLKLNKTLN
Nucleotide
Download Length: 1521 bp
>NTDB_id=1163195 AACL23_RS03145 WP_100102925.1 661235..662755(-) (comM) [Candidatus Hamiltonella endosymbiont of Tuberolachnus salignus isolate eee46e84-88ba-4997-9627-ba9a3c08c7b3]
ATGGCACTGGCAGTAATCAACACTCGAGCGAGTCTGGGGGTACAGTCTCCTGCAGTTGCGGTTGAAGCGCATATCAGTAA
TGGATTGCCTTGTTTTACCCTGGTGGGCTTACCTGAAACCGCGGTAAAAGAAGCGAGAGATCGCGTACGGAGTGCGATTT
TGAACAGTGGTTTTATGTTTCCAGCCAAGCGTATTACGGTGAGTCTAGGACCTGCAGATCTACCAAAAGAAGGTGGGCGT
TACGATTTACCTATAGCTTTAGCCATATTAATAGCCTCAGAACAAGTGAGTGGTCACAAATCAGTTCATTATGAATTTTT
AGGTGAATTAGCTTTATCTGGTGCATTACGACCTATTAATAGCACAATTCCAGCGGCTCTTGCATGCTCTAAAGCAAATC
GGAAGCTGATATTACCTTCAGCTAATTCTCTAGAAATGACGTTAATCCCTGAAGGGGAGGTTATGATTGCCAAACACTTG
CTAGAGGTGTGTGGGTTTTTAAGTGGAGAAGAAAATTTACTTTCTGTTGTAAATAATGAAACTGATCTAACAGATACGTA
TGCTGATCTTCAAGATGTGATTGGTCAACAGCAGTCAAAACGAGCTCTCGAAGTGGCTGCAGCTGGGGGGCATAATTTAT
TACTAATTGGCCCACCTGGTACTGGCAAAACGATGCTGGCTAGCAGATTGATTGGCATACTACCTGCATTAACAAACGAG
GAAGCATTGGAAACCGCTGCTGTAAACAGTTTATTAAATATTGAAAAATATCCGACTCAATGGCGTAAGCGTCCTTTTCG
ATCCCCTCACCATAGTGCCTCGATTGCTGCTTTAGTGGGCGGGGGATCTTTACCTCGTCCAGGCGAAATATCTTTAGCTC
ATAATGGCGTATTATTTTTAGATGAATTACCAGAATTTGAACGCAGGGTACTAGATTCATTAAGAGAACCACTAGAATCT
GGGGAGATTATCATTTCTCGCGCCATGGCCAAGATTCGTTTTCCTGCAAGGGTACAGTTGATTGCGGCCATGAACCCAAG
CCCAAGTGGGCATTATAAAGGTATTCATCATCGAACAACCCCACAACAAATTTTACGATATCTTTCAAAACTTTCTGGTC
CTTTCCTTGATAGGTTTGATCTATCTATTGAAGTTCCTTTATTACCTCCTGGTTTATTACAAATGCAAGAAAATCAGGGT
GAAAGCAGCAGTACTATTAGAACGCGTGTTCTGAAAGCTCGTGAACGCCAATTAAATAGAACAAAAAAAATCAACGCTCA
TCTCAATAATAAAGAAGTGGCTAAATTTTGTCATATAAGCACTAAGGATGCTCAATTTTTAGAACAAGTGTTGTTAAAAT
TAGGTCTTTCCGTACGTGCTTGGCATCGCATTTTGAAAGTAGCTCGAACACTTGCGGATTTAGCTGAACAAGAAAATATT
AAGAAGGATCACCTTGCTGAAGCGCTAAGTTACCGTTGCATAGATAGATTATTTTTAAAATTAAATAAAACTTTGAATTA
A
ATGGCACTGGCAGTAATCAACACTCGAGCGAGTCTGGGGGTACAGTCTCCTGCAGTTGCGGTTGAAGCGCATATCAGTAA
TGGATTGCCTTGTTTTACCCTGGTGGGCTTACCTGAAACCGCGGTAAAAGAAGCGAGAGATCGCGTACGGAGTGCGATTT
TGAACAGTGGTTTTATGTTTCCAGCCAAGCGTATTACGGTGAGTCTAGGACCTGCAGATCTACCAAAAGAAGGTGGGCGT
TACGATTTACCTATAGCTTTAGCCATATTAATAGCCTCAGAACAAGTGAGTGGTCACAAATCAGTTCATTATGAATTTTT
AGGTGAATTAGCTTTATCTGGTGCATTACGACCTATTAATAGCACAATTCCAGCGGCTCTTGCATGCTCTAAAGCAAATC
GGAAGCTGATATTACCTTCAGCTAATTCTCTAGAAATGACGTTAATCCCTGAAGGGGAGGTTATGATTGCCAAACACTTG
CTAGAGGTGTGTGGGTTTTTAAGTGGAGAAGAAAATTTACTTTCTGTTGTAAATAATGAAACTGATCTAACAGATACGTA
TGCTGATCTTCAAGATGTGATTGGTCAACAGCAGTCAAAACGAGCTCTCGAAGTGGCTGCAGCTGGGGGGCATAATTTAT
TACTAATTGGCCCACCTGGTACTGGCAAAACGATGCTGGCTAGCAGATTGATTGGCATACTACCTGCATTAACAAACGAG
GAAGCATTGGAAACCGCTGCTGTAAACAGTTTATTAAATATTGAAAAATATCCGACTCAATGGCGTAAGCGTCCTTTTCG
ATCCCCTCACCATAGTGCCTCGATTGCTGCTTTAGTGGGCGGGGGATCTTTACCTCGTCCAGGCGAAATATCTTTAGCTC
ATAATGGCGTATTATTTTTAGATGAATTACCAGAATTTGAACGCAGGGTACTAGATTCATTAAGAGAACCACTAGAATCT
GGGGAGATTATCATTTCTCGCGCCATGGCCAAGATTCGTTTTCCTGCAAGGGTACAGTTGATTGCGGCCATGAACCCAAG
CCCAAGTGGGCATTATAAAGGTATTCATCATCGAACAACCCCACAACAAATTTTACGATATCTTTCAAAACTTTCTGGTC
CTTTCCTTGATAGGTTTGATCTATCTATTGAAGTTCCTTTATTACCTCCTGGTTTATTACAAATGCAAGAAAATCAGGGT
GAAAGCAGCAGTACTATTAGAACGCGTGTTCTGAAAGCTCGTGAACGCCAATTAAATAGAACAAAAAAAATCAACGCTCA
TCTCAATAATAAAGAAGTGGCTAAATTTTGTCATATAAGCACTAAGGATGCTCAATTTTTAGAACAAGTGTTGTTAAAAT
TAGGTCTTTCCGTACGTGCTTGGCATCGCATTTTGAAAGTAGCTCGAACACTTGCGGATTTAGCTGAACAAGAAAATATT
AAGAAGGATCACCTTGCTGAAGCGCTAAGTTACCGTTGCATAGATAGATTATTTTTAAAATTAAATAAAACTTTGAATTA
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
61.176 |
100 |
0.617 |
| comM | Vibrio campbellii strain DS40M4 |
60.835 |
99.407 |
0.605 |
| comM | Glaesserella parasuis strain SC1401 |
60.277 |
100 |
0.603 |
| comM | Vibrio cholerae strain A1552 |
60.636 |
99.407 |
0.603 |
| comM | Legionella pneumophila str. Paris |
50.202 |
98.024 |
0.492 |
| comM | Legionella pneumophila strain ERS1305867 |
50.202 |
98.024 |
0.492 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
45.726 |
99.407 |
0.455 |