Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | DQL22_RS02480 | Genome accession | NZ_LS483485 |
| Coordinates | 470717..472252 (-) | Length | 511 a.a. |
| NCBI ID | WP_111301461.1 | Uniprot ID | - |
| Organism | Aggregatibacter aphrophilus strain NCTC11096 | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 465717..477252
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| DQL22_RS02455 (NCTC11096_00486) | - | 466107..467192 (-) | 1086 | WP_111301460.1 | tetratricopeptide repeat protein | - |
| DQL22_RS02460 (NCTC11096_00487) | - | 467210..468796 (-) | 1587 | WP_005701923.1 | DUF4384 domain-containing protein | - |
| DQL22_RS02465 (NCTC11096_00488) | - | 468852..469232 (-) | 381 | WP_005701924.1 | hypothetical protein | - |
| DQL22_RS02470 (NCTC11096_00489) | - | 469295..469672 (-) | 378 | WP_225791922.1 | hypothetical protein | - |
| DQL22_RS02475 (NCTC11096_00490) | - | 469810..470178 (-) | 369 | WP_012771871.1 | hypothetical protein | - |
| DQL22_RS02480 (NCTC11096_00491) | comM | 470717..472252 (-) | 1536 | WP_111301461.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| DQL22_RS02485 (NCTC11096_00492) | deoR | 472361..473110 (-) | 750 | WP_083014242.1 | DNA-binding transcriptional repressor DeoR | - |
| DQL22_RS02490 (NCTC11096_00493) | deoC | 473135..473806 (-) | 672 | WP_083014244.1 | deoxyribose-phosphate aldolase | - |
| DQL22_RS02495 (NCTC11096_00494) | - | 474040..475407 (+) | 1368 | WP_083014246.1 | patatin-like phospholipase family protein | - |
| DQL22_RS02500 (NCTC11096_00495) | - | 475599..476000 (+) | 402 | WP_083014248.1 | pyrimidine dimer DNA glycosylase/endonuclease V | - |
| DQL22_RS02505 (NCTC11096_00496) | - | 476307..476540 (-) | 234 | WP_111301462.1 | glycine zipper 2TM domain-containing protein | - |
Sequence
Protein
Download Length: 511 a.a. Molecular weight: 56093.64 Da Isoelectric Point: 9.4972
>NTDB_id=1142588 DQL22_RS02480 WP_111301461.1 470717..472252(-) (comM) [Aggregatibacter aphrophilus strain NCTC11096]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPNFTLVGLPEKTVKEAQDRVRSALLNAEFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGMLAASGYIDAEKLKQFEFIGELALTGQLRAVHGVIPAILAAKKAKRKCIIAYGNANEASLISDQETYFAHSL
LEVVQFLNNQGELPLAKDIMAQSAVDFGGENQKDLTEIIGQQHAKRALIIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDQEAIETASVASLVQNELNFHNWKQRPFRAPHHSASTPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLIAAMNPSPTGHYQGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
NTERGEPSAIVREKVLKTRNIQLERAGKINAHLTGKEIERDCKLESKDALFLESALTKLGLSVRAYHRILKVSRTIADLE
GEKCITQKHLAEALGYRAMDRFLQRLSKESS
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPNFTLVGLPEKTVKEAQDRVRSALLNAEFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGMLAASGYIDAEKLKQFEFIGELALTGQLRAVHGVIPAILAAKKAKRKCIIAYGNANEASLISDQETYFAHSL
LEVVQFLNNQGELPLAKDIMAQSAVDFGGENQKDLTEIIGQQHAKRALIIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDQEAIETASVASLVQNELNFHNWKQRPFRAPHHSASTPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLIAAMNPSPTGHYQGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
NTERGEPSAIVREKVLKTRNIQLERAGKINAHLTGKEIERDCKLESKDALFLESALTKLGLSVRAYHRILKVSRTIADLE
GEKCITQKHLAEALGYRAMDRFLQRLSKESS
Nucleotide
Download Length: 1536 bp
>NTDB_id=1142588 DQL22_RS02480 WP_111301461.1 470717..472252(-) (comM) [Aggregatibacter aphrophilus strain NCTC11096]
ATGTCTTTAGCCATCGTTTACAGTCGTGCTTCAATGGGCGTGCAGGCGCCTTTAGTGACGATTGAAGTGCATTTAAGTAA
CGGCAAGCCTAACTTTACGTTGGTTGGATTGCCGGAAAAAACTGTTAAAGAAGCACAAGATCGAGTTCGTAGTGCATTGC
TGAACGCCGAATTCAAATATCCCGCCAAACGCATTACCGTCAATCTCGCACCTGCTGATTTACCCAAAGAAGGCGGACGT
TTTGACTTGCCTATCGCTATCGGTATGCTTGCGGCTTCAGGCTATATTGACGCGGAAAAATTAAAACAATTTGAATTTAT
CGGTGAATTGGCACTAACCGGTCAACTCCGCGCGGTACATGGCGTAATTCCTGCTATTTTGGCTGCCAAAAAAGCAAAAC
GAAAATGTATTATCGCTTATGGCAATGCTAATGAAGCCTCATTGATCTCCGATCAAGAAACGTATTTTGCCCATTCATTG
CTTGAAGTTGTGCAATTTCTCAATAATCAAGGGGAACTGCCTTTGGCAAAGGACATAATGGCTCAAAGTGCGGTGGATTT
TGGCGGCGAAAATCAAAAAGATCTGACGGAGATTATCGGTCAACAACACGCCAAACGGGCGCTGATTATTGCTGCAGCCG
GGCAACATAACTTGTTATTTCTGGGCCCTCCCGGCACCGGTAAAACCATGCTTGCCAGTCGTTTAACGGGGCTGTTACCG
GAAATGACCGACCAAGAGGCCATCGAAACCGCCTCCGTTGCCAGTCTCGTGCAAAATGAACTGAATTTTCATAATTGGAA
ACAACGCCCTTTCCGCGCCCCACATCACAGCGCGTCCACGCCGGCTTTAGTGGGAGGCGGCACAATTCCAAAACCCGGCG
AAATTTCTCTCGCGCATAATGGTGTGCTTTTTTTAGACGAATTGCCTGAATTTGAACGTAAGGTGCTGGATGCTTTACGC
CAACCGTTAGAAAGTGGAGAAATCATTATTTCGCGTGCCAATGCCAAAATTCAGTTTCCGGCAAGGTTTCAGCTTATCGC
GGCGATGAACCCAAGCCCGACAGGGCATTATCAAGGGACACACAATCGTACGTCGCCGCAACAAATCATGCGGTATTTAA
ATCGTCTCTCCGGCCCATTTCTGGATCGTTTTGATTTATCCATTGAAGTGCCTTTGTTGCCACAAGGCAGTTTGCAAAAT
AATACGGAGCGTGGCGAACCTAGCGCAATCGTCCGTGAAAAAGTGTTAAAAACCCGCAACATCCAATTAGAACGTGCAGG
CAAAATAAACGCCCATTTAACCGGTAAAGAAATTGAACGCGATTGTAAACTGGAAAGCAAAGACGCGTTATTTCTTGAAA
GTGCTCTGACCAAACTGGGGCTTTCCGTACGGGCTTACCACCGAATCTTAAAAGTGTCACGCACCATTGCTGATTTAGAA
GGTGAAAAATGCATTACCCAAAAGCATTTGGCTGAAGCATTAGGCTACCGAGCAATGGATCGGTTTTTACAGAGGCTATC
GAAAGAATCAAGTTAA
ATGTCTTTAGCCATCGTTTACAGTCGTGCTTCAATGGGCGTGCAGGCGCCTTTAGTGACGATTGAAGTGCATTTAAGTAA
CGGCAAGCCTAACTTTACGTTGGTTGGATTGCCGGAAAAAACTGTTAAAGAAGCACAAGATCGAGTTCGTAGTGCATTGC
TGAACGCCGAATTCAAATATCCCGCCAAACGCATTACCGTCAATCTCGCACCTGCTGATTTACCCAAAGAAGGCGGACGT
TTTGACTTGCCTATCGCTATCGGTATGCTTGCGGCTTCAGGCTATATTGACGCGGAAAAATTAAAACAATTTGAATTTAT
CGGTGAATTGGCACTAACCGGTCAACTCCGCGCGGTACATGGCGTAATTCCTGCTATTTTGGCTGCCAAAAAAGCAAAAC
GAAAATGTATTATCGCTTATGGCAATGCTAATGAAGCCTCATTGATCTCCGATCAAGAAACGTATTTTGCCCATTCATTG
CTTGAAGTTGTGCAATTTCTCAATAATCAAGGGGAACTGCCTTTGGCAAAGGACATAATGGCTCAAAGTGCGGTGGATTT
TGGCGGCGAAAATCAAAAAGATCTGACGGAGATTATCGGTCAACAACACGCCAAACGGGCGCTGATTATTGCTGCAGCCG
GGCAACATAACTTGTTATTTCTGGGCCCTCCCGGCACCGGTAAAACCATGCTTGCCAGTCGTTTAACGGGGCTGTTACCG
GAAATGACCGACCAAGAGGCCATCGAAACCGCCTCCGTTGCCAGTCTCGTGCAAAATGAACTGAATTTTCATAATTGGAA
ACAACGCCCTTTCCGCGCCCCACATCACAGCGCGTCCACGCCGGCTTTAGTGGGAGGCGGCACAATTCCAAAACCCGGCG
AAATTTCTCTCGCGCATAATGGTGTGCTTTTTTTAGACGAATTGCCTGAATTTGAACGTAAGGTGCTGGATGCTTTACGC
CAACCGTTAGAAAGTGGAGAAATCATTATTTCGCGTGCCAATGCCAAAATTCAGTTTCCGGCAAGGTTTCAGCTTATCGC
GGCGATGAACCCAAGCCCGACAGGGCATTATCAAGGGACACACAATCGTACGTCGCCGCAACAAATCATGCGGTATTTAA
ATCGTCTCTCCGGCCCATTTCTGGATCGTTTTGATTTATCCATTGAAGTGCCTTTGTTGCCACAAGGCAGTTTGCAAAAT
AATACGGAGCGTGGCGAACCTAGCGCAATCGTCCGTGAAAAAGTGTTAAAAACCCGCAACATCCAATTAGAACGTGCAGG
CAAAATAAACGCCCATTTAACCGGTAAAGAAATTGAACGCGATTGTAAACTGGAAAGCAAAGACGCGTTATTTCTTGAAA
GTGCTCTGACCAAACTGGGGCTTTCCGTACGGGCTTACCACCGAATCTTAAAAGTGTCACGCACCATTGCTGATTTAGAA
GGTGAAAAATGCATTACCCAAAAGCATTTGGCTGAAGCATTAGGCTACCGAGCAATGGATCGGTTTTTACAGAGGCTATC
GAAAGAATCAAGTTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
86.588 |
99.217 |
0.859 |
| comM | Glaesserella parasuis strain SC1401 |
78.082 |
100 |
0.781 |
| comM | Vibrio cholerae strain A1552 |
64.902 |
99.804 |
0.648 |
| comM | Vibrio campbellii strain DS40M4 |
64.51 |
99.804 |
0.644 |
| comM | Legionella pneumophila str. Paris |
51.19 |
98.63 |
0.505 |
| comM | Legionella pneumophila strain ERS1305867 |
51.19 |
98.63 |
0.505 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
46.923 |
100 |
0.477 |