Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   DQL22_RS02480 Genome accession   NZ_LS483485
Coordinates   470717..472252 (-) Length   511 a.a.
NCBI ID   WP_111301461.1    Uniprot ID   -
Organism   Aggregatibacter aphrophilus strain NCTC11096     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 465717..477252
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQL22_RS02455 (NCTC11096_00486) - 466107..467192 (-) 1086 WP_111301460.1 tetratricopeptide repeat protein -
  DQL22_RS02460 (NCTC11096_00487) - 467210..468796 (-) 1587 WP_005701923.1 DUF4384 domain-containing protein -
  DQL22_RS02465 (NCTC11096_00488) - 468852..469232 (-) 381 WP_005701924.1 hypothetical protein -
  DQL22_RS02470 (NCTC11096_00489) - 469295..469672 (-) 378 WP_225791922.1 hypothetical protein -
  DQL22_RS02475 (NCTC11096_00490) - 469810..470178 (-) 369 WP_012771871.1 hypothetical protein -
  DQL22_RS02480 (NCTC11096_00491) comM 470717..472252 (-) 1536 WP_111301461.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  DQL22_RS02485 (NCTC11096_00492) deoR 472361..473110 (-) 750 WP_083014242.1 DNA-binding transcriptional repressor DeoR -
  DQL22_RS02490 (NCTC11096_00493) deoC 473135..473806 (-) 672 WP_083014244.1 deoxyribose-phosphate aldolase -
  DQL22_RS02495 (NCTC11096_00494) - 474040..475407 (+) 1368 WP_083014246.1 patatin-like phospholipase family protein -
  DQL22_RS02500 (NCTC11096_00495) - 475599..476000 (+) 402 WP_083014248.1 pyrimidine dimer DNA glycosylase/endonuclease V -
  DQL22_RS02505 (NCTC11096_00496) - 476307..476540 (-) 234 WP_111301462.1 glycine zipper 2TM domain-containing protein -

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 56093.64 Da        Isoelectric Point: 9.4972

>NTDB_id=1142588 DQL22_RS02480 WP_111301461.1 470717..472252(-) (comM) [Aggregatibacter aphrophilus strain NCTC11096]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPNFTLVGLPEKTVKEAQDRVRSALLNAEFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGMLAASGYIDAEKLKQFEFIGELALTGQLRAVHGVIPAILAAKKAKRKCIIAYGNANEASLISDQETYFAHSL
LEVVQFLNNQGELPLAKDIMAQSAVDFGGENQKDLTEIIGQQHAKRALIIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDQEAIETASVASLVQNELNFHNWKQRPFRAPHHSASTPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLIAAMNPSPTGHYQGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
NTERGEPSAIVREKVLKTRNIQLERAGKINAHLTGKEIERDCKLESKDALFLESALTKLGLSVRAYHRILKVSRTIADLE
GEKCITQKHLAEALGYRAMDRFLQRLSKESS

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=1142588 DQL22_RS02480 WP_111301461.1 470717..472252(-) (comM) [Aggregatibacter aphrophilus strain NCTC11096]
ATGTCTTTAGCCATCGTTTACAGTCGTGCTTCAATGGGCGTGCAGGCGCCTTTAGTGACGATTGAAGTGCATTTAAGTAA
CGGCAAGCCTAACTTTACGTTGGTTGGATTGCCGGAAAAAACTGTTAAAGAAGCACAAGATCGAGTTCGTAGTGCATTGC
TGAACGCCGAATTCAAATATCCCGCCAAACGCATTACCGTCAATCTCGCACCTGCTGATTTACCCAAAGAAGGCGGACGT
TTTGACTTGCCTATCGCTATCGGTATGCTTGCGGCTTCAGGCTATATTGACGCGGAAAAATTAAAACAATTTGAATTTAT
CGGTGAATTGGCACTAACCGGTCAACTCCGCGCGGTACATGGCGTAATTCCTGCTATTTTGGCTGCCAAAAAAGCAAAAC
GAAAATGTATTATCGCTTATGGCAATGCTAATGAAGCCTCATTGATCTCCGATCAAGAAACGTATTTTGCCCATTCATTG
CTTGAAGTTGTGCAATTTCTCAATAATCAAGGGGAACTGCCTTTGGCAAAGGACATAATGGCTCAAAGTGCGGTGGATTT
TGGCGGCGAAAATCAAAAAGATCTGACGGAGATTATCGGTCAACAACACGCCAAACGGGCGCTGATTATTGCTGCAGCCG
GGCAACATAACTTGTTATTTCTGGGCCCTCCCGGCACCGGTAAAACCATGCTTGCCAGTCGTTTAACGGGGCTGTTACCG
GAAATGACCGACCAAGAGGCCATCGAAACCGCCTCCGTTGCCAGTCTCGTGCAAAATGAACTGAATTTTCATAATTGGAA
ACAACGCCCTTTCCGCGCCCCACATCACAGCGCGTCCACGCCGGCTTTAGTGGGAGGCGGCACAATTCCAAAACCCGGCG
AAATTTCTCTCGCGCATAATGGTGTGCTTTTTTTAGACGAATTGCCTGAATTTGAACGTAAGGTGCTGGATGCTTTACGC
CAACCGTTAGAAAGTGGAGAAATCATTATTTCGCGTGCCAATGCCAAAATTCAGTTTCCGGCAAGGTTTCAGCTTATCGC
GGCGATGAACCCAAGCCCGACAGGGCATTATCAAGGGACACACAATCGTACGTCGCCGCAACAAATCATGCGGTATTTAA
ATCGTCTCTCCGGCCCATTTCTGGATCGTTTTGATTTATCCATTGAAGTGCCTTTGTTGCCACAAGGCAGTTTGCAAAAT
AATACGGAGCGTGGCGAACCTAGCGCAATCGTCCGTGAAAAAGTGTTAAAAACCCGCAACATCCAATTAGAACGTGCAGG
CAAAATAAACGCCCATTTAACCGGTAAAGAAATTGAACGCGATTGTAAACTGGAAAGCAAAGACGCGTTATTTCTTGAAA
GTGCTCTGACCAAACTGGGGCTTTCCGTACGGGCTTACCACCGAATCTTAAAAGTGTCACGCACCATTGCTGATTTAGAA
GGTGAAAAATGCATTACCCAAAAGCATTTGGCTGAAGCATTAGGCTACCGAGCAATGGATCGGTTTTTACAGAGGCTATC
GAAAGAATCAAGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

86.588

99.217

0.859

  comM Glaesserella parasuis strain SC1401

78.082

100

0.781

  comM Vibrio cholerae strain A1552

64.902

99.804

0.648

  comM Vibrio campbellii strain DS40M4

64.51

99.804

0.644

  comM Legionella pneumophila str. Paris

51.19

98.63

0.505

  comM Legionella pneumophila strain ERS1305867

51.19

98.63

0.505

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.923

100

0.477


Multiple sequence alignment