Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   QMG46_RS01650 Genome accession   NZ_AP027042
Coordinates   366971..368470 (-) Length   499 a.a.
NCBI ID   WP_281850698.1    Uniprot ID   -
Organism   Dyella sp. GSA-30     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 361971..373470
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QMG46_RS01640 (DYGSA30_03250) aceA 363224..364528 (+) 1305 WP_281850696.1 isocitrate lyase -
  QMG46_RS01645 (DYGSA30_03260) - 364622..366433 (-) 1812 WP_281850697.1 sodium:proton antiporter -
  QMG46_RS01650 (DYGSA30_03270) comM 366971..368470 (-) 1500 WP_281850698.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  QMG46_RS01655 (DYGSA30_03280) - 368671..368919 (-) 249 WP_281850699.1 accessory factor UbiK family protein -
  QMG46_RS01660 (DYGSA30_03290) glnK 369092..369430 (+) 339 WP_281850700.1 P-II family nitrogen regulator -
  QMG46_RS01665 (DYGSA30_03300) - 369427..369978 (-) 552 WP_281850701.1 DUF1453 domain-containing protein -
  QMG46_RS01670 (DYGSA30_03310) gpmA 370161..370907 (-) 747 WP_281850702.1 2,3-diphosphoglycerate-dependent phosphoglycerate mutase -
  QMG46_RS01675 (DYGSA30_03320) - 371097..371744 (+) 648 WP_281850703.1 hypothetical protein -
  QMG46_RS01680 (DYGSA30_03330) hemF 371785..372687 (-) 903 WP_281850704.1 oxygen-dependent coproporphyrinogen oxidase -

Sequence


Protein


Download         Length: 499 a.a.        Molecular weight: 53903.56 Da        Isoelectric Point: 7.6983

>NTDB_id=99242 QMG46_RS01650 WP_281850698.1 366971..368470(-) (comM) [Dyella sp. GSA-30]
MSLAVTLSRAQEGVAAPQVMVEVHLSAGLPGTHIVGLPEAAVREARDRVRVAIQSAAFEYPNRRVTVNLAPAELPKDGGR
FDLAIALGILAAGGQVPREKLDDCEFLGELALSGDLRAVSGVLPALLRARARGRRVVVPRANASEAALIPEVDVRVADTL
AEVCGWLHGAQDLSTPSTAMEWGSDDLGPDLSDVRGQLQARRALEITATGGHHLLLVGPPGTGKTMLAERLPSILPPLTE
SEALETCAVLSVAGQQVDPKFWRRRPFRSPHHTASAVALVGGGSEPRPGEISLAHNGVLFLDELPEFTRHVLDVLREPLE
SGQIMISRASRQSAFPAQFQLIAAMNPCPCGYAGDPRQRCRCTPDQIQRYRNRVSGPLLDRIDLSVEVPRVPVSEIGAPR
AAQDEDSATVRARVLKARHQALMRAGRPNAEISTRELERDCALGPAERRWFDAALERLGLSARAYHRTLRVARTIADLDG
GAAALDRSHLAEALQYRRF

Nucleotide


Download         Length: 1500 bp        

>NTDB_id=99242 QMG46_RS01650 WP_281850698.1 366971..368470(-) (comM) [Dyella sp. GSA-30]
ATGAGCCTTGCCGTCACATTGAGCCGCGCCCAGGAAGGGGTTGCAGCGCCGCAGGTGATGGTCGAAGTGCATCTTTCCGC
CGGCCTGCCGGGAACGCATATCGTGGGTTTGCCCGAGGCTGCCGTACGCGAGGCGCGTGATCGCGTGCGTGTGGCGATCC
AGAGCGCGGCCTTTGAGTATCCCAATCGACGGGTCACGGTAAACCTGGCTCCGGCCGAGTTACCCAAGGACGGTGGCCGT
TTTGACCTGGCCATTGCGCTGGGCATTCTGGCCGCTGGCGGACAGGTGCCGCGCGAGAAGCTGGACGACTGCGAATTTCT
TGGTGAGCTGGCCCTGTCCGGCGACTTGCGCGCGGTATCGGGTGTGCTGCCGGCGCTGCTGCGGGCACGGGCGCGTGGCC
GTCGCGTGGTGGTTCCGCGCGCCAATGCTTCCGAGGCTGCTCTCATCCCTGAAGTCGACGTACGCGTTGCCGATACGCTC
GCCGAGGTATGCGGCTGGCTGCATGGCGCACAGGACTTGTCGACGCCCAGCACTGCGATGGAATGGGGCAGCGACGATCT
GGGGCCAGACTTGTCCGATGTACGTGGGCAATTGCAGGCTCGCCGGGCATTGGAAATTACCGCCACGGGTGGGCATCACC
TGTTGCTGGTCGGGCCTCCCGGTACCGGCAAGACCATGCTCGCCGAGCGCTTACCCAGCATTCTGCCGCCGCTGACCGAG
TCGGAGGCGCTGGAAACCTGTGCGGTACTTTCGGTAGCCGGGCAGCAGGTCGATCCGAAATTCTGGCGCCGGCGGCCGTT
TCGTTCGCCGCACCATACGGCATCGGCCGTGGCACTGGTGGGTGGCGGCTCGGAGCCGCGTCCCGGAGAGATTTCGCTGG
CTCACAACGGGGTGCTGTTCCTCGACGAGCTGCCTGAATTTACCCGGCACGTCCTCGATGTCTTGCGTGAGCCGCTGGAG
TCGGGACAGATCATGATTTCCCGGGCATCACGCCAGTCGGCGTTCCCGGCGCAGTTTCAGCTGATCGCCGCGATGAATCC
CTGTCCCTGCGGCTATGCCGGCGATCCACGCCAGCGCTGCCGCTGTACGCCGGATCAGATCCAGCGCTATCGCAACCGTG
TTTCCGGTCCGCTGCTCGACCGCATCGACCTGTCCGTGGAGGTGCCGCGCGTGCCCGTGTCGGAGATCGGTGCGCCGCGT
GCGGCGCAGGACGAAGACTCGGCGACCGTTCGTGCACGTGTTCTCAAGGCCCGGCATCAGGCGCTGATGCGGGCAGGGCG
CCCGAATGCCGAAATAAGCACCCGCGAACTCGAGCGCGACTGCGCCCTGGGGCCCGCCGAAAGGCGCTGGTTCGATGCGG
CGCTGGAGCGCCTGGGCCTGTCGGCCCGCGCCTACCATCGCACCCTGCGGGTCGCTCGCACGATCGCCGACCTGGACGGT
GGTGCGGCGGCGCTCGATCGCAGTCATCTGGCCGAGGCGCTCCAGTACCGGCGGTTTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

54.527

99.599

0.543

  comM Vibrio campbellii strain DS40M4

53.707

100

0.537

  comM Glaesserella parasuis strain SC1401

53.293

100

0.535

  comM Haemophilus influenzae Rd KW20

52.789

100

0.531

  comM Legionella pneumophila str. Paris

49.799

99.8

0.497

  comM Legionella pneumophila strain ERS1305867

49.799

99.8

0.497

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.149

100

0.457


Multiple sequence alignment