Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   HPQ68_RS04270 Genome accession   NZ_CP053748
Coordinates   926361..927878 (+) Length   505 a.a.
NCBI ID   WP_255756603.1    Uniprot ID   -
Organism   Massilia sp. erpn     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 921361..932878
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HPQ68_RS04240 (HPQ68_04230) - 921594..923156 (-) 1563 WP_304665262.1 ammonium transporter -
  HPQ68_RS04245 (HPQ68_04235) - 923168..923506 (-) 339 WP_050409507.1 P-II family nitrogen regulator -
  HPQ68_RS04250 (HPQ68_04240) - 923518..924273 (-) 756 WP_255756601.1 TorF family putative porin -
  HPQ68_RS04255 (HPQ68_04245) - 924477..924722 (+) 246 WP_050412521.1 accessory factor UbiK family protein -
  HPQ68_RS04260 (HPQ68_04250) - 924777..925142 (-) 366 WP_255756602.1 hypothetical protein -
  HPQ68_RS04265 (HPQ68_04255) dkgB 925355..926161 (+) 807 WP_255758222.1 2,5-didehydrogluconate reductase DkgB -
  HPQ68_RS04270 (HPQ68_04260) comM 926361..927878 (+) 1518 WP_255756603.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  HPQ68_RS04275 (HPQ68_04265) - 928043..930994 (+) 2952 WP_255756604.1 hypothetical protein -
  HPQ68_RS04280 (HPQ68_04270) - 930991..931464 (+) 474 WP_255756605.1 hypothetical protein -

Sequence


Protein


Download         Length: 505 a.a.        Molecular weight: 54738.96 Da        Isoelectric Point: 8.9154

>NTDB_id=446882 HPQ68_RS04270 WP_255756603.1 926361..927878(+) (comM) [Massilia sp. erpn]
MSLAVLRSRALAGMDAPEVGVEVHLANGLPAFHIVGLAETEVKEARDRVRAAIQNAGYEMPAQRITVSLAPADLPKESGR
FDLPIAIGILAASGQIPTRGLEQYEFAGELSLTGALRPIRGALAMTFAMQRKHASRTRAFILPLENADEAALVREALIHP
ARSLSQVCSHLCAVTGEGRLPRHIAATLPLAPCYPDFAEVKGQRVAKRALEIAAAGGHSILLVGPPGSGKTMLASRFAGL
LPHMTDEEALEAAAVQSLSGKFQAALWKVRPYRAPHHTSSGVALVGGGSVPHPGEISLAHCGVLFLDELTEFDRRVLEVL
REPMESGKVTISRAARQADFPARFQLIAAMNPCPCGYLGHETLACRCPPDVVLRYQGRISGPLLDRIDMQIEVGAVSAER
LAANADGETSGVIAARVEAAFQRQLARQKKSNQQLNSAEVDRWCRLERKAETTLRSSMQQFHWSARAWHRILRVARTIAD
LDGTATIADRHVSEAIQYRRALRER

Nucleotide


Download         Length: 1518 bp        

>NTDB_id=446882 HPQ68_RS04270 WP_255756603.1 926361..927878(+) (comM) [Massilia sp. erpn]
ATGAGCCTCGCCGTTCTCCGCAGCCGCGCCCTGGCCGGAATGGACGCCCCCGAAGTGGGCGTCGAAGTGCATCTGGCGAA
TGGCCTGCCAGCCTTTCATATCGTCGGCCTGGCCGAGACCGAAGTGAAGGAGGCGCGCGACCGGGTGCGCGCGGCAATCC
AGAACGCGGGCTATGAGATGCCGGCCCAGCGCATCACCGTCAGCCTGGCGCCGGCCGACCTGCCGAAGGAGTCCGGACGC
TTCGACCTGCCCATCGCCATTGGCATTCTGGCCGCATCGGGCCAGATTCCCACGCGCGGCCTGGAACAGTACGAGTTTGC
CGGCGAGCTGTCGCTGACAGGCGCACTGCGCCCCATCCGCGGCGCGCTGGCCATGACCTTCGCCATGCAGCGCAAGCACG
CCAGCAGGACACGCGCCTTTATCCTGCCGCTGGAAAACGCCGACGAAGCGGCCCTGGTGCGCGAAGCGCTGATCCACCCG
GCCCGTTCGCTGTCGCAGGTGTGCAGCCATCTCTGCGCCGTGACCGGGGAAGGGCGCCTGCCGCGCCACATCGCGGCAAC
CCTGCCGCTGGCGCCCTGCTATCCCGACTTTGCCGAAGTCAAAGGCCAGCGCGTCGCCAAGCGCGCATTGGAAATCGCGG
CCGCCGGCGGCCACTCTATCCTGCTGGTGGGGCCTCCCGGCAGCGGCAAGACCATGCTGGCCAGCCGCTTCGCCGGCCTG
CTGCCGCATATGACAGATGAAGAGGCGCTGGAGGCTGCTGCGGTGCAATCCCTGAGCGGCAAGTTCCAGGCCGCGCTCTG
GAAGGTACGCCCCTATCGCGCGCCTCACCACACCTCGTCGGGCGTGGCCCTGGTCGGTGGTGGCAGCGTGCCACATCCGG
GCGAGATTTCACTGGCCCATTGCGGCGTGCTCTTTCTCGACGAGCTGACCGAATTTGACCGGCGCGTGCTGGAAGTGCTG
CGCGAGCCGATGGAGTCGGGCAAAGTCACCATCTCGCGCGCGGCGCGGCAGGCCGATTTTCCGGCGCGCTTCCAGCTGAT
TGCCGCCATGAACCCCTGTCCCTGCGGCTATCTCGGCCACGAAACGCTGGCCTGCCGCTGTCCGCCCGACGTGGTGCTGC
GCTACCAGGGCCGCATTTCCGGCCCGCTGCTGGACCGCATCGACATGCAGATCGAAGTGGGCGCCGTGTCGGCCGAACGC
CTGGCGGCGAATGCGGACGGCGAAACATCGGGCGTCATCGCGGCACGTGTGGAAGCGGCCTTTCAACGCCAGCTGGCACG
TCAGAAGAAAAGCAATCAGCAATTGAACAGCGCTGAAGTCGACCGCTGGTGCCGGCTCGAACGCAAGGCCGAAACAACGC
TGCGCAGCTCGATGCAGCAGTTCCACTGGTCGGCACGCGCCTGGCACCGCATCCTGCGCGTGGCGCGCACCATCGCCGAC
CTGGACGGCACGGCCACCATCGCCGACCGCCACGTCAGCGAAGCAATCCAGTACCGGCGCGCCCTGCGCGAGCGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

51.373

100

0.519

  comM Vibrio campbellii strain DS40M4

51.594

99.406

0.513

  comM Haemophilus influenzae Rd KW20

50.098

100

0.507

  comM Glaesserella parasuis strain SC1401

50

100

0.503

  comM Legionella pneumophila str. Paris

49.307

100

0.493

  comM Legionella pneumophila strain ERS1305867

49.307

100

0.493

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.75

100

0.444