Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   MJ904_RS10795 Genome accession   NZ_CP092780
Coordinates   2382045..2383559 (-) Length   504 a.a.
NCBI ID   WP_240737415.1    Uniprot ID   -
Organism   Massilia sp. MB5     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2377045..2388559
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MJ904_RS28140 - 2377410..2377514 (-) 105 WP_307732629.1 DUF1840 family protein -
  MJ904_RS28145 - 2377610..2377759 (-) 150 Protein_2124 DUF1840 family protein -
  MJ904_RS10785 (MJ904_10785) - 2377848..2378348 (-) 501 WP_240737408.1 hypothetical protein -
  MJ904_RS10790 (MJ904_10790) - 2378569..2382003 (+) 3435 WP_240737410.1 hypothetical protein -
  MJ904_RS10795 (MJ904_10795) comM 2382045..2383559 (-) 1515 WP_240737415.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  MJ904_RS10800 (MJ904_10800) dkgB 2383759..2384565 (-) 807 WP_050409511.1 2,5-didehydrogluconate reductase DkgB -
  MJ904_RS10805 (MJ904_10805) - 2384799..2385011 (-) 213 WP_240737416.1 hypothetical protein -
  MJ904_RS10810 (MJ904_10810) - 2385179..2385511 (+) 333 WP_050409509.1 hypothetical protein -
  MJ904_RS10815 (MJ904_10815) - 2385566..2385811 (-) 246 WP_050412521.1 accessory factor UbiK family protein -
  MJ904_RS10820 (MJ904_10820) - 2386015..2386770 (+) 756 WP_050409508.1 TorF family putative porin -
  MJ904_RS10825 (MJ904_10825) - 2386782..2387120 (+) 339 WP_050409507.1 P-II family nitrogen regulator -

Sequence


Protein


Download         Length: 504 a.a.        Molecular weight: 54818.12 Da        Isoelectric Point: 9.0610

>NTDB_id=660277 MJ904_RS10795 WP_240737415.1 2382045..2383559(-) (comM) [Massilia sp. MB5]
MSLAVLRSRALAGMEAPEVGVEVHLANGLPAFHIVGLAETEVKEARDRVRAAIQNAGYEMPAQRITVSLAPADLPKESGR
FDLPIAIGILAASGQIPTRGLEQYEFAGELSLTGALRPIRGALAMTFAMQRSRAKARAFILPMENADEAALVREALIHPA
RSLSQVCSHLCAVSEEGRLPRHIAAALPLAPCYPDFAEVKGQQLAKRALEIAAAGGHSILLVGPPGSGKTMLASRFAGLL
PRMTDEEALEAAAVQSLSGKFHAESWKVRPYRSPHHTSSGVALVGGGSVPRPGEISLAHCGVLFLDELTEFDRRVLEVLR
EPMESGKVTISRAARQADFPARFQLIAAMNPCPCGYLGHASQACRCPPDVILRYQGRLSGPLLDRIDMQIEVGAVAPEYL
VEEASGEASSIIAARVEAAFQRQLARQKKSNQRLNTAEIDRWCRLERKAESTLRRSMNKFHWSARAYHRVLRVARTIADL
DGASVIADRHVSEAIQYRRALRER

Nucleotide


Download         Length: 1515 bp        

>NTDB_id=660277 MJ904_RS10795 WP_240737415.1 2382045..2383559(-) (comM) [Massilia sp. MB5]
ATGAGCCTCGCCGTTCTCCGCAGCCGCGCGCTGGCCGGGATGGAAGCGCCGGAAGTCGGCGTCGAAGTGCATCTGGCGAA
TGGCCTGCCGGCCTTTCATATCGTCGGCCTGGCCGAGACCGAAGTGAAGGAAGCACGCGACCGGGTGCGCGCTGCGATCC
AGAACGCGGGCTACGAGATGCCGGCCCAGCGCATTACCGTCAGCCTGGCGCCCGCCGATCTGCCCAAAGAGTCGGGCCGT
TTCGACCTGCCCATTGCCATCGGCATTCTGGCCGCATCGGGCCAGATTCCCACGCGCGGCCTGGAACAATATGAATTTGC
CGGCGAATTGTCGCTGACGGGCGCGTTGCGTCCGATACGCGGCGCGCTGGCGATGACCTTCGCCATGCAGCGCAGCCGCG
CCAAGGCGCGCGCCTTCATTCTGCCCATGGAAAACGCCGACGAGGCGGCGCTGGTGCGCGAAGCGCTGATCCATCCAGCC
CGCTCGCTGTCGCAGGTATGCAGCCATCTGTGCGCCGTGAGCGAAGAGGGGCGCTTGCCGCGCCATATTGCGGCTGCCCT
GCCGCTGGCGCCCTGCTACCCCGATTTCGCCGAAGTCAAAGGCCAGCAGTTGGCCAAACGCGCGCTGGAGATAGCCGCGG
CGGGCGGTCACAGCATCCTGCTGGTTGGCCCGCCCGGCAGCGGCAAGACCATGCTGGCCAGCCGCTTCGCCGGCCTGCTG
CCACGCATGACGGATGAGGAAGCGCTGGAAGCGGCGGCGGTGCAGTCCCTGAGCGGCAAATTTCATGCGGAGAGCTGGAA
GGTTAGGCCCTATCGTTCGCCTCACCATACATCGTCGGGCGTCGCCCTGGTCGGCGGCGGCAGTGTGCCGCGTCCCGGCG
AGATTTCGCTGGCCCATTGCGGCGTCCTCTTTCTCGACGAACTGACCGAGTTTGACCGGCGCGTGCTGGAGGTGCTGCGC
GAACCGATGGAATCGGGCAAGGTCACCATTTCGCGCGCAGCGCGCCAGGCGGACTTTCCCGCGCGCTTTCAGTTGATCGC
GGCGATGAATCCTTGTCCTTGCGGCTATCTCGGCCATGCATCCCAAGCCTGCCGCTGCCCACCCGATGTGATCCTGCGCT
ACCAGGGACGCCTCTCCGGCCCGCTGCTGGACCGTATCGATATGCAGATCGAAGTCGGCGCCGTGGCACCCGAATACCTG
GTGGAGGAAGCGAGCGGCGAAGCATCCAGCATCATCGCGGCGCGCGTGGAAGCGGCCTTTCAGCGCCAACTGGCGCGTCA
GAAGAAGAGCAACCAGCGCCTCAACACCGCTGAAATCGACCGCTGGTGCCGCCTCGAACGCAAAGCCGAATCGACGCTGC
GCCGTTCGATGAACAAGTTCCACTGGTCGGCGCGCGCCTATCACCGCGTCCTGCGTGTGGCACGCACGATCGCCGACCTG
GATGGCGCCAGCGTTATCGCCGACCGCCATGTCAGCGAAGCGATCCAGTACCGGCGTGCCCTGCGCGAGCGCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

51.992

99.603

0.518

  comM Vibrio cholerae strain A1552

51.8

99.206

0.514

  comM Haemophilus influenzae Rd KW20

50.196

100

0.508

  comM Glaesserella parasuis strain SC1401

50.495

100

0.506

  comM Legionella pneumophila str. Paris

49.307

100

0.494

  comM Legionella pneumophila strain ERS1305867

49.307

100

0.494

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.922

100

0.456