Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   QEN43_RS20465 Genome accession   NZ_OX458333
Coordinates   4644625..4646142 (+) Length   505 a.a.
NCBI ID   WP_026610080.1    Uniprot ID   -
Organism   Methylocaldum szegediense isolate Msz(Nor)     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 4639625..4651142
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QEN43_RS21880 - 4640032..4640328 (+) 297 Protein_4074 AMIN domain-containing protein -
  QEN43_RS20440 (MSZNOR_4812) - 4640473..4641702 (+) 1230 WP_036268677.1 transporter -
  QEN43_RS20445 (MSZNOR_4813) - 4641877..4642185 (+) 309 WP_084161838.1 hypothetical protein -
  QEN43_RS20450 (MSZNOR_4814) amt 4642328..4643713 (-) 1386 WP_396662040.1 ammonium transporter -
  QEN43_RS20455 (MSZNOR_4815) glnK 4643735..4644073 (-) 339 WP_026610078.1 P-II family nitrogen regulator -
  QEN43_RS20460 (MSZNOR_4817) - 4644353..4644604 (+) 252 WP_026610079.1 accessory factor UbiK family protein -
  QEN43_RS20465 (MSZNOR_4818) comM 4644625..4646142 (+) 1518 WP_026610080.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  QEN43_RS20470 (MSZNOR_4819) rep 4646142..4648151 (+) 2010 WP_026610081.1 DNA helicase Rep -
  QEN43_RS20480 (MSZNOR_4821) - 4649227..4649610 (-) 384 WP_036268155.1 hypothetical protein -

Sequence


Protein


Download         Length: 505 a.a.        Molecular weight: 54453.54 Da        Isoelectric Point: 7.3374

>NTDB_id=1158480 QEN43_RS20465 WP_026610080.1 4644625..4646142(+) (comM) [Methylocaldum szegediense isolate Msz(Nor)]
MSLAVVFTRGRQGIEAPLVTVEVHISNGLPSLSIVGLPETAVKESKDRVRGALLNCQFEFPLQRITVNMAPADIPKEGGR
FDLAIGLGILAASGQIRSEALRDIECVGELSLSGDLRPISGVLPVAIQAKQAGRALIVPAENAGEAVLAEGAKILPARHL
LEVCAHLNGQQSIPFEPSEASAAVPEDLPDFADVHGHFHAKRALEVAAAGRHNIIMLGPPGTGKSMLAARLPSILPPLTD
EEALESAAIASVSDLPFDPRRWRIPPFRAPHHTASAPALVGGGGNPKPGEISLAHNGVLFLDELPEFDRKVLEVLREPLE
TGAITISRVARQVDFPARFQLVAAMNPCPCGYLGDASGRCRCTAEQVQRYRSRISGPLLDRIDIHVEVPRVSQDVLRNGT
PGGEQTSADIRRRVLAAREIARQRTGKPNAFMTPQEIKRYCRLSDEGHRLLEQATEKLGLSHRAYHRILKLARTIADLAG
EEDIAVSHLSEAIGFRRLDRVPTGL

Nucleotide


Download         Length: 1518 bp        

>NTDB_id=1158480 QEN43_RS20465 WP_026610080.1 4644625..4646142(+) (comM) [Methylocaldum szegediense isolate Msz(Nor)]
ATGTCCCTGGCCGTCGTTTTCACTCGGGGCAGGCAAGGCATCGAGGCGCCGCTGGTCACCGTCGAGGTACATATTTCCAA
CGGTCTGCCCAGCCTATCCATCGTCGGCCTTCCGGAAACCGCGGTAAAAGAAAGCAAGGATCGGGTGCGAGGCGCGCTGC
TGAACTGCCAGTTCGAATTTCCGCTCCAGCGTATCACCGTGAACATGGCGCCTGCCGACATTCCCAAGGAAGGCGGCCGC
TTCGATTTGGCCATTGGACTGGGCATCCTGGCGGCGTCGGGGCAGATCAGGAGCGAAGCGCTTCGGGACATCGAATGCGT
CGGCGAGCTTTCCTTAAGCGGTGATCTCCGCCCCATCAGCGGCGTATTGCCGGTCGCAATTCAGGCCAAGCAGGCGGGAC
GCGCCCTAATCGTTCCCGCGGAGAACGCGGGCGAGGCGGTTTTGGCCGAAGGCGCGAAAATCCTCCCGGCGCGCCATCTG
CTCGAAGTTTGCGCGCATTTGAACGGGCAACAGTCGATTCCGTTCGAACCTTCCGAGGCATCCGCTGCTGTTCCGGAGGA
TCTGCCCGATTTCGCCGACGTGCACGGCCATTTTCACGCCAAGCGGGCTTTGGAAGTCGCCGCCGCCGGTAGGCATAACA
TCATTATGCTGGGACCGCCCGGGACCGGTAAGTCGATGCTGGCAGCCCGGCTTCCGAGCATCCTCCCGCCCTTGACTGAC
GAAGAGGCCCTGGAAAGCGCAGCGATCGCATCGGTCAGCGATCTGCCTTTCGATCCGCGGCGCTGGCGTATCCCGCCGTT
TCGCGCACCGCACCATACCGCCTCAGCCCCGGCTTTGGTCGGCGGCGGCGGCAACCCCAAGCCGGGCGAAATTTCCCTGG
CGCACAATGGCGTGCTTTTTCTCGACGAACTGCCCGAATTCGATCGTAAGGTGTTAGAGGTCTTGAGAGAACCCTTGGAA
ACCGGGGCCATCACGATTTCCCGCGTCGCGCGCCAGGTCGATTTCCCAGCCCGGTTCCAGCTCGTCGCCGCCATGAACCC
GTGCCCCTGCGGCTATCTGGGTGATGCCTCGGGACGCTGTCGTTGCACGGCGGAACAGGTGCAGCGTTATCGCTCCCGGA
TTTCCGGGCCCTTGCTCGACCGGATCGACATTCACGTCGAAGTGCCGCGGGTTTCCCAAGACGTGCTGCGCAATGGGACA
CCCGGCGGTGAACAGACGAGCGCCGACATCCGTCGCCGCGTGCTCGCAGCTCGGGAAATCGCCCGGCAGCGGACCGGTAA
ACCGAATGCCTTCATGACTCCACAAGAAATCAAGCGATATTGCCGGCTGAGCGACGAGGGGCACCGATTGCTCGAGCAGG
CCACGGAAAAACTCGGACTGTCCCATCGTGCCTATCACCGTATCCTGAAGCTCGCGCGCACCATTGCCGATCTTGCCGGC
GAGGAGGACATCGCCGTTTCGCATCTCAGCGAAGCGATCGGGTTCCGGCGTCTGGACCGCGTTCCGACGGGCCTCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

54.691

99.208

0.543

  comM Haemophilus influenzae Rd KW20

53.861

100

0.539

  comM Glaesserella parasuis strain SC1401

53.267

100

0.533

  comM Vibrio campbellii strain DS40M4

52.495

99.208

0.521

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

48.438

100

0.491

  comM Legionella pneumophila str. Paris

49.402

99.406

0.491

  comM Legionella pneumophila strain ERS1305867

49.402

99.406

0.491


Multiple sequence alignment