Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   QUE75_RS02965 Genome accession   NZ_AP028062
Coordinates   650803..652329 (+) Length   508 a.a.
NCBI ID   WP_106694444.1    Uniprot ID   -
Organism   Marinobacter shengliensis strain D49     
Function   require for natural transformation (predicted from homology)   
Unclear

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 584677..651271 650803..652329 flank -468


Gene organization within MGE regions


Location: 584677..652329
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QUE75_RS02665 (MAALD49_05270) gshA 586154..587710 (+) 1557 WP_286253431.1 glutamate--cysteine ligase -
  QUE75_RS02670 (MAALD49_05280) rmf 587824..588036 (+) 213 WP_058091935.1 ribosome modulation factor -
  QUE75_RS02675 (MAALD49_05290) fliL 588038..588463 (-) 426 WP_286253432.1 flagellar basal body-associated protein FliL -
  QUE75_RS02680 (MAALD49_05300) - 588531..589037 (+) 507 WP_138436841.1 disulfide bond formation protein B -
  QUE75_RS02685 (MAALD49_05310) rsd 589143..589622 (+) 480 WP_286253434.1 sigma D regulator -
  QUE75_RS02690 (MAALD49_05320) - 589693..590406 (-) 714 WP_286253435.1 FKBP-type peptidyl-prolyl cis-trans isomerase -
  QUE75_RS02695 (MAALD49_05330) - 590530..590913 (-) 384 WP_072678671.1 hypothetical protein -
  QUE75_RS02700 (MAALD49_05340) - 590910..591443 (-) 534 WP_286253446.1 DUF2390 domain-containing protein -
  QUE75_RS02705 (MAALD49_05350) - 591488..593434 (+) 1947 WP_286253448.1 ATP-binding cassette domain-containing protein -
  QUE75_RS02710 (MAALD49_05360) ppx 593436..594962 (-) 1527 WP_286253450.1 exopolyphosphatase -
  QUE75_RS02715 (MAALD49_05370) trxA 595248..595574 (+) 327 WP_011784020.1 thioredoxin TrxA -
  QUE75_RS02720 (MAALD49_05380) - 595669..596115 (-) 447 WP_286253460.1 molybdenum cofactor biosynthesis protein MoaE -
  QUE75_RS02725 (MAALD49_05390) moaD 596117..596377 (-) 261 WP_286253462.1 molybdopterin converting factor subunit 1 -
  QUE75_RS02730 (MAALD49_05400) glp 596380..597594 (-) 1215 WP_286253465.1 gephyrin-like molybdotransferase Glp -
  QUE75_RS02735 (MAALD49_05410) moaB 597607..598179 (-) 573 WP_061332513.1 molybdenum cofactor biosynthesis protein B -
  QUE75_RS02740 (MAALD49_05420) rho 598547..599809 (+) 1263 WP_058091947.1 transcription termination factor Rho -
  QUE75_RS02745 (MAALD49_05430) ubiD 600032..601519 (+) 1488 WP_061332514.1 4-hydroxy-3-polyprenylbenzoate decarboxylase -
  QUE75_RS02750 (MAALD49_05440) - 601533..602525 (+) 993 WP_286253469.1 2Fe-2S iron-sulfur cluster-binding protein -
  QUE75_RS02755 (MAALD49_05450) - 602587..603828 (-) 1242 WP_061332516.1 heme biosynthesis HemY N-terminal domain-containing protein -
  QUE75_RS02760 (MAALD49_05460) - 603825..604937 (-) 1113 WP_286253472.1 uroporphyrinogen-III C-methyltransferase -
  QUE75_RS02765 (MAALD49_05470) - 604924..605763 (-) 840 WP_286253474.1 uroporphyrinogen-III synthase -
  QUE75_RS02770 (MAALD49_05480) hemC 605744..606685 (-) 942 WP_061332518.1 hydroxymethylbilane synthase -
  QUE75_RS02775 (MAALD49_05490) - 606771..607526 (-) 756 WP_061332519.1 LytTR family DNA-binding domain-containing protein -
  QUE75_RS02780 (MAALD49_05500) - 607557..608645 (-) 1089 WP_396631684.1 sensor histidine kinase -
  QUE75_RS02785 (MAALD49_05510) argH 608705..610120 (+) 1416 WP_106694422.1 argininosuccinate lyase -
  QUE75_RS02790 (MAALD49_05520) - 610271..612358 (+) 2088 WP_286253480.1 DUF2235 domain-containing protein -
  QUE75_RS02795 (MAALD49_05530) - 612355..612960 (+) 606 WP_318036579.1 DUF2931 family protein -
  QUE75_RS02800 (MAALD49_05540) - 612979..615420 (-) 2442 WP_286253482.1 EAL domain-containing protein -
  QUE75_RS02805 (MAALD49_05550) - 615560..618409 (+) 2850 WP_286253483.1 class I adenylate cyclase -
  QUE75_RS02810 (MAALD49_05560) - 618482..618652 (+) 171 WP_138436827.1 LPS translocon maturation chaperone LptM -
  QUE75_RS02815 (MAALD49_05570) lysA 618697..619947 (+) 1251 WP_106694427.1 diaminopimelate decarboxylase -
  QUE75_RS02820 (MAALD49_05580) dapF 619962..620876 (+) 915 WP_106694528.1 diaminopimelate epimerase -
  QUE75_RS02825 (MAALD49_05590) - 620894..621646 (+) 753 WP_061332525.1 DUF484 family protein -
  QUE75_RS02830 (MAALD49_05600) xerC 621682..622614 (+) 933 WP_106694529.1 tyrosine recombinase XerC -
  QUE75_RS02835 (MAALD49_05610) - 622626..624020 (+) 1395 WP_072678636.1 GGDEF domain-containing protein -
  QUE75_RS02840 (MAALD49_05620) - 624030..624263 (-) 234 WP_286253492.1 DUF2789 domain-containing protein -
  QUE75_RS02845 (MAALD49_05630) - 624361..627117 (+) 2757 WP_286253493.1 ATP-binding protein -
  QUE75_RS02850 (MAALD49_05640) - 627120..627485 (-) 366 WP_061332529.1 hypothetical protein -
  QUE75_RS02855 (MAALD49_05650) - 627717..628325 (+) 609 WP_286253496.1 hypothetical protein -
  QUE75_RS02860 (MAALD49_05660) - 628465..630543 (+) 2079 WP_286255587.1 patatin-like phospholipase family protein -
  QUE75_RS02865 (MAALD49_05670) - 630564..631226 (-) 663 WP_286253498.1 GntR family transcriptional regulator -
  QUE75_RS02870 (MAALD49_05680) cls 631304..632740 (-) 1437 WP_286253500.1 cardiolipin synthase -
  QUE75_RS02875 (MAALD49_05690) - 632737..633777 (-) 1041 WP_286253501.1 AraC family transcriptional regulator -
  QUE75_RS02880 (MAALD49_05700) - 634165..634956 (+) 792 WP_286253503.1 alpha/beta fold hydrolase -
  QUE75_RS02885 (MAALD49_05710) - 635059..636144 (+) 1086 WP_286255589.1 SMP-30/gluconolactonase/LRE family protein -
  QUE75_RS02890 (MAALD49_05720) - 636141..636605 (+) 465 WP_072678627.1 thioesterase family protein -
  QUE75_RS02895 (MAALD49_05730) - 636683..637663 (+) 981 WP_286253507.1 NADP-dependent oxidoreductase -
  QUE75_RS02900 (MAALD49_05740) tcdA 637660..638457 (-) 798 WP_106694438.1 tRNA cyclic N6-threonylcarbamoyladenosine(37) synthase TcdA -
  QUE75_RS02905 (MAALD49_05750) - 638715..640490 (-) 1776 WP_106694439.1 acyl-CoA dehydrogenase C-terminal domain-containing protein -
  QUE75_RS02910 (MAALD49_05760) - 640667..641401 (+) 735 WP_286253511.1 SDR family NAD(P)-dependent oxidoreductase -
  QUE75_RS02915 (MAALD49_05770) - 641420..642364 (-) 945 WP_286253513.1 sodium-dependent bicarbonate transport family permease -
  QUE75_RS02920 (MAALD49_05780) - 642539..643177 (-) 639 WP_286253515.1 ChrR family anti-sigma-E factor -
  QUE75_RS02925 (MAALD49_05790) - 643174..643773 (-) 600 WP_061332542.1 sigma-70 family RNA polymerase sigma factor -
  QUE75_RS02930 (MAALD49_05800) - 644001..645047 (+) 1047 WP_286253517.1 AraC family transcriptional regulator -
  QUE75_RS02935 (MAALD49_05810) speA 645051..646961 (-) 1911 WP_286253519.1 biosynthetic arginine decarboxylase -
  QUE75_RS02940 (MAALD49_05820) speE 647191..648072 (+) 882 WP_061332546.1 polyamine aminopropyltransferase -
  QUE75_RS02945 (MAALD49_05830) - 648165..648359 (+) 195 WP_061332547.1 DUF6316 family protein -
  QUE75_RS02950 (MAALD49_05840) - 648436..649683 (-) 1248 WP_061332548.1 ammonium transporter -
  QUE75_RS02955 (MAALD49_05850) glnK 649755..650093 (-) 339 WP_058091350.1 P-II family nitrogen regulator -
  QUE75_RS02960 (MAALD49_05860) - 650478..650729 (+) 252 WP_061332549.1 accessory factor UbiK family protein -
  QUE75_RS02965 (MAALD49_05870) comM 650803..652329 (+) 1527 WP_106694444.1 YifB family Mg chelatase-like AAA ATPase Machinery gene

Sequence


Protein


Download         Length: 508 a.a.        Molecular weight: 53541.40 Da        Isoelectric Point: 7.8410

>NTDB_id=103454 QUE75_RS02965 WP_106694444.1 650803..652329(+) (comM) [Marinobacter shengliensis strain D49]
MLAIVHSRASIGVSAPAVTVEVHLSGGLPALSIVGLPETGVRESKDRVRSALINAGFEFPARRITINLAPADLPKEGGRF
DLPIALGILAASGQIPAESLKDLEFIGELSLDGALRPLKGVLPAVLAARGAGRALLLPQDNAEEAALASDDDVFAASHIL
TVCEHLSGRARISPVARAQPEAGPRDEGLDLADVRGQQVPRRALEVAAAGGHNLLLFGPPGTGKSMLASRLPGILPALDD
AAAMEVASVHSVAGLPLKPGGWRQAPFRSPHHTASAVALVGGGSSPRPGEISLAHRGVLFLDELPEFQRRVLEVLREPME
TGEISISRAARQVTFPARFQVVAAMNPCPCGYSGHPTMECQCTPQQVMRYRSRISGPLLDRFDLHVEVPVQAGGVLLGAG
ETGESSASVRERVLRARARQSERGVLNAALAGKALHEASHLNAESEKLLSGAMEKLGLSARALHRILRVARTLADLDGQP
AVTRNYLMEALGYRQLDRQQGQSSVVSA

Nucleotide


Download         Length: 1527 bp        

>NTDB_id=103454 QUE75_RS02965 WP_106694444.1 650803..652329(+) (comM) [Marinobacter shengliensis strain D49]
ATGCTTGCTATTGTCCATTCCCGTGCCAGTATCGGTGTGTCTGCACCGGCAGTGACTGTTGAAGTGCATCTGTCTGGCGG
ACTGCCGGCCCTGTCGATTGTTGGATTGCCGGAAACCGGTGTGCGTGAGAGCAAAGACCGGGTTCGCAGTGCCTTGATCA
ACGCCGGGTTCGAATTCCCCGCCCGTCGAATCACCATTAACCTGGCCCCCGCAGACCTGCCCAAAGAGGGTGGACGCTTC
GACCTGCCCATCGCCCTTGGCATCCTCGCCGCTTCCGGGCAGATTCCCGCCGAAAGTCTCAAAGACCTCGAGTTCATTGG
CGAGTTGTCCCTGGATGGCGCATTACGGCCCCTGAAAGGCGTATTGCCGGCGGTACTGGCGGCAAGAGGCGCCGGTCGCG
CGCTGTTGCTTCCCCAAGACAATGCTGAAGAGGCCGCGCTGGCCAGCGATGATGACGTGTTTGCCGCCAGCCACATACTG
ACCGTCTGTGAGCATTTGTCTGGTCGCGCCCGGATATCCCCGGTAGCTCGAGCCCAGCCAGAAGCCGGGCCCCGGGACGA
GGGGCTGGACCTCGCGGACGTCCGGGGCCAGCAGGTTCCCCGACGGGCCCTGGAAGTGGCGGCTGCCGGCGGGCACAATC
TTCTGCTGTTTGGCCCGCCAGGCACCGGCAAAAGCATGCTGGCCAGCCGGCTGCCCGGCATACTGCCAGCCCTGGACGAC
GCCGCGGCTATGGAGGTGGCCAGTGTGCATTCAGTCGCCGGCCTTCCCCTCAAGCCCGGTGGTTGGCGGCAGGCTCCATT
CCGCTCCCCGCACCATACCGCTTCGGCGGTGGCGCTGGTGGGAGGTGGTAGCAGCCCAAGGCCCGGAGAAATCTCCCTGG
CCCATCGAGGCGTACTGTTTCTTGACGAGTTGCCAGAGTTCCAGCGCCGCGTGTTGGAGGTATTGCGGGAACCTATGGAA
ACCGGTGAAATTTCCATCAGTCGGGCTGCACGGCAGGTCACTTTTCCTGCCCGATTCCAGGTGGTTGCTGCTATGAACCC
TTGCCCGTGCGGCTACAGTGGGCATCCGACCATGGAGTGTCAGTGTACGCCGCAGCAGGTCATGCGCTATCGCTCCCGGA
TATCCGGGCCGTTGCTGGACCGGTTTGACCTGCACGTTGAAGTGCCGGTGCAGGCCGGGGGCGTGTTGTTGGGGGCGGGT
GAAACCGGAGAGTCCAGCGCCAGCGTCCGGGAACGGGTGCTACGGGCCCGGGCCAGGCAGTCAGAACGGGGCGTGCTCAA
TGCGGCCCTTGCTGGCAAGGCGTTGCACGAGGCCAGCCATCTGAATGCGGAGAGCGAGAAACTGCTGTCCGGAGCCATGG
AGAAACTTGGGTTGTCTGCGCGGGCGTTGCATCGGATTCTGCGAGTGGCCCGCACCCTGGCGGACCTGGATGGCCAGCCA
GCGGTAACCCGAAACTATCTGATGGAAGCGCTCGGCTACCGGCAACTGGACCGTCAACAGGGGCAAAGCTCGGTTGTCTC
CGCTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

55.777

98.819

0.551

  comM Glaesserella parasuis strain SC1401

55.777

98.819

0.551

  comM Vibrio cholerae strain A1552

55.422

98.031

0.543

  comM Vibrio campbellii strain DS40M4

54.819

98.031

0.537

  comM Legionella pneumophila str. Paris

49.597

97.638

0.484

  comM Legionella pneumophila strain ERS1305867

49.597

97.638

0.484

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.151

100

0.472


Multiple sequence alignment