Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   MQG43_RS01500 Genome accession   NZ_CP093910
Coordinates   295748..296452 (+) Length   234 a.a.
NCBI ID   WP_309137068.1    Uniprot ID   -
Organism   Aggregatibacter actinomycetemcomitans strain 27R     
Function   require for natural transformation (predicted from homology)   
Unclear

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 293038..315003 295748..296452 within 0


Gene organization within MGE regions


Location: 293038..315003
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MQG43_RS01490 - 293038..294957 (-) 1920 WP_005559215.1 Mu transposase C-terminal domain-containing protein -
  MQG43_RS01495 - 294950..295612 (-) 663 WP_005548207.1 heteromeric transposase endonuclease subunit TnsA -
  MQG43_RS01500 comM 295748..296452 (+) 705 WP_309137068.1 ATP-binding protein Machinery gene
  MQG43_RS01505 - 296464..297282 (-) 819 WP_005548203.1 divergent polysaccharide deacetylase family protein -
  MQG43_RS01510 envC 297288..298547 (-) 1260 WP_032999579.1 murein hydrolase activator EnvC -
  MQG43_RS01515 - 298756..300414 (-) 1659 WP_005548199.1 phospho-sugar mutase -
  MQG43_RS01520 - 300720..301403 (+) 684 WP_005548197.1 2,3-diphosphoglycerate-dependent phosphoglycerate mutase -
  MQG43_RS01525 - 301488..301982 (-) 495 WP_005548195.1 YfbU family protein -
  MQG43_RS01530 - 302000..302743 (-) 744 WP_032996149.1 VacJ family lipoprotein -
  MQG43_RS01535 nikA 302842..304425 (-) 1584 WP_258271245.1 nickel ABC transporter substrate-binding protein -
  MQG43_RS01540 hslU 304491..305822 (-) 1332 WP_005538655.1 HslU--HslV peptidase ATPase subunit -
  MQG43_RS01545 hslV 305843..306370 (-) 528 WP_005548189.1 ATP-dependent protease subunit HslV -
  MQG43_RS01550 aphA 306585..307292 (+) 708 WP_005548187.1 acid phosphatase AphA -
  MQG43_RS01555 - 307674..307991 (-) 318 Protein_302 DUF4298 domain-containing protein -
  MQG43_RS01560 mtr 308270..309514 (-) 1245 WP_258271246.1 tryptophan permease -
  MQG43_RS01565 ppa 309676..310203 (+) 528 WP_005548178.1 inorganic diphosphatase -
  MQG43_RS01570 - 310279..311052 (+) 774 WP_005548176.1 M48 family metallopeptidase -
  MQG43_RS01575 ribB 311732..312376 (+) 645 WP_005548174.1 3,4-dihydroxy-2-butanone-4-phosphate synthase -
  MQG43_RS01580 trpS 312463..313467 (-) 1005 WP_005575514.1 tryptophan--tRNA ligase -
  MQG43_RS01585 ftsY 313642..315003 (-) 1362 WP_005548170.1 signal recognition particle-docking protein FtsY -

Sequence


Protein


Download         Length: 234 a.a.        Molecular weight: 26218.05 Da        Isoelectric Point: 9.8867

>NTDB_id=665386 MQG43_RS01500 WP_309137068.1 295748..296452(+) (comM) [Aggregatibacter actinomycetemcomitans strain 27R]
MPLICNFDSGTIPKPGEISLAHNGVFFLDELPEFERKVLDALRQPLESGEIIISRANAKIQFPAKFQLIAAMNPSPTGHY
QGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQNSGDRGESSATVREKVLKTRAIQLQRAGKINAHLNSKE
IERDCKLTDKDAQFLENALTKLGLSVRAYHRILKVSRTIADLDGEQHINQRHLAEALGYRAMDRLLQKLAKNSA

Nucleotide


Download         Length: 705 bp        

>NTDB_id=665386 MQG43_RS01500 WP_309137068.1 295748..296452(+) (comM) [Aggregatibacter actinomycetemcomitans strain 27R]
ATGCCACTTATTTGTAACTTTGACAGCGGCACGATCCCCAAACCGGGCGAAATTTCCCTCGCTCACAATGGCGTGTTTTT
CTTAGATGAGTTGCCCGAATTTGAACGCAAAGTGCTGGACGCGCTGCGTCAACCATTGGAAAGTGGCGAAATTATTATTT
CCCGCGCCAACGCCAAGATTCAATTTCCGGCAAAATTTCAACTGATTGCCGCGATGAATCCCAGCCCGACAGGGCATTAT
CAAGGCACGCATAATCGCACCTCACCACAACAAGTTATGCGTTATCTGAATCGGTTATCCGGCCCTTTTTTAGATCGCTT
CGATTTATCCATTGAAGTGCCCTTATTACCACAAGGCAGCTTACAAAATAGCGGCGATCGTGGCGAATCCAGCGCAACGG
TGCGAGAAAAAGTCCTGAAAACCCGTGCCATTCAACTTCAGCGCGCCGGCAAAATTAATGCCCATTTAAATAGCAAAGAA
ATCGAACGGGACTGCAAACTAACCGACAAAGACGCACAGTTCCTGGAAAATGCCCTGACTAAACTGGGGCTTTCCGTACG
AGCGTACCACCGCATACTGAAAGTGTCCCGCACCATTGCCGATTTAGATGGAGAACAACACATCAACCAGCGCCATCTGG
CCGAAGCTCTGGGTTATCGGGCAATGGACAGGTTGTTGCAAAAATTGGCGAAAAACTCGGCGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

86.522

98.291

0.85

  comM Glaesserella parasuis strain SC1401

80.603

99.145

0.799

  comM Vibrio cholerae strain A1552

67.421

94.444

0.637

  comM Vibrio campbellii strain DS40M4

65.455

94.017

0.615

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

51.556

96.154

0.496

  comM Legionella pneumophila str. Paris

51.402

91.453

0.47

  comM Legionella pneumophila strain ERS1305867

51.402

91.453

0.47