Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   MQG37_RS01500 Genome accession   NZ_CP093911
Coordinates   295939..296643 (+) Length   234 a.a.
NCBI ID   WP_005575487.1    Uniprot ID   -
Organism   Aggregatibacter actinomycetemcomitans strain 16R     
Function   require for natural transformation (predicted from homology)   
Unclear

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 293229..315194 295939..296643 within 0


Gene organization within MGE regions


Location: 293229..315194
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MQG37_RS01490 - 293229..295148 (-) 1920 Protein_288 Mu transposase C-terminal domain-containing protein -
  MQG37_RS01495 - 295141..295803 (-) 663 WP_005548207.1 heteromeric transposase endonuclease subunit TnsA -
  MQG37_RS01500 comM 295939..296643 (+) 705 WP_005575487.1 ATP-binding protein Machinery gene
  MQG37_RS01505 - 296655..297473 (-) 819 WP_005548203.1 divergent polysaccharide deacetylase family protein -
  MQG37_RS01510 envC 297479..298738 (-) 1260 WP_258286306.1 murein hydrolase activator EnvC -
  MQG37_RS01515 - 298947..300605 (-) 1659 WP_005548199.1 phospho-sugar mutase -
  MQG37_RS01520 - 300911..301594 (+) 684 WP_005548197.1 2,3-diphosphoglycerate-dependent phosphoglycerate mutase -
  MQG37_RS01525 - 301679..302173 (-) 495 WP_005548195.1 YfbU family protein -
  MQG37_RS01530 - 302191..302934 (-) 744 WP_258286307.1 VacJ family lipoprotein -
  MQG37_RS01535 nikA 303033..304616 (-) 1584 WP_005548193.1 nickel ABC transporter substrate-binding protein -
  MQG37_RS01540 hslU 304682..306013 (-) 1332 WP_005538655.1 HslU--HslV peptidase ATPase subunit -
  MQG37_RS01545 hslV 306034..306561 (-) 528 WP_005548189.1 ATP-dependent protease subunit HslV -
  MQG37_RS01550 aphA 306776..307483 (+) 708 WP_005548187.1 acid phosphatase AphA -
  MQG37_RS01555 - 307865..308182 (-) 318 Protein_301 DUF4298 domain-containing protein -
  MQG37_RS01560 mtr 308461..309705 (-) 1245 WP_005548180.1 tryptophan permease -
  MQG37_RS01565 ppa 309867..310394 (+) 528 WP_005548178.1 inorganic diphosphatase -
  MQG37_RS01570 - 310470..311243 (+) 774 WP_005548176.1 M48 family metallopeptidase -
  MQG37_RS01575 ribB 311923..312567 (+) 645 WP_005548174.1 3,4-dihydroxy-2-butanone-4-phosphate synthase -
  MQG37_RS01580 trpS 312654..313658 (-) 1005 WP_005548172.1 tryptophan--tRNA ligase -
  MQG37_RS01585 ftsY 313833..315194 (-) 1362 WP_005548170.1 signal recognition particle-docking protein FtsY -

Sequence


Protein


Download         Length: 234 a.a.        Molecular weight: 26184.03 Da        Isoelectric Point: 9.8867

>NTDB_id=665411 MQG37_RS01500 WP_005575487.1 295939..296643(+) (comM) [Aggregatibacter actinomycetemcomitans strain 16R]
MPLICNFDSGTIPKPGEISLAHNGVLFLDELPEFERKVLDALRQPLESGEIIISRANAKIQFPAKFQLIAAMNPSPTGHY
QGTHNRTSPQQVMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQNSGDRGESSATVREKVLKTRAIQLQRAGKINAHLNSKE
IERDCKLTDKDAQFLENALTKLGLSVRAYHRILKVSRTIADLDGEQHINQRHLAEALGYRAMDRLLQKLAKNSA

Nucleotide


Download         Length: 705 bp        

>NTDB_id=665411 MQG37_RS01500 WP_005575487.1 295939..296643(+) (comM) [Aggregatibacter actinomycetemcomitans strain 16R]
ATGCCACTTATTTGTAACTTTGACAGCGGCACGATCCCCAAACCGGGCGAAATTTCCCTCGCTCACAATGGCGTGCTTTT
CTTAGATGAGTTGCCCGAATTTGAACGCAAAGTGCTGGACGCGCTGCGTCAACCATTGGAAAGTGGCGAAATTATTATTT
CCCGCGCCAACGCCAAGATTCAATTTCCGGCAAAATTTCAACTGATTGCCGCGATGAATCCCAGCCCGACAGGGCATTAT
CAAGGCACGCATAATCGCACCTCACCACAACAAGTTATGCGTTATCTGAATCGGTTATCCGGCCCTTTTTTAGATCGCTT
CGATTTATCCATTGAAGTGCCCTTATTACCACAAGGCAGCTTACAAAATAGCGGCGATCGTGGCGAATCCAGCGCAACGG
TGCGAGAAAAAGTCCTGAAAACCCGTGCCATTCAACTTCAGCGCGCCGGCAAAATTAATGCCCATTTAAATAGCAAAGAA
ATCGAACGGGACTGCAAACTAACCGACAAAGACGCACAGTTCCTGGAAAATGCCCTGACTAAACTGGGGCTTTCCGTACG
AGCGTACCACCGCATACTGAAAGTGTCCCGCACCATTGCCGATTTAGATGGAGAACAACACATCAACCAGCGCCATCTGG
CCGAAGCTCTGGGTTATCGGGCAATGGACAGGTTGTTGCAAAAATTGGCGAAAAACTCGGCGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

86.957

98.291

0.855

  comM Glaesserella parasuis strain SC1401

81.034

99.145

0.803

  comM Vibrio cholerae strain A1552

67.873

94.444

0.641

  comM Vibrio campbellii strain DS40M4

65.909

94.017

0.62

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

52

96.154

0.5

  comM Legionella pneumophila str. Paris

51.869

91.453

0.474

  comM Legionella pneumophila strain ERS1305867

51.869

91.453

0.474