Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   G3M63_RS05525 Genome accession   NZ_CP048629
Coordinates   1197452..1198957 (-) Length   501 a.a.
NCBI ID   WP_163982104.1    Uniprot ID   A0A6C0TNR6
Organism   Pseudomonas sp. OIL-1     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 1193568..1204586 1197452..1198957 within 0


Gene organization within MGE regions


Location: 1193568..1204586
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G3M63_RS05515 (G3M63_05515) rep 1193568..1195574 (-) 2007 WP_150300044.1 DNA helicase Rep -
  G3M63_RS05520 (G3M63_05520) - 1195755..1197443 (+) 1689 WP_150300045.1 bifunctional diguanylate cyclase/phosphodiesterase -
  G3M63_RS05525 (G3M63_05525) comM 1197452..1198957 (-) 1506 WP_163982104.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  G3M63_RS05530 (G3M63_05530) - 1198988..1199245 (-) 258 WP_150300047.1 accessory factor UbiK family protein -
  G3M63_RS05535 (G3M63_05535) glnK 1199788..1200126 (+) 339 WP_093396590.1 P-II family nitrogen regulator -
  G3M63_RS05540 (G3M63_05540) - 1200173..1201417 (+) 1245 WP_178100951.1 ammonium transporter -
  G3M63_RS05545 (G3M63_05545) - 1201511..1201927 (+) 417 WP_163982105.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  G3M63_RS05550 (G3M63_05550) - 1201979..1202434 (+) 456 WP_150300050.1 hypothetical protein -
  G3M63_RS05555 (G3M63_05555) - 1202611..1202922 (+) 312 WP_150300051.1 hypothetical protein -
  G3M63_RS05560 (G3M63_05560) - 1202926..1203648 (-) 723 WP_163982106.1 HAD family hydrolase -
  G3M63_RS05565 (G3M63_05565) xerC 1203645..1204568 (-) 924 WP_163984191.1 tyrosine recombinase XerC -

Sequence


Protein


Download         Length: 501 a.a.        Molecular weight: 54028.91 Da        Isoelectric Point: 7.5388

>NTDB_id=421965 G3M63_RS05525 WP_163982104.1 1197452..1198957(-) (comM) [Pseudomonas sp. OIL-1]
MSLAIVHSRAKLGVQAPAVTVEVHLANGMPSLTLVGLPETAVRESKDRVRSALQNARLEFPGLKRITINLAPADLPKEGG
RFDLAIALGLLSASGQIPADVLDQCECLGELALSGELRPVQGVLPAALACRDQGRTLLLPRANAMEAALVRGLHIVAADH
LLELCEHLRGERTLEPVSPNPAPELPRYHADLDEVQGQAQAKRALIVAASGGHNLLLFGPPGTGKTLLASRLPGILPQLV
EGEALEVAAIQSVCGQGPLQTWPTRPFRAPHHGASAAALVGGGSQPRPGEISQAHRGVLFLDELPEFDRKVLEMLREPME
SGEIHIARAREQLSFPARFQLIAAMNPCPCGYQGDPSGRCRCSAEQVERYRGKLSGPLLDRIDLHVAMHNEAFSLANKEP
GKPNSAEAAAQVAAARQQQQQRQGRLNAHLDLAGLRDHCPLKDEDARWMERAIEQLGLSHRASHRALKVARTLADLDNQT
AIGRAHLAEALQYRQLDRRPS

Nucleotide


Download         Length: 1506 bp        

>NTDB_id=421965 G3M63_RS05525 WP_163982104.1 1197452..1198957(-) (comM) [Pseudomonas sp. OIL-1]
ATGTCCCTGGCGATTGTCCACAGCCGCGCCAAACTAGGCGTGCAGGCGCCTGCCGTCACCGTTGAGGTGCATCTGGCCAA
TGGAATGCCCAGCCTGACACTGGTCGGGTTGCCTGAAACCGCCGTGCGCGAAAGCAAGGATCGCGTACGCAGCGCCCTGC
AGAATGCCCGCCTCGAATTCCCCGGCCTGAAGCGCATCACCATCAATCTCGCGCCCGCTGACCTGCCCAAGGAAGGCGGA
CGCTTCGATCTGGCGATTGCACTGGGCCTGTTATCCGCCAGCGGTCAGATTCCCGCCGACGTGCTGGACCAGTGCGAATG
CCTGGGCGAGCTGGCGCTGTCCGGCGAGTTACGCCCGGTTCAGGGCGTGTTGCCAGCGGCGCTGGCGTGTCGGGACCAGG
GCCGCACGCTGTTGTTACCACGCGCCAATGCCATGGAAGCGGCATTAGTCCGGGGCTTGCACATTGTCGCCGCCGACCAC
CTGTTGGAGCTGTGCGAACACCTGCGCGGCGAACGCACCCTCGAACCAGTATCACCAAACCCTGCGCCAGAGCTGCCTCG
CTACCATGCCGACCTTGACGAGGTGCAGGGTCAGGCGCAGGCCAAACGCGCGCTGATCGTCGCGGCCAGCGGCGGTCACA
ATTTACTTTTATTTGGCCCTCCGGGTACGGGGAAAACCCTGCTGGCGAGTCGCCTACCCGGAATCCTCCCGCAGCTGGTC
GAGGGCGAAGCACTGGAAGTGGCGGCCATCCAATCGGTGTGTGGGCAGGGCCCGCTACAGACCTGGCCAACACGGCCCTT
CCGTGCGCCCCATCATGGTGCATCTGCTGCAGCGCTAGTCGGCGGCGGCAGTCAGCCGCGCCCCGGAGAGATCAGCCAGG
CCCACCGGGGCGTACTGTTTCTCGACGAATTGCCCGAGTTTGATCGCAAGGTCCTTGAGATGCTGCGTGAGCCGATGGAG
TCAGGCGAGATACACATCGCTCGCGCACGTGAACAACTGAGCTTCCCTGCCCGTTTTCAATTGATCGCAGCGATGAACCC
ATGCCCCTGCGGATACCAGGGAGACCCATCCGGACGCTGTCGCTGCTCCGCTGAGCAAGTGGAACGCTACCGCGGCAAAC
TGTCCGGCCCGCTGCTGGACCGGATCGACCTGCACGTCGCCATGCACAATGAAGCCTTCAGCCTTGCCAACAAAGAGCCT
GGCAAACCCAACAGCGCGGAAGCCGCCGCCCAGGTAGCCGCTGCACGCCAACAGCAGCAACAGCGCCAGGGTAGATTGAA
CGCGCATCTGGATCTGGCGGGCTTGCGTGACCATTGCCCGTTGAAGGATGAGGATGCACGCTGGATGGAGCGCGCTATAG
AGCAGCTGGGTTTATCCCACCGCGCCAGCCATCGGGCATTGAAAGTGGCGCGGACATTGGCGGATCTTGATAATCAGACG
GCTATCGGCCGAGCGCATCTGGCGGAGGCGTTGCAATACCGGCAACTGGATAGAAGGCCCAGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6C0TNR6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

54.871

100

0.551

  comM Vibrio cholerae strain A1552

54.8

99.8

0.547

  comM Glaesserella parasuis strain SC1401

52.838

100

0.539

  comM Haemophilus influenzae Rd KW20

52.871

100

0.533

  comM Legionella pneumophila str. Paris

49.602

100

0.497

  comM Legionella pneumophila strain ERS1305867

49.602

100

0.497

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.365

100

0.471


Multiple sequence alignment