Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   C4J94_RS26520 Genome accession   NZ_CP027727
Coordinates   5766119..5767615 (+) Length   498 a.a.
NCBI ID   WP_124388738.1    Uniprot ID   A0A3G7YKG6
Organism   Pseudomonas sp. R5-89-07     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 5750465..5794555 5766119..5767615 within 0


Gene organization within MGE regions


Location: 5750465..5794555
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C4J94_RS26425 (C4J94_5298) - 5751813..5752472 (-) 660 WP_124388723.1 glutathione S-transferase family protein -
  C4J94_RS26430 (C4J94_5299) - 5752599..5752883 (+) 285 WP_124388724.1 hypothetical protein -
  C4J94_RS26435 (C4J94_5300) - 5752980..5753234 (+) 255 WP_003195403.1 TIGR02647 family protein -
  C4J94_RS26440 (C4J94_5301) - 5753403..5756243 (+) 2841 WP_124388725.1 class I adenylate cyclase -
  C4J94_RS26445 (C4J94_5302) rnk 5756240..5756653 (-) 414 WP_124388726.1 nucleoside diphosphate kinase regulator -
  C4J94_RS26450 (C4J94_5303) - 5756785..5757003 (-) 219 WP_124388727.1 DUF1289 domain-containing protein -
  C4J94_RS26455 (C4J94_5304) cyaY 5757006..5757338 (-) 333 WP_124388728.1 iron donor protein CyaY -
  C4J94_RS26460 (C4J94_5305) - 5757577..5757744 (+) 168 WP_003195417.1 lipoprotein -
  C4J94_RS26465 (C4J94_5306) lysA 5757754..5759001 (+) 1248 WP_124388729.1 diaminopimelate decarboxylase -
  C4J94_RS26470 (C4J94_5307) dapF 5759005..5759835 (+) 831 WP_124388730.1 diaminopimelate epimerase -
  C4J94_RS26475 (C4J94_5308) - 5759849..5760565 (+) 717 WP_124388731.1 DUF484 family protein -
  C4J94_RS26480 (C4J94_5309) xerC 5760568..5761467 (+) 900 WP_124388732.1 tyrosine recombinase XerC -
  C4J94_RS26485 (C4J94_5310) - 5761464..5762159 (+) 696 WP_124388733.1 HAD family hydrolase -
  C4J94_RS26490 (C4J94_5311) sutA 5762346..5762678 (-) 333 WP_124388734.1 transcriptional regulator SutA -
  C4J94_RS26495 (C4J94_5312) - 5762781..5763206 (-) 426 WP_124388735.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  C4J94_RS26500 (C4J94_5313) - 5763417..5764757 (-) 1341 WP_124388736.1 ammonium transporter -
  C4J94_RS26505 (C4J94_5314) glnK 5764792..5765130 (-) 339 WP_002555808.1 P-II family nitrogen regulator -
  C4J94_RS26510 (C4J94_5315) - 5765553..5765816 (+) 264 WP_124388737.1 accessory factor UbiK family protein -
  C4J94_RS28220 - 5765859..5765954 (-) 96 Protein_5193 N-acetyltransferase -
  C4J94_RS26520 (C4J94_5316) comM 5766119..5767615 (+) 1497 WP_124388738.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  C4J94_RS26525 (C4J94_5317) - 5767648..5769624 (-) 1977 WP_124388739.1 methyl-accepting chemotaxis protein -
  C4J94_RS26530 (C4J94_5318) - 5769802..5770722 (-) 921 WP_124388740.1 LysR substrate-binding domain-containing protein -
  C4J94_RS26535 (C4J94_5319) - 5770874..5772277 (+) 1404 WP_124388741.1 NorM family multidrug efflux MATE transporter -
  C4J94_RS26540 (C4J94_5320) - 5772320..5773984 (-) 1665 WP_124388742.1 bifunctional diguanylate cyclase/phosphodiesterase -
  C4J94_RS26545 (C4J94_5321) rep 5774249..5776258 (+) 2010 WP_124388743.1 DNA helicase Rep -
  C4J94_RS26550 (C4J94_5322) - 5776317..5776886 (+) 570 WP_003176890.1 xanthine phosphoribosyltransferase -
  C4J94_RS26555 (C4J94_5323) - 5777011..5778933 (-) 1923 WP_124388744.1 acetyl-CoA hydrolase/transferase C-terminal domain-containing protein -
  C4J94_RS26560 (C4J94_5324) - 5779071..5779481 (-) 411 WP_124389046.1 cytochrome c5 family protein -
  C4J94_RS26565 (C4J94_5325) - 5779662..5780210 (-) 549 WP_003176893.1 cupin domain-containing protein -
  C4J94_RS26570 (C4J94_5326) alr 5780327..5781400 (-) 1074 WP_124388745.1 alanine racemase -
  C4J94_RS26575 (C4J94_5327) - 5781478..5781831 (-) 354 WP_124388746.1 RidA family protein -
  C4J94_RS26580 (C4J94_5328) dadA 5781803..5783104 (-) 1302 WP_124388747.1 D-amino acid dehydrogenase -
  C4J94_RS26585 (C4J94_5329) - 5783261..5783749 (+) 489 WP_003176896.1 Lrp/AsnC ligand binding domain-containing protein -
  C4J94_RS26590 (C4J94_5330) - 5783814..5784167 (-) 354 WP_124388748.1 YkgJ family cysteine cluster protein -
  C4J94_RS26595 (C4J94_5331) - 5784282..5785574 (+) 1293 WP_124388749.1 FAD-binding oxidoreductase -
  C4J94_RS26600 (C4J94_5332) - 5785575..5785799 (-) 225 WP_124388750.1 DUF1127 domain-containing protein -
  C4J94_RS26605 (C4J94_5333) - 5785966..5787390 (+) 1425 WP_124388751.1 PLP-dependent aminotransferase family protein -
  C4J94_RS26610 (C4J94_5334) - 5787453..5789021 (+) 1569 WP_124388752.1 phospholipase D family protein -
  C4J94_RS26615 (C4J94_5335) - 5789036..5790178 (-) 1143 WP_124388753.1 MFS transporter -
  C4J94_RS26620 (C4J94_5336) - 5790344..5791837 (-) 1494 WP_124388754.1 aldehyde dehydrogenase -
  C4J94_RS26625 (C4J94_5337) - 5792067..5792429 (+) 363 WP_124388755.1 cupin domain-containing protein -
  C4J94_RS26630 (C4J94_5338) rpmG 5792503..5792658 (-) 156 WP_003176906.1 50S ribosomal protein L33 -
  C4J94_RS26635 (C4J94_5339) rpmB 5792670..5792903 (-) 234 WP_003176907.1 50S ribosomal protein L28 -

Sequence


Protein


Download         Length: 498 a.a.        Molecular weight: 53504.86 Da        Isoelectric Point: 8.0906

>NTDB_id=279492 C4J94_RS26520 WP_124388738.1 5766119..5767615(+) (comM) [Pseudomonas sp. R5-89-07]
MSLAIVHSRAQIGVEAPVVTVEVHMANGLPSLTLVGLPETAVKESKDRVRSAILNSALQYPARRITLNLAPADLPKDGGR
FDLAIALGILAASMQVPALMLDDVECLGELALSGEVRAVKGVLPAALAARKAGRTLIVPRANAEEACLASGLKVIAVDHL
LQVVAHLNGHVPIEPYKSDGLLYLNKPYPDLNEVQGQLAAKRALLIAAAGAHNLLFSGPPGTGKTLLASRLPGLLPPLSE
QEALEVAAIQSVVSLAPLSHWPHRPFRQPHHSASGPALVGGGSKPQPGEITLAHHGVLFLDELPEFDRKVLEVLREPLES
GHIVISRARDRVSFPARFQLVAAMNPCPCGYMGEPSGRCRCTPEQIQRYRNKLSGPLLDRIDLHLTVAREATALSPAQQT
GDNTAKTAALVADARERQQRRQGCANAFLDLPGLREYCTLAKVDEGWLESACERLTLSLRAAHRLLKVARTLADLEQTEA
IARHHLQEALQYRPAAIN

Nucleotide


Download         Length: 1497 bp        

>NTDB_id=279492 C4J94_RS26520 WP_124388738.1 5766119..5767615(+) (comM) [Pseudomonas sp. R5-89-07]
ATGTCCCTCGCCATCGTCCACAGCCGCGCCCAGATCGGCGTTGAAGCTCCCGTCGTTACCGTCGAAGTGCATATGGCCAA
TGGCTTGCCATCCCTGACCCTGGTGGGTTTGCCGGAAACCGCGGTCAAGGAAAGCAAGGACCGCGTTCGCAGCGCCATTC
TCAACTCGGCCCTGCAATACCCAGCGCGGCGCATCACGCTCAACCTCGCGCCCGCCGACCTGCCCAAGGACGGCGGGCGG
TTTGACCTGGCGATTGCCCTGGGGATCCTGGCGGCCAGCATGCAGGTGCCGGCATTGATGCTCGATGACGTCGAGTGCCT
GGGAGAGTTGGCGCTGTCCGGCGAGGTACGCGCGGTGAAAGGCGTGTTGCCGGCCGCGCTGGCGGCGCGCAAGGCCGGGC
GCACCCTGATAGTGCCCAGGGCGAACGCTGAGGAAGCTTGCCTGGCGTCAGGGTTGAAGGTGATTGCGGTAGATCACTTG
CTGCAGGTGGTGGCGCACTTGAATGGGCATGTACCGATCGAGCCCTACAAGTCCGACGGCTTGCTGTATTTGAATAAGCC
TTACCCGGACCTCAATGAAGTACAGGGCCAGTTGGCCGCCAAGCGTGCCTTGCTGATCGCCGCGGCGGGGGCGCATAACC
TGTTATTCAGCGGGCCGCCCGGCACCGGCAAGACCTTGCTCGCCAGCCGCCTGCCGGGCTTGCTGCCACCGCTCAGCGAG
CAGGAAGCACTGGAAGTCGCGGCGATTCAATCCGTGGTCAGCCTGGCCCCGCTAAGTCATTGGCCGCACCGGCCGTTCCG
CCAGCCGCACCACTCGGCGTCCGGCCCCGCGCTGGTCGGCGGAGGATCGAAGCCGCAACCGGGGGAAATCACCCTGGCCC
ATCACGGCGTGTTGTTCCTGGACGAGTTGCCGGAGTTCGACCGCAAAGTCCTGGAGGTACTGCGCGAGCCGCTCGAGTCG
GGACATATTGTGATTTCCCGCGCGCGGGACCGGGTGAGTTTCCCGGCGCGCTTCCAACTGGTCGCGGCGATGAACCCCTG
CCCTTGCGGTTATATGGGCGAACCCAGCGGCCGTTGTCGGTGCACGCCGGAACAGATTCAACGCTACCGCAACAAGCTCT
CGGGGCCGCTATTGGACCGCATCGATTTGCACCTGACCGTGGCGCGGGAGGCCACGGCACTGAGCCCGGCCCAACAGACC
GGCGACAACACTGCGAAAACCGCCGCCCTGGTTGCCGACGCACGGGAGCGCCAGCAACGACGCCAGGGCTGCGCCAATGC
GTTCCTCGATCTGCCGGGGCTGCGCGAGTACTGCACGCTGGCAAAGGTCGATGAAGGCTGGCTGGAAAGTGCATGTGAGC
GACTGACACTGTCGCTGCGCGCGGCTCATAGGCTGCTCAAGGTGGCGCGCACTTTGGCGGACCTCGAACAAACAGAAGCA
ATCGCTCGCCACCATCTGCAAGAGGCCTTGCAGTACCGTCCGGCGGCGATCAATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A3G7YKG6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

56.566

99.398

0.562

  comM Vibrio campbellii strain DS40M4

56.364

99.398

0.56

  comM Glaesserella parasuis strain SC1401

54.89

100

0.552

  comM Haemophilus influenzae Rd KW20

54.8

100

0.55

  comM Legionella pneumophila str. Paris

51.008

99.598

0.508

  comM Legionella pneumophila strain ERS1305867

51.008

99.598

0.508

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.123

100

0.466


Multiple sequence alignment