Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   RS899_RS25490 Genome accession   NZ_CP135919
Coordinates   5561675..5563165 (+) Length   496 a.a.
NCBI ID   WP_065348187.1    Uniprot ID   A0A1B8VE92
Organism   Pseudomonas sp. C9-3     
Function   require for natural transformation (predicted from homology)   
Unclear

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 5550063..5589828 5561675..5563165 within 0


Gene organization within MGE regions


Location: 5550063..5589828
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  RS899_RS25410 rnk 5552133..5552537 (-) 405 WP_045215433.1 nucleoside diphosphate kinase regulator -
  RS899_RS25415 - 5552721..5552927 (-) 207 WP_045215430.1 DUF1289 domain-containing protein -
  RS899_RS25420 cyaY 5552905..5553240 (-) 336 WP_024762102.1 iron donor protein CyaY -
  RS899_RS25425 - 5553237..5554298 (-) 1062 WP_315808355.1 AraC family transcriptional regulator -
  RS899_RS25430 lptM 5554452..5554592 (+) 141 WP_017520757.1 LPS translocon maturation chaperone LptM -
  RS899_RS25440 - 5554684..5554910 (+) 227 Protein_5043 diaminopimelate decarboxylase -
  RS899_RS25445 dapF 5554924..5555754 (+) 831 WP_054910832.1 diaminopimelate epimerase -
  RS899_RS25450 - 5555766..5556467 (+) 702 WP_045215418.1 DUF484 family protein -
  RS899_RS25455 xerC 5556521..5557426 (+) 906 WP_024762097.1 tyrosine recombinase XerC -
  RS899_RS25460 - 5557423..5558115 (+) 693 WP_315808356.1 HAD family hydrolase -
  RS899_RS25465 sutA 5558293..5558625 (-) 333 WP_037011556.1 transcriptional regulator SutA -
  RS899_RS25470 - 5558704..5559129 (-) 426 WP_054910834.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  RS899_RS25475 - 5559236..5560570 (-) 1335 WP_045215404.1 ammonium transporter -
  RS899_RS25480 glnK 5560612..5560950 (-) 339 WP_003457590.1 P-II family nitrogen regulator -
  RS899_RS25485 - 5561354..5561635 (+) 282 WP_054910835.1 accessory factor UbiK family protein -
  RS899_RS25490 comM 5561675..5563165 (+) 1491 WP_065348187.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  RS899_RS25495 - 5563185..5563874 (-) 690 WP_315808357.1 M15 family metallopeptidase -
  RS899_RS25500 - 5564014..5564244 (-) 231 WP_315808358.1 DUF1127 domain-containing protein -
  RS899_RS25505 - 5564429..5565877 (+) 1449 WP_038803555.1 PLP-dependent aminotransferase family protein -
  RS899_RS25510 - 5565960..5566763 (+) 804 WP_315808359.1 TlpA disulfide reductase family protein -
  RS899_RS25515 - 5567140..5567403 (+) 264 WP_315808360.1 hypothetical protein -
  RS899_RS25520 - 5567400..5568569 (+) 1170 WP_315808361.1 hypothetical protein -
  RS899_RS25525 - 5568688..5569164 (-) 477 WP_315808362.1 Bro-N domain-containing protein -
  RS899_RS25530 - 5569901..5570953 (-) 1053 WP_024762088.1 hypothetical protein -
  RS899_RS25535 - 5571138..5572046 (-) 909 WP_315808363.1 AraC family transcriptional regulator -
  RS899_RS25540 - 5572095..5573015 (-) 921 WP_315808364.1 LysR substrate-binding domain-containing protein -
  RS899_RS25545 - 5573153..5574547 (+) 1395 WP_315808365.1 NorM family multidrug efflux MATE transporter -
  RS899_RS25550 - 5574544..5576220 (-) 1677 WP_054910076.1 putative bifunctional diguanylate cyclase/phosphodiesterase -
  RS899_RS25555 rep 5576513..5578522 (+) 2010 WP_315808366.1 DNA helicase Rep -
  RS899_RS25560 - 5578867..5579442 (+) 576 WP_024762083.1 xanthine phosphoribosyltransferase -
  RS899_RS25565 - 5579594..5581369 (-) 1776 WP_315810475.1 acetyl-CoA hydrolase/transferase C-terminal domain-containing protein -
  RS899_RS25570 - 5581635..5582063 (-) 429 WP_054910073.1 c-type cytochrome -
  RS899_RS25575 - 5582236..5582784 (-) 549 WP_233297557.1 cupin domain-containing protein -
  RS899_RS25580 alr 5583088..5584167 (-) 1080 WP_233297558.1 alanine racemase -
  RS899_RS25585 - 5584250..5584603 (-) 354 WP_315808367.1 RidA family protein -
  RS899_RS25590 dadA 5584578..5585873 (-) 1296 WP_024762077.1 D-amino acid dehydrogenase -
  RS899_RS25595 - 5586235..5586579 (-) 345 WP_024762076.1 YdbL family protein -
  RS899_RS25600 - 5586594..5586785 (-) 192 WP_024762075.1 YnbE family lipoprotein -
  RS899_RS25605 - 5586803..5589367 (-) 2565 WP_315808368.1 YdbH domain-containing protein -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 52799.60 Da        Isoelectric Point: 7.7655

>NTDB_id=887680 RS899_RS25490 WP_065348187.1 5561675..5563165(+) (comM) [Pseudomonas sp. C9-3]
MSLAIVHSRAQVGVEAPGVIVEAHLANGLPSLTLVGLPEGAVKESKDRVRSALLNAGFDFPNRRITLNLAPADLPKDGGR
FDLAIALGILAASGQLADAAGLDELECLGELALSGTLRPVPGVLPAALAARAAGRALVVPKENAEEASLASGLTVYAVGH
LLELAAHFSGQERLRPYEANGLMRVTPPYPDLSEVQGQAAAKRALLVAAAGAHNLLFSGPPGTGKTLLASRLPGLMPPLD
EDEALQVAAIHSVAGRGPLTHWPQRPFRQPHHTASAPALVGGGSRPQPGEITLAHEGVLFLDELPEFDRKVLEVLREPLE
GGEIVIARANGRVRFPARFQLVAAMNPCPCGYLGDPSGRCRCSPEQIQRYRAKLSGPLLDRIDLHLTVNRESTTLAPSGP
STSSAELATQVAAARQRQLARQGCANAFLTLKKMHQYCALSPEDQAWLEKAGERLNLSLRALHRILKVARTLADLQQEDR
IERPHLAEALQYRAGH

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=887680 RS899_RS25490 WP_065348187.1 5561675..5563165(+) (comM) [Pseudomonas sp. C9-3]
ATGTCCCTCGCCATCGTCCATAGCCGCGCCCAGGTCGGCGTGGAAGCGCCCGGCGTCATCGTCGAGGCACACCTGGCCAA
CGGCCTGCCTTCCCTGACCCTGGTCGGCCTGCCCGAAGGCGCCGTGAAAGAGAGCAAGGACCGCGTGCGCAGCGCCCTGC
TCAACGCCGGCTTCGATTTCCCCAACCGACGCATCACCCTGAACCTCGCCCCGGCCGACCTGCCGAAGGATGGCGGGCGT
TTCGACCTGGCCATCGCCCTGGGCATCCTCGCCGCCAGCGGCCAGTTGGCCGACGCTGCCGGGCTGGATGAACTGGAATG
CCTGGGCGAACTGGCGCTGTCCGGCACTTTGCGCCCGGTCCCCGGGGTATTACCTGCCGCCCTGGCTGCCCGCGCCGCCG
GCCGCGCCCTGGTGGTGCCGAAGGAGAACGCCGAAGAGGCCAGCCTCGCCAGCGGCCTGACCGTCTACGCCGTCGGCCAC
CTGCTGGAGCTGGCCGCGCACTTCAGCGGCCAGGAGCGACTGCGTCCTTATGAGGCGAACGGCCTGATGCGGGTGACGCC
GCCCTATCCGGACCTGTCGGAAGTCCAGGGCCAGGCCGCCGCCAAGCGCGCGCTATTGGTCGCCGCTGCCGGGGCGCACA
ACCTGCTGTTCAGCGGTCCGCCCGGCACCGGCAAGACCCTGCTCGCCAGCCGCCTGCCCGGCCTGATGCCGCCGCTGGAC
GAAGATGAGGCGTTGCAGGTTGCCGCGATCCATTCCGTGGCAGGTCGCGGGCCGCTGACTCACTGGCCACAGCGCCCCTT
CAGACAACCGCACCACACCGCCTCCGCACCGGCGCTGGTCGGCGGGGGAAGCCGCCCGCAGCCGGGCGAGATCACACTGG
CGCACGAAGGTGTGTTGTTCCTGGATGAATTGCCGGAGTTCGATCGCAAGGTCCTGGAGGTTCTCCGCGAACCGCTGGAA
GGGGGCGAAATTGTCATCGCCCGCGCCAACGGCCGTGTGCGCTTCCCGGCACGCTTCCAGTTGGTGGCAGCGATGAACCC
CTGCCCCTGCGGCTATCTCGGCGACCCCAGCGGGCGTTGCCGCTGCTCCCCGGAACAGATCCAGCGCTACCGCGCCAAGC
TCTCCGGGCCGTTGCTGGACCGCATCGACCTGCACCTCACCGTCAACCGGGAAAGCACCACCCTTGCCCCCAGCGGGCCC
AGCACCAGCAGCGCGGAACTGGCCACTCAGGTCGCCGCTGCCCGCCAGCGCCAGCTCGCCCGCCAGGGCTGCGCAAACGC
GTTTCTTACGCTGAAGAAAATGCACCAATATTGTGCGTTAAGCCCGGAAGACCAGGCCTGGCTGGAAAAGGCCGGCGAAC
GCCTGAACCTGTCACTGCGCGCCCTGCACCGGATTCTCAAGGTCGCCCGCACCCTTGCCGACCTGCAGCAGGAAGACCGC
ATCGAGCGCCCGCACCTGGCCGAAGCGCTGCAATACCGTGCGGGGCATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1B8VE92

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

55.315

100

0.567

  comM Vibrio cholerae strain A1552

55.734

100

0.558

  comM Vibrio campbellii strain DS40M4

55.131

100

0.552

  comM Glaesserella parasuis strain SC1401

53.846

100

0.55

  comM Legionella pneumophila str. Paris

48.419

100

0.494

  comM Legionella pneumophila strain ERS1305867

48.419

100

0.494

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.421

100

0.482