Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   D3880_RS21620 Genome accession   NZ_CP032419
Coordinates   4748735..4750231 (+) Length   498 a.a.
NCBI ID   WP_119895464.1    Uniprot ID   A0A385Z9F1
Organism   Pseudomonas cavernae strain K2W31S-8     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 4719200..4775870 4748735..4750231 within 0


Gene organization within MGE regions


Location: 4719200..4775870
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D3880_RS21445 (D3880_21450) - 4719373..4720380 (-) 1008 WP_119895432.1 AlgP family protein -
  D3880_RS21450 (D3880_21455) - 4720509..4721153 (+) 645 WP_119895433.1 FKBP-type peptidyl-prolyl cis-trans isomerase -
  D3880_RS21455 (D3880_21460) rsd 4721166..4721639 (-) 474 WP_119895434.1 sigma D regulator -
  D3880_RS21460 (D3880_21465) - 4721919..4722410 (-) 492 WP_119895435.1 disulfide bond formation protein B -
  D3880_RS21465 (D3880_21470) - 4722523..4723776 (-) 1254 WP_119895436.1 heme biosynthesis protein HemY -
  D3880_RS21470 (D3880_21475) - 4723773..4724891 (-) 1119 WP_119895437.1 uroporphyrinogen-III C-methyltransferase -
  D3880_RS21475 (D3880_21480) - 4724908..4725678 (-) 771 WP_119895438.1 uroporphyrinogen-III synthase -
  D3880_RS21480 (D3880_21485) hemC 4725675..4726616 (-) 942 WP_119895807.1 hydroxymethylbilane synthase -
  D3880_RS21490 (D3880_21495) - 4726883..4727629 (-) 747 WP_119895440.1 LytR/AlgR family response regulator transcription factor -
  D3880_RS21495 (D3880_21500) - 4727626..4728708 (-) 1083 WP_119895441.1 sensor histidine kinase -
  D3880_RS21500 (D3880_21505) argH 4728966..4730360 (+) 1395 WP_119895442.1 argininosuccinate lyase -
  D3880_RS21505 (D3880_21510) - 4730424..4730909 (+) 486 WP_119895443.1 GNAT family N-acetyltransferase -
  D3880_RS21510 (D3880_21515) - 4730922..4731215 (+) 294 WP_119895444.1 hypothetical protein -
  D3880_RS21515 (D3880_21520) - 4731334..4731585 (+) 252 WP_119895445.1 TIGR02647 family protein -
  D3880_RS21520 (D3880_21525) - 4731676..4734525 (+) 2850 WP_119895446.1 class I adenylate cyclase -
  D3880_RS21525 (D3880_21530) rnk 4734634..4735041 (-) 408 WP_119895808.1 nucleoside diphosphate kinase regulator -
  D3880_RS21530 (D3880_21535) - 4735243..4735407 (-) 165 WP_119895447.1 DUF1289 domain-containing protein -
  D3880_RS21535 (D3880_21540) cyaY 4735424..4735756 (-) 333 WP_119895448.1 iron donor protein CyaY -
  D3880_RS21540 (D3880_21545) - 4735796..4736854 (-) 1059 WP_119895449.1 AraC family transcriptional regulator -
  D3880_RS21545 (D3880_21550) lptM 4737000..4737140 (+) 141 WP_119895450.1 LPS translocon maturation chaperone LptM -
  D3880_RS21550 (D3880_21555) lysA 4737154..4738401 (+) 1248 WP_119895451.1 diaminopimelate decarboxylase -
  D3880_RS21555 (D3880_21560) dapF 4738412..4739242 (+) 831 WP_119895452.1 diaminopimelate epimerase -
  D3880_RS21560 (D3880_21565) - 4739239..4739940 (+) 702 WP_119895453.1 DUF484 family protein -
  D3880_RS21565 (D3880_21570) xerC 4740001..4740900 (+) 900 WP_119895454.1 tyrosine recombinase XerC -
  D3880_RS21570 (D3880_21575) - 4740897..4741592 (+) 696 WP_119895455.1 HAD family hydrolase -
  D3880_RS21575 (D3880_21580) rfbB 4741791..4742873 (+) 1083 WP_119895456.1 dTDP-glucose 4,6-dehydratase -
  D3880_RS21580 (D3880_21585) rfbA 4742870..4743748 (+) 879 WP_119895457.1 glucose-1-phosphate thymidylyltransferase RfbA -
  D3880_RS21585 (D3880_21590) rfbC 4743745..4744287 (+) 543 WP_119895458.1 dTDP-4-dehydrorhamnose 3,5-epimerase -
  D3880_RS21590 (D3880_21595) rfbD 4744290..4745177 (+) 888 WP_119895459.1 dTDP-4-dehydrorhamnose reductase -
  D3880_RS21595 (D3880_21600) sutA 4745338..4745667 (-) 330 WP_119895460.1 transcriptional regulator SutA -
  D3880_RS21600 (D3880_21605) - 4745743..4746168 (-) 426 WP_119895461.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  D3880_RS21605 (D3880_21610) - 4746342..4747673 (-) 1332 WP_119895462.1 ammonium transporter -
  D3880_RS21610 (D3880_21615) glnK 4747707..4748045 (-) 339 WP_003096476.1 P-II family nitrogen regulator -
  D3880_RS21615 (D3880_21620) - 4748441..4748701 (+) 261 WP_119895463.1 accessory factor UbiK family protein -
  D3880_RS21620 (D3880_21625) comM 4748735..4750231 (+) 1497 WP_119895464.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  D3880_RS21625 (D3880_21630) - 4750332..4750883 (-) 552 WP_119895465.1 SRPBCC family protein -
  D3880_RS21630 (D3880_21635) - 4750993..4751439 (-) 447 WP_119895466.1 YybH family protein -
  D3880_RS21635 (D3880_21640) - 4751455..4752252 (-) 798 WP_119895467.1 DUF899 domain-containing protein -
  D3880_RS22835 - 4752239..4752388 (-) 150 WP_162935029.1 hypothetical protein -
  D3880_RS21640 (D3880_21645) - 4752399..4752743 (-) 345 WP_119895468.1 YciI family protein -
  D3880_RS21645 (D3880_21650) - 4753007..4754254 (-) 1248 WP_119895809.1 RNA polymerase sigma factor -
  D3880_RS21650 (D3880_21655) - 4754341..4755261 (-) 921 WP_119895469.1 LysR substrate-binding domain-containing protein -
  D3880_RS21655 (D3880_21660) - 4755389..4756774 (+) 1386 WP_119895470.1 NorM family multidrug efflux MATE transporter -
  D3880_RS21660 (D3880_21665) - 4756945..4758615 (-) 1671 WP_119895471.1 putative bifunctional diguanylate cyclase/phosphodiesterase -
  D3880_RS21665 (D3880_21670) rep 4758889..4760910 (+) 2022 WP_119895472.1 DNA helicase Rep -
  D3880_RS21670 (D3880_21675) - 4761133..4761705 (+) 573 WP_119895473.1 xanthine phosphoribosyltransferase -
  D3880_RS21675 (D3880_21680) - 4762004..4762426 (-) 423 WP_119895474.1 c-type cytochrome -
  D3880_RS21680 (D3880_21685) - 4762596..4763144 (-) 549 WP_119895475.1 cupin domain-containing protein -
  D3880_RS21685 (D3880_21690) alr 4763239..4764312 (-) 1074 WP_119895476.1 alanine racemase -
  D3880_RS21690 (D3880_21695) - 4764379..4764732 (-) 354 WP_119895477.1 RidA family protein -
  D3880_RS21695 (D3880_21700) dadA 4764707..4766005 (-) 1299 WP_119895478.1 D-amino acid dehydrogenase -
  D3880_RS21700 (D3880_21705) - 4766161..4766649 (+) 489 WP_119895479.1 Lrp/AsnC ligand binding domain-containing protein -
  D3880_RS21705 (D3880_21710) - 4766743..4767096 (-) 354 WP_119895480.1 YkgJ family cysteine cluster protein -
  D3880_RS21710 (D3880_21715) - 4767270..4768601 (+) 1332 WP_119895481.1 NAD(P)/FAD-dependent oxidoreductase -
  D3880_RS21715 (D3880_21720) - 4768619..4769761 (-) 1143 WP_119895482.1 MFS transporter -
  D3880_RS21720 (D3880_21725) - 4769911..4771404 (-) 1494 WP_119895483.1 aldehyde dehydrogenase -
  D3880_RS21725 (D3880_21730) - 4771629..4771994 (+) 366 WP_119895484.1 cupin domain-containing protein -
  D3880_RS21730 (D3880_21735) rpmG 4772139..4772294 (-) 156 WP_108108871.1 50S ribosomal protein L33 -
  D3880_RS21735 (D3880_21740) rpmB 4772305..4772541 (-) 237 WP_003291677.1 50S ribosomal protein L28 -
  D3880_RS21740 (D3880_21745) - 4772905..4774482 (+) 1578 WP_119895485.1 ABC transporter substrate-binding protein -
  D3880_RS21745 (D3880_21750) radC 4774524..4775198 (-) 675 WP_119895486.1 RadC family protein -

Sequence


Protein


Download         Length: 498 a.a.        Molecular weight: 52700.50 Da        Isoelectric Point: 7.9098

>NTDB_id=315717 D3880_RS21620 WP_119895464.1 4748735..4750231(+) (comM) [Pseudomonas cavernae strain K2W31S-8]
MSLAIVHSRALVGVEAPAVSVEAHLANGLPALALVGLPETSVKESKDRVRSAILNCGFDFPPRRITLNLAPADLPKDGGR
FDLAIALGILAASGQIPAAALSESECLGELALSGALRPVQGVLPAALAARAAGRTLIVPKANAEEASLASGLSVLAAEHL
LELAAHFNGSQPLAPYQAQGLLRQTPPYPDLAEVQGQQAAKRALLVAASGGHNLLLSGPPGTGKTLLASRLPGLLPPLDE
TEALEVAAIHSVAIHAPLEAWPQRPFRQPHHSASGPALVGGGSRPQPGEITLAHQGVLFLDELPEFDRKVLEVLREPLES
GHIVIARARDKVRFPARFQLVAAMNPCPCGYLHDPSGRCRCTPEQIQRYRGKLSGPLLDRIDLHLTVARESTSLQAPAEP
GQDSASVAARVAAARQRQLRRQGCANAVLDLTGLRQHCSLRADDQGWLEQACERLNLSLRAAHRLLKVARTLADLDQQDA
IGRPHLAEALQYRPSAAP

Nucleotide


Download         Length: 1497 bp        

>NTDB_id=315717 D3880_RS21620 WP_119895464.1 4748735..4750231(+) (comM) [Pseudomonas cavernae strain K2W31S-8]
ATGTCCCTGGCCATCGTCCACAGCCGCGCACTGGTCGGCGTCGAAGCGCCGGCCGTCAGCGTCGAAGCCCATCTGGCCAA
CGGCCTGCCCGCACTGGCGCTGGTCGGCCTGCCGGAAACCAGCGTCAAGGAAAGCAAGGACCGCGTGCGCAGCGCGATCC
TCAACTGCGGTTTCGATTTTCCGCCACGGCGCATCACCCTCAACCTGGCGCCTGCCGACCTACCCAAGGACGGCGGGCGT
TTCGACCTGGCCATCGCCCTCGGCATCCTCGCCGCCAGCGGCCAGATTCCCGCCGCGGCGCTGAGCGAAAGCGAATGCCT
CGGCGAACTGGCGCTGTCCGGCGCGCTGCGCCCGGTGCAGGGCGTGCTGCCGGCCGCCTTGGCCGCACGCGCGGCGGGAC
GCACCCTGATAGTGCCGAAAGCCAATGCCGAGGAAGCCAGCCTGGCCTCGGGACTCAGCGTACTGGCCGCCGAGCATCTG
CTGGAACTGGCGGCCCACTTCAACGGCAGCCAGCCGCTCGCGCCCTATCAGGCCCAGGGCCTGTTGCGGCAGACGCCGCC
CTACCCGGACCTCGCCGAGGTGCAGGGCCAGCAAGCGGCCAAGCGCGCCCTGCTGGTCGCCGCCAGTGGCGGCCATAACC
TGTTGCTCAGTGGCCCGCCGGGCACCGGCAAGACCCTGCTGGCCAGCCGCCTGCCCGGCCTGCTACCGCCGCTGGACGAA
ACGGAAGCCCTGGAGGTCGCCGCGATCCATTCGGTGGCTATCCATGCGCCGCTGGAGGCCTGGCCGCAGCGACCGTTTCG
CCAGCCGCATCACAGTGCCTCGGGGCCGGCGCTGGTCGGTGGCGGCAGCCGCCCGCAGCCGGGCGAAATCACCCTGGCGC
ACCAGGGCGTGCTGTTTCTCGACGAGCTGCCGGAGTTCGACCGCAAGGTGCTGGAGGTGCTGCGCGAGCCGCTGGAAAGC
GGCCATATCGTCATCGCTCGGGCCCGCGACAAGGTGCGCTTCCCGGCGCGCTTCCAGCTGGTGGCGGCGATGAACCCCTG
CCCCTGCGGCTACCTGCACGACCCCAGCGGGCGCTGCCGCTGCACCCCGGAACAGATTCAGCGCTACCGCGGCAAGCTGT
CCGGGCCACTGCTCGACCGCATCGACCTGCACCTCACCGTGGCGCGCGAGAGCACCTCGCTGCAGGCCCCCGCCGAACCC
GGCCAGGACAGCGCCAGCGTCGCCGCCCGGGTCGCCGCCGCGCGTCAGCGCCAACTGCGCCGCCAGGGCTGCGCCAACGC
CGTGCTCGACCTGACTGGCCTGCGTCAGCACTGCAGCTTGCGAGCCGACGACCAAGGCTGGCTGGAGCAGGCCTGCGAGC
GCCTCAACCTGTCGCTGCGCGCCGCCCACCGGCTGCTCAAGGTCGCACGTACCCTGGCCGATCTCGACCAACAGGACGCC
ATCGGCCGGCCGCATCTGGCCGAGGCGCTGCAGTACCGGCCAAGCGCGGCGCCGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A385Z9F1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

56.162

99.398

0.558

  comM Vibrio cholerae strain A1552

55.556

99.398

0.552

  comM Glaesserella parasuis strain SC1401

54.709

100

0.548

  comM Haemophilus influenzae Rd KW20

54.4

100

0.546

  comM Legionella pneumophila str. Paris

50.791

100

0.516

  comM Legionella pneumophila strain ERS1305867

50.791

100

0.516

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.614

100

0.47


Multiple sequence alignment