Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   CWR53_RS21905 Genome accession   NZ_CP025035
Coordinates   4865695..4867185 (+) Length   496 a.a.
NCBI ID   WP_058602804.1    Uniprot ID   A0AAD0B5C6
Organism   Pseudomonas sp. SGAir0191     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 4856634..4901035 4865695..4867185 within 0


Gene organization within MGE regions


Location: 4856634..4901035
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CWR53_RS21860 (CWR53_21865) dapF 4856828..4857658 (+) 831 WP_058602810.1 diaminopimelate epimerase -
  CWR53_RS21865 (CWR53_21870) - 4857672..4858382 (+) 711 WP_058638862.1 DUF484 family protein -
  CWR53_RS21870 (CWR53_21875) xerC 4858382..4859278 (+) 897 WP_100852476.1 tyrosine recombinase XerC -
  CWR53_RS21875 (CWR53_21880) - 4859278..4859973 (+) 696 WP_100852477.1 HAD family hydrolase -
  CWR53_RS21880 (CWR53_21885) sutA 4860243..4860572 (-) 330 WP_023532471.1 transcriptional regulator SutA -
  CWR53_RS21885 (CWR53_21890) - 4861079..4862410 (-) 1332 WP_058602806.1 ammonium transporter -
  CWR53_RS21890 (CWR53_21895) glnK 4862458..4862796 (-) 339 WP_002555808.1 P-II family nitrogen regulator -
  CWR53_RS21895 (CWR53_21900) - 4863160..4863417 (+) 258 WP_100852478.1 accessory factor UbiK family protein -
  CWR53_RS21900 (CWR53_21905) - 4863453..4865462 (-) 2010 WP_100852696.1 DUF4034 domain-containing protein -
  CWR53_RS21905 (CWR53_21910) comM 4865695..4867185 (+) 1491 WP_058602804.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  CWR53_RS21910 (CWR53_21915) - 4867295..4868302 (-) 1008 WP_138031838.1 hypothetical protein -
  CWR53_RS21915 (CWR53_21920) - 4868644..4869420 (+) 777 WP_100852480.1 alpha/beta fold hydrolase -
  CWR53_RS21920 (CWR53_21925) - 4869420..4870217 (+) 798 WP_058602802.1 SDR family NAD(P)-dependent oxidoreductase -
  CWR53_RS21925 (CWR53_21930) - 4870348..4871259 (+) 912 WP_100852481.1 LysR substrate-binding domain-containing protein -
  CWR53_RS21930 (CWR53_21935) - 4871461..4873302 (-) 1842 WP_058602800.1 amidohydrolase -
  CWR53_RS21935 (CWR53_21940) - 4873380..4874198 (-) 819 WP_100852482.1 alpha/beta fold hydrolase -
  CWR53_RS21940 (CWR53_21945) - 4874236..4874802 (-) 567 WP_100852483.1 antibiotic biosynthesis monooxygenase -
  CWR53_RS21945 (CWR53_21950) ycaC 4874916..4875545 (-) 630 WP_100852484.1 isochorismate family cysteine hydrolase YcaC -
  CWR53_RS21950 (CWR53_21955) - 4875769..4877196 (+) 1428 WP_058638853.1 mechanosensitive ion channel family protein -
  CWR53_RS21955 (CWR53_21960) - 4877226..4878512 (-) 1287 WP_058602796.1 FAD-binding oxidoreductase -
  CWR53_RS21960 (CWR53_21965) - 4878672..4880162 (-) 1491 WP_058602795.1 aldehyde dehydrogenase family protein -
  CWR53_RS21965 (CWR53_21970) - 4880296..4881186 (+) 891 WP_100852485.1 LysR family transcriptional regulator -
  CWR53_RS21970 (CWR53_21975) - 4881323..4882717 (+) 1395 WP_100852486.1 VOC family protein -
  CWR53_RS23620 - 4882756..4883616 (-) 861 WP_375335507.1 methyl-accepting chemotaxis protein -
  CWR53_RS23625 - 4883659..4884654 (-) 996 Protein_4308 Cache 3/Cache 2 fusion domain-containing protein -
  CWR53_RS21980 (CWR53_21985) - 4884810..4885736 (-) 927 WP_058602791.1 LysR substrate-binding domain-containing protein -
  CWR53_RS21985 (CWR53_21990) - 4885891..4887279 (+) 1389 WP_100852487.1 NorM family multidrug efflux MATE transporter -
  CWR53_RS21990 (CWR53_21995) - 4887328..4888995 (-) 1668 WP_100852488.1 bifunctional diguanylate cyclase/phosphodiesterase -
  CWR53_RS21995 (CWR53_22000) rep 4889286..4891295 (+) 2010 WP_058602788.1 DNA helicase Rep -
  CWR53_RS22000 (CWR53_22005) xpt 4891358..4891930 (+) 573 WP_023532789.1 xanthine phosphoribosyltransferase -
  CWR53_RS22005 (CWR53_22010) - 4891990..4893846 (-) 1857 WP_100852489.1 acetyl-CoA hydrolase/transferase C-terminal domain-containing protein -
  CWR53_RS22010 (CWR53_22020) - 4894522..4894971 (-) 450 WP_228385158.1 cytochrome c5 family protein -
  CWR53_RS22015 (CWR53_22025) - 4895081..4895629 (-) 549 WP_023532512.1 cupin domain-containing protein -
  CWR53_RS22020 (CWR53_22030) alr 4895724..4896797 (-) 1074 WP_058638844.1 alanine racemase -
  CWR53_RS22025 (CWR53_22035) dadA 4896878..4898182 (-) 1305 WP_023532358.1 D-amino acid dehydrogenase -
  CWR53_RS22030 (CWR53_22040) dadR 4898337..4898825 (+) 489 WP_003258963.1 transcriptional regulator DadR -
  CWR53_RS22035 (CWR53_22045) - 4898848..4899201 (-) 354 WP_023532519.1 YkgJ family cysteine cluster protein -
  CWR53_RS22040 (CWR53_22050) - 4899304..4900596 (+) 1293 WP_100852491.1 FAD-binding oxidoreductase -
  CWR53_RS22045 (CWR53_22055) - 4900603..4900833 (-) 231 WP_078479345.1 DUF1127 domain-containing protein -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 52674.39 Da        Isoelectric Point: 7.5368

>NTDB_id=257553 CWR53_RS21905 WP_058602804.1 4865695..4867185(+) (comM) [Pseudomonas sp. SGAir0191]
MSLALVHSRAQVGVQAPAVSVETHLANGLPHLTLVGLPETTVKESKDRVRSAIVNSGLNYPARRITQNLAPADLPKDGGR
YDLAIALGILAADAQVPVAALTELECLGELALSGTLRPVQGVLPAALAAREAGRALVVPRENAEEASLASGLVVYAVGHL
LELVAHLNGQVPLPAYAANGLLLDPRPYPDLSEVQGQLAAKRALLLAAAGAHNLLFTGPPGTGKTLLASRLPGLLPPLDE
HEALEVAAIQSVSGHTPLSSWPQRPFRHPHHSASGPALVGGGSRPQPGEITLAHHGVLFLDELPEFERRVLEVLREPLES
GEIVIARARDKVRFPARFQLVAAMNPCPCGYLGDPTGRCRCSTEQISRYRNKLSGPLLDRIDLHLTVARESTTLNNRPCG
QSSADVALQVAHARDVQQRRQGCANAFLDLDGLRQHCSLAAPDQAWLEGACERLTLSLRAAHRLLKVARTLADLEGCTAI
GRAHLAEALQYRPGGG

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=257553 CWR53_RS21905 WP_058602804.1 4865695..4867185(+) (comM) [Pseudomonas sp. SGAir0191]
ATGTCTTTAGCCCTTGTCCACAGCCGCGCCCAGGTAGGCGTGCAGGCCCCGGCCGTCAGCGTCGAAACCCACTTGGCCAA
TGGCTTGCCTCATCTCACATTGGTCGGCTTGCCCGAAACCACGGTCAAGGAAAGCAAGGACCGCGTGCGCAGTGCCATCG
TCAACAGTGGCCTGAATTACCCAGCGCGGCGCATCACGCAAAACCTCGCACCCGCCGACCTGCCCAAGGACGGTGGCCGC
TATGACCTGGCCATCGCCCTGGGCATCCTGGCGGCCGACGCCCAGGTACCCGTTGCCGCCCTGACCGAGCTCGAATGCCT
GGGCGAGCTGGCGCTGTCAGGCACGTTGCGGCCGGTGCAGGGCGTCTTGCCGGCAGCACTCGCCGCACGCGAAGCTGGCC
GAGCGCTGGTAGTACCCCGTGAAAATGCCGAGGAAGCCAGCCTTGCCAGCGGATTGGTGGTGTATGCCGTTGGCCATCTA
CTGGAGCTGGTGGCCCACCTGAACGGGCAGGTACCGCTACCTGCCTATGCCGCCAATGGCTTGCTCCTGGACCCTCGCCC
CTATCCCGACCTGAGCGAGGTGCAGGGGCAACTGGCCGCCAAACGCGCACTGCTACTGGCCGCTGCCGGGGCGCACAACC
TGCTTTTCACCGGGCCGCCGGGCACTGGCAAAACCTTGTTGGCCAGCCGCCTGCCGGGCCTGCTGCCACCGCTGGATGAG
CATGAAGCGCTGGAAGTCGCCGCCATCCAGTCGGTCAGCGGCCACACCCCGCTGAGCAGCTGGCCGCAGCGACCGTTTCG
CCACCCTCACCACTCCGCCTCCGGGCCTGCGCTGGTGGGTGGCGGAAGCCGGCCGCAGCCCGGGGAAATCACCTTGGCCC
ACCACGGGGTGCTGTTTCTGGACGAGTTGCCCGAATTCGAGCGTCGGGTACTGGAAGTGCTGCGTGAACCCTTGGAGTCT
GGCGAGATCGTGATCGCCAGGGCGCGGGATAAAGTGCGCTTCCCTGCCCGCTTTCAGTTGGTCGCCGCCATGAACCCCTG
CCCTTGCGGCTACCTGGGCGACCCCACAGGGCGGTGCCGTTGCAGTACCGAACAAATCTCCCGTTATCGCAACAAACTGT
CAGGCCCGCTGCTGGACCGGATCGACTTGCACCTGACCGTGGCGCGCGAGAGCACCACACTCAACAATCGGCCCTGCGGG
CAAAGCAGTGCCGACGTCGCCCTCCAGGTCGCGCACGCCCGGGACGTGCAACAGCGTCGGCAAGGGTGCGCCAATGCATT
CCTCGACCTGGACGGGCTGCGCCAGCACTGTAGCTTGGCAGCACCTGATCAGGCTTGGCTGGAAGGCGCCTGTGAACGGC
TGACGCTGTCGCTGCGAGCAGCACACCGATTGCTCAAGGTCGCCAGGACACTGGCAGACCTTGAGGGATGCACGGCGATT
GGCCGGGCGCATCTGGCCGAGGCCCTGCAGTACCGTCCTGGGGGTGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

55.556

99.798

0.554

  comM Haemophilus influenzae Rd KW20

55

100

0.554

  comM Vibrio cholerae strain A1552

54.949

99.798

0.548

  comM Glaesserella parasuis strain SC1401

53.8

100

0.542

  comM Legionella pneumophila str. Paris

48.992

100

0.49

  comM Legionella pneumophila strain ERS1305867

48.992

100

0.49

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.521

100

0.472


Multiple sequence alignment