Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   MOP87_RS20345 Genome accession   NZ_CP093940
Coordinates   4529746..4531236 (+) Length   496 a.a.
NCBI ID   WP_242809936.1    Uniprot ID   -
Organism   Pseudomonas sp. I3-I5     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4524746..4536236
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MOP87_RS20325 (MOP87_20325) - 4525130..4526461 (-) 1332 WP_023532646.1 ammonium transporter -
  MOP87_RS20330 (MOP87_20330) glnK 4526509..4526847 (-) 339 WP_002555808.1 P-II family nitrogen regulator -
  MOP87_RS20335 (MOP87_20335) - 4527211..4527468 (+) 258 WP_027913405.1 accessory factor UbiK family protein -
  MOP87_RS20340 (MOP87_20340) - 4527504..4529513 (-) 2010 WP_110606422.1 DUF4034 domain-containing protein -
  MOP87_RS20345 (MOP87_20345) comM 4529746..4531236 (+) 1491 WP_242809936.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  MOP87_RS20350 (MOP87_20350) - 4531449..4531916 (+) 468 WP_242809937.1 helix-turn-helix domain-containing protein -
  MOP87_RS20355 (MOP87_20355) - 4531913..4532488 (+) 576 WP_242809938.1 PIN domain-containing protein -
  MOP87_RS20360 (MOP87_20360) - 4532669..4533445 (+) 777 WP_027913400.1 alpha/beta hydrolase -
  MOP87_RS20365 (MOP87_20365) - 4533445..4534242 (+) 798 WP_028686766.1 SDR family oxidoreductase -
  MOP87_RS20370 (MOP87_20370) - 4534371..4535282 (+) 912 WP_110606350.1 LysR substrate-binding domain-containing protein -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 52605.35 Da        Isoelectric Point: 7.6868

>NTDB_id=665762 MOP87_RS20345 WP_242809936.1 4529746..4531236(+) (comM) [Pseudomonas sp. I3-I5]
MSLALVHSRAQVGVQAPAVSVETHLANGLPHLTLVGLPETTVKESKDRVRSAIVNSGLNYPARRITQNLAPADLPKDGGR
YDLAIALGILAADAQVPVATLAELECLGELALSGALRPVQGVLPAALAARQAGRALVVPRENAEEASLASGLVVYAVGHL
LELVAHLNGQVPLAAYAANGLLLNPCPYPDLSEVQGQLAAKRALLLAAAGAHNLLFTGPPGTGKTLLASRLPGLLPPLDE
HEALEVAAIQSVSGHTPLNSWPQRPFRHPHHSASGPALVGGGSRPQPGEITLAHHGVLFLDELPEFERRVLEVLREPLES
GEIVIARARDKVRFPARFQLVAAMNPCPCGYLGDPTGRCRCSTEQISRYRNKLSGPLLDRIDLHLTVARESTTLNNQPCG
QSSADVALQVAHAREVQQRRQGCANAFLNLDGLRQHCSLAAPDQAWLEGACERLTLSLRAAHRLLKVARTLADLEGCTAI
GRAHLAEALQYRPGSG

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=665762 MOP87_RS20345 WP_242809936.1 4529746..4531236(+) (comM) [Pseudomonas sp. I3-I5]
ATGTCTTTAGCCCTTGTTCACAGCCGCGCCCAGGTAGGCGTGCAGGCCCCGGCCGTCAGCGTTGAAACCCACCTGGCCAA
TGGCTTGCCTCATCTCACACTGGTCGGCTTGCCCGAAACCACGGTCAAGGAAAGCAAGGACCGCGTGCGCAGTGCCATCG
TCAACAGTGGCCTGAACTACCCAGCCCGGCGCATCACGCAAAACCTCGCTCCGGCCGACCTGCCCAAGGACGGTGGCCGC
TACGACCTGGCCATCGCCCTGGGCATCCTGGCGGCCGATGCCCAGGTGCCCGTGGCCACCCTGGCCGAGCTCGAATGCCT
GGGCGAGCTGGCACTGTCGGGCGCGTTGCGGCCGGTACAGGGCGTCTTGCCCGCAGCGCTTGCCGCACGCCAGGCGGGGC
GGGCACTGGTAGTGCCCCGTGAAAATGCCGAGGAAGCCAGCCTTGCCAGTGGCCTGGTGGTGTATGCCGTAGGCCATCTA
TTGGAGCTGGTGGCCCACCTGAACGGACAGGTACCGCTAGCTGCCTATGCCGCCAATGGCTTGCTCCTGAACCCTTGCCC
CTATCCCGACCTGAGCGAGGTGCAGGGGCAACTGGCGGCCAAACGCGCGCTGCTACTGGCCGCTGCCGGGGCGCACAACC
TGCTTTTCACCGGGCCGCCAGGTACTGGCAAAACCTTGTTGGCCAGCCGCCTGCCGGGCCTGCTGCCACCGCTGGATGAG
CATGAAGCGCTGGAGGTCGCCGCCATCCAGTCGGTCAGCGGGCACACCCCGCTGAACAGCTGGCCACAACGGCCGTTTCG
CCACCCTCATCATTCCGCCTCCGGACCTGCGCTGGTGGGTGGCGGAAGCCGGCCGCAGCCCGGGGAAATCACCTTGGCCC
ACCACGGGGTGCTGTTTCTGGACGAGTTGCCCGAATTCGAGCGCCGGGTACTGGAAGTGCTGCGTGAACCCCTGGAATCT
GGCGAGATCGTAATCGCCAGGGCGCGGGATAAAGTGCGCTTTCCCGCCCGCTTTCAGCTCGTCGCTGCCATGAACCCCTG
CCCTTGCGGCTACCTGGGCGACCCTACAGGGCGATGCCGCTGCAGCACCGAGCAGATATCCCGTTATCGCAACAAACTGT
CAGGCCCGCTACTGGACCGGATCGACTTGCACCTGACCGTGGCGCGCGAGAGCACCACGCTCAACAACCAGCCTTGCGGG
CAAAGCAGCGCCGACGTCGCCCTTCAAGTCGCGCACGCCCGGGAGGTGCAACAACGTCGGCAAGGGTGCGCCAATGCATT
TCTCAACCTCGATGGCCTGCGCCAGCACTGTAGCTTGGCAGCACCTGATCAGGCTTGGTTGGAAGGCGCCTGTGAACGAT
TGACGTTGTCGCTGCGAGCAGCACACCGATTGCTCAAGGTCGCCAGAACACTGGCAGACCTCGAGGGATGCACGGCAATT
GGCCGGGCGCACCTGGCTGAGGCCCTGCAGTACCGTCCCGGGAGTGGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

55.8

100

0.563

  comM Vibrio campbellii strain DS40M4

55.444

100

0.554

  comM Vibrio cholerae strain A1552

55.354

99.798

0.552

  comM Glaesserella parasuis strain SC1401

53.8

100

0.542

  comM Legionella pneumophila str. Paris

49.899

100

0.5

  comM Legionella pneumophila strain ERS1305867

49.899

100

0.5

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.521

100

0.472