Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   LG197_RS04050 Genome accession   NZ_CP084714
Coordinates   898963..899643 (+) Length   226 a.a.
NCBI ID   WP_274397077.1    Uniprot ID   -
Organism   Pseudomonas asiatica strain P1     
Function   require for natural transformation (predicted from homology)   
Unclear

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 895155..898266 898963..899643 flank 697


Gene organization within MGE regions


Location: 895155..899643
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LG197_RS04030 (LG197_04040) - 895155..896135 (-) 981 WP_024086516.1 IS5-like element ISPa26 family transposase -
  LG197_RS04035 (LG197_04045) - 896178..897080 (-) 903 WP_274396172.1 helix-turn-helix domain-containing protein -
  LG197_RS04040 (LG197_04050) - 897286..898266 (+) 981 WP_038410028.1 IS5-like element ISPa16 family transposase -
  LG197_RS04045 (LG197_04055) - 898402..898935 (-) 534 WP_274396175.1 TnsA endonuclease N-terminal domain-containing protein -
  LG197_RS04050 (LG197_04060) comM 898963..899643 (+) 681 WP_274397077.1 ATP-binding protein Machinery gene

Sequence


Protein


Download         Length: 226 a.a.        Molecular weight: 25003.50 Da        Isoelectric Point: 7.2723

>NTDB_id=612672 LG197_RS04050 WP_274397077.1 898963..899643(+) (comM) [Pseudomonas asiatica strain P1]
MFSFYDVFYSGSSRPQPGEITLAHHGVLFLDELPEFERRVLEVLREPLESGEIVIARARDKVRFPARFQLVAAMNPCPCG
YLGDPTGRCRCSTEQIARYRNKLSGPLLDRIDLHLTVARESTTLNNQPCGETSADVAAKVAEARELQQRRQGCANAFLDL
EGLRCHCGLAAADQAWLEGACERLTLSLRAAHRLLKVARTLADLEGNQAIGRAHLAEALQYRPGSS

Nucleotide


Download         Length: 681 bp        

>NTDB_id=612672 LG197_RS04050 WP_274397077.1 898963..899643(+) (comM) [Pseudomonas asiatica strain P1]
ATTTTCTCATTTTACGATGTGTTTTACAGCGGCAGCAGCCGGCCGCAACCGGGCGAAATCACCCTCGCCCACCATGGCGT
GCTGTTTCTGGATGAACTACCTGAGTTCGAGCGTCGCGTGCTGGAAGTGTTGCGCGAACCGCTGGAGTCGGGCGAGATCG
TGATTGCCCGGGCCCGCGACAAGGTGCGCTTCCCCGCACGTTTCCAGCTGGTGGCAGCAATGAACCCCTGCCCCTGCGGC
TACCTGGGCGACCCCACTGGCCGCTGCCGCTGTAGCACCGAGCAGATCGCGCGGTATCGCAACAAGCTGTCCGGGCCATT
GCTGGACCGTATCGACCTGCACCTGACCGTGGCCCGGGAGAGCACCACGCTGAACAACCAGCCCTGCGGCGAGACCAGTG
CCGATGTGGCGGCCAAGGTAGCCGAGGCACGTGAACTGCAACAGCGGCGGCAAGGGTGTGCGAATGCATTTCTCGACCTT
GAGGGGTTGCGCTGCCATTGTGGGTTGGCGGCGGCGGACCAGGCCTGGTTGGAGGGGGCCTGTGAGCGGCTGACCCTGTC
GTTACGTGCGGCTCACCGGTTACTGAAGGTGGCGCGGACGCTGGCGGATCTGGAAGGTAACCAGGCAATTGGCCGGGCGC
ATCTGGCCGAGGCCCTGCAGTACCGGCCGGGGAGCAGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

55.349

95.133

0.527

  comM Glaesserella parasuis strain SC1401

52.968

96.903

0.513

  comM Vibrio cholerae strain A1552

52.535

96.018

0.504

  comM Vibrio campbellii strain DS40M4

51.152

96.018

0.491

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

49.77

96.018

0.478

  comM Legionella pneumophila str. Paris

47.706

96.46

0.46

  comM Legionella pneumophila strain ERS1305867

47.706

96.46

0.46