Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   R9X41_RS22365 Genome accession   NZ_CP137854
Coordinates   4839738..4841276 (+) Length   512 a.a.
NCBI ID   WP_318632627.1    Uniprot ID   -
Organism   Xylophilus sp. GOD-11R     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4834738..4846276
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R9X41_RS22345 (R9X41_22345) - 4835640..4836578 (-) 939 WP_318632624.1 SMP-30/gluconolactonase/LRE family protein -
  R9X41_RS22350 (R9X41_22350) amt 4836741..4838291 (-) 1551 WP_318632625.1 ammonium transporter -
  R9X41_RS22355 (R9X41_22355) glnK 4838317..4838655 (-) 339 WP_213954512.1 P-II family nitrogen regulator -
  R9X41_RS22360 (R9X41_22360) - 4838714..4839448 (-) 735 WP_318632626.1 TorF family putative porin -
  R9X41_RS22365 (R9X41_22365) comM 4839738..4841276 (+) 1539 WP_318632627.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  R9X41_RS22370 (R9X41_22370) - 4841289..4842749 (-) 1461 WP_318632628.1 D-aminoacylase -
  R9X41_RS22375 (R9X41_22375) - 4843068..4844057 (-) 990 WP_318632629.1 tripartite tricarboxylate transporter substrate binding protein -
  R9X41_RS22380 (R9X41_22380) - 4844142..4845221 (-) 1080 WP_318632630.1 threonine/serine dehydratase -
  R9X41_RS22385 (R9X41_22385) - 4845221..4846204 (-) 984 WP_318632631.1 N-carbamoyl-D-amino-acid hydrolase -

Sequence


Protein


Download         Length: 512 a.a.        Molecular weight: 53820.55 Da        Isoelectric Point: 9.4025

>NTDB_id=900922 R9X41_RS22365 WP_318632627.1 4839738..4841276(+) (comM) [Xylophilus sp. GOD-11R]
MSIAIVQSRALQGMQAAAVTVEVHLANGLPSFTLVGLADVEVKEARERVRSALLNAGLEFPHNKRITVNLAPADLPKDSG
RFDLPIALGILAASGQVDASQLAGWEFAGELSLSGELRPVRGALAMALALRRGSPDTRLVLPTVSAGEAALVPGREIYGA
RHLLDVVARFLPAGTEGPADPPEGWSRPGAQALATQPVYQDLGDVRGQAGAKRALEIAAAGGHSLLMVGPPGSGKSMLAQ
RFAGLLPPMSLDEALESAAIASLGGPFDLARWGQRPTRSPHHTASAVALVGGGSPPRPGEISMAHRGVLFLDELPEFARP
ALEALREPLETGQISISRAARRSEFPARFQLVAAMNPTPCGGLGTRENPYRSSPEQIQRYQARISGPLLDRIDLQIEVPA
LTPADLLQAPPGEDTATIRARVHAAREHALGRQGKANHALQAAEVETHAALDAPALAFLHRTATKLGWSARSTHRVMRVA
RTIADLAGATAVGVGHLAEAAQYRRALLVPQA

Nucleotide


Download         Length: 1539 bp        

>NTDB_id=900922 R9X41_RS22365 WP_318632627.1 4839738..4841276(+) (comM) [Xylophilus sp. GOD-11R]
ATGTCGATCGCGATTGTGCAAAGCCGGGCCCTGCAAGGAATGCAGGCCGCCGCGGTGACCGTGGAAGTGCATCTGGCCAA
CGGCTTGCCGAGCTTCACGCTGGTGGGTTTGGCCGATGTCGAAGTGAAGGAGGCCCGGGAGCGGGTGCGCTCGGCGCTGC
TGAACGCGGGGCTGGAATTTCCCCACAACAAGCGCATCACCGTGAACCTCGCGCCGGCCGATCTGCCCAAGGATTCCGGC
CGCTTCGACCTGCCGATCGCACTCGGGATTCTGGCGGCCAGCGGCCAGGTGGACGCATCGCAACTGGCCGGTTGGGAATT
CGCCGGCGAGCTCTCGCTCTCGGGTGAACTGCGGCCGGTGCGCGGGGCGCTTGCGATGGCGCTGGCCCTGCGGCGCGGCT
CGCCCGACACGCGGCTCGTACTGCCCACCGTGAGCGCTGGCGAAGCCGCGCTGGTGCCGGGCCGCGAAATCTACGGCGCG
CGCCATCTGCTCGACGTCGTCGCCCGCTTTTTGCCGGCCGGTACCGAAGGCCCGGCCGATCCGCCCGAAGGCTGGTCGCG
ACCCGGCGCGCAGGCGCTGGCCACGCAGCCGGTCTACCAGGATCTCGGCGACGTGCGAGGCCAGGCCGGCGCCAAGCGCG
CGCTGGAGATCGCCGCGGCCGGCGGCCACAGCCTGTTGATGGTGGGCCCGCCGGGTTCGGGCAAGTCGATGCTGGCGCAG
CGCTTCGCCGGCCTGCTGCCGCCGATGTCGCTCGACGAGGCACTGGAAAGCGCCGCCATCGCCAGCCTCGGTGGCCCGTT
CGACCTGGCCCGCTGGGGCCAGCGCCCGACCCGGTCGCCGCACCACACCGCCAGCGCGGTCGCGCTGGTGGGCGGAGGCT
CGCCACCACGGCCGGGCGAGATATCGATGGCGCACCGCGGGGTCTTGTTTCTGGACGAGCTTCCGGAGTTCGCCCGACCT
GCGCTGGAGGCGCTGCGCGAGCCGCTCGAGACGGGTCAGATCAGCATCTCGCGGGCTGCACGCCGGTCTGAATTCCCGGC
ACGCTTTCAGCTCGTGGCGGCGATGAATCCCACGCCCTGCGGCGGTCTCGGCACCCGCGAGAACCCGTACCGGTCGAGCC
CGGAGCAGATCCAGCGCTACCAGGCACGCATCTCCGGCCCGCTGCTCGACCGGATCGACCTGCAGATCGAAGTGCCCGCC
CTGACACCGGCCGACCTGCTGCAGGCGCCGCCCGGCGAAGACACCGCCACCATCCGGGCCCGGGTGCATGCGGCCCGCGA
ACACGCTTTGGGGCGACAGGGCAAGGCCAACCACGCCCTGCAGGCGGCCGAGGTGGAAACGCACGCCGCGCTCGATGCGC
CGGCCCTGGCCTTTCTGCACCGCACCGCCACCAAGCTGGGCTGGTCGGCGCGCAGCACGCACCGGGTGATGCGGGTGGCG
CGCACCATCGCCGACCTGGCCGGCGCCACCGCCGTCGGCGTCGGACATCTGGCAGAAGCGGCTCAGTACCGGCGAGCCCT
GCTCGTACCCCAGGCGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

52.734

100

0.527

  comM Glaesserella parasuis strain SC1401

51.765

99.609

0.516

  comM Vibrio cholerae strain A1552

52.174

98.828

0.516

  comM Vibrio campbellii strain DS40M4

50.395

98.828

0.498

  comM Legionella pneumophila str. Paris

47.665

100

0.479

  comM Legionella pneumophila strain ERS1305867

47.665

100

0.479

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.96

98.633

0.434