Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   WGR79_RS25060 Genome accession   NZ_CP148970
Coordinates   5074445..5075965 (+) Length   506 a.a.
NCBI ID   WP_060416009.1    Uniprot ID   -
Organism   Klebsiella pneumoniae strain FUJ80029     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 5069445..5080965
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  WGR79_RS25035 (WGR79_24950) ilvE 5071005..5071934 (-) 930 WP_002883171.1 branched-chain-amino-acid transaminase -
  WGR79_RS25040 (WGR79_24955) ilvM 5071951..5072208 (-) 258 WP_002883170.1 acetolactate synthase 2 small subunit -
  WGR79_RS25045 (WGR79_24960) ilvG 5072205..5073851 (-) 1647 WP_002883142.1 acetolactate synthase 2 catalytic subunit -
  WGR79_RS25050 ilvX 5073854..5073907 (-) 54 WP_201281575.1 peptide IlvX -
  WGR79_RS25055 (WGR79_24965) ilvL 5073992..5074090 (-) 99 WP_001311244.1 ilv operon leader peptide -
  WGR79_RS25060 (WGR79_24970) comM 5074445..5075965 (+) 1521 WP_060416009.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  WGR79_RS25065 (WGR79_24975) - 5075991..5076329 (-) 339 WP_004146520.1 DUF413 domain-containing protein -
  WGR79_RS25070 (WGR79_24980) hdfR 5076448..5077269 (+) 822 WP_004177958.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55159.54 Da        Isoelectric Point: 8.0945

>NTDB_id=956408 WGR79_RS25060 WP_060416009.1 5074445..5075965(+) (comM) [Klebsiella pneumoniae strain FUJ80029]
MSLAIVYTRAALGIEAPLITVEVHLSNGLPGLTMVGLPETTVKEARDRVRSALINSGYAFPAKKITINLAPADLPKEGGR
YDLPIALALLVASEQLNTTRLNQYEFVGELALTGGLRGVPGAIPSAMEAIKAGRRIVVSSDNAAEVGLIGGSDCLVADHL
QEVCAFLAGQTSLSPPLAEAPARDERYEDLLDVIGQQQGKRALEIVAAGGHNLLLIGPPGTGKTMLASRLPGLLPPLSNQ
EALESAAIQSLVNLHTAKTQWRQRPFRAPHHSASLAAMVGGGSIPVPGEISLAHNGVLFLDELPEFERRVLDALREPIES
GKIHISRSRAKIDYPARFQLIAAMNPSPTGHYQGKHNRASPEQTLRYLGRLSGPFLDRFDLSLEIPLPPPGILSQGSQGE
ESSATVRQRVLAARERQMLRQNKLNAHLENREMKNCCRLRREDAVWLEQTLTQLGLSIRAWQRLLKVARTIADLAEVEEI
ERCHLQEALSYRAIDRMLNHLQKMMA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=956408 WGR79_RS25060 WP_060416009.1 5074445..5075965(+) (comM) [Klebsiella pneumoniae strain FUJ80029]
ATGTCGCTCGCTATCGTCTATACTCGCGCGGCGCTCGGTATCGAAGCGCCATTGATTACCGTTGAGGTTCATCTCAGCAA
CGGTCTTCCTGGTCTAACTATGGTCGGGCTGCCGGAAACCACCGTGAAAGAGGCCCGCGACCGGGTCCGCAGCGCCCTGA
TCAACAGCGGCTACGCTTTTCCTGCGAAGAAGATAACCATTAACCTGGCGCCAGCGGATCTGCCCAAAGAAGGCGGACGA
TACGATCTGCCCATCGCTCTCGCGCTTCTCGTTGCCTCAGAGCAGCTCAACACGACGCGACTGAATCAATATGAGTTTGT
GGGCGAACTCGCCCTTACAGGTGGGTTACGAGGCGTTCCAGGGGCGATCCCCAGCGCAATGGAGGCCATCAAAGCCGGCC
GGCGCATTGTCGTCTCCTCTGACAATGCGGCGGAGGTCGGCCTGATCGGCGGCAGCGATTGTCTGGTCGCCGACCATCTG
CAGGAGGTTTGCGCATTTCTTGCGGGGCAGACATCGCTTTCGCCGCCTCTCGCCGAGGCGCCTGCTCGGGATGAACGCTA
CGAAGATCTGCTCGATGTTATCGGCCAGCAGCAGGGCAAACGAGCGCTGGAGATTGTGGCCGCCGGTGGTCACAATCTGC
TCCTGATAGGCCCGCCTGGGACCGGGAAAACCATGCTAGCCAGCCGACTCCCCGGTCTCCTGCCGCCATTAAGCAATCAG
GAAGCGCTGGAGAGCGCGGCCATACAGAGTCTGGTCAACCTCCACACCGCAAAGACGCAGTGGCGTCAGAGGCCGTTCCG
CGCCCCCCACCATAGCGCCTCGCTGGCAGCGATGGTGGGCGGCGGCTCGATACCGGTCCCCGGTGAGATTTCCCTGGCCC
ATAATGGCGTGCTGTTTCTTGATGAACTGCCGGAGTTTGAGCGGCGGGTACTGGATGCGCTACGCGAACCCATTGAGTCA
GGCAAGATCCACATATCACGCTCGCGCGCCAAAATTGACTATCCGGCGCGCTTTCAGCTTATTGCAGCGATGAATCCAAG
CCCGACAGGACATTATCAGGGTAAACATAATCGTGCATCGCCGGAGCAGACATTGCGCTACCTTGGACGCCTGTCAGGCC
CCTTCCTCGACCGCTTCGATCTTTCCTTAGAGATCCCATTGCCACCGCCAGGAATACTGAGTCAGGGCTCGCAGGGCGAA
GAATCGAGCGCAACGGTCCGGCAGCGGGTGCTGGCGGCGCGTGAACGACAAATGCTCAGGCAAAATAAACTCAATGCCCA
TCTTGAGAATCGTGAAATGAAGAACTGCTGTCGCTTAAGGCGGGAGGATGCTGTCTGGCTGGAACAGACGCTAACGCAGC
TGGGGCTTTCTATTCGCGCCTGGCAGCGTCTGTTAAAGGTTGCGAGAACCATTGCCGATCTGGCAGAGGTTGAAGAGATT
GAACGCTGTCATTTGCAGGAGGCGCTCAGCTATCGGGCAATTGATCGGATGCTCAACCATCTGCAGAAAATGATGGCGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

60.078

100

0.607

  comM Glaesserella parasuis strain SC1401

59.725

100

0.601

  comM Vibrio cholerae strain A1552

60.04

99.407

0.597

  comM Vibrio campbellii strain DS40M4

59.524

99.605

0.593

  comM Legionella pneumophila str. Paris

50.201

98.419

0.494

  comM Legionella pneumophila strain ERS1305867

50.201

98.419

0.494

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.488

100

0.447