Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   WGU05_RS25045 Genome accession   NZ_CP148918
Coordinates   5070511..5072031 (+) Length   506 a.a.
NCBI ID   WP_060416009.1    Uniprot ID   -
Organism   Klebsiella pneumoniae strain FUJ80049     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 5065511..5077031
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  WGU05_RS25020 (WGU05_24940) ilvE 5067071..5068000 (-) 930 WP_002883171.1 branched-chain-amino-acid transaminase -
  WGU05_RS25025 (WGU05_24945) ilvM 5068017..5068274 (-) 258 WP_002883170.1 acetolactate synthase 2 small subunit -
  WGU05_RS25030 (WGU05_24950) ilvG 5068271..5069917 (-) 1647 WP_002883142.1 acetolactate synthase 2 catalytic subunit -
  WGU05_RS25035 ilvX 5069920..5069973 (-) 54 WP_201281575.1 peptide IlvX -
  WGU05_RS25040 (WGU05_24955) ilvL 5070058..5070156 (-) 99 WP_001311244.1 ilv operon leader peptide -
  WGU05_RS25045 (WGU05_24960) comM 5070511..5072031 (+) 1521 WP_060416009.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  WGU05_RS25050 (WGU05_24965) - 5072057..5072395 (-) 339 WP_004146520.1 DUF413 domain-containing protein -
  WGU05_RS25055 (WGU05_24970) hdfR 5072514..5073335 (+) 822 WP_004177958.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55159.54 Da        Isoelectric Point: 8.0945

>NTDB_id=956217 WGU05_RS25045 WP_060416009.1 5070511..5072031(+) (comM) [Klebsiella pneumoniae strain FUJ80049]
MSLAIVYTRAALGIEAPLITVEVHLSNGLPGLTMVGLPETTVKEARDRVRSALINSGYAFPAKKITINLAPADLPKEGGR
YDLPIALALLVASEQLNTTRLNQYEFVGELALTGGLRGVPGAIPSAMEAIKAGRRIVVSSDNAAEVGLIGGSDCLVADHL
QEVCAFLAGQTSLSPPLAEAPARDERYEDLLDVIGQQQGKRALEIVAAGGHNLLLIGPPGTGKTMLASRLPGLLPPLSNQ
EALESAAIQSLVNLHTAKTQWRQRPFRAPHHSASLAAMVGGGSIPVPGEISLAHNGVLFLDELPEFERRVLDALREPIES
GKIHISRSRAKIDYPARFQLIAAMNPSPTGHYQGKHNRASPEQTLRYLGRLSGPFLDRFDLSLEIPLPPPGILSQGSQGE
ESSATVRQRVLAARERQMLRQNKLNAHLENREMKNCCRLRREDAVWLEQTLTQLGLSIRAWQRLLKVARTIADLAEVEEI
ERCHLQEALSYRAIDRMLNHLQKMMA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=956217 WGU05_RS25045 WP_060416009.1 5070511..5072031(+) (comM) [Klebsiella pneumoniae strain FUJ80049]
ATGTCGCTCGCTATCGTCTATACTCGCGCGGCGCTCGGTATCGAAGCGCCATTGATTACCGTTGAGGTTCATCTCAGCAA
CGGTCTTCCTGGTCTAACTATGGTCGGGCTGCCGGAAACCACCGTGAAAGAGGCCCGCGACCGGGTCCGCAGCGCCCTGA
TCAACAGCGGCTACGCTTTTCCTGCGAAGAAGATAACCATTAACCTGGCGCCAGCGGATCTGCCCAAAGAAGGCGGACGA
TACGATCTGCCCATCGCTCTCGCGCTTCTCGTTGCCTCAGAGCAGCTCAACACGACGCGACTGAATCAATATGAGTTTGT
GGGCGAACTCGCCCTTACAGGTGGGTTACGAGGCGTTCCAGGGGCGATCCCCAGCGCAATGGAGGCCATCAAAGCCGGCC
GGCGCATTGTCGTCTCCTCTGACAATGCGGCGGAGGTCGGCCTGATCGGCGGCAGCGATTGTCTGGTCGCCGACCATCTG
CAGGAGGTTTGCGCATTTCTTGCGGGGCAGACATCGCTTTCGCCGCCTCTCGCCGAGGCGCCTGCTCGGGATGAACGCTA
CGAAGATCTGCTCGATGTTATCGGCCAGCAGCAGGGCAAACGAGCGCTGGAGATTGTGGCCGCCGGTGGTCACAATCTGC
TCCTGATAGGCCCGCCTGGGACCGGGAAAACCATGCTAGCCAGCCGACTCCCCGGTCTCCTGCCGCCATTAAGCAATCAG
GAAGCGCTGGAGAGCGCGGCCATACAGAGTCTGGTCAACCTCCACACCGCAAAGACGCAGTGGCGTCAGAGGCCGTTCCG
CGCCCCCCACCATAGCGCCTCGCTGGCAGCGATGGTGGGCGGCGGCTCGATACCGGTCCCCGGTGAGATTTCCCTGGCCC
ATAATGGCGTGCTGTTTCTTGATGAACTGCCGGAGTTTGAGCGGCGGGTACTGGATGCGCTACGCGAACCCATTGAGTCA
GGCAAGATCCACATATCACGCTCGCGCGCCAAAATTGACTATCCGGCGCGCTTTCAGCTTATTGCAGCGATGAATCCAAG
CCCGACAGGACATTATCAGGGTAAACATAATCGTGCATCGCCGGAGCAGACATTGCGCTACCTTGGACGCCTGTCAGGCC
CCTTCCTCGACCGCTTCGATCTTTCCTTAGAGATCCCATTGCCACCGCCAGGAATACTGAGTCAGGGCTCGCAGGGCGAA
GAATCGAGCGCAACGGTCCGGCAGCGGGTGCTGGCGGCGCGTGAACGACAAATGCTCAGGCAAAATAAACTCAATGCCCA
TCTTGAGAATCGTGAAATGAAGAACTGCTGTCGCTTAAGGCGGGAGGATGCTGTCTGGCTGGAACAGACGCTAACGCAGC
TGGGGCTTTCTATTCGCGCCTGGCAGCGTCTGTTAAAGGTTGCGAGAACCATTGCCGATCTGGCAGAGGTTGAAGAGATT
GAACGCTGTCATTTGCAGGAGGCGCTCAGCTATCGGGCAATTGATCGGATGCTCAACCATCTGCAGAAAATGATGGCGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

60.078

100

0.607

  comM Glaesserella parasuis strain SC1401

59.725

100

0.601

  comM Vibrio cholerae strain A1552

60.04

99.407

0.597

  comM Vibrio campbellii strain DS40M4

59.524

99.605

0.593

  comM Legionella pneumophila str. Paris

50.201

98.419

0.494

  comM Legionella pneumophila strain ERS1305867

50.201

98.419

0.494

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.488

100

0.447