Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   AM447_RS10670 Genome accession   NZ_CP020067
Coordinates   2212181..2213701 (+) Length   506 a.a.
NCBI ID   WP_004181640.1    Uniprot ID   A0A0C7KD86
Organism   Klebsiella pneumoniae strain AR_0068     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 2207181..2218701
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AM447_RS10650 (AM447_11120) ilvE 2208741..2209670 (-) 930 WP_002883171.1 branched-chain-amino-acid transaminase -
  AM447_RS10655 (AM447_11125) ilvM 2209687..2209944 (-) 258 WP_002883170.1 acetolactate synthase 2 small subunit -
  AM447_RS10660 (AM447_11130) ilvG 2209941..2211587 (-) 1647 WP_002883142.1 acetolactate synthase 2 catalytic subunit -
  AM447_RS31965 ilvX 2211590..2211643 (-) 54 WP_201281575.1 peptide IlvX -
  AM447_RS10665 (AM447_11135) ilvL 2211728..2211826 (-) 99 WP_001311244.1 ilv operon leader peptide -
  AM447_RS10670 (AM447_11145) comM 2212181..2213701 (+) 1521 WP_004181640.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  AM447_RS10675 (AM447_11150) - 2213727..2214065 (-) 339 WP_004146520.1 DUF413 domain-containing protein -
  AM447_RS10680 (AM447_11155) hdfR 2214184..2215005 (+) 822 WP_004181639.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55186.61 Da        Isoelectric Point: 8.6038

>NTDB_id=220796 AM447_RS10670 WP_004181640.1 2212181..2213701(+) (comM) [Klebsiella pneumoniae strain AR_0068]
MSLAIVYTRAALGIEAPLITVEVHLSNGLPGLTMVGLPETTVKEARDRVRSALINSGYAFPAKKITINLAPADLPKEGGR
YDLPIALALLVASEQLNTTRLNQYEFVGELALTGGLRGVPGAIPSAMEAIKAGRRIVVSSDNAAEVGLIGGSDCLVADHL
QEVCAFLAGQTSLSPPLAEAPARDERYEDLLDVIGQQQGKRALEIVAAGGHNLLLIGPPGTGKTMLASRLPGLLPPLSNQ
EALESAAIQSLVNLHTAKTRWRQRPFRAPHHSASLAAMVGGGSIPVPGEISLAHNGVLFLDELPEFERRVLDALREPIES
GKIHISRSRAKINYPARFQLIAAMNPSPTGHYQGKHNRASPEQTLRYLGRLSGPFLDRFDLSLEIPLPPPGILSQGSQGE
ESSATVRQRVLAARERQMLRQNKLNAHLENREMKNCCRLRREDAVWLEQTLTQLGLSIRAWQRLLKVARTIADLAEVEEI
ERCHLQEALSYRAIDRMLNHLQKMMA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=220796 AM447_RS10670 WP_004181640.1 2212181..2213701(+) (comM) [Klebsiella pneumoniae strain AR_0068]
ATGTCGCTCGCTATCGTCTATACTCGCGCGGCGCTCGGTATCGAAGCGCCATTGATTACCGTTGAGGTTCATCTCAGCAA
CGGTCTTCCTGGTCTAACTATGGTCGGGCTGCCGGAAACCACCGTGAAAGAGGCCCGCGACCGGGTCCGCAGCGCCCTGA
TCAACAGCGGCTACGCTTTTCCTGCGAAGAAGATAACCATTAACCTGGCGCCAGCGGATCTGCCCAAAGAAGGCGGACGA
TACGATCTGCCCATCGCTCTCGCGCTTCTCGTTGCCTCAGAGCAGCTCAACACGACGCGACTGAATCAATATGAGTTTGT
GGGCGAACTCGCCCTTACAGGTGGGTTACGAGGCGTTCCAGGGGCGATCCCCAGCGCAATGGAGGCCATCAAAGCCGGCC
GGCGCATTGTCGTCTCCTCTGACAATGCGGCGGAGGTCGGCCTGATCGGCGGCAGCGATTGTCTGGTCGCCGACCATCTG
CAGGAGGTTTGCGCATTTCTTGCGGGGCAGACATCGCTTTCGCCGCCTCTCGCCGAGGCGCCTGCTCGGGATGAACGCTA
CGAAGATCTGCTCGATGTTATCGGCCAGCAGCAGGGCAAACGAGCGCTGGAGATTGTGGCCGCCGGTGGTCACAATCTGC
TCCTGATAGGCCCGCCCGGGACCGGGAAAACCATGCTAGCCAGCCGACTCCCCGGTCTCCTGCCGCCATTAAGCAATCAG
GAAGCGCTGGAGAGCGCGGCCATACAGAGTCTGGTCAACCTCCACACCGCAAAGACGCGGTGGCGTCAGAGGCCGTTCCG
CGCCCCCCACCATAGCGCCTCGCTGGCAGCGATGGTGGGCGGCGGCTCGATACCGGTCCCCGGTGAGATTTCCCTGGCCC
ATAATGGCGTGCTGTTTCTTGATGAACTGCCGGAGTTTGAGCGGCGGGTACTGGATGCGCTACGCGAACCCATTGAGTCA
GGCAAGATCCACATATCACGCTCGCGCGCCAAAATTAATTATCCGGCGCGCTTTCAGCTTATTGCAGCGATGAATCCAAG
CCCGACAGGACATTATCAGGGTAAACATAATCGTGCATCGCCGGAGCAGACATTGCGCTACCTTGGACGCCTGTCAGGCC
CCTTCCTCGACCGCTTCGATCTTTCCTTAGAGATCCCATTGCCGCCGCCAGGAATACTGAGTCAGGGCTCGCAGGGCGAA
GAATCGAGCGCAACGGTCCGGCAGCGGGTGCTGGCGGCGCGTGAACGACAAATGCTCAGGCAAAATAAACTCAATGCCCA
TCTTGAGAATCGTGAAATGAAGAACTGCTGTCGCTTAAGGCGGGAGGATGCTGTCTGGCTGGAACAGACGCTAACGCAGC
TGGGGCTTTCTATTCGCGCCTGGCAGCGTCTGTTAAAGGTTGCGAGAACCATTGCCGATCTGGCAGAGGTTGAAGAGATT
GAACGCTGTCATTTGCAGGAGGCGCTCAGCTATCGGGCAATTGATCGGATGCTCAACCATCTGCAGAAAATGATGGCGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0C7KD86

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

60.078

100

0.607

  comM Glaesserella parasuis strain SC1401

59.725

100

0.601

  comM Vibrio cholerae strain A1552

60.04

99.407

0.597

  comM Vibrio campbellii strain DS40M4

59.524

99.605

0.593

  comM Legionella pneumophila str. Paris

50.201

98.419

0.494

  comM Legionella pneumophila strain ERS1305867

50.201

98.419

0.494

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.685

100

0.449


Multiple sequence alignment