Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   BC497_RS20545 Genome accession   NZ_CP018307
Coordinates   4163406..4164926 (+) Length   506 a.a.
NCBI ID   WP_012969207.1    Uniprot ID   A0A1F2LT85
Organism   Klebsiella variicola strain X39     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4158406..4169926
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BC497_RS20525 (BC497_20500) - 4159963..4160892 (-) 930 WP_008807980.1 branched-chain amino acid transaminase -
  BC497_RS20530 (BC497_20505) ilvM 4160912..4161169 (-) 258 WP_055323049.1 acetolactate synthase 2 small subunit -
  BC497_RS20535 (BC497_20510) ilvG 4161166..4162812 (-) 1647 WP_008807982.1 acetolactate synthase 2 catalytic subunit -
  BC497_RS33360 ilvX 4162815..4162868 (-) 54 WP_201281575.1 peptide IlvX -
  BC497_RS20540 (BC497_20515) ilvL 4162953..4163051 (-) 99 WP_001311244.1 ilv operon leader peptide -
  BC497_RS20545 (BC497_20520) comM 4163406..4164926 (+) 1521 WP_012969207.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  BC497_RS20550 (BC497_20525) - 4164951..4165289 (-) 339 WP_004886877.1 DUF413 domain-containing protein -
  BC497_RS20555 (BC497_20530) hdfR 4165408..4166229 (+) 822 WP_012543255.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55099.54 Da        Isoelectric Point: 8.2651

>NTDB_id=207916 BC497_RS20545 WP_012969207.1 4163406..4164926(+) (comM) [Klebsiella variicola strain X39]
MSLAIVYTRAALGIEAPLITIEVHLSSGLPGLTMVGLPETTVKEARDRVRSALINSGYAFPAKKITINLAPADLPKEGGR
YDLPIALALLVASEQLNTTRLNQYEFVGELALTGGLRGVPGAIPSAMEAIKAGRRIVVSSDNAAEVGLIGGSDCLVADHL
QQVCAFLAGQASLSPPLADVPAVDERCEDLRDVIGQQQGKRALEIVAAGGHNLLLIGPPGTGKTMLACRLPGLLPPLSNQ
EALESTAIQSLVNLQTAKTRWRQRPFRAPHHSASLAAMVGGGSIPVPGEISLAHNGVLFLDELPEFERRVLDALREPIES
GKIHISRSRAKIDYPARFQLIAAMNPSPTGHYQGKHNRASPEQTLRYLGRLSGPFLDRFDLSLEIPLPPPGILSQGTQGE
ESSATVRQRVLAARERQMLRQNKLNAHLENREMKSCCHLRQEDALWLEQTLTQLGLSIRAWQRLLKVARTIADLAEAEEI
ERRHLQEALSYRAIDRMLNHLQKMMA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=207916 BC497_RS20545 WP_012969207.1 4163406..4164926(+) (comM) [Klebsiella variicola strain X39]
ATGTCGCTCGCCATCGTTTACACACGCGCGGCGCTCGGTATCGAAGCGCCATTGATTACTATCGAGGTTCATCTCAGCAG
CGGTCTGCCTGGCTTGACCATGGTCGGGCTACCGGAGACCACCGTGAAAGAGGCCCGCGACCGGGTACGCAGCGCCTTGA
TCAACAGCGGCTACGCTTTCCCTGCGAAGAAAATAACCATTAACCTGGCACCCGCGGATCTGCCCAAAGAGGGCGGGCGA
TATGACCTTCCCATCGCTCTCGCGCTTCTCGTCGCCTCAGAGCAGCTCAACACTACCCGATTGAATCAATATGAGTTTGT
GGGCGAACTCGCCCTTACAGGCGGCTTACGCGGCGTTCCAGGGGCGATCCCCAGCGCAATGGAAGCCATCAAAGCCGGCC
GGCGCATTGTCGTCTCCTCTGACAATGCGGCTGAGGTGGGTCTGATCGGCGGCAGCGATTGCCTGGTCGCCGACCATCTG
CAACAGGTGTGCGCGTTTCTCGCAGGGCAAGCATCGCTCTCGCCGCCTCTCGCCGACGTGCCCGCCGTTGATGAACGCTG
TGAAGATCTACGTGATGTTATCGGCCAACAACAAGGCAAGCGAGCGCTGGAGATTGTGGCGGCCGGTGGCCACAACCTGC
TGCTGATTGGCCCGCCCGGTACCGGCAAAACAATGCTTGCCTGCCGGCTCCCCGGCCTTTTGCCGCCATTAAGCAACCAG
GAAGCGCTGGAGAGCACGGCCATTCAAAGTCTGGTCAACCTCCAGACTGCGAAGACCCGGTGGCGTCAGAGGCCGTTTCG
CGCCCCTCACCATAGCGCCTCGCTGGCAGCGATGGTAGGCGGAGGCTCAATACCTGTCCCCGGCGAGATTTCACTGGCCC
ATAATGGCGTCCTATTTCTTGATGAACTGCCAGAGTTTGAACGACGGGTGCTGGACGCGCTGCGTGAACCCATTGAGTCA
GGCAAGATCCACATATCACGATCGCGCGCCAAGATTGACTATCCGGCGCGTTTTCAGCTTATCGCGGCAATGAACCCAAG
CCCTACCGGGCATTATCAGGGCAAACATAATCGCGCGTCGCCTGAGCAGACATTGCGCTACCTTGGGCGCCTGTCCGGCC
CTTTTCTCGACCGCTTCGATCTTTCTTTAGAGATCCCGCTCCCGCCGCCGGGGATACTGAGCCAGGGCACGCAGGGCGAA
GAATCGAGTGCGACGGTCCGTCAGCGTGTGCTGGCGGCACGTGAACGTCAAATGCTCAGGCAAAATAAGCTGAATGCGCA
TCTTGAGAATCGTGAAATGAAGAGCTGCTGTCACTTGCGCCAGGAGGATGCGCTTTGGCTGGAACAGACGTTAACGCAGC
TGGGGCTTTCTATACGGGCCTGGCAGCGTCTGTTAAAGGTGGCAAGAACCATTGCCGATCTGGCAGAAGCTGAAGAGATT
GAACGCCGCCATTTGCAGGAGGCGCTCAGCTATCGGGCAATAGACCGGATGCTAAACCATCTGCAGAAAATGATGGCGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1F2LT85

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

59.883

100

0.605

  comM Glaesserella parasuis strain SC1401

59.766

100

0.605

  comM Vibrio cholerae strain A1552

59.245

99.407

0.589

  comM Vibrio campbellii strain DS40M4

58.532

99.605

0.583

  comM Legionella pneumophila str. Paris

49.497

98.221

0.486

  comM Legionella pneumophila strain ERS1305867

49.497

98.221

0.486

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.862

100

0.449


Multiple sequence alignment