Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   SOH37_RS25195 Genome accession   NZ_CP151664
Coordinates   5230785..5232305 (+) Length   506 a.a.
NCBI ID   WP_110236129.1    Uniprot ID   -
Organism   Klebsiella variicola strain 2023EP-00289     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 5225785..5237305
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SOH37_RS25175 (SOH37_25175) - 5227342..5228271 (-) 930 WP_008807980.1 branched-chain amino acid transaminase -
  SOH37_RS25180 (SOH37_25180) ilvM 5228291..5228548 (-) 258 WP_008807981.1 acetolactate synthase 2 small subunit -
  SOH37_RS25185 (SOH37_25185) ilvG 5228545..5230191 (-) 1647 WP_008807982.1 acetolactate synthase 2 catalytic subunit -
  SOH37_RS25190 (SOH37_25190) ilvL 5230332..5230430 (-) 99 WP_001311244.1 ilv operon leader peptide -
  SOH37_RS25195 (SOH37_25195) comM 5230785..5232305 (+) 1521 WP_110236129.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  SOH37_RS25200 (SOH37_25200) - 5232330..5232668 (-) 339 WP_004886877.1 DUF413 domain-containing protein -
  SOH37_RS25205 (SOH37_25205) hdfR 5232787..5233608 (+) 822 WP_012543255.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55085.52 Da        Isoelectric Point: 8.2651

>NTDB_id=979611 SOH37_RS25195 WP_110236129.1 5230785..5232305(+) (comM) [Klebsiella variicola strain 2023EP-00289]
MSLAIVYTRAALGIEAPLITIEVHLSSGLPGLTMVGLPETTVKEARDRVRSALINSGYAFPAKKITINLAPADLPKEGGR
YDLPIALALLVASEQLNTTRLNQYEFVGELALTGGLRGVPGAIPSAMEAIKAGRRIVVSSDNAAEVGLIGGSDCLIADHL
QQVCAFLAGQASLSPPLADVPAVDERCEDLRDVIGQQQGKRALEIVAAGGHNLLLIGPPGTGKTMLACRLPGLLPPLSNQ
EALESTAIQSLVNLQTAKTRWRQRPFRAPHHSASLAAMVGGGSIPAPGEISLAHNGVLFLDELPEFERRVLDALREPIES
GKIHISRSRAKIDYPARFQLIAAMNPSPTGHYQGKHNRASPEQTLRYLGRLSGPFLDRFDLSLEIPLPPPGILSQGTQGE
ESSATVRQRVLAARERQMLRQNKLNAHLENREMKSCCHLRQEDALWLEQTLTQLGLSIRAWQRLLKVARTIADLAEAEEI
ERRHLQEALSYRAIDRMLNHLQKMMA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=979611 SOH37_RS25195 WP_110236129.1 5230785..5232305(+) (comM) [Klebsiella variicola strain 2023EP-00289]
ATGTCGCTCGCCATCGTTTACACACGCGCGGCGCTCGGTATCGAAGCGCCATTGATTACTATCGAGGTTCATCTCAGCAG
CGGTCTGCCTGGCTTGACCATGGTCGGGCTACCGGAGACCACCGTGAAAGAGGCCCGCGACCGGGTACGCAGCGCCTTGA
TCAACAGCGGCTACGCTTTCCCTGCGAAGAAAATAACCATTAACCTGGCACCCGCGGATCTGCCCAAAGAGGGCGGGCGA
TATGACCTTCCCATCGCGCTCGCGCTTCTCGTCGCCTCAGAGCAGCTCAACACTACCCGATTGAATCAATATGAGTTTGT
GGGCGAACTCGCCCTTACAGGCGGCTTACGCGGCGTTCCAGGGGCGATCCCCAGCGCAATGGAAGCCATCAAAGCCGGCC
GGCGCATTGTCGTCTCCTCTGACAATGCGGCTGAGGTGGGTCTGATCGGCGGCAGCGATTGCCTGATCGCCGACCATCTG
CAACAGGTGTGCGCGTTTCTCGCAGGGCAAGCATCGCTCTCGCCGCCTCTCGCCGACGTGCCCGCCGTTGATGAACGCTG
TGAAGATCTACGTGATGTTATCGGCCAACAACAAGGCAAGCGAGCGCTGGAGATTGTGGCGGCCGGTGGCCACAACCTGC
TGCTGATTGGCCCGCCCGGTACCGGCAAAACAATGCTTGCCTGCCGGCTCCCCGGCCTTTTGCCGCCATTAAGCAACCAG
GAAGCGCTGGAGAGCACGGCCATTCAAAGTCTGGTCAACCTCCAGACTGCGAAGACCCGGTGGCGTCAGAGGCCGTTTCG
CGCCCCTCACCATAGCGCCTCGCTGGCAGCGATGGTGGGCGGTGGCTCAATACCTGCCCCAGGCGAGATTTCACTGGCCC
ATAATGGCGTCCTATTTCTTGATGAGCTACCGGAGTTTGAGCGGCGGGTGCTGGATGCGCTACGTGAGCCCATTGAGTCA
GGCAAGATCCACATATCGCGATCGCGCGCCAAGATTGACTATCCGGCGCGTTTTCAGCTTATCGCGGCAATGAACCCAAG
CCCTACCGGGCATTATCAGGGCAAACATAATCGCGCGTCGCCTGAGCAGACATTGCGCTACCTTGGGCGCCTGTCCGGCC
CTTTTCTCGACCGCTTCGATCTTTCTTTAGAGATCCCGCTCCCGCCGCCGGGGATACTGAGCCAGGGCACGCAGGGCGAA
GAATCGAGTGCGACGGTCCGTCAGCGTGTGCTGGCGGCACGTGAACGTCAAATGCTCAGGCAAAATAAGCTGAATGCGCA
TCTTGAGAATCGTGAAATGAAGAGCTGCTGTCACTTGCGCCAGGAGGATGCGCTTTGGCTGGAACAGACGTTAACGCAGC
TGGGGCTTTCTATACGGGCCTGGCAGCGTCTGTTAAAGGTGGCAAGAACCATTGCCGATCTGGCAGAAGCTGAAGAGATT
GAACGCCGCCATTTGCAGGAGGCGCTCAGCTATCGGGCAATAGACCGGATGCTAAACCATCTGCAGAAAATGATGGCGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

59.883

100

0.605

  comM Glaesserella parasuis strain SC1401

59.766

100

0.605

  comM Vibrio cholerae strain A1552

59.245

99.407

0.589

  comM Vibrio campbellii strain DS40M4

58.532

99.605

0.583

  comM Legionella pneumophila str. Paris

49.497

98.221

0.486

  comM Legionella pneumophila strain ERS1305867

49.497

98.221

0.486

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.862

100

0.449


Multiple sequence alignment