Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   U1R81_RS27220 Genome accession   NZ_CP141632
Coordinates   5558995..5560515 (+) Length   506 a.a.
NCBI ID   WP_012969207.1    Uniprot ID   A0A1F2LT85
Organism   Klebsiella variicola strain 353     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 5553995..5565515
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  U1R81_RS27200 (U1R81_27200) - 5555552..5556481 (-) 930 WP_324725343.1 branched-chain amino acid transaminase -
  U1R81_RS27205 (U1R81_27205) ilvM 5556501..5556758 (-) 258 WP_008807981.1 acetolactate synthase 2 small subunit -
  U1R81_RS27210 (U1R81_27210) ilvG 5556755..5558401 (-) 1647 WP_008807982.1 acetolactate synthase 2 catalytic subunit -
  U1R81_RS30775 ilvX 5558404..5558457 (-) 54 WP_201281575.1 peptide IlvX -
  U1R81_RS27215 (U1R81_27215) ilvL 5558542..5558640 (-) 99 WP_001311244.1 ilv operon leader peptide -
  U1R81_RS27220 (U1R81_27220) comM 5558995..5560515 (+) 1521 WP_012969207.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  U1R81_RS27225 (U1R81_27225) - 5560540..5560878 (-) 339 WP_004886877.1 DUF413 domain-containing protein -
  U1R81_RS27230 (U1R81_27230) hdfR 5560997..5561818 (+) 822 WP_012543255.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55099.54 Da        Isoelectric Point: 8.2651

>NTDB_id=917781 U1R81_RS27220 WP_012969207.1 5558995..5560515(+) (comM) [Klebsiella variicola strain 353]
MSLAIVYTRAALGIEAPLITIEVHLSSGLPGLTMVGLPETTVKEARDRVRSALINSGYAFPAKKITINLAPADLPKEGGR
YDLPIALALLVASEQLNTTRLNQYEFVGELALTGGLRGVPGAIPSAMEAIKAGRRIVVSSDNAAEVGLIGGSDCLVADHL
QQVCAFLAGQASLSPPLADVPAVDERCEDLRDVIGQQQGKRALEIVAAGGHNLLLIGPPGTGKTMLACRLPGLLPPLSNQ
EALESTAIQSLVNLQTAKTRWRQRPFRAPHHSASLAAMVGGGSIPVPGEISLAHNGVLFLDELPEFERRVLDALREPIES
GKIHISRSRAKIDYPARFQLIAAMNPSPTGHYQGKHNRASPEQTLRYLGRLSGPFLDRFDLSLEIPLPPPGILSQGTQGE
ESSATVRQRVLAARERQMLRQNKLNAHLENREMKSCCHLRQEDALWLEQTLTQLGLSIRAWQRLLKVARTIADLAEAEEI
ERRHLQEALSYRAIDRMLNHLQKMMA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=917781 U1R81_RS27220 WP_012969207.1 5558995..5560515(+) (comM) [Klebsiella variicola strain 353]
ATGTCGCTCGCCATCGTTTACACACGCGCGGCGCTCGGTATCGAAGCGCCATTGATTACTATCGAGGTTCATCTCAGCAG
CGGTCTGCCTGGCTTGACCATGGTCGGGCTACCGGAGACCACCGTGAAAGAGGCCCGCGACCGGGTACGCAGCGCCTTGA
TCAACAGCGGCTACGCTTTCCCTGCGAAGAAAATAACCATTAACCTGGCACCCGCGGATCTGCCCAAAGAGGGCGGGCGA
TATGACCTTCCCATCGCTCTCGCGCTTCTCGTCGCCTCAGAGCAGCTCAACACTACCCGATTGAATCAATATGAGTTTGT
GGGCGAACTCGCCCTTACAGGCGGCTTACGCGGCGTTCCAGGGGCGATCCCCAGCGCAATGGAAGCCATCAAAGCCGGCC
GGCGCATTGTCGTCTCCTCTGACAATGCGGCTGAGGTGGGTCTGATCGGCGGCAGCGATTGCCTGGTCGCCGACCATCTG
CAACAGGTGTGCGCGTTTCTCGCAGGGCAAGCATCACTCTCGCCGCCTCTCGCCGACGTGCCCGCCGTTGATGAACGCTG
TGAAGATCTACGTGATGTTATCGGCCAACAACAAGGCAAGCGAGCGCTGGAGATTGTGGCGGCCGGTGGCCACAACCTGC
TCCTGATTGGCCCGCCCGGTACCGGCAAAACAATGCTTGCCTGCCGGCTCCCCGGCCTTTTGCCGCCATTAAGCAACCAG
GAAGCGCTGGAGAGCACGGCCATTCAAAGTCTGGTCAACCTCCAGACTGCGAAGACCCGGTGGCGTCAGAGGCCGTTTCG
CGCCCCTCACCATAGCGCCTCGCTGGCAGCGATGGTAGGCGGAGGCTCAATACCTGTCCCCGGCGAGATTTCACTGGCCC
ATAATGGCGTGCTGTTTCTTGATGAACTGCCAGAGTTTGAACGACGGGTGCTGGACGCGCTGCGTGAACCCATTGAGTCA
GGCAAGATCCACATATCACGATCGCGCGCCAAGATTGACTATCCGGCGCGTTTTCAGCTTATCGCGGCAATGAACCCAAG
CCCTACCGGGCATTATCAGGGCAAACATAATCGCGCGTCACCTGAGCAGACATTGCGCTACCTTGGGCGCCTGTCCGGCC
CTTTTCTCGACCGCTTCGATCTTTCTTTAGAGATCCCGCTCCCGCCGCCGGGGATACTGAGCCAGGGCACGCAGGGCGAA
GAATCGAGTGCGACGGTCCGTCAGCGTGTGCTGGCGGCACGTGAACGTCAAATGCTCAGGCAAAATAAGCTGAATGCGCA
TCTTGAGAATCGTGAAATGAAGAGCTGCTGTCACTTGCGGCAGGAGGATGCGCTTTGGCTGGAACAGACGTTAACGCAGC
TGGGGCTTTCTATACGGGCCTGGCAGCGTCTGTTAAAGGTGGCAAGAACCATTGCCGATCTGGCAGAAGCTGAAGAGATT
GAACGCCGCCATTTGCAGGAGGCGCTCAGCTATCGGGCAATAGACCGGATGCTAAACCATCTGCAGAAAATGATGGCGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1F2LT85

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

59.883

100

0.605

  comM Glaesserella parasuis strain SC1401

59.766

100

0.605

  comM Vibrio cholerae strain A1552

59.245

99.407

0.589

  comM Vibrio campbellii strain DS40M4

58.532

99.605

0.583

  comM Legionella pneumophila str. Paris

49.497

98.221

0.486

  comM Legionella pneumophila strain ERS1305867

49.497

98.221

0.486

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.862

100

0.449