Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   KI232_RS22705 Genome accession   NZ_AP024333
Coordinates   4897347..4898870 (+) Length   507 a.a.
NCBI ID   WP_167864774.1    Uniprot ID   -
Organism   Erwinia rhapontici strain MAFF 311155     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4892347..4903870
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KI232_RS22680 (ERHA55_49740) - 4893923..4894849 (-) 927 WP_133846880.1 branched-chain amino acid transaminase -
  KI232_RS22685 (ERHA55_49750) ilvM 4894868..4895125 (-) 258 WP_133846881.1 acetolactate synthase 2 small subunit -
  KI232_RS22690 (ERHA55_49770) ilvG 4895122..4896767 (-) 1646 Protein_4507 acetolactate synthase 2 catalytic subunit -
  KI232_RS22695 ilvL 4896908..4897006 (-) 99 WP_105595116.1 ilv operon leader peptide -
  KI232_RS22700 (ERHA55_49780) - 4897140..4897277 (-) 138 WP_159338527.1 hypothetical protein -
  KI232_RS22705 (ERHA55_49790) comM 4897347..4898870 (+) 1524 WP_167864774.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  KI232_RS22710 (ERHA55_49800) - 4898903..4899241 (-) 339 WP_062749119.1 DUF413 domain-containing protein -
  KI232_RS22715 (ERHA55_49810) hdfR 4899363..4900190 (+) 828 WP_133846884.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 507 a.a.        Molecular weight: 54631.40 Da        Isoelectric Point: 7.8471

>NTDB_id=84861 KI232_RS22705 WP_167864774.1 4897347..4898870(+) (comM) [Erwinia rhapontici strain MAFF 311155]
MSLSVANTRAALGIQAPLVSVEVHLSNGLPALSLVGLPETTVKEARDRVRSAIINSGFTFPAKRITVNLAPADLPKEGGR
YDLPIAIAILAASEQIPAAKLAGYEFLGELALNGALRGVQGSIPAAVAALNAGRQLILSADNQNDVGLIQHGKSLIASHL
LEVCAFLHGRTQLEEAQSNDLVCLSHVTGDLNEIIGQQQAKRALEVTAAGGHNLLLIGPPGTGKTMLASRLNGLMPPLSD
REALESASVASLLGSGDLHRQWRQRPFRSPHHSSSLYALVGGGSLPKPGEISLAHNGILFLDELPEFERRALDALREPLE
SGEISISRARAKITYPARFQLIAAMNPSPTGHYSGPHNRSSPQQTLRYLSRLSGPFLDRFDLSLEVPLLPPGMLSTQQGN
GESSQQVRERVLAARERQINRCGKVNAVMNNNDIKACCTLTPEDAQWLEQVLNQLGLSVRAWQRLLKVARTIADLAGENE
VSREHLTEAVSYRGIDRLLIHLHNSLQ

Nucleotide


Download         Length: 1524 bp        

>NTDB_id=84861 KI232_RS22705 WP_167864774.1 4897347..4898870(+) (comM) [Erwinia rhapontici strain MAFF 311155]
ATGTCGCTATCTGTTGCCAATACCCGTGCCGCATTGGGCATTCAGGCGCCGCTGGTGTCTGTTGAAGTTCACCTGAGTAA
TGGCCTGCCCGCTTTGTCACTGGTTGGTCTGCCCGAAACGACAGTTAAAGAAGCCCGGGACCGGGTGCGCAGCGCCATCA
TTAACAGCGGTTTCACCTTTCCTGCAAAACGCATTACCGTCAATCTTGCCCCCGCCGATCTCCCGAAAGAAGGCGGCAGG
TATGACTTGCCTATCGCTATAGCCATCCTGGCAGCTTCAGAGCAGATTCCTGCGGCGAAACTTGCCGGGTATGAGTTCCT
CGGTGAGTTAGCCCTCAACGGCGCGCTACGTGGCGTACAGGGCTCAATTCCTGCAGCCGTTGCGGCACTCAATGCAGGAA
GGCAACTCATCCTCTCAGCGGATAACCAAAATGATGTTGGCCTTATCCAGCACGGGAAAAGTTTAATCGCCAGCCATCTT
CTTGAGGTCTGTGCTTTCCTGCATGGCAGAACGCAGCTTGAAGAAGCCCAGAGCAACGACCTGGTCTGTCTGTCGCACGT
TACAGGGGATTTAAACGAGATTATCGGCCAGCAGCAGGCCAAGCGCGCACTGGAGGTAACAGCAGCCGGTGGGCACAACC
TGCTCTTAATTGGCCCACCCGGCACGGGGAAGACGATGCTCGCCTCACGGTTGAACGGTCTGATGCCGCCTCTAAGCGAT
CGTGAGGCCCTGGAGAGTGCCAGCGTAGCCAGTCTGCTAGGCAGTGGCGATCTGCACCGTCAGTGGCGCCAGAGGCCATT
CCGTTCCCCTCATCACAGTTCATCTCTGTATGCTCTTGTCGGCGGTGGTTCACTGCCTAAGCCAGGTGAGATTTCCCTTG
CCCACAACGGCATACTGTTTCTTGATGAGCTTCCGGAGTTTGAACGCCGCGCATTGGATGCACTGCGTGAACCGCTTGAA
TCAGGGGAGATCAGTATTTCGCGGGCACGTGCCAAAATCACCTATCCGGCACGCTTTCAGCTGATCGCCGCAATGAACCC
CAGCCCAACCGGGCATTATAGCGGCCCACATAATCGAAGTTCACCCCAGCAGACATTACGTTATCTCAGCCGTCTGTCTG
GCCCTTTTCTCGATCGTTTTGATCTCTCTCTTGAAGTGCCGTTACTACCACCAGGGATGTTGAGTACACAACAGGGTAAT
GGAGAGTCCAGTCAGCAGGTGCGTGAACGTGTGCTGGCCGCACGGGAACGCCAGATCAATCGCTGCGGCAAAGTGAATGC
GGTAATGAACAACAACGATATCAAGGCATGCTGTACACTTACGCCTGAGGATGCTCAATGGCTGGAGCAAGTGCTGAACC
AACTGGGGCTGTCAGTGCGGGCATGGCAACGTTTGCTGAAGGTGGCACGTACGATTGCCGATCTGGCAGGGGAGAACGAG
GTTAGCAGGGAGCATTTAACGGAAGCCGTTAGTTACCGAGGGATCGACCGCCTGCTGATTCATCTGCATAACAGCCTGCA
GTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

62.157

100

0.625

  comM Glaesserella parasuis strain SC1401

61.736

100

0.617

  comM Vibrio cholerae strain A1552

61.155

99.014

0.606

  comM Vibrio campbellii strain DS40M4

60.757

99.014

0.602

  comM Legionella pneumophila str. Paris

50.202

97.83

0.491

  comM Legionella pneumophila strain ERS1305867

50.202

97.83

0.491

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.556

97.83

0.436


Multiple sequence alignment