Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   EAM_RS00785 Genome accession   NC_013971
Coordinates   185282..186802 (-) Length   506 a.a.
NCBI ID   WP_004154893.1    Uniprot ID   A0A830ZW65
Organism   Erwinia amylovora ATCC 49946     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 180282..191802
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EAM_RS00775 (EAM_0148) hdfR 183964..184791 (-) 828 WP_004154889.1 HTH-type transcriptional regulator HdfR -
  EAM_RS00780 (EAM_0149) - 184912..185250 (+) 339 WP_004154890.1 DUF413 domain-containing protein -
  EAM_RS00785 (EAM_0150) comM 185282..186802 (-) 1521 WP_004154893.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  EAM_RS19065 - 187137..187235 (+) 99 WP_099258353.1 IlvGEDA operon leader peptide -
  EAM_RS00790 (EAM_0151) ilvG 187376..189022 (+) 1647 WP_013035760.1 acetolactate synthase 2 catalytic subunit -
  EAM_RS00795 (EAM_0152) ilvM 189019..189276 (+) 258 WP_004154896.1 acetolactate synthase 2 small subunit -
  EAM_RS00800 (EAM_0153) - 189295..190221 (+) 927 WP_004154897.1 branched-chain amino acid transaminase -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55103.24 Da        Isoelectric Point: 7.4176

>NTDB_id=36801 EAM_RS00785 WP_004154893.1 185282..186802(-) (comM) [Erwinia amylovora ATCC 49946]
MSLSVAYTRAAIGIQAPLVSVEVHLSNGLPALSLVGLPETTVKEARDRVRSAILNSGFFFPAKRITVSLAPADLPKEGGR
YDLPIAVAILAASEQVPAEKLIQYEFLGELALTGTLRGVQGATPAALAALDARRQLILSAENQHDVGLIRHGESLIATHL
LEVCAFLHGKAPLDAAHCEPEESLPSTGDLNEIIGQQQAKRALEITAAGGHNLLLIGPPGTGKTMLASRLSGLMPPLSDR
EALESASLASLISGSDFRHNWRQRPFRAPHHSASLYALVGGGSLPKPGEISLAHNGVLFLDELPEFERRALDALREPLES
GEISISRARAKITYPARFQLIAAMNPSPTGHYRGPHNRSSPQQTLRYLSRLSGPFLDRFDISLEVPLLPAGMMSAQHGES
ESSHQVRERVLLARERQLARCNKMNAAMSNQEIRACCKLTPEDAEWLERVMIQLGLSVRAWQRILKVARTIADMAGEPWI
SREHLTEAVSYRAIDRLLIHLQNSLD

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=36801 EAM_RS00785 WP_004154893.1 185282..186802(-) (comM) [Erwinia amylovora ATCC 49946]
ATGTCGCTATCAGTTGCTTACACCCGTGCCGCGATTGGCATCCAGGCACCACTGGTGTCCGTGGAAGTTCATCTCAGTAA
TGGCCTTCCTGCATTATCACTGGTTGGATTGCCGGAAACCACCGTTAAGGAGGCTCGTGACAGAGTGCGCAGCGCCATCC
TTAACAGCGGTTTTTTTTTCCCGGCTAAACGGATAACCGTTAGCCTGGCACCGGCCGACCTGCCTAAAGAAGGCGGCAGA
TATGACTTACCTATCGCTGTCGCCATTCTCGCGGCTTCAGAACAGGTTCCGGCAGAAAAACTTATTCAGTATGAATTTCT
GGGTGAACTGGCCCTCACAGGCACGCTACGTGGCGTACAGGGCGCTACTCCCGCTGCGCTGGCGGCGTTAGACGCTCGTC
GGCAACTGATTCTCTCAGCCGAGAATCAGCATGATGTTGGGTTGATTCGGCATGGCGAGAGCCTGATTGCCACCCATCTT
CTGGAGGTGTGCGCCTTTTTACATGGTAAAGCACCACTGGATGCTGCGCACTGCGAACCTGAAGAATCCCTGCCGTCCAC
GGGAGATTTAAATGAAATTATCGGCCAACAGCAGGCAAAGCGCGCGCTGGAGATCACGGCAGCGGGAGGGCACAATCTGT
TGCTCATCGGCCCGCCGGGAACCGGCAAAACGATGCTGGCATCCAGACTTAGTGGCCTGATGCCGCCCCTGAGCGATCGT
GAAGCGTTAGAGAGCGCCAGCCTTGCCAGTCTGATATCCGGCAGCGATTTTCGGCATAACTGGCGCCAGAGACCCTTCCG
CGCCCCTCACCATAGCGCATCACTGTATGCGCTGGTGGGTGGAGGATCACTGCCTAAACCCGGGGAAATTTCGCTGGCCC
ATAACGGGGTACTGTTTCTGGATGAGCTGCCCGAGTTTGAACGCCGTGCGCTGGATGCACTGCGTGAACCGCTTGAATCG
GGAGAGATTAGCATTTCGCGAGCCAGAGCAAAAATTACTTATCCGGCCCGCTTCCAGCTGATTGCAGCAATGAACCCCAG
CCCAACCGGACATTATCGCGGCCCGCATAATCGCTCATCACCCCAGCAGACGCTGCGCTATCTGAGCCGCCTCTCCGGCC
CATTCCTTGACCGTTTTGATATTTCTCTGGAAGTACCACTGCTGCCAGCAGGAATGATGAGTGCTCAACATGGCGAAAGT
GAGTCCAGCCATCAGGTGCGCGAACGGGTGCTGTTGGCCCGTGAACGGCAGCTGGCGCGCTGTAATAAAATGAATGCAGC
AATGAGCAACCAGGAAATCCGGGCTTGTTGTAAACTGACGCCTGAGGATGCTGAATGGCTGGAGCGGGTAATGATCCAGC
TGGGTCTGTCGGTACGCGCATGGCAACGCATTCTGAAAGTTGCCCGCACTATCGCAGACATGGCCGGGGAGCCATGGATT
AGTCGGGAACACCTGACCGAAGCCGTTAGCTACCGGGCTATCGACCGTTTACTGATCCATCTACAAAATAGTCTGGATTA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A830ZW65

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

62.598

100

0.628

  comM Glaesserella parasuis strain SC1401

61.144

100

0.613

  comM Vibrio cholerae strain A1552

61.355

99.209

0.609

  comM Vibrio campbellii strain DS40M4

60.956

99.209

0.605

  comM Legionella pneumophila str. Paris

50.402

98.419

0.496

  comM Legionella pneumophila strain ERS1305867

50.402

98.419

0.496

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.676

100

0.437


Multiple sequence alignment