Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   D0N50_RS21540 Genome accession   NZ_CP031695
Coordinates   4620443..4621963 (+) Length   506 a.a.
NCBI ID   WP_151038566.1    Uniprot ID   -
Organism   Erwinia billingiae strain TH88     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4615443..4626963
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D0N50_RS21520 (D0N50_21540) - 4617012..4617938 (-) 927 WP_013200223.1 branched-chain amino acid transaminase -
  D0N50_RS21525 (D0N50_21545) ilvM 4617958..4618215 (-) 258 WP_041691853.1 acetolactate synthase 2 small subunit -
  D0N50_RS21530 (D0N50_21550) ilvG 4618212..4619858 (-) 1647 WP_151038565.1 acetolactate synthase 2 catalytic subunit -
  D0N50_RS21535 (D0N50_21555) ilvL 4619999..4620097 (-) 99 WP_071822090.1 ilv operon leader peptide -
  D0N50_RS21540 (D0N50_21560) comM 4620443..4621963 (+) 1521 WP_151038566.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  D0N50_RS21545 (D0N50_21565) - 4621989..4622327 (-) 339 WP_151038567.1 DUF413 domain-containing protein -
  D0N50_RS21550 (D0N50_21570) hdfR 4622449..4623273 (+) 825 WP_041692181.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55170.45 Da        Isoelectric Point: 8.2707

>NTDB_id=310457 D0N50_RS21540 WP_151038566.1 4620443..4621963(+) (comM) [Erwinia billingiae strain TH88]
MSLSLAYTRAAIGIEAPLVLVEVHLSNGLPALSLVGLPETTVKEARDRVRSAIINCGFTFPAKRITVNLAPADLPKEGGR
YDLPIAIAILAASEQIPAEKLGQYEFLGELALTGALRGVQGAIPAALAATQAQRQLILSSDNLEEVGMIHEAKSLLSGHL
LNVCHFLAGHTTLEEARCELPEVGWQGGDLSDIVGQQQAKRALEICAAGGHNLLLIGPPGTGKTMLATRLTGLMPSLSDE
EALESAAIASLVSTGVLHQQWRQRPFRAPHHTSSRYALVGGGSMPKPGEISLAHNGILFLDELPEFDRKTLDALREPLES
GEICISRARAKVTYPARFQLIAAMNPSPTGHYQGIHNRSTPQQTLRYLNKLSGPFLDRFDLSLEVPLLPPGTLSRQNANS
ESSTTIRLRVIAARQRQLDRAGKVNALIQPKEIKRDCRITQQDAQWLEEVLNQLGLSVRAWQRILKVARTIADLGKKESI
QREHLHEALSYRGIDRLLSHLQKSLE

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=310457 D0N50_RS21540 WP_151038566.1 4620443..4621963(+) (comM) [Erwinia billingiae strain TH88]
ATGTCGCTATCTCTCGCCTATACCCGAGCCGCTATTGGTATCGAGGCGCCATTGGTGTTGGTTGAAGTTCATCTCAGTAA
TGGCCTGCCCGCGCTTTCACTGGTGGGTCTGCCTGAAACGACGGTAAAAGAAGCCCGCGATCGGGTACGCAGTGCCATCA
TCAACTGTGGCTTTACCTTTCCCGCTAAACGCATCACCGTCAATCTTGCGCCAGCCGATCTTCCGAAAGAAGGAGGAAGA
TACGATCTTCCTATCGCAATAGCGATTCTGGCGGCCTCAGAGCAGATTCCTGCGGAGAAGTTGGGGCAATATGAATTCCT
GGGAGAGTTAGCTTTAACAGGTGCTCTCCGTGGCGTACAGGGCGCTATTCCCGCAGCGCTGGCAGCCACTCAGGCACAGC
GCCAGCTGATCCTGTCGAGCGATAACCTTGAAGAGGTCGGCATGATCCACGAGGCGAAAAGCTTGCTGAGCGGTCATCTG
CTAAACGTTTGCCATTTTCTGGCCGGCCACACAACGCTTGAAGAGGCACGATGTGAGTTACCTGAGGTAGGCTGGCAAGG
AGGAGACCTCAGTGACATCGTGGGTCAGCAACAGGCGAAGCGGGCACTGGAAATTTGTGCCGCCGGTGGGCATAACCTGT
TACTGATTGGGCCGCCCGGCACAGGGAAGACCATGCTTGCCACCAGGCTAACCGGGTTAATGCCATCATTAAGCGATGAG
GAGGCCCTGGAGAGCGCCGCCATTGCCAGCCTGGTCAGCACAGGTGTCCTGCATCAACAATGGCGACAGCGCCCATTCCG
GGCCCCTCATCACACGTCATCACGCTATGCGCTGGTCGGTGGCGGTTCAATGCCAAAACCCGGCGAGATTTCGCTGGCAC
ACAATGGCATTCTGTTTCTGGATGAGCTGCCAGAGTTTGATCGAAAAACGCTGGATGCGCTGAGAGAACCGCTGGAGTCT
GGTGAGATTTGTATTTCACGCGCGCGGGCGAAAGTGACCTACCCGGCGCGCTTTCAGTTAATAGCCGCAATGAACCCCAG
CCCGACAGGTCATTATCAGGGTATCCATAATCGCAGCACGCCCCAGCAGACGCTTCGCTACCTCAACAAACTGTCCGGGC
CGTTCCTCGATCGCTTCGATCTGTCGCTTGAAGTCCCTCTGCTTCCACCTGGCACACTGAGCAGGCAAAACGCGAACAGT
GAAAGCAGCACCACTATTCGGCTGAGGGTGATTGCAGCCCGACAACGCCAGCTTGATCGTGCCGGAAAGGTTAATGCCCT
GATTCAGCCCAAAGAAATTAAGCGCGATTGCCGGATAACGCAGCAGGATGCACAGTGGCTGGAAGAGGTGTTAAATCAGT
TAGGTCTGTCAGTTCGCGCCTGGCAGCGTATTCTGAAGGTGGCACGGACGATTGCAGATTTAGGAAAGAAGGAGAGTATT
CAGCGGGAGCACCTGCACGAGGCGCTAAGTTATCGGGGGATCGACAGGTTACTTAGCCATCTACAGAAAAGTCTCGAATG
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

61.66

100

0.617

  comM Glaesserella parasuis strain SC1401

61.144

100

0.613

  comM Vibrio cholerae strain A1552

60.437

99.407

0.601

  comM Vibrio campbellii strain DS40M4

59.562

99.209

0.591

  comM Legionella pneumophila str. Paris

49.703

99.802

0.496

  comM Legionella pneumophila strain ERS1305867

49.703

99.802

0.496

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.533

99.407

0.443


Multiple sequence alignment