Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   VRB37_RS21550 Genome accession   NZ_CP142209
Coordinates   4584789..4586309 (+) Length   506 a.a.
NCBI ID   WP_151038566.1    Uniprot ID   -
Organism   Erwinia billingiae strain W05_1     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4579789..4591309
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  VRB37_RS21530 (VRB37_21530) - 4581359..4582285 (-) 927 WP_013200223.1 branched-chain amino acid transaminase -
  VRB37_RS21535 (VRB37_21535) ilvM 4582305..4582562 (-) 258 WP_338572135.1 acetolactate synthase 2 small subunit -
  VRB37_RS21540 (VRB37_21540) ilvG 4582559..4584205 (-) 1647 WP_338572138.1 acetolactate synthase 2 catalytic subunit -
  VRB37_RS21545 (VRB37_21545) ilvL 4584345..4584443 (-) 99 WP_071822090.1 ilv operon leader peptide -
  VRB37_RS21550 (VRB37_21550) comM 4584789..4586309 (+) 1521 WP_151038566.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  VRB37_RS21555 (VRB37_21555) - 4586335..4586673 (-) 339 WP_151038567.1 DUF413 domain-containing protein -
  VRB37_RS21560 (VRB37_21560) hdfR 4586795..4587619 (+) 825 WP_041692181.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55170.45 Da        Isoelectric Point: 8.2707

>NTDB_id=923266 VRB37_RS21550 WP_151038566.1 4584789..4586309(+) (comM) [Erwinia billingiae strain W05_1]
MSLSLAYTRAAIGIEAPLVLVEVHLSNGLPALSLVGLPETTVKEARDRVRSAIINCGFTFPAKRITVNLAPADLPKEGGR
YDLPIAIAILAASEQIPAEKLGQYEFLGELALTGALRGVQGAIPAALAATQAQRQLILSSDNLEEVGMIHEAKSLLSGHL
LNVCHFLAGHTTLEEARCELPEVGWQGGDLSDIVGQQQAKRALEICAAGGHNLLLIGPPGTGKTMLATRLTGLMPSLSDE
EALESAAIASLVSTGVLHQQWRQRPFRAPHHTSSRYALVGGGSMPKPGEISLAHNGILFLDELPEFDRKTLDALREPLES
GEICISRARAKVTYPARFQLIAAMNPSPTGHYQGIHNRSTPQQTLRYLNKLSGPFLDRFDLSLEVPLLPPGTLSRQNANS
ESSTTIRLRVIAARQRQLDRAGKVNALIQPKEIKRDCRITQQDAQWLEEVLNQLGLSVRAWQRILKVARTIADLGKKESI
QREHLHEALSYRGIDRLLSHLQKSLE

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=923266 VRB37_RS21550 WP_151038566.1 4584789..4586309(+) (comM) [Erwinia billingiae strain W05_1]
ATGTCGCTATCTCTCGCCTATACCCGAGCCGCTATTGGTATCGAGGCGCCATTGGTGTTGGTTGAAGTCCATCTCAGTAA
TGGCCTGCCCGCGCTTTCACTGGTGGGTCTGCCTGAAACGACGGTAAAAGAAGCCCGCGATCGGGTACGCAGTGCCATCA
TCAACTGTGGCTTTACCTTTCCCGCTAAACGCATCACCGTCAATCTTGCGCCAGCCGATCTTCCCAAAGAAGGAGGAAGA
TACGATCTTCCTATCGCAATAGCGATTCTGGCGGCCTCAGAGCAGATTCCTGCGGAGAAGTTGGGGCAATATGAATTCCT
GGGAGAGTTAGCTTTAACAGGTGCTCTCCGTGGCGTACAGGGCGCCATTCCCGCAGCGCTGGCAGCCACTCAGGCACAGC
GCCAGCTGATCTTGTCGAGCGATAATCTTGAAGAGGTCGGCATGATCCACGAGGCGAAAAGCTTGCTGAGCGGTCATCTA
TTAAACGTTTGCCATTTTCTGGCCGGCCACACAACGCTTGAAGAGGCACGATGTGAGTTACCTGAGGTAGGCTGGCAAGG
AGGAGACCTCAGTGACATCGTGGGTCAGCAACAGGCGAAGCGGGCACTGGAAATTTGTGCCGCCGGTGGGCATAACCTGT
TACTGATTGGGCCGCCCGGCACAGGGAAGACAATGCTTGCCACCAGGCTAACCGGGTTAATGCCATCATTAAGCGATGAG
GAGGCCCTGGAGAGCGCCGCCATTGCCAGCCTGGTCAGCACAGGTGTCCTGCATCAACAATGGCGACAGCGCCCATTCCG
GGCCCCTCATCACACGTCATCACGCTATGCGCTGGTCGGTGGCGGTTCAATGCCAAAACCCGGCGAGATTTCGCTGGCAC
ACAATGGCATTCTGTTTCTGGATGAGCTGCCAGAGTTTGATCGAAAAACGCTGGATGCGCTGAGAGAACCGCTGGAGTCT
GGTGAGATTTGTATTTCACGCGCGCGGGCGAAAGTGACCTACCCGGCGCGCTTTCAGTTAATAGCCGCAATGAACCCCAG
CCCGACAGGTCATTATCAGGGTATCCATAATCGCAGCACGCCCCAGCAGACGCTTCGCTACCTCAACAAACTGTCCGGGC
CGTTCCTCGATCGCTTCGATCTGTCGCTTGAAGTCCCTCTGCTTCCACCTGGCACACTGAGCAGGCAAAACGCGAACAGT
GAAAGCAGCACCACTATTCGGCTGAGGGTGATTGCAGCCCGACAACGCCAGCTTGATCGTGCCGGAAAGGTTAATGCCCT
GATTCAGCCCAAAGAAATTAAGCGCGATTGCCGGATAACGCAGCAGGATGCACAGTGGCTGGAAGAGGTGTTAAATCAGT
TAGGTCTGTCAGTTCGCGCCTGGCAGCGTATTCTGAAGGTGGCCCGGACGATTGCAGATTTAGGAAAGAAGGAGAGTATT
CAGCGGGAGCACCTGCACGAGGCGCTAAGTTATCGGGGGATCGACAGGTTACTTAGCCATCTACAGAAAAGTCTCGAATG
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

61.66

100

0.617

  comM Glaesserella parasuis strain SC1401

61.144

100

0.613

  comM Vibrio cholerae strain A1552

60.437

99.407

0.601

  comM Vibrio campbellii strain DS40M4

59.562

99.209

0.591

  comM Legionella pneumophila str. Paris

49.703

99.802

0.496

  comM Legionella pneumophila strain ERS1305867

49.703

99.802

0.496

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.533

99.407

0.443