Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ENTER_RS21860 Genome accession   NZ_LR881936
Coordinates   4624786..4626306 (+) Length   506 a.a.
NCBI ID   WP_202561368.1    Uniprot ID   -
Organism   Enterobacter cancerogenus strain UPC1     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4619786..4631306
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ENTER_RS21835 (ENTER_4447) - 4621352..4622281 (-) 930 WP_006179207.1 branched-chain amino acid transaminase -
  ENTER_RS21840 (ENTER_4448) ilvM 4622300..4622563 (-) 264 WP_006179206.1 acetolactate synthase 2 small subunit -
  ENTER_RS21845 (ENTER_4449) ilvG 4622560..4624206 (-) 1647 WP_202561367.1 acetolactate synthase 2 catalytic subunit -
  ENTER_RS21850 ilvX 4624209..4624259 (-) 51 WP_202562291.1 peptide IlvX -
  ENTER_RS21855 ilvL 4624346..4624444 (-) 99 WP_001311244.1 ilv operon leader peptide -
  ENTER_RS21860 (ENTER_4451) comM 4624786..4626306 (+) 1521 WP_202561368.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  ENTER_RS21865 (ENTER_4452) - 4626339..4626677 (-) 339 WP_006179202.1 DUF413 domain-containing protein -
  ENTER_RS21870 (ENTER_4453) hdfR 4626796..4627617 (+) 822 WP_141113597.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55033.11 Da        Isoelectric Point: 7.6079

>NTDB_id=1132676 ENTER_RS21860 WP_202561368.1 4624786..4626306(+) (comM) [Enterobacter cancerogenus strain UPC1]
MSLSVVYTRAALGVKAPLISVEVHLSNGLPGLTLVGLPETTVKEARDRVRSAIINSGYNFPAKKITINLAPADLPKEGGR
YDLPIAIALLAASEQLNASRLGSYEFIGELALTGALRGVPGAISGALEAIRAGRQIIVANENAAEVSLIAEKGCLVAGHL
QEVCAYLEKRHELAEPEECHDTLPAPADDLSDILGQEQGKRALEITAAGGHNLLLIGPPGTGKTMLASRLSGLLPPLNNH
EALESAAILSLVNATSLHKQWRRRPFRSPHHSASLTAMVGGGAIPGPGEISLAHNGILFLDELPEFERRVLDALREPIES
GEIHISRTRAKISYPAKFQLVAAMNPSPTGHYQGNHNRSTPEQTLRYLGKLSGPFLDRFDLSLEIPLPPPGLLRHPDARG
ESTAQVRERVIAAQERQYARQGRLNARLDNTGIRQWCPLKTEDAGWLEETLQRFGLSIRAWQRLLKVARTVADMEGCPDI
ERQHLQEALSYRAIDRLLLHLQKLLA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=1132676 ENTER_RS21860 WP_202561368.1 4624786..4626306(+) (comM) [Enterobacter cancerogenus strain UPC1]
ATGTCACTGTCAGTTGTCTATACGCGCGCGGCGCTCGGCGTGAAAGCTCCGCTCATTTCCGTTGAGGTGCATTTGAGCAA
CGGTTTACCCGGGCTAACGCTCGTGGGCCTGCCGGAAACGACGGTCAAAGAGGCCCGGGATCGGGTGCGCAGTGCAATTA
TCAATAGCGGTTATAACTTCCCAGCGAAGAAGATCACTATCAACCTCGCCCCAGCCGATTTGCCAAAGGAGGGCGGACGA
TATGATTTACCTATTGCTATCGCGCTTCTCGCCGCCTCTGAGCAGCTTAATGCCTCAAGGTTAGGCTCATATGAGTTCAT
CGGTGAACTGGCGCTTACAGGCGCGCTAAGAGGCGTTCCCGGTGCGATCTCGGGTGCGCTGGAAGCCATCCGCGCAGGAC
GGCAAATCATTGTGGCGAATGAGAATGCAGCAGAAGTTAGCCTCATAGCTGAGAAAGGCTGCCTGGTCGCCGGGCATTTG
CAGGAGGTTTGCGCGTATCTGGAAAAACGGCACGAGCTGGCTGAGCCGGAAGAGTGTCACGACACGCTGCCGGCACCCGC
AGACGATCTGAGCGACATCCTGGGCCAGGAGCAGGGGAAGCGAGCGTTAGAGATCACTGCTGCAGGTGGACATAATCTTT
TGCTCATCGGCCCGCCCGGGACGGGTAAAACCATGCTGGCAAGCAGGCTGAGCGGTTTGCTGCCTCCGCTCAACAACCAT
GAAGCCCTCGAAAGCGCGGCTATTTTAAGCCTGGTCAATGCGACGTCCCTGCACAAACAGTGGCGACGCCGCCCTTTCCG
ATCGCCGCACCACAGCGCCTCTCTCACGGCGATGGTAGGAGGCGGGGCAATCCCGGGGCCAGGTGAAATTTCACTGGCAC
ACAATGGCATTCTGTTTCTGGATGAACTGCCGGAATTTGAGCGGCGCGTGCTGGACGCGCTGCGCGAGCCGATAGAGTCG
GGAGAAATTCATATCTCCCGCACGCGGGCCAAAATTAGCTATCCCGCTAAATTTCAGCTGGTTGCGGCAATGAACCCCAG
CCCGACAGGGCATTACCAGGGCAATCACAACCGCTCGACGCCCGAGCAGACGCTGCGCTATCTGGGAAAACTCTCCGGCC
CCTTTCTCGACCGGTTTGATTTATCTCTTGAGATCCCTCTTCCTCCGCCTGGGCTACTTCGACACCCCGACGCCAGAGGT
GAAAGCACAGCGCAGGTGCGCGAACGGGTTATTGCTGCACAGGAGCGGCAGTACGCGCGTCAGGGCAGGCTCAACGCGCG
TCTTGATAACACGGGTATACGCCAGTGGTGTCCTCTCAAAACCGAAGACGCAGGCTGGCTGGAAGAGACGTTGCAAAGGT
TTGGGCTTTCCATTCGCGCATGGCAGCGCCTGTTGAAAGTGGCAAGAACTGTCGCCGATATGGAGGGATGCCCCGATATT
GAGAGGCAGCATTTGCAGGAGGCACTGAGCTATCGGGCGATTGATCGCCTGCTGCTTCATCTGCAAAAGCTGCTGGCATA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

58.447

100

0.595

  comM Glaesserella parasuis strain SC1401

58.58

100

0.587

  comM Vibrio campbellii strain DS40M4

58.532

99.605

0.583

  comM Vibrio cholerae strain A1552

58.765

99.209

0.583

  comM Legionella pneumophila str. Paris

50.198

99.605

0.5

  comM Legionella pneumophila strain ERS1305867

50.198

99.605

0.5

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.393

100

0.435


Multiple sequence alignment