Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   CWI88_RS22045 Genome accession   NZ_CP025225
Coordinates   4621000..4622520 (+) Length   506 a.a.
NCBI ID   WP_101738486.1    Uniprot ID   -
Organism   Enterobacter cancerogenus strain CR-Eb1     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4616000..4627520
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CWI88_RS22025 (CWI88_22030) - 4617566..4618495 (-) 930 WP_006179207.1 branched-chain amino acid transaminase -
  CWI88_RS22030 (CWI88_22035) ilvM 4618514..4618777 (-) 264 WP_006179206.1 acetolactate synthase 2 small subunit -
  CWI88_RS22035 (CWI88_22040) ilvG 4618774..4620420 (-) 1647 WP_101738485.1 acetolactate synthase 2 catalytic subunit -
  CWI88_RS22040 (CWI88_22045) ilvL 4620560..4620658 (-) 99 WP_001311244.1 ilv operon leader peptide -
  CWI88_RS22045 (CWI88_22050) comM 4621000..4622520 (+) 1521 WP_101738486.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  CWI88_RS22050 (CWI88_22055) - 4622553..4622891 (-) 339 WP_006179202.1 DUF413 domain-containing protein -
  CWI88_RS22055 (CWI88_22060) hdfR 4623010..4623831 (+) 822 WP_006179201.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55064.26 Da        Isoelectric Point: 8.1043

>NTDB_id=259151 CWI88_RS22045 WP_101738486.1 4621000..4622520(+) (comM) [Enterobacter cancerogenus strain CR-Eb1]
MSLSVVYTRAALGVKAPLISVEVHLSNGLPGLTLVGLPETTVKEARDRVRSAIINSGYTFPAKKITINLAPADLPKEGGR
YDLPIAIALLAASEQLNASRLGSYEFIGELALTGALRGVPGAISGALEAIRAGRQIIVANENAAEVSLIAEKGCLVAGHL
QEVCAYLEKRHELAEPEECHDTLPAPADDLSDILGQEQGKRALEITAAGGHNLLLIGPPGTGKTMLASRLSGLLPPLNNN
EALESAAILSLVNARSLHKQWRRRPFRSPHHSASLTAMVGGGSIPAPGEISLAHNGILFLDELPEFERRVLDALREPIES
GEIHISRTRAKISYPAKFQLVAAMNPSPTGHYQGNHNRSTPEQTLRYLGKLSGPFLDRFDLSLEIPLPPPGLLRHPDARG
ESTAQVRERVIAAQERQYVRQGRLNARLDNTGIRQWCPLKIEDAGWLEETLQRFGLSIRAWQRLLKVARTVADMEGCPGI
ERQHLQEALSYRAIDRLLLHLQKLLA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=259151 CWI88_RS22045 WP_101738486.1 4621000..4622520(+) (comM) [Enterobacter cancerogenus strain CR-Eb1]
ATGTCACTGTCAGTTGTCTATACGCGCGCGGCGCTCGGCGTGAAAGCTCCGCTCATTTCCGTTGAGGTGCATTTGAGCAA
CGGTTTACCCGGGCTAACGCTCGTGGGCCTGCCGGAAACGACCGTCAAAGAGGCCCGGGATCGGGTGCGCAGTGCAATTA
TCAATAGCGGTTATACCTTCCCGGCGAAGAAGATCACTATCAACCTCGCCCCAGCCGATTTGCCAAAGGAGGGCGGACGA
TATGATTTACCTATTGCTATCGCGCTTCTCGCCGCCTCTGAGCAGCTTAATGCCTCAAGGTTAGGCTCATATGAGTTCAT
CGGGGAACTGGCGCTTACAGGCGCGCTAAGAGGCGTTCCCGGTGCGATCTCGGGTGCGCTGGAAGCCATCCGCGCAGGAC
GGCAAATCATTGTGGCGAATGAGAATGCAGCAGAAGTTAGCCTCATAGCTGAGAAAGGCTGCCTGGTCGCCGGGCATTTG
CAGGAGGTTTGCGCGTATCTGGAAAAACGGCACGAGCTGGCTGAGCCGGAAGAGTGTCACGACACGCTGCCGGCACCCGC
AGACGATCTGAGCGACATCCTGGGCCAGGAGCAGGGGAAGCGAGCGTTAGAGATTACTGCTGCAGGTGGACATAATCTTT
TGCTCATCGGCCCGCCCGGGACGGGTAAAACCATGCTGGCAAGCAGGCTGAGCGGTTTGCTGCCTCCGCTCAACAACAAT
GAAGCCCTCGAAAGCGCGGCTATTTTAAGCCTGGTCAATGCGAGGTCCTTGCACAAACAATGGCGACGTCGCCCTTTCCG
ATCGCCGCACCATAGCGCCTCTCTCACGGCGATGGTAGGAGGCGGGTCAATCCCGGCGCCAGGTGAAATTTCACTGGCAC
ACAATGGCATTCTGTTTCTGGATGAACTGCCGGAATTTGAGCGGCGCGTGCTGGACGCGCTGCGCGAGCCGATAGAGTCG
GGAGAAATTCATATCTCCCGCACGCGGGCCAAAATTAGCTATCCCGCTAAATTTCAGCTGGTTGCGGCAATGAACCCCAG
CCCGACAGGGCATTACCAGGGCAATCACAACCGCTCGACGCCCGAGCAGACGCTGCGCTATCTGGGAAAACTCTCCGGCC
CCTTTCTCGACCGGTTTGATTTATCTCTTGAGATCCCTCTTCCTCCGCCTGGGCTACTTCGACACCCCGACGCCAGAGGT
GAAAGTACAGCACAGGTGCGCGAACGGGTTATTGCTGCACAGGAGCGGCAGTACGTGCGTCAGGGCAGGCTTAACGCGCG
TCTTGATAACACGGGTATACGCCAGTGGTGTCCTCTCAAAATCGAAGACGCAGGCTGGCTGGAAGAGACGTTGCAAAGAT
TTGGGCTTTCCATTCGCGCATGGCAGCGCCTGTTGAAAGTGGCAAGAACTGTCGCCGATATGGAGGGATGCCCCGGTATT
GAGAGGCAGCATTTGCAGGAGGCACTGAGCTATCGAGCGATTGATCGCCTGCTTCTTCATCTGCAAAAGCTGCTGGCATA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

58.171

100

0.591

  comM Glaesserella parasuis strain SC1401

58.777

100

0.589

  comM Vibrio campbellii strain DS40M4

58.532

99.605

0.583

  comM Vibrio cholerae strain A1552

58.566

99.209

0.581

  comM Legionella pneumophila str. Paris

50

99.605

0.498

  comM Legionella pneumophila strain ERS1305867

50

99.605

0.498

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.59

100

0.437


Multiple sequence alignment