Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   CCF14_RS07225 Genome accession   NZ_CP024282
Coordinates   1418814..1420334 (+) Length   506 a.a.
NCBI ID   WP_059259706.1    Uniprot ID   -
Organism   Escherichia albertii strain 2014C-4356     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1413814..1425334
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CCF14_RS07200 ilvE 1415368..1416297 (-) 930 WP_000208522.1 branched-chain-amino-acid transaminase -
  CCF14_RS07205 ilvM 1416317..1416580 (-) 264 WP_000983255.1 acetolactate synthase 2 small subunit -
  CCF14_RS07210 ilvG 1416577..1418223 (-) 1647 WP_059259704.1 acetolactate synthase 2 catalytic subunit -
  CCF14_RS28080 ilvX 1418226..1418276 (-) 51 WP_001387183.1 peptide IlvX -
  CCF14_RS07215 ilvL 1418363..1418461 (-) 99 WP_001311244.1 ilv operon leader peptide -
  CCF14_RS07225 comM 1418814..1420334 (+) 1521 WP_059259706.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  CCF14_RS07230 maoP 1420360..1420698 (-) 339 WP_000840999.1 macrodomain Ori organization protein MaoP -
  CCF14_RS07235 hdfR 1420817..1421656 (+) 840 WP_059259708.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55192.40 Da        Isoelectric Point: 9.0024

>NTDB_id=253324 CCF14_RS07225 WP_059259706.1 1418814..1420334(+) (comM) [Escherichia albertii strain 2014C-4356]
MSLSIVHTRAALGVNAPPITVEVHISKGLPGLTMVGLPETTVKEARDRVRSAIINSGYEYPAKKITINLAPADLPKEGGR
YDLPIAIALLAASEQLTANKLHEYELVGELALTGALRGVPGAISSATEAIKSGRKIIVAKDNEAEVGLISGEGCLIADHL
QTVCAFLEGKHALERPKPTDAVSRALQHDLSDVVGQEQGKRGLEITAAGGHNLLLIGPPGTGKTMLASRINGLLPDLSNE
EALESAAILSLVNAESVQKQWRQRPFRSPHHSASLTAMVGGGAIPGPGEISLAHNGVLFLDELPEFERRTLDALREPIES
GQIHLSRTRAKITYPARFQLVAAMNPSPTGHYQGNHNRCTPEQTLRYLNRLSGPFLDRFDLSLEIPLPPPGILSKAVVQG
ENSATVKQRIIAARERQFKRQNKLNARLDSPEIRQYCKLKSNDAQWLEETLIHLGLSIRAWQRLLKVSRTIADIEQSDVI
TRQHLQEAVSYRAIDRLLIHLQKLLT

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=253324 CCF14_RS07225 WP_059259706.1 1418814..1420334(+) (comM) [Escherichia albertii strain 2014C-4356]
ATGTCACTATCAATTGTTCATACTCGCGCAGCTCTGGGAGTAAATGCTCCGCCGATCACTGTTGAGGTACATATCAGCAA
AGGTCTACCCGGATTAACAATGGTGGGCTTACCAGAAACGACGGTAAAGGAAGCGCGTGATCGCGTACGCAGCGCCATTA
TCAATAGCGGATATGAATATCCGGCGAAAAAAATCACTATTAACCTTGCACCTGCCGATCTGCCGAAAGAAGGAGGGCGA
TATGATTTACCTATCGCTATTGCTTTGCTGGCGGCCTCTGAACAGCTTACAGCCAATAAGTTACATGAATATGAATTAGT
CGGTGAACTGGCGCTTACAGGAGCTTTGCGTGGCGTTCCCGGCGCAATTTCCAGTGCAACTGAAGCCATTAAGTCGGGCA
GAAAGATTATCGTCGCGAAAGATAACGAAGCTGAAGTAGGGTTAATTAGTGGCGAAGGATGCCTGATAGCCGATCATCTA
CAAACTGTCTGTGCGTTTCTGGAAGGTAAGCACGCTCTCGAACGCCCGAAACCAACTGATGCAGTATCCCGGGCGCTACA
ACATGATCTCAGTGATGTTGTCGGTCAGGAGCAAGGAAAGCGAGGACTGGAAATTACCGCCGCTGGCGGGCACAACCTTT
TACTGATTGGGCCGCCGGGAACAGGTAAAACAATGCTCGCCAGCCGTATTAATGGCCTGTTGCCAGATTTAAGCAATGAA
GAGGCGCTGGAGAGTGCCGCAATATTAAGCCTGGTAAATGCTGAATCAGTACAAAAACAATGGCGGCAGCGTCCGTTCCG
CTCACCGCATCACAGTGCGTCGTTAACTGCGATGGTGGGCGGTGGTGCAATTCCTGGGCCTGGTGAAATTTCGCTGGCGC
ATAACGGCGTGCTTTTTCTTGATGAACTCCCTGAATTTGAACGACGTACATTGGATGCCTTGCGGGAACCCATAGAGTCA
GGACAGATCCATCTTTCACGCACACGAGCAAAAATAACTTATCCAGCCCGTTTCCAGCTTGTCGCGGCGATGAATCCCAG
CCCAACCGGACATTATCAGGGGAACCATAACCGCTGCACGCCAGAACAAACATTACGTTATCTCAACCGACTTTCTGGCC
CCTTTCTCGACCGCTTCGATCTCTCACTGGAGATCCCACTACCACCACCCGGTATTTTAAGTAAAGCGGTAGTTCAGGGG
GAAAATAGCGCCACCGTTAAACAACGCATCATAGCCGCCAGAGAGCGCCAATTTAAGCGGCAGAATAAGCTGAACGCCAG
GCTGGATAGCCCGGAGATACGCCAATACTGCAAACTTAAAAGCAACGATGCACAGTGGCTGGAGGAAACGTTGATACACC
TGGGATTATCGATTCGTGCCTGGCAACGACTACTGAAGGTATCAAGAACTATTGCTGATATTGAGCAATCTGACGTCATT
ACTCGCCAGCATTTACAGGAAGCGGTGAGCTATCGGGCTATCGACCGCCTGCTCATCCATTTGCAAAAACTGCTGACGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

58.708

100

0.593

  comM Glaesserella parasuis strain SC1401

58.777

100

0.589

  comM Vibrio cholerae strain A1552

58.765

99.209

0.583

  comM Vibrio campbellii strain DS40M4

57.57

99.209

0.571

  comM Legionella pneumophila str. Paris

48.193

98.419

0.474

  comM Legionella pneumophila strain ERS1305867

48.193

98.419

0.474

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.874

100

0.439


Multiple sequence alignment