Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   GXP68_RS20485 Genome accession   NZ_CP048243
Coordinates   4478991..4480532 (+) Length   513 a.a.
NCBI ID   WP_185688867.1    Uniprot ID   -
Organism   Ewingella americana strain B6-1     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4473991..4485532
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GXP68_RS20460 (GXP68_20440) - 4474164..4475093 (-) 930 WP_034794715.1 branched-chain amino acid transaminase -
  GXP68_RS20465 (GXP68_20445) ilvM 4475110..4475373 (-) 264 WP_034794713.1 acetolactate synthase 2 small subunit -
  GXP68_RS20470 (GXP68_20450) ilvG 4475370..4477016 (-) 1647 WP_185688865.1 acetolactate synthase 2 catalytic subunit -
  GXP68_RS20475 (GXP68_20455) ilvL 4477154..4477255 (-) 102 WP_015699153.1 ilv operon leader peptide -
  GXP68_RS20480 (GXP68_20460) - 4477387..4478625 (-) 1239 WP_185688866.1 MFS transporter -
  GXP68_RS20485 (GXP68_20465) comM 4478991..4480532 (+) 1542 WP_185688867.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  GXP68_RS20490 (GXP68_20470) - 4480558..4480896 (-) 339 WP_034794706.1 DUF413 domain-containing protein -
  GXP68_RS20495 (GXP68_20475) hdfR 4481015..4481842 (+) 828 WP_034794703.1 HTH-type transcriptional regulator HdfR -
  GXP68_RS24055 - 4481861..4481992 (-) 132 WP_272901188.1 hypothetical protein -

Sequence


Protein


Download         Length: 513 a.a.        Molecular weight: 55871.23 Da        Isoelectric Point: 7.8964

>NTDB_id=420116 GXP68_RS20485 WP_185688867.1 4478991..4480532(+) (comM) [Ewingella americana strain B6-1]
MALAVVHTRASLGVQAPGVAVEVHISNGLPALALVGLPETTVKEARDRVRSAILNCGFTFPAKRITVNLAPADLPKEGGR
YDLSIALAILVASEQLSGDKLGDYEFLGELGLSGALRGVNGAIPAALEAQKEGRRLILPQDNRQEMSLLETGIAKVADHL
LQVCAFLQGEEDLHDVERVDPPDIYVNQPDIADIIGQEQSKRALEVAAAGGHNLLLIGPPGTGKTMLASRLPALLPPLTD
QETLETAAIASLVYNPEENGNFSRSRPFRAPHHSTSMSALVGGGSLPRPGEISLAHNGVLFLDELPEFERKVLDALRQPM
ESGEITISRARAKVRYPARTQLIAAMNPSPTGHYQGVHNRTPPQQVLRYLSRLSGPFLDRFDLSIEVPLLPPGVLSLQSQ
KLGSRARESSAQVRERVMRARERQLARSGKINAHMSSSEVEKFCELKKEDAEFLEGVLHKLGLSVRAWHRILKVARTIAD
LNNQSMIEKAHISEALSYRCMDRLLLKLHKSLA

Nucleotide


Download         Length: 1542 bp        

>NTDB_id=420116 GXP68_RS20485 WP_185688867.1 4478991..4480532(+) (comM) [Ewingella americana strain B6-1]
ATGGCATTAGCGGTTGTTCACACCAGAGCCTCACTTGGCGTGCAGGCTCCGGGGGTTGCTGTCGAAGTCCATATCAGCAA
CGGCTTGCCCGCTCTGGCGCTGGTTGGCTTACCGGAAACCACGGTAAAAGAGGCACGGGATCGAGTACGCAGCGCCATAC
TCAACTGCGGTTTCACCTTTCCCGCCAAACGCATCACCGTCAATCTGGCTCCCGCCGACCTGCCCAAAGAGGGTGGCCGT
TACGACTTATCCATTGCGTTAGCGATTCTGGTGGCTTCTGAGCAGCTTTCTGGGGATAAGCTGGGTGATTACGAGTTTCT
AGGGGAATTAGGCCTTTCTGGCGCGTTACGTGGCGTAAATGGCGCTATTCCCGCTGCACTAGAAGCCCAAAAAGAGGGGC
GGCGTTTAATCCTCCCGCAAGACAATCGGCAAGAAATGTCACTGCTCGAGACGGGTATCGCCAAAGTCGCCGACCATCTT
CTGCAAGTTTGTGCTTTTTTGCAGGGAGAAGAGGATTTACATGACGTCGAACGCGTCGATCCGCCCGATATCTACGTCAA
CCAGCCCGACATTGCTGACATTATTGGGCAAGAGCAGTCTAAACGAGCGCTGGAAGTCGCCGCCGCCGGTGGGCATAACC
TTTTGCTGATTGGCCCGCCGGGTACCGGGAAAACTATGCTTGCCAGCCGCCTGCCTGCGTTGCTGCCTCCGTTAACCGAC
CAAGAAACTCTAGAGACGGCGGCGATAGCCAGTCTGGTCTACAACCCTGAAGAGAATGGGAATTTCTCACGCAGCCGCCC
TTTCCGCGCCCCTCACCACAGCACCTCCATGAGTGCCTTAGTCGGCGGCGGCTCTTTACCTCGTCCGGGTGAAATTTCGT
TGGCGCACAACGGCGTGCTGTTTCTTGACGAGCTGCCGGAGTTCGAACGCAAAGTACTCGATGCCTTGCGCCAACCCATG
GAGTCAGGCGAAATCACCATCTCCCGCGCCCGCGCCAAGGTGCGTTACCCCGCCAGAACGCAACTCATCGCCGCCATGAA
TCCCAGCCCTACAGGGCATTACCAAGGCGTACACAACCGCACGCCTCCTCAGCAGGTCCTTCGCTATCTCAGCAGGCTCT
CAGGCCCCTTTCTCGACCGTTTTGATTTATCGATTGAAGTGCCTCTCCTGCCACCGGGCGTGCTGAGTTTGCAAAGCCAG
AAGCTCGGCTCTCGCGCCAGAGAGAGCAGTGCGCAGGTACGCGAGCGAGTGATGAGGGCTCGCGAGCGCCAATTAGCCCG
GTCAGGGAAAATTAATGCGCATATGAGCAGCAGTGAAGTCGAAAAATTTTGTGAGCTTAAAAAGGAGGATGCTGAATTTC
TGGAGGGGGTGCTGCACAAGCTGGGATTATCGGTTCGAGCTTGGCATCGTATTCTTAAAGTTGCGAGAACAATAGCCGAT
CTCAATAATCAGTCTATGATTGAGAAAGCTCATATCTCTGAAGCGCTAAGCTACAGATGTATGGATAGATTGCTGCTGAA
GTTACACAAAAGCCTGGCATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

63.672

99.805

0.635

  comM Vibrio cholerae strain A1552

63.458

99.22

0.63

  comM Glaesserella parasuis strain SC1401

62.183

100

0.622

  comM Vibrio campbellii strain DS40M4

62.795

99.025

0.622

  comM Legionella pneumophila str. Paris

48.718

98.83

0.481

  comM Legionella pneumophila strain ERS1305867

48.718

98.83

0.481

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.315

100

0.462


Multiple sequence alignment