Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   E5E97_RS03725 Genome accession   NZ_CP038513
Coordinates   743749..745263 (+) Length   504 a.a.
NCBI ID   WP_118880121.1    Uniprot ID   -
Organism   Aeromonas sp. 2692-1     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 738749..750263
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E5E97_RS03710 (E5E97_03730) - 740514..741458 (-) 945 WP_118880123.1 branched-chain amino acid transaminase -
  E5E97_RS03715 (E5E97_03735) ilvM 741471..741722 (-) 252 WP_005306810.1 acetolactate synthase 2 small subunit -
  E5E97_RS03720 (E5E97_03740) ilvG 741719..743365 (-) 1647 WP_118880122.1 acetolactate synthase 2 catalytic subunit -
  E5E97_RS03725 (E5E97_03750) comM 743749..745263 (+) 1515 WP_118880121.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  E5E97_RS03730 (E5E97_03755) - 745384..746298 (+) 915 WP_029302342.1 acyltransferase -
  E5E97_RS03735 (E5E97_03760) - 746292..747248 (+) 957 WP_171279888.1 acyltransferase -
  E5E97_RS03740 (E5E97_03765) - 747194..747679 (-) 486 WP_101149667.1 DUF523 domain-containing protein -
  E5E97_RS03745 (E5E97_03770) - 747779..748429 (-) 651 WP_011707866.1 DNA mismatch repair protein MutT -
  E5E97_RS03750 (E5E97_03775) - 748505..749140 (+) 636 WP_011707867.1 nicotinamidase -

Sequence


Protein


Download         Length: 504 a.a.        Molecular weight: 54768.69 Da        Isoelectric Point: 7.2612

>NTDB_id=354925 E5E97_RS03725 WP_118880121.1 743749..745263(+) (comM) [Aeromonas sp. 2692-1]
MSLAVVYSRASLGVAAPQVTVEVHLSNGLPAFNMVGLPETSVKESRDRVRSALLNGNFEFPSKHITVNLAPADLPKEGGR
FDLAIAIGILAASKQIPAKYLLDHEFLGELALTGEIRPVLGVLPAVLACRDAGRTLLVPRENGPEASLIQDAEVRTAHQL
LAVTAWLAGQYELPLPDPQTTDALPDVPDLQDVIGQSQAKRALEIAAAGSHNLLFIGPPGTGKSMLASRLPGILPPLSEQ
EAQQTAAIHSIGGLTPRAGHWHHRPYRTPHHSASAVALVGGGSHPRPGEISLAHNGVLFLDELPEFERKVLDSLREPLET
GHITISRAARQVDFPARFQLVGAMNPSPCGHYGDGQTRSSPDQILRYLGKLSGPFLDRFDLTVEVPLLPKGSLTGKAERG
ESSQQIRERVLAARERMLSRNGKLNNLLDSREIEEICRLSPQDAEFLENAIQKLGLSIRAWHRILRVSRTIADLAGWQTI
EKEHLIEALGYRAMDRLLSRLRSG

Nucleotide


Download         Length: 1515 bp        

>NTDB_id=354925 E5E97_RS03725 WP_118880121.1 743749..745263(+) (comM) [Aeromonas sp. 2692-1]
ATGTCATTAGCTGTGGTTTATAGCCGTGCCAGCTTGGGTGTTGCGGCCCCGCAAGTGACGGTGGAAGTCCATTTGTCCAA
CGGGCTGCCCGCCTTCAACATGGTGGGCCTGCCGGAAACCTCGGTGAAGGAGTCGCGGGATCGGGTGCGCAGCGCCCTGC
TCAACGGCAATTTCGAGTTCCCGAGCAAGCACATCACGGTCAATCTGGCCCCCGCCGATCTGCCCAAGGAAGGCGGTCGT
TTCGATCTGGCCATCGCCATCGGCATCCTCGCCGCTTCCAAGCAGATACCCGCAAAATACCTGCTCGATCACGAATTTTT
AGGCGAACTGGCCCTGACCGGCGAGATCCGCCCCGTGCTCGGCGTGCTGCCCGCCGTGCTCGCCTGCCGCGATGCAGGGC
GCACCCTGCTGGTACCGCGGGAGAACGGTCCGGAGGCTTCCCTCATCCAAGACGCCGAAGTGCGCACCGCCCACCAGCTG
TTGGCCGTCACCGCCTGGCTGGCAGGCCAGTACGAACTGCCACTACCGGATCCCCAGACCACGGATGCCCTGCCCGATGT
GCCGGACCTGCAGGACGTGATCGGCCAGTCCCAGGCCAAGCGGGCGCTGGAGATTGCCGCCGCCGGCAGCCACAACCTGC
TGTTCATCGGCCCGCCCGGCACCGGCAAGAGCATGCTGGCCAGCCGCTTGCCCGGCATCTTGCCGCCGCTGAGCGAACAG
GAAGCGCAGCAGACCGCCGCCATTCACTCCATCGGCGGCCTCACCCCGCGCGCCGGTCACTGGCATCACAGACCCTACCG
TACGCCGCACCACAGCGCCTCGGCGGTGGCGCTGGTGGGGGGTGGCAGCCACCCGCGGCCCGGTGAAATATCCCTGGCCC
ACAACGGGGTGCTGTTTCTGGATGAACTGCCCGAGTTCGAGCGCAAGGTGCTCGACTCCCTGCGCGAGCCGCTGGAGACC
GGCCACATCACCATCAGCCGGGCCGCCCGTCAGGTGGATTTTCCCGCCCGCTTCCAGCTGGTCGGCGCCATGAACCCCAG
CCCTTGTGGACACTATGGCGATGGCCAGACCCGCTCCAGCCCGGATCAGATCCTGCGCTACCTTGGCAAGCTCTCCGGCC
CCTTTCTCGACCGCTTCGACCTGACGGTGGAGGTGCCACTGCTGCCCAAGGGCAGCCTGACCGGCAAGGCAGAGCGGGGA
GAGTCGAGCCAGCAGATCCGCGAGCGGGTACTGGCGGCGCGGGAGCGCATGCTGAGCCGCAACGGCAAGCTCAACAACTT
GCTTGATAGCCGTGAAATCGAAGAAATTTGCCGCTTATCGCCGCAGGATGCCGAGTTTCTGGAGAACGCCATCCAGAAGC
TGGGGCTCAGCATCCGGGCCTGGCACCGCATCCTGCGGGTGTCGCGCACCATAGCCGATCTGGCAGGATGGCAGACCATC
GAGAAGGAGCACCTGATCGAGGCGCTCGGCTACCGGGCCATGGACCGGCTGTTGTCACGGCTGCGCAGCGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

63.347

99.603

0.631

  comM Vibrio campbellii strain DS40M4

62.351

99.603

0.621

  comM Haemophilus influenzae Rd KW20

61.858

100

0.621

  comM Glaesserella parasuis strain SC1401

60.355

100

0.607

  comM Legionella pneumophila str. Paris

50.794

100

0.508

  comM Legionella pneumophila strain ERS1305867

50.794

100

0.508

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.443

100

0.466


Multiple sequence alignment