Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   GWG05_RS20100 Genome accession   NZ_CP047982
Coordinates   4325441..4326958 (+) Length   505 a.a.
NCBI ID   WP_180907105.1    Uniprot ID   -
Organism   Aeromonas caviae strain 1507-17068     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 4320441..4331958
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GWG05_RS20085 (GWG05_20065) - 4322199..4323143 (-) 945 WP_052815778.1 branched-chain amino acid transaminase -
  GWG05_RS20090 (GWG05_20070) ilvM 4323156..4323413 (-) 258 WP_180907104.1 acetolactate synthase 2 small subunit -
  GWG05_RS20095 (GWG05_20075) ilvG 4323410..4325056 (-) 1647 WP_039039681.1 acetolactate synthase 2 catalytic subunit -
  GWG05_RS20100 (GWG05_20085) comM 4325441..4326958 (+) 1518 WP_180907105.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  GWG05_RS20105 (GWG05_20090) - 4327091..4327990 (+) 900 WP_049636080.1 acyltransferase -
  GWG05_RS20110 (GWG05_20095) - 4327998..4328954 (+) 957 WP_039039684.1 acyltransferase -
  GWG05_RS20115 (GWG05_20100) - 4328900..4329385 (-) 486 WP_049636081.1 DUF523 domain-containing protein -
  GWG05_RS20120 (GWG05_20105) - 4329462..4330112 (-) 651 WP_049636082.1 NUDIX hydrolase -
  GWG05_RS20125 (GWG05_20110) - 4330187..4330822 (+) 636 WP_041214225.1 nicotinamidase -

Sequence


Protein


Download         Length: 505 a.a.        Molecular weight: 54746.64 Da        Isoelectric Point: 7.4702

>NTDB_id=418986 GWG05_RS20100 WP_180907105.1 4325441..4326958(+) (comM) [Aeromonas caviae strain 1507-17068]
MSLAVVYSRASLGIAAPQVTVEVHLSNGLPAFNMVGLPETSVKESRDRVRSALLNGNFEFPSKHITVNLAPADLPKEGGR
FDLAIAIGILAASKQIPARYLLDHEFLGELALTGEIRPVLGVLPSVLACRDAGRTLLVPRENGPEASLIQDAEVRTAHQL
LAVTAWLAGQYELPLPEPQSAETLPDVPDLQDVIGQSQAKRALEIAAAGSHNLLFIGPPGTGKSMLASRLPGILPPLSEQ
EAQQTAAIHSIGGLTPRAGHWHHRPYRTPHHSASAVALVGGGTHPRPGEISLAHNGVLFLDELPEFERKVLDSLREPLET
GHITISRAARQVDFPARFQLVGAMNPSPCGHYGDGQTRSSPDQILRYLGKLSGPFLDRFDLTVEVPLLPRGSLTGKVERG
ESSQQIRERVLGARERMLSRSGKLNNLLDSREIEDVCRLSPQDAEFLEGAIQKLGLSIRAWHRILRVSRTIADLAGHATI
ERAHLIEALGYRAMDRLLSRLRGGQ

Nucleotide


Download         Length: 1518 bp        

>NTDB_id=418986 GWG05_RS20100 WP_180907105.1 4325441..4326958(+) (comM) [Aeromonas caviae strain 1507-17068]
ATGTCATTAGCTGTTGTTTATAGCCGTGCCAGCCTGGGGATAGCGGCGCCGCAAGTCACGGTGGAGGTGCACCTCTCCAA
CGGCCTGCCCGCCTTCAACATGGTGGGGCTGCCGGAAACCTCGGTGAAGGAGTCGCGGGATCGGGTGCGCAGCGCCCTGC
TCAACGGCAATTTCGAGTTCCCGAGCAAACACATCACGGTCAACCTGGCCCCTGCGGATCTGCCCAAAGAGGGGGGTCGC
TTCGACCTGGCCATCGCCATCGGCATTCTCGCCGCATCCAAGCAGATACCCGCAAGATACCTGCTCGATCACGAATTTTT
AGGGGAATTGGCCCTGACGGGCGAGATCCGCCCCGTGCTTGGGGTGCTCCCCTCGGTGCTCGCCTGCCGCGACGCAGGGC
GCACCCTGCTGGTCCCACGGGAGAACGGTCCGGAGGCTTCCCTCATACAGGATGCCGAGGTGCGCACCGCCCACCAGCTG
CTGGCGGTCACCGCCTGGCTGGCGGGCCAGTACGAACTGCCGCTGCCGGAACCCCAGAGCGCGGAGACCCTGCCCGACGT
GCCGGATCTGCAGGATGTGATCGGTCAGTCCCAGGCGAAGCGGGCGCTGGAGATCGCCGCAGCTGGCAGCCACAACCTGC
TGTTCATCGGCCCCCCCGGCACCGGCAAGAGCATGTTGGCCAGCCGTCTGCCCGGCATTCTGCCACCGCTGAGCGAACAG
GAGGCGCAGCAGACCGCCGCCATTCACTCCATCGGCGGCCTCACCCCCCGCGCCGGCCACTGGCACCACAGGCCGTATCG
CACGCCCCATCACAGCGCCTCGGCGGTGGCGCTGGTGGGCGGGGGTACCCATCCCAGGCCTGGCGAAATTTCCCTCGCCC
ACAACGGGGTACTGTTTCTGGATGAATTGCCGGAGTTCGAGCGCAAGGTGCTCGACTCCCTGCGCGAGCCGCTGGAAACC
GGGCACATTACCATCAGTCGCGCCGCCCGCCAGGTGGATTTTCCCGCCCGCTTCCAGCTGGTGGGTGCCATGAATCCCAG
CCCCTGCGGCCATTATGGCGACGGCCAGACCCGCTCCAGCCCGGATCAGATCCTGCGCTATCTGGGCAAGCTCTCCGGCC
CCTTTCTCGATCGGTTCGATCTGACGGTAGAGGTCCCGCTCTTGCCCAGGGGGAGCCTGACCGGCAAGGTGGAGCGGGGG
GAATCGAGCCAGCAGATACGCGAACGGGTGCTGGGGGCGCGAGAGCGCATGCTGAGTCGCAGCGGAAAACTCAACAACCT
GCTGGATAGTCGTGAAATCGAAGATGTTTGCAGATTATCGCCACAGGATGCCGAGTTTCTGGAAGGTGCCATCCAGAAGC
TGGGGCTCAGCATCCGGGCCTGGCACCGCATCCTGCGGGTATCGCGCACCATCGCCGATCTGGCGGGGCACGCCACCATC
GAGAGAGCGCATCTGATCGAAGCCCTGGGCTATCGTGCCATGGATAGGCTGTTGTCGCGGCTGCGCGGGGGCCAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

63.745

99.406

0.634

  comM Haemophilus influenzae Rd KW20

61.66

100

0.618

  comM Vibrio campbellii strain DS40M4

61.952

99.406

0.616

  comM Glaesserella parasuis strain SC1401

60.63

100

0.61

  comM Legionella pneumophila str. Paris

50.794

99.802

0.507

  comM Legionella pneumophila strain ERS1305867

50.794

99.802

0.507

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.972

100

0.463


Multiple sequence alignment