Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   C2U39_RS20925 Genome accession   NZ_CP026222
Coordinates   4452426..4453943 (+) Length   505 a.a.
NCBI ID   WP_042016430.1    Uniprot ID   -
Organism   Aeromonas sp. ASNIH3     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 4447426..4458943
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C2U39_RS20905 (C2U39_20910) - 4449184..4450128 (-) 945 WP_039039680.1 branched-chain amino acid transaminase -
  C2U39_RS20910 (C2U39_20915) ilvM 4450141..4450398 (-) 258 WP_010676086.1 acetolactate synthase 2 small subunit -
  C2U39_RS20915 (C2U39_20920) ilvG 4450395..4452041 (-) 1647 WP_039039681.1 acetolactate synthase 2 catalytic subunit -
  C2U39_RS20925 (C2U39_20930) comM 4452426..4453943 (+) 1518 WP_042016430.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  C2U39_RS20930 (C2U39_20935) - 4454076..4454975 (+) 900 WP_103255147.1 acyltransferase -
  C2U39_RS20935 (C2U39_20940) - 4454983..4455939 (+) 957 WP_039039684.1 acyltransferase -
  C2U39_RS20940 (C2U39_20945) - 4455885..4456370 (-) 486 WP_103255145.1 DUF523 domain-containing protein -
  C2U39_RS20945 (C2U39_20950) - 4456447..4457097 (-) 651 WP_029314880.1 DNA mismatch repair protein MutT -
  C2U39_RS20950 (C2U39_20955) - 4457172..4457807 (+) 636 WP_069652407.1 nicotinamidase -

Sequence


Protein


Download         Length: 505 a.a.        Molecular weight: 54686.63 Da        Isoelectric Point: 7.4701

>NTDB_id=267902 C2U39_RS20925 WP_042016430.1 4452426..4453943(+) (comM) [Aeromonas sp. ASNIH3]
MSLAVVYSRASLGIAAPQVTVEVHLSNGLPAFNMVGLPETSVKESRDRVRSALLNGNFEFPAKHITVNLAPADLPKEGGR
FDLAIAIGILAASKQIPAKYLLDHEFLGELALTGEIRPVLGVLPSVLACRDAGRTLLVPRENGPEASLIQDAEVRTAHQL
LAVTAWLAGQYELPLPEPQSAEILPDVPDLQDVIGQSQAKRALEIAAAGSHNLLFIGPPGTGKSMLASRLPGILPPLSEQ
EAQQTAAIHSIGGLTPRAGHWHHRPYRTPHHSASAVALVGGGTHPRPGEISLAHNGVLFLDELPEFERKVLDSLREPLET
GHITISRAARQVDFPARFQLVGAMNPSPCGHYGDGQTRSSPDQILRYLGKLSGPFLDRFDLTVEVPLLPRGSLTGKAERG
ESSQQIRERVLGARERMLSRSGKLNNLLDSREIEDVCRLSPQDAEFLEGAIQKLGLSIRAWHRILRVSRTIADLAGHATI
ERAHLIEALGYRAMDRLLSRLRGGQ

Nucleotide


Download         Length: 1518 bp        

>NTDB_id=267902 C2U39_RS20925 WP_042016430.1 4452426..4453943(+) (comM) [Aeromonas sp. ASNIH3]
ATGTCATTAGCTGTTGTTTATAGCCGTGCCAGCCTGGGGATAGCGGCGCCGCAAGTCACGGTGGAGGTGCACCTCTCCAA
CGGCCTGCCCGCCTTCAACATGGTGGGGTTGCCGGAAACCTCGGTGAAAGAGTCGCGGGATCGGGTGCGCAGCGCCCTGC
TCAACGGCAATTTCGAATTTCCGGCCAAGCACATCACGGTCAACCTGGCCCCTGCGGATCTGCCCAAAGAGGGGGGCCGC
TTCGACCTGGCCATCGCCATCGGCATTCTCGCCGCATCCAAGCAGATACCCGCAAAATACCTGCTCGATCACGAATTTTT
AGGGGAGTTGGCCCTGACGGGCGAGATCCGCCCCGTGCTTGGGGTGCTCCCCTCGGTGCTCGCCTGCCGCGACGCAGGGC
GCACCCTGCTGGTCCCACGGGAGAACGGTCCGGAGGCTTCCCTCATCCAGGATGCCGAGGTGCGCACCGCCCACCAGCTG
CTGGCGGTCACCGCCTGGCTGGCGGGCCAGTACGAACTACCGCTGCCGGAACCCCAGAGCGCGGAGATCCTGCCCGACGT
GCCGGATCTGCAGGATGTGATCGGTCAGTCCCAGGCGAAGCGGGCGCTGGAGATCGCCGCAGCTGGCAGCCACAACCTGC
TGTTCATCGGCCCCCCCGGCACCGGCAAGAGCATGCTGGCCAGCCGTCTGCCCGGCATTCTGCCACCGCTGAGCGAACAG
GAGGCGCAGCAGACCGCAGCCATTCACTCCATCGGCGGCCTCACTCCCCGCGCCGGCCACTGGCACCACAGGCCGTATCG
CACTCCCCATCACAGCGCCTCGGCGGTGGCGCTGGTGGGCGGGGGTACCCATCCCAGGCCTGGCGAAATTTCCCTCGCCC
ACAACGGAGTCCTGTTTCTGGATGAACTGCCGGAGTTCGAGCGCAAGGTGCTCGACTCCCTTCGCGAGCCGCTGGAAACC
GGGCACATTACCATCAGTCGTGCCGCCCGCCAGGTGGATTTTCCCGCCCGCTTCCAGCTGGTGGGTGCCATGAATCCCAG
CCCCTGCGGCCATTATGGCGACGGCCAGACCCGCTCCAGCCCGGATCAGATCCTGCGCTATCTGGGCAAGCTCTCCGGCC
CCTTTCTCGACCGGTTCGATCTGACGGTAGAGGTCCCGCTCTTGCCCAGGGGGAGCCTGACCGGCAAGGCGGAGCGGGGG
GAATCGAGCCAGCAGATACGCGAACGGGTGCTGGGGGCGCGAGAGCGCATGCTGAGTCGCAGCGGAAAACTCAACAATCT
GCTGGATAGTCGTGAAATCGAAGATGTTTGCAGATTATCGCCACAGGATGCCGAGTTTCTGGAAGGTGCCATCCAGAAGC
TGGGGCTCAGCATCCGGGCCTGGCACCGCATCCTGCGGGTATCGCGCACCATCGCCGACCTGGCGGGGCACGCCACCATC
GAGAGAGCCCATCTGATCGAAGCCCTGGGCTATCGTGCCATGGACAGGCTGTTGTCGCGGCTGCGCGGGGGCCAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

63.944

99.406

0.636

  comM Haemophilus influenzae Rd KW20

61.569

100

0.622

  comM Vibrio campbellii strain DS40M4

62.151

99.406

0.618

  comM Glaesserella parasuis strain SC1401

60.433

100

0.608

  comM Legionella pneumophila str. Paris

51.411

98.218

0.505

  comM Legionella pneumophila strain ERS1305867

51.411

98.218

0.505

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.776

100

0.461


Multiple sequence alignment