Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   C2U40_RS22000 Genome accession   NZ_CP026217
Coordinates   4683473..4684987 (+) Length   504 a.a.
NCBI ID   WP_103827658.1    Uniprot ID   -
Organism   Aeromonas sp. ASNIH4     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 4678473..4689987
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C2U40_RS21980 (C2U40_21985) - 4680238..4681182 (-) 945 WP_024945526.1 branched-chain amino acid transaminase -
  C2U40_RS21985 (C2U40_21990) ilvM 4681195..4681446 (-) 252 WP_005306810.1 acetolactate synthase 2 small subunit -
  C2U40_RS21990 (C2U40_21995) ilvG 4681443..4683089 (-) 1647 WP_103827657.1 acetolactate synthase 2 catalytic subunit -
  C2U40_RS22000 (C2U40_22005) comM 4683473..4684987 (+) 1515 WP_103827658.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  C2U40_RS22005 (C2U40_22010) - 4685108..4686022 (+) 915 WP_029302342.1 acyltransferase -
  C2U40_RS22010 (C2U40_22015) - 4686016..4686972 (+) 957 WP_045525913.1 acyltransferase -
  C2U40_RS22015 (C2U40_22020) - 4686918..4687400 (-) 483 WP_103827660.1 DUF523 domain-containing protein -
  C2U40_RS22020 (C2U40_22025) - 4687397..4688047 (-) 651 WP_011707866.1 DNA mismatch repair protein MutT -
  C2U40_RS22025 (C2U40_22030) - 4688123..4688758 (+) 636 WP_103827661.1 nicotinamidase -

Sequence


Protein


Download         Length: 504 a.a.        Molecular weight: 54708.63 Da        Isoelectric Point: 7.4494

>NTDB_id=267874 C2U40_RS22000 WP_103827658.1 4683473..4684987(+) (comM) [Aeromonas sp. ASNIH4]
MSLAVVYSRASLGVAAPQVTVEVHLSNGLPAFNMVGLPETSVKESRDRVRSALLNGNFEFPSKHITVNLAPADLPKEGGR
FDLAIAIGILAASKQIPAKYLLDHEFLGELALTGEIRPVLGVLPAVLACRDAGRTLLVPRENGPEASLIQDAEVRTAHQL
LAVTAWLAGQYELPLPDPQSTEALPDVPDLQDVIGQSQAKRALEIAAAGSHNLLFIGPPGTGKSMLASRLPGILPPLSEQ
EAQQTAAIHSIGGLTPRAGHWHHRPYRTPHHSASAVALVGGGSHPRPGEISLAHNGVLFLDELPEFERKVLDSLREPLET
GHITISRAARQVDFPARFQLVGAMNPSPCGHYGDGQTRSSPDQILRYLGKLSGPFLDRFDLTVEVPLLPKGSLTGKAERG
ESSQQIRERVLAARERMLSRNGKLNNLLDSREIEEICRLSPQDAEFLENAIQKLGLSIRAWHRILRVSRTIADLAGRQAI
EKEHLIEALGYRAMDRLLSRLRSG

Nucleotide


Download         Length: 1515 bp        

>NTDB_id=267874 C2U40_RS22000 WP_103827658.1 4683473..4684987(+) (comM) [Aeromonas sp. ASNIH4]
ATGTCATTAGCTGTGGTTTATAGCCGTGCCAGCTTAGGTGTCGCGGCCCCGCAAGTGACGGTGGAGGTACACCTCTCCAA
CGGTTTGCCCGCCTTCAACATGGTGGGCCTGCCGGAAACCTCGGTGAAGGAGTCGCGGGATCGGGTGCGCAGCGCCCTGC
TCAACGGCAATTTCGAGTTCCCGAGCAAACACATCACGGTCAATCTGGCCCCCGCCGATCTGCCCAAGGAGGGGGGCCGC
TTCGATCTGGCCATCGCCATCGGCATTCTCGCGGCTTCCAAGCAGATACCTGCAAAATACCTGCTAGATCACGAATTTTT
AGGCGAACTGGCCCTGACCGGCGAGATCCGTCCCGTGCTCGGGGTGCTGCCCGCCGTGCTCGCCTGCCGCGATGCGGGGC
GCACTCTGCTGGTGCCGCGAGAGAACGGCCCCGAAGCCTCACTGATCCAGGACGCCGAAGTGCGCACCGCCCATCAGCTG
CTGGCCGTCACCGCCTGGCTGGCGGGCCAGTACGAGCTGCCACTGCCGGATCCCCAGAGCACGGAGGCCCTGCCCGATGT
GCCGGACCTGCAGGACGTGATCGGCCAGTCTCAGGCCAAGCGGGCGCTGGAGATCGCTGCCGCCGGCAGCCACAACCTGC
TGTTCATCGGCCCGCCCGGCACCGGCAAGAGCATGCTGGCCAGCCGCTTGCCCGGTATCTTGCCGCCCCTTAGCGAACAG
GAAGCGCAGCAGACCGCCGCCATCCATTCCATCGGCGGCCTCACCCCGCGCGCCGGTCACTGGCATCACAGGCCCTATCG
CACGCCCCATCACAGCGCCTCGGCGGTGGCGCTGGTGGGGGGTGGCAGCCACCCAAGGCCCGGCGAAATTTCGTTGGCCC
ACAATGGGGTGCTGTTTCTGGATGAACTGCCCGAGTTCGAGCGCAAGGTGCTCGACTCCCTGCGCGAGCCGCTGGAGACC
GGCCATATCACCATCAGTCGGGCCGCCCGCCAGGTGGATTTTCCCGCCCGCTTCCAGCTGGTCGGCGCCATGAATCCCAG
CCCTTGCGGCCACTATGGCGATGGCCAGACCCGCTCCAGCCCGGATCAGATCCTGCGCTACCTCGGCAAGCTCTCCGGCC
CCTTTCTCGACCGCTTCGACCTGACGGTGGAGGTGCCGCTGCTGCCCAAGGGGAGTCTCACCGGCAAGGCGGAGCGGGGG
GAGTCGAGCCAGCAGATCCGCGAACGGGTGCTGGCGGCGCGGGAGCGTATGCTGAGCCGCAACGGCAAGCTCAACAACCT
GCTTGATAGCCGTGAAATCGAAGAAATTTGCCGCTTATCGCCGCAGGATGCCGAGTTTCTGGAGAACGCCATCCAGAAGC
TGGGGCTCAGCATCCGGGCCTGGCACCGCATCCTGCGGGTGTCGCGCACCATAGCCGATCTGGCAGGACGGCAAGCCATC
GAGAAGGAGCACCTGATCGAGGCGCTCGGTTACCGGGCCATGGACCGGCTGCTGTCGCGGCTGCGCAGCGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

63.546

99.603

0.633

  comM Vibrio campbellii strain DS40M4

62.55

99.603

0.623

  comM Haemophilus influenzae Rd KW20

62.055

100

0.623

  comM Glaesserella parasuis strain SC1401

60.552

100

0.609

  comM Legionella pneumophila str. Paris

50.794

100

0.508

  comM Legionella pneumophila strain ERS1305867

50.794

100

0.508

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.443

100

0.466


Multiple sequence alignment