Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   R2E40_RS00645 Genome accession   NZ_CP137631
Coordinates   149772..151286 (-) Length   504 a.a.
NCBI ID   WP_318164514.1    Uniprot ID   -
Organism   Aeromonas sp. CD     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 144772..156286
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R2E40_RS00620 (R2E40_00620) - 145894..146529 (-) 636 WP_045788782.1 nicotinamidase -
  R2E40_RS00625 (R2E40_00625) - 146605..147255 (+) 651 WP_011707866.1 NUDIX hydrolase -
  R2E40_RS00630 (R2E40_00630) - 147356..147841 (+) 486 WP_318164512.1 DUF523 domain-containing protein -
  R2E40_RS00635 (R2E40_00635) - 147787..148743 (-) 957 WP_016352357.1 acyltransferase -
  R2E40_RS00640 (R2E40_00640) - 148737..149651 (-) 915 WP_318164513.1 acyltransferase -
  R2E40_RS00645 (R2E40_00645) comM 149772..151286 (-) 1515 WP_318164514.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  R2E40_RS00650 (R2E40_00650) ilvG 151670..153316 (+) 1647 WP_318164515.1 acetolactate synthase 2 catalytic subunit -
  R2E40_RS00655 (R2E40_00655) ilvM 153313..153564 (+) 252 WP_005306810.1 acetolactate synthase 2 small subunit -
  R2E40_RS00660 (R2E40_00660) - 153577..154521 (+) 945 WP_011707859.1 branched-chain amino acid transaminase -

Sequence


Protein


Download         Length: 504 a.a.        Molecular weight: 54511.40 Da        Isoelectric Point: 7.6884

>NTDB_id=899543 R2E40_RS00645 WP_318164514.1 149772..151286(-) (comM) [Aeromonas sp. CD]
MSLAVVYSRASLGVAAPQVTVEVHLSNGLPAFNMVGLPETSVKESRDRVRSALLNGNFEFPSKHITVNLAPADLPKEGGR
FDLAIAIGILAASKQIPAKYLLDHEFLGELALTGEIRPVLGVLPAVLACRDAGRTLLVPRENGPEASLIQDAEVRTAHQL
LAVTAWLAGQYELPLPDPQTTDALPDVPDLQDVIGQSQAKRALEIAAAGSHNLLFIGPPGTGKSMLASRLPGILPPLSEQ
EAQQTAAIHSIGGLTPRAGHWHHRPYRTPHHSASAVALVGGGSHPRPGEISLAHNGVLFLDELPEFERKVLDSLREPLET
GHITISRAARQVDFPARFQLVGAMNPSPCGHYGDGQTRSSPDQILRYLSKLSGPFLDRFDLTVEVPLLPKGSLTGKAERG
ESSQQIRERVLGARERMLSRNGKLNNLLDSREIEGICRLSSQDAEFLENAIQKLGLSIRAWHRILRVSRTIADLAGQPAI
GKEHLIEALGYRAMDRLLSRLRSG

Nucleotide


Download         Length: 1515 bp        

>NTDB_id=899543 R2E40_RS00645 WP_318164514.1 149772..151286(-) (comM) [Aeromonas sp. CD]
ATGTCATTAGCTGTGGTTTATAGCCGTGCCAGCTTGGGTGTCGCGGCCCCGCAAGTGACGGTGGAGGTGCACCTCTCCAA
CGGTTTGCCCGCCTTCAACATGGTGGGCCTGCCGGAAACCTCGGTGAAGGAGTCGCGGGATCGGGTGCGCAGCGCCCTGC
TCAACGGCAATTTCGAGTTCCCGAGCAAACACATCACGGTCAATCTGGCCCCCGCCGATCTGCCCAAGGAGGGGGGCCGC
TTCGATCTGGCCATCGCCATCGGCATTCTCGCAGCTTCCAAGCAAATACCCGCAAAATACCTGCTCGATCACGAATTTTT
AGGTGAACTGGCCCTGACCGGCGAGATCCGTCCCGTGCTCGGCGTGCTGCCCGCCGTGCTCGCCTGCCGCGATGCGGGTC
GCACCCTGCTGGTACCACGAGAAAACGGCCCCGAGGCCTCGCTAATCCAGGATGCCGAGGTGCGTACCGCCCATCAGCTG
CTGGCCGTTACCGCCTGGCTGGCAGGCCAGTACGAGCTGCCGCTGCCGGATCCCCAGACCACGGATGCCCTGCCCGATGT
GCCGGACCTGCAGGACGTGATCGGCCAGTCCCAGGCCAAGCGGGCGCTGGAGATCGCCGCCGCCGGCAGCCACAACCTGC
TGTTCATCGGCCCGCCAGGCACCGGCAAGAGCATGCTGGCCAGCCGCTTGCCCGGCATCTTGCCACCGCTCAGCGAACAG
GAGGCACAGCAGACCGCCGCCATTCACTCCATCGGCGGCCTCACCCCGCGCGCCGGTCACTGGCATCACAGGCCCTATCG
AACGCCGCACCACAGCGCCTCGGCGGTGGCGCTGGTGGGAGGTGGCAGTCACCCGCGGCCCGGTGAAATTTCGTTGGCCC
ACAACGGGGTGCTGTTTCTGGATGAACTGCCCGAGTTCGAGCGCAAGGTGCTCGACTCCCTGCGCGAGCCGCTGGAGACC
GGCCACATCACCATCAGCCGGGCCGCCCGCCAGGTGGATTTCCCCGCCCGCTTCCAGCTGGTCGGCGCCATGAACCCCAG
CCCTTGCGGCCACTATGGCGATGGCCAGACCCGCTCCAGCCCGGATCAGATCCTGCGCTACCTCAGCAAGCTCTCCGGCC
CCTTTCTCGACCGCTTCGACCTGACGGTAGAGGTGCCACTGCTGCCCAAGGGCAGCCTGACCGGCAAGGCGGAGCGGGGG
GAGTCGAGCCAGCAGATCCGAGAACGGGTGCTGGGGGCGCGGGAGCGCATGCTGAGCCGCAACGGCAAACTCAACAACCT
GCTTGATAGCCGTGAAATCGAAGGAATTTGCCGCTTGTCGTCACAGGATGCCGAGTTTCTGGAGAACGCCATCCAGAAGC
TGGGGCTCAGCATCCGGGCCTGGCACCGCATCCTGCGGGTGTCGCGCACCATAGCCGATCTGGCCGGACAACCCGCCATC
GGCAAGGAGCACCTGATCGAGGCGCTCGGCTACCGGGCCATGGACCGGCTGCTGTCACGGCTGCGCAGCGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

63.546

99.603

0.633

  comM Haemophilus influenzae Rd KW20

61.66

100

0.619

  comM Vibrio campbellii strain DS40M4

62.151

99.603

0.619

  comM Glaesserella parasuis strain SC1401

60.355

100

0.607

  comM Legionella pneumophila str. Paris

50.992

100

0.51

  comM Legionella pneumophila strain ERS1305867

50.992

100

0.51

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.443

100

0.466