Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   M5524_RS05495 Genome accession   NZ_CP097466
Coordinates   1296736..1298244 (-) Length   502 a.a.
NCBI ID   WP_373990447.1    Uniprot ID   -
Organism   Duganella sp. BuS-21     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 1291736..1303244
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  M5524_RS05480 (M5524_05460) - 1293943..1294284 (-) 342 WP_373990444.1 DUF1840 domain-containing protein -
  M5524_RS05485 (M5524_05465) - 1294426..1294932 (-) 507 WP_373990445.1 hypothetical protein -
  M5524_RS05490 (M5524_05470) - 1294929..1296650 (-) 1722 WP_373990446.1 M1 family metallopeptidase -
  M5524_RS05495 (M5524_05475) comM 1296736..1298244 (-) 1509 WP_373990447.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  M5524_RS05500 (M5524_05480) - 1298354..1298593 (-) 240 WP_373990448.1 accessory factor UbiK family protein -
  M5524_RS05505 (M5524_05485) - 1298845..1299591 (+) 747 WP_373990449.1 TorF family putative porin -
  M5524_RS05510 (M5524_05490) - 1299603..1299941 (+) 339 WP_008444920.1 P-II family nitrogen regulator -
  M5524_RS05515 (M5524_05495) - 1299955..1301514 (+) 1560 WP_373990450.1 ammonium transporter -
  M5524_RS05520 (M5524_05500) gshA 1301641..1302942 (+) 1302 WP_373990451.1 glutamate--cysteine ligase -

Sequence


Protein


Download         Length: 502 a.a.        Molecular weight: 54241.42 Da        Isoelectric Point: 9.3445

>NTDB_id=689574 M5524_RS05495 WP_373990447.1 1296736..1298244(-) (comM) [Duganella sp. BuS-21]
MSLAVLKSRALAGMEAQEVSVEVHLANGLPAFTIVGLADTEVKEAKDRVRAAIQNGGFEFPAQRITVNLAPADLPKESGR
FDLPIALGILAASRQIPSRRLHQYEFAGELSLSGELRPIRGALAMSLATRRDGGCLAFILPLANADEAALVSSAAIYPAH
SLLQVCHHFSGKSVDTMLSRHQAAPLAAPPEYPDFADVKGQVFVKRALEVAAAGTHSILLVGPPGSGKTMLASRFAGLLP
AMSDEEALEAAAVQSLTGSFRIEHWKQRPFRAPHHTSSGAALVGGGSVPRPGEISLAHRGVLFLDELPEFDRRVLDVLRE
PMESGRITISRAARQADFPAHFQLIAAMNPCPCGYFGHPSVGCRCAPDVRLRYINRISGPLLDRIDMQMEVGSVHPDILA
AGADGETSAVIAARVQAAANRQLQRQGKRNQYLAPREIDHYCKLDRPGKAQLKHSMEKFKWSGRAYHRILRVARTIADLA
GATTIALPHIKEAIQYRRALAE

Nucleotide


Download         Length: 1509 bp        

>NTDB_id=689574 M5524_RS05495 WP_373990447.1 1296736..1298244(-) (comM) [Duganella sp. BuS-21]
ATGTCACTCGCCGTACTCAAAAGCCGCGCCCTGGCCGGCATGGAGGCGCAGGAAGTCAGCGTCGAGGTACACCTGGCGAA
CGGCTTGCCGGCCTTCACCATCGTCGGCCTGGCCGATACGGAAGTGAAGGAAGCCAAAGACCGCGTGCGCGCCGCCATCC
AGAACGGCGGCTTCGAATTTCCCGCCCAACGCATCACCGTCAACCTGGCGCCGGCCGACCTGCCCAAGGAGTCTGGCCGC
TTCGACCTGCCGATCGCGCTCGGCATCCTGGCCGCTTCCAGACAAATCCCCTCGCGCCGCCTGCATCAGTACGAATTTGC
CGGCGAGCTGTCCCTGTCCGGCGAACTGCGGCCGATACGCGGTGCGCTGGCGATGTCGCTGGCAACGCGGCGCGACGGCG
GCTGCCTGGCCTTCATCCTGCCGCTGGCCAATGCCGACGAAGCGGCGCTGGTCTCCAGCGCCGCCATCTACCCGGCCCAC
TCGCTGCTGCAGGTGTGCCACCACTTCTCCGGCAAGTCGGTCGACACCATGCTGTCGCGCCACCAAGCGGCGCCGCTGGC
TGCGCCCCCCGAGTATCCCGACTTCGCCGACGTCAAGGGACAGGTCTTCGTCAAGCGTGCGCTGGAGGTGGCGGCGGCCG
GCACCCATTCGATCCTGCTGGTCGGTCCGCCCGGCTCCGGCAAGACCATGCTGGCCTCGCGCTTTGCCGGCCTGCTGCCG
GCAATGAGCGATGAAGAGGCGCTGGAAGCGGCGGCCGTGCAGTCGCTGACCGGCAGCTTCCGCATCGAGCACTGGAAGCA
GCGGCCGTTCCGCGCGCCGCACCACACTTCCTCCGGCGCCGCGCTGGTCGGCGGCGGCAGCGTGCCGCGTCCCGGCGAGA
TTTCCCTGGCCCATCGCGGCGTGCTGTTCCTGGACGAATTGCCGGAGTTCGACCGGCGCGTGCTCGACGTGCTGCGCGAG
CCGATGGAATCGGGCCGCATCACCATCTCGCGCGCGGCGCGCCAGGCCGACTTCCCGGCGCACTTCCAATTGATCGCCGC
CATGAACCCCTGCCCTTGCGGCTACTTCGGCCACCCGAGCGTCGGCTGCCGCTGCGCGCCGGACGTGCGGCTGCGCTACA
TCAACCGCATCTCCGGCCCGCTGCTGGACCGGATCGACATGCAAATGGAAGTCGGCTCGGTCCATCCCGACATCCTGGCC
GCCGGGGCCGATGGCGAAACGTCCGCCGTGATCGCCGCCCGCGTCCAGGCGGCCGCCAACCGCCAACTCCAGCGCCAGGG
CAAACGCAACCAGTACCTCGCTCCGCGCGAGATCGACCACTACTGCAAGCTCGACCGGCCAGGCAAGGCGCAGCTCAAGC
ACAGCATGGAGAAATTCAAATGGTCGGGCCGCGCCTATCACCGCATCCTGCGCGTGGCCCGCACCATCGCCGACCTGGCC
GGCGCCACCACCATCGCGCTGCCGCACATCAAAGAGGCAATCCAATACCGCCGCGCGCTCGCCGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

55.2

99.602

0.55

  comM Vibrio campbellii strain DS40M4

55.11

99.402

0.548

  comM Glaesserella parasuis strain SC1401

51.485

100

0.518

  comM Haemophilus influenzae Rd KW20

51.394

100

0.514

  comM Legionella pneumophila str. Paris

49.315

100

0.502

  comM Legionella pneumophila strain ERS1305867

49.315

100

0.502

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.108

99.801

0.46