Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   M0C34_RS01800 Genome accession   NZ_CP096670
Coordinates   383644..385158 (+) Length   504 a.a.
NCBI ID   WP_248713962.1    Uniprot ID   -
Organism   Agarivorans sp. TSD2052     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 378644..390158
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  M0C34_RS01775 (M0C34_01775) ilvM 379498..379770 (-) 273 WP_248713957.1 acetolactate synthase 2 small subunit -
  M0C34_RS01780 (M0C34_01780) ilvG 379770..381428 (-) 1659 WP_248713958.1 acetolactate synthase 2 catalytic subunit -
  M0C34_RS01785 (M0C34_01785) - 381926..382363 (+) 438 WP_248713959.1 DUF192 domain-containing protein -
  M0C34_RS01790 (M0C34_01790) - 382434..383024 (+) 591 WP_248713960.1 sugar O-acetyltransferase -
  M0C34_RS01795 (M0C34_01795) - 383150..383563 (+) 414 WP_248713961.1 CBS domain-containing protein -
  M0C34_RS01800 (M0C34_01800) comM 383644..385158 (+) 1515 WP_248713962.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  M0C34_RS01805 (M0C34_01805) - 385600..386001 (+) 402 WP_248713963.1 hypothetical protein -
  M0C34_RS01810 (M0C34_01810) recC 386119..389520 (+) 3402 WP_248713964.1 exodeoxyribonuclease V subunit gamma -

Sequence


Protein


Download         Length: 504 a.a.        Molecular weight: 53884.88 Da        Isoelectric Point: 8.2087

>NTDB_id=682209 M0C34_RS01800 WP_248713962.1 383644..385158(+) (comM) [Agarivorans sp. TSD2052]
MGLAIVRTCTLLGMEALNVTVEVHLANGLPAFNIVGLPETSVKEAKDRVRSAILNSGFSFPAKRITVNLAPADVPKSGGR
FDLPIAIGILAAAGDIPLACLDDLAFCGELALSGAIRPVNGAIATALSVSRQQLTLVTAEQDGLAAVRVPDAKVHASPSL
QQLSAGLNGQMAFNLLAASSVEDNLEQSWLDMGDVHGQHLAKRALELAAAGSHNLLMLGPPGTGKTMLASRLPGILPPLS
EQQAIEVAAIASVSTQQRDLPHWYVPPFRSPHHSASMVALVGGGSNPRPGEITLAHHGVLFLDELPEFSRATLDALRQPL
ESGEVHISRAALQVRFPAQFQLIAAMNPSPCGYYQGQQLRSNPDQILKYLGKLSGPFLDRFDLSVEVSGLPKGALSQHTT
GESSQQIKQRVMIARQIQLSRAAKLNSQLSGKELQLHAALSQGDSAFLESSITQLGLSARAFHRVWRLARTIADLKQQTT
IQRSDIVEALSYRAMDRLLHRLSN

Nucleotide


Download         Length: 1515 bp        

>NTDB_id=682209 M0C34_RS01800 WP_248713962.1 383644..385158(+) (comM) [Agarivorans sp. TSD2052]
ATGGGTTTAGCGATAGTTAGAACGTGTACTTTATTGGGCATGGAAGCCTTAAACGTAACGGTGGAGGTGCATTTAGCCAA
TGGTTTGCCAGCATTTAACATTGTAGGCTTACCTGAAACCTCGGTGAAAGAAGCTAAAGATAGAGTGCGTAGCGCCATTC
TCAACAGTGGTTTTTCCTTTCCTGCAAAACGTATCACGGTAAACCTTGCCCCGGCAGATGTACCCAAAAGTGGCGGTCGT
TTCGATTTACCCATTGCCATTGGCATTTTGGCGGCGGCTGGTGATATTCCCTTAGCATGTTTAGATGACCTCGCTTTTTG
TGGCGAGTTAGCCTTATCGGGGGCAATTCGACCGGTGAACGGAGCGATCGCCACTGCGTTGTCGGTAAGTCGACAGCAAT
TAACCTTAGTGACAGCTGAGCAAGATGGTCTTGCTGCGGTGCGAGTTCCTGATGCAAAGGTGCATGCCAGCCCAAGTTTG
CAGCAGCTTAGTGCTGGTTTAAATGGGCAAATGGCCTTTAATCTATTAGCCGCTAGCTCGGTTGAAGACAATCTAGAACA
AAGCTGGTTAGATATGGGAGATGTGCATGGCCAGCATTTAGCCAAACGGGCCCTAGAATTAGCCGCGGCAGGTTCGCATA
ATCTACTCATGCTGGGGCCTCCAGGTACCGGAAAAACCATGTTGGCTAGCCGTTTGCCTGGGATTTTGCCGCCTCTGAGT
GAGCAGCAAGCGATCGAAGTGGCGGCGATCGCTTCGGTCAGCACTCAGCAGCGAGATTTACCGCACTGGTATGTTCCGCC
ATTTAGAAGCCCCCATCATAGCGCTTCTATGGTGGCCTTAGTGGGCGGTGGCTCTAACCCTCGCCCAGGTGAAATCACTT
TAGCGCATCATGGGGTATTATTTTTGGACGAGCTGCCAGAGTTTTCTCGTGCTACCCTTGATGCACTGCGCCAGCCTTTA
GAGTCGGGTGAGGTGCATATTTCACGTGCCGCACTGCAAGTGCGTTTTCCTGCTCAGTTTCAATTGATTGCAGCAATGAA
CCCCTCTCCTTGTGGCTATTACCAAGGCCAACAATTACGCAGTAATCCCGACCAAATTCTTAAATATTTAGGCAAATTAT
CGGGTCCATTTTTAGACCGTTTTGACTTGAGTGTGGAAGTAAGTGGTTTACCCAAAGGGGCACTAAGTCAGCATACCACT
GGAGAGTCTAGTCAACAGATTAAGCAGCGGGTGATGATTGCGCGGCAGATCCAGTTAAGCCGTGCGGCAAAGCTAAATAG
CCAATTGTCGGGTAAAGAGTTACAGCTGCATGCCGCCCTTAGCCAAGGGGATAGCGCCTTTTTGGAATCGAGCATTACCC
AATTGGGTTTGTCGGCCCGGGCTTTTCATCGAGTATGGCGCTTAGCAAGAACCATTGCCGACCTTAAACAACAAACAACG
ATTCAACGCAGTGATATTGTTGAAGCCTTAAGTTACCGCGCCATGGACCGCTTACTGCATCGTTTATCAAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

54.96

100

0.55

  comM Vibrio campbellii strain DS40M4

54.365

100

0.544

  comM Haemophilus influenzae Rd KW20

53.529

100

0.542

  comM Glaesserella parasuis strain SC1401

53.557

100

0.538

  comM Legionella pneumophila str. Paris

46.693

99.008

0.462

  comM Legionella pneumophila strain ERS1305867

46.693

99.008

0.462

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

41.584

100

0.417