Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ACLD71_RS03300 Genome accession   NZ_CP177241
Coordinates   695520..697034 (-) Length   504 a.a.
NCBI ID   WP_411992962.1    Uniprot ID   -
Organism   Agarivorans sp. DSG3-1     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 690520..702034
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACLD71_RS03290 (ACLD71_03290) recC 691197..694550 (-) 3354 WP_411992960.1 exodeoxyribonuclease V subunit gamma -
  ACLD71_RS03295 (ACLD71_03295) - 694669..695067 (-) 399 WP_303511691.1 hypothetical protein -
  ACLD71_RS03300 (ACLD71_03300) comM 695520..697034 (-) 1515 WP_411992962.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  ACLD71_RS03305 (ACLD71_03305) - 697277..697987 (+) 711 WP_411994822.1 substrate-binding periplasmic protein -
  ACLD71_RS03310 (ACLD71_03310) - 698133..698549 (-) 417 WP_137673395.1 CBS domain-containing protein -
  ACLD71_RS03315 (ACLD71_03315) - 698672..699154 (-) 483 WP_137673394.1 hypothetical protein -
  ACLD71_RS03320 (ACLD71_03320) - 699165..699728 (-) 564 WP_411992964.1 sugar O-acetyltransferase -
  ACLD71_RS03325 (ACLD71_03325) - 699788..700237 (-) 450 WP_411992966.1 DUF192 domain-containing protein -

Sequence


Protein


Download         Length: 504 a.a.        Molecular weight: 53968.99 Da        Isoelectric Point: 8.5061

>NTDB_id=1086363 ACLD71_RS03300 WP_411992962.1 695520..697034(-) (comM) [Agarivorans sp. DSG3-1]
MGLAIVKTCTLVGMEALNVTVEVHLANGLPAFSIVGLPETSVKEAKDRVRSAILNSGFSFPAKRITVNLAPADVPKSGGR
FDLPIAIGILAAAGDIPLACLNDMAFCGELALSGAIRPVSGAIATALSISQQHLTLVTSEQDADAAARVPEVKVHGSASL
QQLSAGLNGQSAFNLLAASPIEPALSKLELDMSDVQGQHLAKRALELAAAGAHHLLMLGPPGTGKTMLASRLPGILPSLS
EQQAIEVAAINSVSAQQRELEQWYIPPFRSPHHSASMVALVGGGSNPKPGEITLAHHGVLFLDELPEFARSTLDALRQPL
ESGEVHISRAALQVRFPSRFQLIAAMNPSPCGYYQGQQLRSNPDQILKYLSKLSGPFLDRFDLSVEVAELPKGSLSQHSS
GESSEAIKQRVIAARKLQMQRSGKLNSQLSGKELHKHAFLSAENSDFLESSIRQLGLSARAFHRVWRLARTIADLKQQSD
VKRSDLIEALSYRAMDRLLKQLSH

Nucleotide


Download         Length: 1515 bp        

>NTDB_id=1086363 ACLD71_RS03300 WP_411992962.1 695520..697034(-) (comM) [Agarivorans sp. DSG3-1]
ATGGGTTTAGCGATTGTTAAAACATGTACTTTGGTAGGCATGGAAGCCTTAAACGTTACCGTAGAAGTTCATTTGGCCAA
TGGCCTCCCGGCTTTTTCAATAGTTGGCTTGCCAGAGACCTCGGTTAAAGAAGCGAAAGACAGAGTGCGTAGTGCGATAC
TAAACAGTGGCTTTTCTTTTCCGGCTAAACGCATCACCGTAAACCTTGCCCCAGCAGACGTTCCTAAAAGCGGAGGTCGA
TTTGATTTGCCTATTGCGATAGGCATCCTCGCTGCTGCTGGCGATATTCCGCTAGCTTGTTTAAACGATATGGCTTTTTG
TGGAGAGCTGGCGCTTTCCGGTGCTATTCGCCCAGTTAGCGGTGCCATTGCCACGGCACTGTCTATAAGCCAGCAGCATC
TCACTTTAGTCACTAGTGAACAAGACGCTGACGCCGCAGCACGGGTGCCTGAGGTGAAAGTACATGGTAGCGCCAGCCTG
CAGCAGCTTAGCGCTGGTTTAAATGGTCAAAGCGCTTTTAATCTGTTGGCTGCAAGCCCCATTGAGCCGGCATTAAGCAA
GCTAGAGTTAGATATGAGTGACGTGCAGGGCCAGCACTTAGCCAAGCGTGCTTTAGAGCTGGCTGCGGCAGGGGCACATC
ACTTATTGATGTTAGGCCCGCCGGGAACGGGCAAGACTATGTTAGCCAGCCGCTTACCGGGTATTTTACCTAGCCTAAGT
GAGCAGCAGGCGATAGAAGTCGCGGCGATTAACTCGGTCAGTGCCCAGCAAAGAGAGCTCGAACAATGGTACATACCGCC
ATTTAGAAGCCCACATCATAGTGCGTCTATGGTGGCTTTAGTAGGTGGAGGATCAAACCCAAAACCGGGAGAAATTACTC
TGGCTCATCATGGAGTGTTATTTCTAGATGAGCTACCAGAGTTTGCCCGCAGTACCCTAGATGCCCTGCGTCAGCCTCTA
GAATCAGGCGAGGTACATATATCACGCGCTGCTTTACAAGTGCGTTTCCCTTCGCGCTTTCAGCTCATAGCAGCCATGAA
TCCGTCGCCCTGTGGATATTACCAAGGCCAACAATTGCGCAGTAATCCCGATCAAATTCTTAAGTATTTGAGTAAGCTTT
CGGGTCCATTTCTAGATCGCTTTGATTTAAGCGTGGAGGTGGCTGAGTTACCGAAAGGCAGTTTAAGCCAACACTCTAGT
GGTGAGTCTAGCGAGGCAATCAAACAACGGGTTATTGCAGCCCGTAAATTACAAATGCAGCGCAGCGGTAAACTCAATAG
CCAACTCTCGGGTAAAGAGTTACATAAACATGCTTTTCTTAGTGCTGAAAATAGCGACTTTTTAGAAAGCAGCATTCGCC
AGCTTGGGCTTTCTGCACGAGCATTTCACCGAGTTTGGCGCTTGGCCAGAACCATTGCCGATCTGAAACAACAATCAGAT
GTTAAACGCAGTGATCTTATAGAAGCACTAAGTTATCGTGCTATGGATCGATTGTTAAAACAATTAAGCCACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

56.436

100

0.565

  comM Vibrio campbellii strain DS40M4

54.365

100

0.544

  comM Haemophilus influenzae Rd KW20

52.953

100

0.534

  comM Glaesserella parasuis strain SC1401

52.539

100

0.534

  comM Legionella pneumophila str. Paris

48.491

98.611

0.478

  comM Legionella pneumophila strain ERS1305867

48.491

98.611

0.478

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

40.99

100

0.411


Multiple sequence alignment