Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   SBAL117_RS21300 Genome accession   NC_017579
Coordinates   4844527..4846053 (+) Length   508 a.a.
NCBI ID   WP_011848081.1    Uniprot ID   A3D9U5
Organism   Shewanella baltica OS117     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 4841398..4843584 4844527..4846053 flank 943


Gene organization within MGE regions


Location: 4841398..4846053
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SBAL117_RS21285 (Sbal117_4196) - 4841398..4842575 (+) 1178 WP_086010626.1 IS3-like element ISSba5 family transposase -
  SBAL117_RS21290 (Sbal117_4197) - 4842643..4843584 (-) 942 WP_011845808.1 IS30-like element ISSba16 family transposase -
  SBAL117_RS21295 (Sbal117_4198) - 4843583..4844041 (+) 459 WP_227257774.1 hypothetical protein -
  SBAL117_RS21300 (Sbal117_4199) comM 4844527..4846053 (+) 1527 WP_011848081.1 YifB family Mg chelatase-like AAA ATPase Machinery gene

Sequence


Protein


Download         Length: 508 a.a.        Molecular weight: 55467.79 Da        Isoelectric Point: 8.3615

>NTDB_id=48963 SBAL117_RS21300 WP_011848081.1 4844527..4846053(+) (comM) [Shewanella baltica OS117]
MAIACVNTRASCGVEAPQVTVEVHLSNGLPAFNLVGLPEASVKEARERVRSALINAGFEFPMRRITVNLAPADLPKQGGR
YDLPIAVGILAASEQIPANSLKNLEFVGELALSGHIRYCQGLLPAIIAAKRQDHTLILPLDNRHDAELVGYPNVLFGSHL
QSLAAYLQGQNQLPTLAPQLEWISQETPESHTCLSDVIGQYQAKQALEIAAAGNHNLLMLGPPGTGKTMLASRMMALLPA
LNYEEALEVAAIHSVAGLDIKPQHFLQRPFRSPHHTSSSISLVGGGSIPKPGEISLAHRGVLFLDEVAEFPRKVLDCLRE
PMETGEVVISRAAAKLTFLSRFQLIAAMNPSPSGDIDSQHRSSPEQIQRYLSRLSGPFLDRFDLTIEVPKLPAGTLTQAA
PQTETSQDIAKRVKRARELQLARSGVLNSELTGKQLKRFSGISDADLVFLEQSVVKLGLSVRSFHRIQRVARTIADLEQV
PNTERRHIAQALGYRAMDRLLARLSQQY

Nucleotide


Download         Length: 1527 bp        

>NTDB_id=48963 SBAL117_RS21300 WP_011848081.1 4844527..4846053(+) (comM) [Shewanella baltica OS117]
ATGGCGATTGCCTGCGTCAATACCCGAGCCAGTTGTGGGGTCGAAGCTCCCCAAGTCACAGTTGAAGTGCATCTCAGTAA
CGGCTTACCGGCATTTAATCTAGTGGGATTACCCGAAGCGTCGGTGAAAGAAGCCCGCGAGCGGGTGCGCAGCGCACTGA
TCAATGCAGGATTCGAGTTTCCGATGCGGCGGATAACCGTTAACCTCGCGCCAGCAGATTTACCTAAACAAGGTGGCCGC
TACGATTTACCCATAGCCGTGGGCATATTGGCCGCCTCGGAACAAATTCCAGCTAATAGTTTAAAAAATCTTGAGTTTGT
CGGTGAGCTGGCACTGTCTGGGCACATACGTTATTGCCAAGGATTATTACCCGCGATTATCGCCGCTAAGCGCCAAGACC
ATACCTTGATATTGCCGCTGGATAATCGCCACGATGCCGAGCTAGTCGGCTATCCTAATGTGTTGTTTGGCTCGCACCTT
CAGAGCTTAGCCGCTTATTTACAGGGGCAAAACCAACTACCGACGTTAGCGCCGCAATTAGAATGGATAAGCCAAGAGAC
GCCTGAGTCCCATACCTGCCTGAGCGATGTGATTGGCCAATATCAGGCCAAGCAAGCACTCGAAATTGCCGCGGCGGGGA
ACCATAACCTCCTGATGCTTGGCCCGCCTGGTACGGGCAAAACCATGCTTGCCAGCCGCATGATGGCGCTATTGCCTGCA
CTCAATTATGAGGAAGCGTTAGAAGTGGCGGCGATTCATTCCGTTGCCGGACTCGACATTAAGCCGCAGCATTTTTTGCA
GCGACCCTTTCGCTCGCCACACCACACAAGTTCATCGATATCCCTCGTGGGTGGCGGCAGTATCCCCAAACCCGGTGAAA
TTTCCCTCGCCCACCGTGGCGTGCTGTTTTTAGATGAAGTTGCCGAATTTCCTCGGAAAGTCTTAGATTGCCTGCGCGAG
CCGATGGAAACCGGCGAAGTGGTGATTTCCCGCGCCGCCGCAAAGCTGACATTTTTGAGTCGCTTCCAATTGATCGCGGC
GATGAATCCTAGCCCGAGCGGTGATATTGATAGCCAACACAGATCGAGCCCCGAACAGATTCAACGTTATCTCTCAAGGC
TTTCTGGCCCGTTTCTCGACCGTTTCGATTTAACCATTGAAGTCCCCAAGCTCCCTGCGGGCACCTTGACTCAAGCCGCG
CCGCAAACAGAAACCAGCCAAGATATCGCCAAGCGGGTCAAACGGGCGCGTGAGCTGCAACTCGCCCGTTCAGGCGTGCT
CAATAGCGAACTCACAGGGAAACAATTGAAACGCTTTAGCGGGATCAGTGATGCTGATTTAGTGTTTTTGGAACAAAGCG
TGGTAAAGCTTGGCCTCTCGGTTCGCAGCTTCCACCGTATACAAAGAGTCGCACGTACTATTGCCGATCTGGAACAAGTG
CCTAACACGGAACGCCGCCATATTGCCCAAGCACTGGGTTATCGCGCTATGGATAGATTACTCGCCCGCTTATCCCAGCA
ATATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A3D9U5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

56.667

100

0.569

  comM Glaesserella parasuis strain SC1401

56.213

99.803

0.561

  comM Haemophilus influenzae Rd KW20

56.102

100

0.561

  comM Vibrio campbellii strain DS40M4

55.403

100

0.555

  comM Legionella pneumophila str. Paris

45.866

100

0.459

  comM Legionella pneumophila strain ERS1305867

45.866

100

0.459

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.952

97.638

0.429


Multiple sequence alignment