Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   SALB1_RS03885 Genome accession   NZ_CP029488
Coordinates   853208..854713 (-) Length   501 a.a.
NCBI ID   WP_109992657.1    Uniprot ID   -
Organism   Salinisphaera sp. LB1     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 848208..859713
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SALB1_RS03860 (SALB1_0801) - 848574..849362 (-) 789 WP_158590613.1 carbon-nitrogen hydrolase family protein -
  SALB1_RS03865 (SALB1_0802) - 849679..850275 (-) 597 WP_370453228.1 cupin domain-containing protein -
  SALB1_RS03880 (SALB1_0804) - 851110..853122 (-) 2013 WP_109992656.1 UvrD-helicase domain-containing protein -
  SALB1_RS03885 (SALB1_0805) comM 853208..854713 (-) 1506 WP_109992657.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  SALB1_RS03890 (SALB1_0806) - 854763..855005 (-) 243 WP_109995260.1 accessory factor UbiK family protein -
  SALB1_RS03895 (SALB1_0807) glnK 855236..855574 (+) 339 WP_109992658.1 P-II family nitrogen regulator -
  SALB1_RS03905 (SALB1_0809) - 855997..857304 (+) 1308 WP_109992660.1 ammonium transporter -
  SALB1_RS03910 (SALB1_0810) - 857432..858757 (+) 1326 WP_370453242.1 ammonium transporter -
  SALB1_RS03915 (SALB1_0811) - 858790..859128 (+) 339 WP_109992662.1 P-II family nitrogen regulator -

Sequence


Protein


Download         Length: 501 a.a.        Molecular weight: 52580.87 Da        Isoelectric Point: 7.0916

>NTDB_id=293124 SALB1_RS03885 WP_109992657.1 853208..854713(-) (comM) [Salinisphaera sp. LB1]
MALATVHSRAQTGLHAPPVAVEVDLAGGLPALAIVGLPETEVKESKDRVRAAISNSGYQFPTRRITVNLAPADLPKEGGR
FDLAIALGILAASGQIPSDSLNGHEFLGELSLSGALRGVRGALPATLAAARVQRRLVLPADNGSEAALAGDDAARTATHL
ADVATTLAGGETLWARPVAEPGDGEHQAVPDLADVRGQAQAKRALEIAAAGGHSLLMIGPPGAGKTMLASRLVGLLPPLE
HEQALEVAAIASISGGGFDPALWGRRPFRAPHHTASGVALVGGGSTPRPGEITLAHRGVLFLDELPEFDRRVLEVLREPL
ESGHIVISRAARQSEFPAAFQLVAAMNPCPCGYQGDASGRCHCTPERIERYRARISGPLLDRIDMHLNVAPVTKKVLTAE
APEVECSAVVQTRVHGARQRAQVRSGAPNAALTPRQVDAHCAPSAEAARLIDQAIDRLGLSARGYHRILRVARTIADLAG
HEQIQAPDIAEAIGYRQLDRD

Nucleotide


Download         Length: 1506 bp        

>NTDB_id=293124 SALB1_RS03885 WP_109992657.1 853208..854713(-) (comM) [Salinisphaera sp. LB1]
GTGGCGTTGGCGACTGTCCACAGCCGGGCCCAGACTGGTCTGCACGCGCCGCCGGTAGCGGTTGAGGTGGATCTGGCTGG
TGGCCTGCCGGCGCTGGCCATTGTCGGTCTGCCGGAGACCGAGGTGAAGGAGAGCAAGGATCGCGTGCGTGCCGCGATCA
GCAACAGTGGCTACCAGTTTCCGACGCGGCGAATCACGGTCAATCTGGCGCCGGCCGATCTACCCAAAGAGGGCGGGCGT
TTCGATCTGGCGATCGCGCTGGGTATTCTCGCGGCATCCGGCCAGATTCCGTCCGACTCGTTGAACGGGCATGAGTTTCT
GGGTGAACTGTCGCTCTCGGGAGCCCTGCGCGGCGTACGTGGTGCCTTGCCGGCCACGCTGGCGGCGGCCCGGGTGCAGC
GCCGGCTGGTCCTGCCGGCGGACAACGGCAGCGAGGCGGCACTGGCCGGCGATGACGCGGCCCGTACCGCCACGCATCTG
GCCGATGTCGCCACCACTCTGGCCGGCGGCGAGACGCTTTGGGCGCGGCCGGTGGCCGAGCCGGGGGATGGCGAACACCA
GGCCGTGCCCGATCTGGCCGATGTGCGTGGCCAGGCCCAGGCCAAGCGCGCACTGGAAATCGCCGCGGCCGGCGGCCATT
CCCTGTTGATGATCGGCCCGCCGGGCGCGGGCAAGACCATGCTGGCCTCGCGCCTGGTGGGGTTGCTGCCGCCGCTGGAG
CATGAGCAGGCGCTGGAGGTGGCTGCGATCGCCTCGATATCGGGCGGCGGGTTCGATCCCGCGCTATGGGGGCGGCGCCC
GTTCCGCGCGCCGCATCACACCGCTTCGGGCGTGGCCCTGGTCGGCGGCGGCTCCACGCCACGCCCGGGCGAGATCACGC
TCGCCCACCGCGGCGTGTTGTTTCTCGACGAGCTGCCCGAGTTCGATCGGCGGGTGCTGGAAGTCCTGCGCGAGCCGCTG
GAATCCGGCCATATCGTGATCTCCCGCGCGGCGCGGCAATCGGAGTTTCCCGCCGCCTTCCAGCTGGTGGCCGCGATGAA
TCCCTGCCCCTGCGGCTATCAGGGCGACGCATCGGGCCGTTGTCACTGCACGCCCGAGCGGATCGAGCGCTATCGCGCCC
GGATTTCCGGTCCGCTGCTCGATCGCATCGACATGCATCTTAATGTGGCGCCGGTCACCAAAAAGGTGCTCACGGCCGAA
GCGCCGGAGGTCGAGTGCTCTGCCGTGGTGCAGACGCGCGTTCATGGTGCAAGACAGCGCGCGCAGGTACGCAGTGGTGC
ACCCAACGCCGCCCTGACCCCGCGTCAGGTCGATGCGCACTGCGCGCCGAGCGCCGAGGCCGCCCGCTTGATCGACCAGG
CCATCGATCGGCTGGGTTTGTCGGCCCGGGGCTATCATCGGATTCTGCGCGTGGCGCGCACGATCGCCGATCTGGCGGGT
CATGAGCAGATACAGGCCCCCGATATCGCCGAAGCGATCGGCTATCGCCAGCTGGATCGGGATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

55.689

100

0.557

  comM Vibrio cholerae strain A1552

55.4

99.8

0.553

  comM Glaesserella parasuis strain SC1401

51.282

100

0.519

  comM Legionella pneumophila strain ERS1305867

51.205

99.401

0.509

  comM Legionella pneumophila str. Paris

51.205

99.401

0.509

  comM Haemophilus influenzae Rd KW20

50.298

100

0.505

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.098

100

0.459


Multiple sequence alignment