Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   DEIMA_RS06645 Genome accession   NC_014958
Coordinates   1417711..1419213 (-) Length   500 a.a.
NCBI ID   WP_013556463.1    Uniprot ID   E8U7B9
Organism   Deinococcus maricopensis DSM 21211     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1412711..1424213
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DEIMA_RS06625 (Deima_1303) - 1412906..1413337 (+) 432 WP_013556459.1 S-adenosylmethionine decarboxylase -
  DEIMA_RS06630 (Deima_1304) - 1413409..1415400 (-) 1992 WP_013556460.1 protein kinase -
  DEIMA_RS06635 (Deima_1305) - 1415562..1417286 (+) 1725 WP_043816525.1 hypothetical protein -
  DEIMA_RS06640 (Deima_1306) - 1417387..1417656 (+) 270 WP_013556462.1 hypothetical protein -
  DEIMA_RS06645 (Deima_1307) comM 1417711..1419213 (-) 1503 WP_013556463.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  DEIMA_RS06650 (Deima_1308) - 1419427..1419954 (+) 528 WP_013556464.1 ankyrin repeat domain-containing protein -
  DEIMA_RS06655 (Deima_1309) - 1420005..1420616 (-) 612 WP_013556465.1 VTT domain-containing protein -
  DEIMA_RS06660 (Deima_1310) tatA 1420653..1420919 (-) 267 WP_013556466.1 twin-arginine translocase TatA/TatE family subunit -
  DEIMA_RS06665 (Deima_1311) - 1421048..1422988 (-) 1941 WP_013556467.1 serine/threonine-protein kinase -
  DEIMA_RS06670 (Deima_1312) - 1423127..1423348 (-) 222 WP_013556468.1 hypothetical protein -

Sequence


Protein


Download         Length: 500 a.a.        Molecular weight: 52317.31 Da        Isoelectric Point: 9.1480

>NTDB_id=39600 DEIMA_RS06645 WP_013556463.1 1417711..1419213(-) (comM) [Deinococcus maricopensis DSM 21211]
MLARTTSVALIGVNAVPVTVEVDVSPGLPAFAIVGLPDQALSEARERVRAAVRNSGLPFPTARITVNLAPADLRKEGPLF
DLPIALGVLAAQDLLPTSALGGVLIAGELALDGSLRPVAGAVNLALRAAEVGADVLLPLANAEEAALIDGARVYGASTLR
DAVAHLSGEAPLPLTTPPTPTPPEDTLLDLLDIKGQTAGKRALEIAVAGGHNLLMVGSPGSGKTMLARRAPGLLPPLTRA
EALDVTRIHSAAGLLSGREGLLTTPPYRAPHHTVSDAGLIGGGSVPKPGEVSLAHRGVLFLDEFPEFSRKALETLRQPLE
EGTVTISRARATVQYPARFQLLSAMNPCPCGYFGDAERPCTCTPGERARYAGRLSGPLLDRIDLVVRVPRLTVDELTRAA
PGEPSAKVRGRVLRARERMLARQGERNALLQGQALQAHARLGPGPEAFVRAAARTLGLTGRSFDRVLRVARTVADLAGHP
DITEAHLAAAVSYRPRDLAT

Nucleotide


Download         Length: 1503 bp        

>NTDB_id=39600 DEIMA_RS06645 WP_013556463.1 1417711..1419213(-) (comM) [Deinococcus maricopensis DSM 21211]
ATGCTCGCGCGGACCACCAGCGTCGCCCTGATCGGCGTGAACGCCGTGCCCGTGACCGTCGAGGTGGACGTCTCCCCGGG
CCTGCCCGCCTTCGCCATTGTCGGCCTGCCGGATCAGGCGCTCAGTGAGGCACGCGAACGCGTCCGCGCCGCCGTGCGCA
ACAGCGGCCTGCCGTTCCCCACGGCCCGCATCACCGTGAACCTCGCGCCTGCGGACCTGCGCAAGGAAGGCCCACTGTTC
GACCTTCCAATCGCCCTGGGCGTCCTGGCCGCGCAGGACCTGCTGCCCACCAGCGCGCTCGGCGGCGTGCTGATCGCCGG
GGAACTCGCGCTCGACGGGTCCCTGCGGCCCGTGGCGGGCGCCGTAAACCTCGCCCTGCGCGCCGCCGAGGTCGGTGCGG
ACGTGCTGCTGCCCCTCGCGAACGCCGAGGAAGCCGCCCTGATCGACGGCGCGCGTGTGTACGGCGCGTCCACCCTGCGG
GACGCCGTGGCGCACCTCAGCGGCGAGGCGCCCCTGCCGCTCACGACGCCGCCCACGCCCACCCCGCCCGAGGACACGCT
GCTGGACCTGCTCGACATCAAGGGGCAGACGGCCGGCAAACGCGCGCTGGAAATCGCCGTGGCCGGCGGACATAACCTCC
TGATGGTCGGCTCGCCAGGCAGCGGGAAGACCATGTTGGCCCGCCGCGCCCCCGGGCTGCTGCCGCCCCTCACGCGCGCC
GAGGCGCTGGACGTCACACGCATCCACAGCGCCGCCGGGCTGCTGTCCGGCCGCGAGGGCCTGCTCACCACCCCCCCGTA
CCGCGCGCCGCACCACACCGTCAGTGACGCCGGCCTGATCGGCGGAGGCAGCGTCCCCAAGCCCGGCGAGGTGAGCCTCG
CGCACCGCGGCGTGCTGTTCCTCGACGAGTTCCCGGAGTTCAGCCGCAAAGCGCTGGAAACCCTGCGTCAGCCGCTGGAG
GAGGGGACGGTCACGATCAGCCGGGCGCGCGCGACCGTGCAGTACCCGGCGCGTTTCCAGCTGCTGAGCGCCATGAACCC
CTGCCCGTGCGGGTACTTCGGGGATGCCGAGCGGCCGTGCACCTGCACGCCGGGCGAACGCGCTCGGTACGCAGGTCGCT
TAAGCGGGCCGCTGCTGGACCGCATTGATCTGGTGGTGCGCGTGCCACGCCTGACTGTGGATGAGTTGACGCGCGCCGCG
CCGGGCGAGCCGAGCGCGAAGGTGCGTGGCCGGGTGCTGCGGGCGCGGGAGCGGATGCTGGCACGGCAGGGCGAACGCAA
CGCGCTGCTGCAGGGACAGGCGTTGCAGGCGCACGCGCGGCTGGGGCCGGGGCCGGAGGCGTTCGTGCGGGCAGCCGCGC
GCACGTTGGGGCTCACGGGACGCAGTTTCGACCGGGTGCTGCGCGTGGCGCGGACGGTTGCGGACCTGGCCGGGCACCCG
GACATCACCGAGGCGCACCTGGCCGCGGCCGTAAGTTACCGACCACGTGATCTGGCCACCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB E8U7B9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

48

100

0.48

  comM Vibrio cholerae strain A1552

47.686

99.4

0.474

  comM Haemophilus influenzae Rd KW20

46.906

100

0.47

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.224

100

0.464

  comM Glaesserella parasuis strain SC1401

45.238

100

0.456

  comM Legionella pneumophila str. Paris

45.272

99.4

0.45

  comM Legionella pneumophila strain ERS1305867

45.272

99.4

0.45


Multiple sequence alignment