Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   DGEO_RS09670 Genome accession   NC_008025
Coordinates   1477606..1479105 (-) Length   499 a.a.
NCBI ID   WP_011530525.1    Uniprot ID   Q1IYJ5
Organism   Deinococcus geothermalis DSM 11300     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 1472606..1484105
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DGEO_RS16795 - 1473143..1473292 (+) 150 WP_165953336.1 hypothetical protein -
  DGEO_RS09650 (Dgeo_1390) - 1473302..1475047 (-) 1746 WP_011530521.1 SLC13 family permease -
  DGEO_RS09655 (Dgeo_1391) - 1475204..1476280 (-) 1077 WP_011530522.1 [LysW]-lysine hydrolase -
  DGEO_RS09660 (Dgeo_1392) - 1476264..1477172 (-) 909 WP_011530523.1 hypothetical protein -
  DGEO_RS09665 (Dgeo_1393) - 1477309..1477599 (+) 291 WP_011530524.1 hypothetical protein -
  DGEO_RS09670 (Dgeo_1394) comM 1477606..1479105 (-) 1500 WP_011530525.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  DGEO_RS09675 (Dgeo_1395) - 1479105..1479962 (-) 858 WP_232220836.1 DegV family protein -
  DGEO_RS09680 (Dgeo_1396) tatA 1480075..1480389 (-) 315 WP_011530527.1 twin-arginine translocase TatA/TatE family subunit -
  DGEO_RS09685 (Dgeo_1397) - 1480486..1482417 (-) 1932 WP_011530528.1 serine/threonine-protein kinase -
  DGEO_RS09690 (Dgeo_1398) - 1482491..1483918 (+) 1428 WP_041221699.1 NAD(P)/FAD-dependent oxidoreductase -

Sequence


Protein


Download         Length: 499 a.a.        Molecular weight: 52396.47 Da        Isoelectric Point: 8.5092

>NTDB_id=25886 DGEO_RS09670 WP_011530525.1 1477606..1479105(-) (comM) [Deinococcus geothermalis DSM 11300]
MLARARSVALIGVDAVPVEVEVDVSPGLPAFTVVGLPDQAVSEARERVRAAVRNAGLPFPAARITVNLAPADLRKEGPLY
DLPIALGLLAAQELLPAEALAGVLSAGELALDGSLRPIAGAVNLALRAAAEGCPVLIPEGNAGEAALIDGVTVHGAPSLL
SAVRHLTGQAPLGRALPPEAPPLDPTLPDLADLKGQTAAKRALEIAVAGGHNLLLVGSPGSGKTMLARRAPGLLPPLTRE
EALEVTRIHSAAGLLTARSGLMGHPPYRAPHHTVSDAGLIGGGSVPRPGEVSLAHRGILFLDEFPEFSRKALETLRQPLE
DGQVSISRARATVCYPARFQLIAAMNPCPCGHQGDPEKACTCTPSERSRYAARLSGPLLDRLDLIVRVPRLTVDELTRAP
EAEKSEAVRQRVIRARAAMLARQGVRNSDLAGQMLRQHAPLAPGPEAFLRAAARQLNLTGRGFDRVLRVARTVADLAGSP
DIREAHLAEAVTYRPRELT

Nucleotide


Download         Length: 1500 bp        

>NTDB_id=25886 DGEO_RS09670 WP_011530525.1 1477606..1479105(-) (comM) [Deinococcus geothermalis DSM 11300]
ATGTTGGCCCGTGCCCGCAGCGTCGCCCTGATCGGCGTGGACGCGGTGCCGGTCGAGGTGGAGGTGGATGTCTCTCCCGG
CCTGCCGGCCTTTACGGTGGTGGGGCTGCCTGATCAGGCCGTCAGCGAGGCCCGCGAGCGGGTGCGGGCGGCGGTGCGGA
ACGCGGGGTTGCCCTTTCCCGCCGCACGCATCACCGTGAACCTCGCGCCCGCCGACCTGCGCAAGGAGGGGCCGCTCTAC
GACCTGCCCATCGCGCTGGGCCTGCTCGCGGCGCAGGAGCTGCTGCCCGCTGAAGCCTTGGCCGGGGTGCTGAGCGCTGG
AGAACTCGCCCTCGACGGCTCGCTGCGGCCCATCGCGGGGGCAGTGAATCTTGCGCTGCGCGCCGCCGCGGAGGGGTGTC
CCGTCCTGATTCCCGAGGGCAATGCCGGCGAGGCCGCCCTGATTGACGGCGTAACCGTTCACGGTGCCCCGTCCCTGCTG
AGCGCGGTGCGCCACCTGACCGGGCAGGCACCCCTGGGCCGGGCGCTGCCGCCCGAGGCGCCGCCGCTTGATCCGACTCT
CCCCGACCTGGCGGACCTCAAGGGGCAGACTGCGGCCAAGCGGGCGCTGGAAATCGCCGTTGCGGGTGGGCACAACCTGC
TGCTGGTCGGCTCGCCCGGCAGCGGCAAGACCATGCTGGCTCGCCGCGCGCCTGGCCTGCTGCCCCCCCTCACCCGCGAG
GAGGCCCTGGAGGTGACGCGCATTCACTCCGCCGCGGGGCTACTCACGGCGCGCAGCGGTCTGATGGGACACCCTCCCTA
CCGTGCGCCGCACCACACTGTCAGCGACGCAGGGCTGATTGGCGGGGGCAGTGTGCCGCGGCCCGGCGAGGTCAGCCTCG
CGCACCGGGGCATCCTCTTCCTCGACGAGTTTCCCGAGTTCTCCCGCAAGGCGCTCGAAACCCTGCGTCAGCCGCTTGAA
GACGGGCAGGTCTCGATCAGCCGCGCCCGCGCTACCGTCTGCTATCCGGCCCGCTTTCAGCTGATCGCCGCGATGAATCC
CTGCCCGTGCGGGCACCAGGGCGACCCCGAAAAAGCCTGCACCTGCACGCCGAGCGAGCGGAGCCGCTACGCTGCCCGCC
TGAGCGGCCCGCTGCTCGACCGGCTTGATCTGATTGTGCGCGTGCCGCGCTTGACGGTGGATGAGCTGACCCGCGCCCCA
GAAGCAGAGAAGAGCGAGGCCGTTCGGCAACGGGTCATCCGTGCCCGGGCGGCCATGCTGGCTCGTCAGGGCGTGCGCAA
CAGTGACCTGGCGGGGCAGATGCTGCGGCAGCATGCACCGCTGGCGCCCGGCCCCGAGGCCTTTCTCCGCGCGGCGGCCC
GACAGCTCAACCTCACCGGGCGCGGTTTTGACCGGGTGCTGCGGGTGGCCCGAACCGTTGCGGACCTGGCGGGCAGCCCC
GATATTCGCGAGGCGCATCTGGCCGAGGCCGTCACCTACCGGCCCCGAGAGCTGACCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q1IYJ5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

48.096

100

0.481

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.614

100

0.469

  comM Vibrio campbellii strain DS40M4

46.693

100

0.467

  comM Haemophilus influenzae Rd KW20

45.726

100

0.461

  comM Glaesserella parasuis strain SC1401

45.309

100

0.455

  comM Legionella pneumophila strain ERS1305867

45.272

99.599

0.451

  comM Legionella pneumophila str. Paris

45.272

99.599

0.451


Multiple sequence alignment