Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   TTH_RS01050 Genome accession   NC_006461
Coordinates   197532..199016 (-) Length   494 a.a.
NCBI ID   WP_011174141.1    Uniprot ID   Q72GR3
Organism   Thermus thermophilus HB8     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 192532..204016
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  TTH_RS01030 (TTHA0197) - 192816..193295 (-) 480 WP_011227770.1 DUF4384 domain-containing protein -
  TTH_RS01035 (TTHA0198) - 193335..193946 (-) 612 WP_011227771.1 ribonuclease HII -
  TTH_RS01040 (TTHA0199) - 193971..195518 (+) 1548 WP_011227772.1 polyhydroxybutyrate depolymerase -
  TTH_RS01045 (TTHA0200) - 195523..197535 (+) 2013 WP_011227773.1 molybdopterin oxidoreductase family protein -
  TTH_RS01050 (TTHA0201) comM 197532..199016 (-) 1485 WP_011174141.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  TTH_RS01055 (TTHA0202) - 199136..200197 (+) 1062 WP_224065221.1 Fic family protein -
  TTH_RS01060 (TTHA0203) - 200179..201318 (-) 1140 WP_011227775.1 nitrate/nitrite transporter -
  TTH_RS01065 (TTHA0204) - 201320..202132 (-) 813 WP_011227776.1 EamA family transporter -
  TTH_RS01070 (TTHA0205) - 202129..203016 (-) 888 WP_164926022.1 DMT family transporter -

Sequence


Protein


Download         Length: 494 a.a.        Molecular weight: 52867.22 Da        Isoelectric Point: 8.4718

>NTDB_id=23998 TTH_RS01050 WP_011174141.1 197532..199016(-) (comM) [Thermus thermophilus HB8]
MLAQVRSYALFGLDAVPVTVEVDVSPGLPSYALVGLPDKAVEESRERVRAALKNAGFPYPQARVVVNLAPAELRKEGSQF
DLPIALGLLAAQGVVPLEALSPFALAGELGLDGSLRPVPGAVNLALGALAEGKKLLLPLESAKEAALVEGVEVYGARSLQ
EAVAFLKGEEALAEARPEEAPEAVEALDLRDVKGQAKAKRALEIAAAGFHHLLMVGSPGSGKTMLARRLPFLLPPLSREE
ALEVTRIHSAAGKPVRGLVKAPPFRAPHHTVSYAGLIGGGAIPKPGEVSLAHRGVLFLDEFPEFSREALEALRQPLEDGV
VTVARARASLTFPARFLLVAAMNPCPCGWHGDPERPCTCTPAAQRRYAARISGPLLDRFDLVVEVPRLTPEELARAPEGE
GTEAVRERVLRARERMLARQGRPNGLLSGRALREHARLTPPAQALLQEAAKRMLLSARSYDRVLRVARTVADLLGEERVG
EAHVAEALAYRRAL

Nucleotide


Download         Length: 1485 bp        

>NTDB_id=23998 TTH_RS01050 WP_011174141.1 197532..199016(-) (comM) [Thermus thermophilus HB8]
ATGCTGGCCCAGGTGCGAAGCTACGCCCTCTTCGGCCTGGACGCGGTTCCCGTCACCGTGGAGGTGGACGTTAGCCCGGG
GCTTCCCAGCTACGCCCTGGTGGGCTTGCCCGACAAGGCGGTGGAGGAAAGCCGGGAGAGGGTGCGGGCGGCCCTCAAGA
ACGCGGGCTTCCCCTACCCCCAGGCCCGGGTGGTGGTGAACCTGGCCCCGGCGGAGCTTCGGAAGGAGGGGAGCCAGTTT
GACCTTCCCATCGCCCTGGGCCTCCTCGCGGCCCAGGGGGTGGTGCCCCTCGAGGCCCTCTCCCCCTTCGCCCTCGCGGG
GGAGCTTGGCTTGGACGGGAGCCTGAGGCCCGTCCCCGGGGCGGTGAACCTGGCCCTGGGGGCCCTGGCCGAGGGGAAGA
AGCTCCTCCTGCCCCTGGAGAGCGCCAAGGAGGCGGCCTTGGTGGAGGGGGTGGAGGTCTACGGGGCGAGAAGCCTCCAG
GAAGCCGTGGCCTTCCTCAAGGGCGAGGAGGCCTTGGCCGAGGCCCGGCCCGAGGAGGCCCCGGAAGCGGTGGAGGCGCT
GGACCTCAGGGACGTCAAGGGCCAGGCCAAGGCCAAACGCGCCCTGGAGATCGCCGCCGCGGGCTTCCACCACCTCCTCA
TGGTGGGAAGCCCGGGCTCGGGGAAGACCATGCTGGCGCGGCGGCTTCCCTTCCTCCTCCCGCCCCTCTCCCGGGAGGAG
GCCTTGGAGGTGACCCGGATCCACTCCGCCGCGGGGAAGCCGGTGCGGGGGCTTGTGAAGGCCCCGCCTTTCCGCGCCCC
GCACCACACGGTGAGCTACGCTGGGCTCATCGGGGGTGGGGCCATTCCCAAGCCCGGGGAGGTCTCCTTGGCCCACCGGG
GGGTGCTCTTTCTGGACGAGTTCCCGGAGTTTTCCCGGGAGGCCCTCGAGGCCCTGCGCCAGCCCCTGGAGGACGGGGTG
GTGACCGTGGCCCGGGCCCGGGCAAGCCTCACCTTTCCCGCCCGCTTCCTCCTGGTGGCGGCCATGAACCCCTGCCCCTG
CGGCTGGCACGGGGACCCGGAAAGGCCTTGCACCTGCACCCCCGCCGCCCAGAGGCGCTACGCGGCCAGGATCTCCGGGC
CCCTCCTGGACCGGTTTGACCTGGTGGTGGAGGTGCCCCGCCTCACCCCCGAGGAGCTCGCCCGCGCCCCCGAGGGGGAG
GGCACGGAGGCGGTGCGGGAGCGGGTCCTAAGGGCCCGGGAGCGGATGCTCGCCCGCCAAGGGAGGCCCAACGGCCTCCT
CTCGGGGCGGGCCCTGAGGGAGCACGCCCGCCTCACCCCACCGGCCCAGGCCCTCCTCCAGGAGGCGGCCAAGCGGATGC
TCCTTTCGGCCCGGAGCTACGACCGCGTCCTCCGGGTGGCCCGCACCGTCGCCGACCTCCTGGGCGAGGAGAGGGTGGGG
GAGGCCCACGTGGCCGAGGCCCTGGCCTACCGCAGGGCCCTCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q72GR3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

48.583

100

0.486

  comM Haemophilus influenzae Rd KW20

47.39

100

0.478

  comM Glaesserella parasuis strain SC1401

46.23

100

0.472

  comM Vibrio campbellii strain DS40M4

46.559

100

0.466

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.04

100

0.46

  comM Legionella pneumophila str. Paris

44.689

100

0.451

  comM Legionella pneumophila strain ERS1305867

44.689

100

0.451


Multiple sequence alignment