Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   THTHE16_RS01005 Genome accession   NC_017272
Coordinates   174627..176111 (+) Length   494 a.a.
NCBI ID   WP_008630589.1    Uniprot ID   H7GDK4
Organism   Thermus thermophilus SG0.5JP17-16     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 169627..181111
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  THTHE16_RS00985 (Ththe16_0186) - 170628..171515 (+) 888 WP_041444826.1 DMT family transporter -
  THTHE16_RS00990 (Ththe16_0187) - 171512..172324 (+) 813 WP_014509788.1 DMT family transporter -
  THTHE16_RS00995 (Ththe16_0188) - 172326..173464 (+) 1139 Protein_186 MFS transporter -
  THTHE16_RS01000 (Ththe16_0189) - 173446..174540 (-) 1095 WP_014509789.1 Fic family protein -
  THTHE16_RS01005 (Ththe16_0190) comM 174627..176111 (+) 1485 WP_008630589.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  THTHE16_RS01010 (Ththe16_0191) - 176108..178120 (-) 2013 WP_014509790.1 molybdopterin oxidoreductase family protein -
  THTHE16_RS01015 (Ththe16_0192) - 178125..179672 (-) 1548 WP_014509791.1 polyhydroxybutyrate depolymerase -
  THTHE16_RS01020 (Ththe16_0193) - 179697..180308 (+) 612 WP_011227771.1 ribonuclease HII -
  THTHE16_RS01025 (Ththe16_0194) - 180348..180827 (+) 480 WP_014509792.1 DUF4384 domain-containing protein -

Sequence


Protein


Download         Length: 494 a.a.        Molecular weight: 52814.17 Da        Isoelectric Point: 8.0687

>NTDB_id=45768 THTHE16_RS01005 WP_008630589.1 174627..176111(+) (comM) [Thermus thermophilus SG0.5JP17-16]
MLAQVRSYALFGLDAVPVTVEVDVSPGLPSYALVGLPDKAVEESRERVRAALKNAGFPYPQARVVVNLAPAELRKEGSQF
DLPIALGLLAAQGVVPLEALSPFALAGELGLDGSLRPVPGAVNLALGALAEGKKLLLPLESAKEAALVEGVEVYGARSLQ
EAVAFLKGEEALAEARPEEAPEAVEALDLRDVKGQAKAKRALEIAAAGFHHLLMVGSPGSGKTMLARRLPFLLPPLSREE
ALEVTRIHSAAGKPVRGLVKAPPFRAPHHTVSYAGLIGGGAIPKPGEVSLAHRGVLFLDEFPEFSREALEALRQPLEDGV
VTVARARASLTFPARFLLVAAMNPCPCGWHGDPERPCTCTPAAQRRYAARISGPLLDRFDLVVEVPRLTPEELACAPEGE
GTEAVRERVLRARERMLARQGRPNGLLSGRALREHARLTPPAQALLQEAAKRMLLSARSYDRVLRVARTVADLLGEERVG
EAHVAEALAYRRAL

Nucleotide


Download         Length: 1485 bp        

>NTDB_id=45768 THTHE16_RS01005 WP_008630589.1 174627..176111(+) (comM) [Thermus thermophilus SG0.5JP17-16]
ATGCTGGCCCAGGTGCGAAGCTACGCCCTCTTCGGCCTGGACGCGGTTCCCGTCACCGTGGAGGTGGACGTTAGCCCGGG
GCTTCCCAGCTACGCCCTGGTGGGCCTGCCCGACAAGGCGGTGGAGGAAAGCCGGGAGCGCGTCCGGGCGGCCCTCAAGA
ACGCGGGCTTCCCCTACCCCCAGGCCCGGGTGGTGGTGAACCTGGCCCCGGCGGAGCTTCGGAAGGAGGGGAGCCAGTTT
GACCTTCCCATCGCCCTGGGCCTCCTCGCGGCCCAGGGGGTGGTGCCCCTCGAGGCCCTCTCCCCCTTCGCCCTCGCGGG
GGAGCTTGGCTTAGACGGGAGCCTGAGGCCCGTTCCCGGGGCGGTGAACCTGGCCCTGGGGGCCTTGGCCGAGGGGAAGA
AGCTCCTCCTGCCCCTGGAGAGCGCCAAGGAGGCGGCCCTGGTGGAGGGGGTGGAGGTCTACGGGGCGAGAAGCCTCCAG
GAAGCCGTGGCCTTCCTCAAGGGCGAGGAGGCCTTGGCCGAGGCCCGGCCCGAGGAGGCCCCGGAAGCGGTGGAGGCGCT
GGACCTCAGGGACGTCAAGGGCCAGGCCAAGGCCAAGCGCGCCCTGGAGATCGCCGCCGCGGGCTTCCACCACCTCCTCA
TGGTGGGAAGCCCGGGCTCGGGGAAGACCATGCTGGCGCGGCGGCTTCCCTTCCTCCTCCCGCCCCTCTCCCGGGAGGAG
GCCTTGGAGGTGACCCGGATCCACTCCGCCGCGGGGAAGCCGGTGCGGGGGCTTGTGAAGGCCCCGCCCTTCCGCGCCCC
ACACCACACGGTGAGCTACGCCGGGCTCATCGGGGGTGGGGCCATCCCCAAGCCCGGGGAGGTCTCCTTGGCCCACCGGG
GCGTGCTCTTTCTGGACGAGTTCCCGGAGTTTTCCCGGGAGGCCCTCGAGGCCCTGCGCCAGCCCCTCGAGGACGGGGTG
GTGACCGTGGCCCGGGCCCGGGCAAGCCTCACCTTCCCCGCCCGCTTCCTCCTGGTGGCGGCCATGAACCCCTGCCCCTG
CGGCTGGCACGGGGACCCGGAAAGGCCTTGCACCTGCACCCCCGCCGCCCAGAGGCGCTACGCGGCCAGGATCTCCGGGC
CCCTCCTGGACCGGTTTGACCTGGTGGTGGAGGTGCCCCGCCTCACCCCCGAGGAGCTCGCCTGCGCCCCCGAGGGGGAG
GGCACGGAGGCGGTGCGGGAGCGGGTCCTAAGGGCCCGGGAGCGGATGCTCGCCCGCCAGGGGAGGCCCAACGGGCTCCT
CTCGGGGCGGGCCCTGAGGGAGCACGCCCGCCTCACCCCACCGGCCCAGGCCCTCCTCCAGGAGGCGGCCAAGCGGATGC
TCCTTTCGGCCCGGAGCTACGACCGCGTCCTCCGGGTGGCCCGCACCGTCGCCGACCTCCTGGGCGAGGAGAGGGTGGGG
GAGGCCCACGTGGCCGAGGCCCTGGCCTACCGCAGGGCCCTCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB H7GDK4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

48.583

100

0.486

  comM Haemophilus influenzae Rd KW20

47.39

100

0.478

  comM Glaesserella parasuis strain SC1401

46.23

100

0.472

  comM Vibrio campbellii strain DS40M4

46.559

100

0.466

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.04

100

0.46

  comM Legionella pneumophila str. Paris

44.8

100

0.453

  comM Legionella pneumophila strain ERS1305867

44.8

100

0.453


Multiple sequence alignment