Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   THEOS_RS10565 Genome accession   NC_019386
Coordinates   1996608..1998101 (+) Length   497 a.a.
NCBI ID   WP_016330303.1    Uniprot ID   -
Organism   Thermus oshimai JL-2     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 1991608..2003101
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  THEOS_RS10540 (Theos_2127) - 1992189..1992491 (-) 303 WP_016330298.1 proton-translocating transhydrogenase family protein -
  THEOS_RS10545 (Theos_2128) - 1992491..1993618 (-) 1128 WP_016330299.1 NAD(P) transhydrogenase subunit alpha -
  THEOS_RS10550 (Theos_2129) - 1993736..1994620 (+) 885 WP_016330300.1 DMT family transporter -
  THEOS_RS10555 (Theos_2130) - 1994617..1995429 (+) 813 WP_016330301.1 EamA family transporter -
  THEOS_RS10560 (Theos_2131) - 1995431..1996564 (+) 1134 WP_016330302.1 MFS transporter -
  THEOS_RS10565 (Theos_2132) comM 1996608..1998101 (+) 1494 WP_016330303.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  THEOS_RS10570 (Theos_2133) - 1998098..2000125 (-) 2028 WP_016330304.1 molybdopterin oxidoreductase family protein -
  THEOS_RS10575 (Theos_2134) - 2000130..2001674 (-) 1545 WP_041436584.1 carboxylesterase/lipase family protein -
  THEOS_RS10580 (Theos_2135) - 2001737..2002336 (+) 600 WP_016330306.1 ribonuclease HII -
  THEOS_RS10585 (Theos_2136) - 2002394..2002870 (+) 477 WP_016330307.1 DUF4384 domain-containing protein -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 53263.66 Da        Isoelectric Point: 8.4596

>NTDB_id=54369 THEOS_RS10565 WP_016330303.1 1996608..1998101(+) (comM) [Thermus oshimai JL-2]
MLAQVRSYALLGLEAVPVTVEVDVSPGLPSYALVGLPDKAVEESRERVRSALKNSGLPYPQARVVVNLAPAELRKEGSQF
DLPIALGLLAAQGVVPPEALAPFAFAGELGLDGSLRPVPGAVNLALGALGEGKKLLLPRESAKEAALVEGVEAYGVGSLG
EAVAFLRGERELAPAEGEEALWEEALLDLRDVKGQAKAKRALEIAASGRHHLLMVGSPGSGKTMLARRLPFLLPPLSQEE
ALEVTRVHSAAGLPVRGLLRTPPFRAPHHTVSYAGLIGGGAIPKPGEVSLAHRGVLFLDEFPEFSREALEALRQPLEDGV
VTVARARASLTFPARFLLVAAMNPCPCGWYGDPERPCACTPTQRLRYANRISGPLLDRFDLVVEVPRLTPLELARAPEGE
STEAVRERVLLARKRMLARQGRPNGELSGRALRAHLHLTPGAEALLQAAAKKLLLSARSYDRLLRVARTVADLMGSERVE
EAHVAEALTYRRSLYPA

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=54369 THEOS_RS10565 WP_016330303.1 1996608..1998101(+) (comM) [Thermus oshimai JL-2]
ATGCTAGCTCAGGTGCGGAGCTACGCCCTTTTGGGCCTCGAGGCGGTCCCCGTCACCGTGGAGGTGGATGTCAGCCCGGG
GCTTCCCAGCTACGCCCTGGTGGGCCTGCCCGACAAGGCGGTGGAGGAGAGCCGGGAAAGGGTGCGTTCCGCCCTGAAGA
ACAGCGGCCTCCCCTACCCCCAGGCCCGGGTGGTGGTGAACCTGGCCCCGGCGGAGCTCCGCAAGGAGGGGAGCCAGTTT
GACCTGCCCATCGCCCTGGGCCTCCTGGCGGCGCAAGGGGTGGTGCCCCCTGAGGCCCTGGCCCCCTTCGCCTTTGCCGG
GGAGCTGGGCCTGGACGGAAGCTTAAGGCCCGTCCCCGGCGCGGTGAACCTGGCCCTGGGGGCCCTAGGGGAGGGGAAAA
AGCTCCTCCTCCCCAGGGAAAGCGCCAAGGAAGCCGCCCTGGTGGAGGGGGTGGAGGCCTACGGGGTGGGCTCCCTGGGG
GAGGCCGTGGCCTTCCTGAGGGGGGAGCGGGAGCTTGCGCCGGCGGAAGGGGAGGAGGCCCTCTGGGAAGAGGCCCTTCT
GGACCTCCGGGACGTGAAGGGCCAGGCCAAGGCCAAGCGGGCCCTGGAGATCGCCGCCAGCGGCCGCCACCACCTCCTCA
TGGTGGGGAGCCCGGGCTCGGGGAAGACCATGCTGGCCCGCCGCCTGCCCTTCCTCCTCCCGCCCCTAAGCCAAGAGGAG
GCCTTGGAGGTGACCCGGGTCCACTCCGCCGCGGGGCTCCCCGTGCGGGGCCTTCTCCGCACCCCCCCTTTCCGCGCCCC
CCACCACACGGTGAGCTACGCCGGGCTCATCGGGGGCGGGGCCATCCCCAAGCCGGGGGAGGTCTCCCTGGCCCACCGCG
GGGTCCTCTTCCTGGACGAGTTCCCGGAGTTTTCCCGGGAGGCCCTCGAGGCCCTGCGCCAGCCCCTGGAGGACGGGGTG
GTCACCGTGGCCCGGGCCCGGGCCAGCCTCACCTTCCCCGCCCGCTTCCTCCTGGTGGCGGCCATGAACCCCTGCCCCTG
CGGCTGGTACGGGGACCCGGAGCGCCCCTGCGCCTGCACCCCCACCCAACGCCTGCGCTACGCCAACCGCATCTCGGGGC
CCCTCCTGGACCGGTTTGACCTGGTGGTGGAGGTGCCCCGCCTCACCCCCCTGGAGCTCGCCCGGGCCCCCGAAGGGGAA
AGCACGGAGGCGGTGCGGGAAAGGGTGCTCCTGGCGCGCAAAAGGATGCTCGCCCGCCAGGGGCGGCCCAACGGGGAGCT
TTCGGGAAGGGCCTTGAGGGCCCACCTCCACCTCACCCCAGGGGCGGAGGCCCTCCTCCAAGCCGCGGCCAAGAAGCTCC
TCCTCTCCGCCCGGAGCTACGACCGCCTCCTGAGGGTGGCCCGCACGGTGGCCGACCTCATGGGCTCGGAGCGGGTGGAA
GAGGCCCACGTGGCCGAGGCCCTCACCTACCGGAGGAGCCTGTACCCCGCTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

48.283

99.598

0.481

  comM Haemophilus influenzae Rd KW20

47.4

100

0.477

  comM Vibrio campbellii strain DS40M4

47.284

100

0.473

  comM Glaesserella parasuis strain SC1401

47.189

100

0.473

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.841

100

0.455

  comM Legionella pneumophila str. Paris

44.378

100

0.445

  comM Legionella pneumophila strain ERS1305867

44.378

100

0.445


Multiple sequence alignment