Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   QYS58_RS00740 Genome accession   NZ_CP130143
Coordinates   162505..164007 (+) Length   500 a.a.
NCBI ID   WP_302140223.1    Uniprot ID   -
Organism   Halomonas alkalicola strain M2     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 157505..169007
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QYS58_RS00710 - 157775..158701 (-) 927 WP_119023101.1 LysR family transcriptional regulator -
  QYS58_RS00715 - 158862..159638 (+) 777 WP_119023100.1 ferredoxin--NADP reductase -
  QYS58_RS00720 glnK 159732..160070 (-) 339 WP_110069051.1 P-II family nitrogen regulator -
  QYS58_RS00725 - 160218..161468 (-) 1251 WP_169956021.1 ammonium transporter -
  QYS58_RS00730 - 161496..161834 (-) 339 WP_119023403.1 P-II family nitrogen regulator -
  QYS58_RS00735 - 162112..162438 (+) 327 WP_302140220.1 accessory factor UbiK family protein -
  QYS58_RS00740 comM 162505..164007 (+) 1503 WP_302140223.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  QYS58_RS00745 - 164215..164712 (-) 498 WP_302140224.1 hypothetical protein -
  QYS58_RS00750 - 164709..165695 (-) 987 WP_302140226.1 alpha/beta hydrolase -
  QYS58_RS00755 - 165931..168105 (+) 2175 WP_302140228.1 malate synthase G -
  QYS58_RS00760 - 168245..168658 (+) 414 WP_302140230.1 PaaI family thioesterase -

Sequence


Protein


Download         Length: 500 a.a.        Molecular weight: 53504.84 Da        Isoelectric Point: 8.1354

>NTDB_id=857314 QYS58_RS00740 WP_302140223.1 162505..164007(+) (comM) [Halomonas alkalicola strain M2]
MTLAIVHTRAGLGLEAPEVQVEVHLANGLPGMTLVGLPEAAVKESRERVRSALVNAGFDYPLRRITLNLAPADLPKEGGR
FDLPIALGLLLASGQLPPEALADIESVGELALDGGLRPITGVLPLALATRRARRRLIVPRANAEEAALAGDLEVLPADHL
LDVVAHLLGQAPIAPHRLASPPEAATSLPDLAEVRGQYQARRALEVAAAGSHNLLFAGPPGTGKTMLASRLPGILPPLTE
EEALEVAAVRSVSGLPLAAEWGRRPFRAPHHTASAVALVGGGSKPRPGEISLAHHGVLFLDELPEFSRHVLEVMREPMES
GQIHIARASHQRRFPARFQLVAAMNPCPCGHLGDPRQACHCTAAQIQRYQARLSGPLLDRIDLQVEVPALPPEQLTAATT
GESSAIVRRRVMAARERQLARGALNAHLAGRELEAACDLASADRAWLAQVLERLRLSARAYHRVLRVALTLADLAGEPKP
AREQLIEAIGYRQLDRLLKG

Nucleotide


Download         Length: 1503 bp        

>NTDB_id=857314 QYS58_RS00740 WP_302140223.1 162505..164007(+) (comM) [Halomonas alkalicola strain M2]
ATGACGCTGGCGATCGTCCATACCCGTGCGGGCCTCGGCCTCGAGGCCCCCGAGGTGCAGGTGGAGGTGCATCTGGCCAA
TGGCCTGCCGGGGATGACCCTGGTGGGGCTGCCCGAGGCGGCGGTGAAGGAGAGCCGCGAGCGGGTGCGCAGCGCCCTGG
TCAACGCCGGCTTCGACTACCCCCTGCGCCGCATCACCCTCAACCTGGCTCCCGCCGACCTGCCCAAGGAGGGTGGACGC
TTCGACCTGCCCATCGCCCTGGGACTGCTGCTGGCCTCAGGGCAGCTGCCCCCGGAGGCCCTGGCGGACATCGAGAGCGT
CGGCGAGCTGGCCCTGGACGGTGGCCTGCGCCCGATCACCGGTGTGCTGCCCCTGGCGCTCGCCACCCGCCGTGCCAGGC
GGCGGCTGATCGTGCCCCGGGCCAACGCCGAGGAGGCGGCGCTCGCCGGCGACCTCGAGGTGCTGCCCGCCGACCATCTG
CTGGACGTGGTCGCCCATCTGCTGGGCCAGGCTCCCATCGCGCCGCACCGGCTGGCGTCTCCGCCGGAGGCGGCGACATC
GCTGCCCGATCTCGCCGAGGTGCGCGGCCAGTACCAGGCGCGCCGCGCCCTGGAAGTGGCCGCCGCGGGGTCCCACAACC
TGCTCTTCGCCGGCCCGCCGGGCACCGGCAAGACCATGCTGGCCAGCCGCCTGCCGGGCATCCTGCCGCCGCTGACCGAG
GAGGAGGCGCTGGAGGTGGCGGCGGTGCGCTCGGTGAGCGGCCTGCCGCTGGCCGCCGAGTGGGGTCGCCGCCCCTTTCG
GGCGCCGCACCATACCGCCAGCGCCGTGGCCCTGGTAGGCGGCGGCTCGAAGCCGCGACCCGGAGAGATCTCGCTGGCCC
ACCACGGGGTGCTGTTCCTCGACGAGCTGCCGGAGTTCTCGCGCCACGTGCTGGAGGTGATGCGCGAGCCCATGGAATCG
GGCCAGATCCATATCGCACGGGCCAGCCACCAGCGCCGCTTTCCGGCTCGCTTCCAGCTGGTGGCGGCCATGAATCCTTG
CCCTTGCGGCCATCTGGGGGACCCGCGGCAGGCCTGCCACTGCACCGCGGCCCAGATCCAGCGCTACCAGGCGCGGCTCT
CCGGGCCGCTGCTCGATCGCATCGACCTGCAGGTGGAGGTGCCGGCGCTGCCGCCGGAGCAGCTCACGGCGGCGACGACG
GGGGAGTCCTCGGCGATCGTCCGCCGGCGGGTGATGGCCGCGCGGGAACGGCAGCTGGCGCGGGGGGCGCTGAATGCCCA
CCTGGCGGGGCGGGAGCTGGAGGCGGCCTGCGACCTCGCCAGCGCCGACCGGGCCTGGCTGGCGCAGGTGCTGGAGCGGC
TCAGGCTCTCGGCGCGGGCCTACCACCGGGTGCTGCGGGTGGCGCTGACCCTGGCCGACCTGGCCGGCGAGCCGAAGCCG
GCCCGGGAGCAGCTGATCGAGGCGATCGGCTATCGGCAGCTGGATCGGCTGCTCAAGGGGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

56.175

100

0.564

  comM Vibrio cholerae strain A1552

55.976

100

0.562

  comM Haemophilus influenzae Rd KW20

54.635

100

0.554

  comM Glaesserella parasuis strain SC1401

53.937

100

0.548

  comM Legionella pneumophila str. Paris

50.398

100

0.506

  comM Legionella pneumophila strain ERS1305867

50.398

100

0.506

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.244

100

0.48