Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   UN84_RS15955 Genome accession   NZ_CP022134
Coordinates   3365231..3366736 (-) Length   501 a.a.
NCBI ID   WP_046078811.1    Uniprot ID   -
Organism   Halomonas sp. HG01     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3360231..3371736
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  UN84_RS15930 - 3360387..3360779 (+) 393 WP_046078808.1 DUF423 domain-containing protein -
  UN84_RS15935 - 3360755..3361168 (-) 414 WP_035595012.1 PaaI family thioesterase -
  UN84_RS15940 - 3361327..3363501 (-) 2175 WP_046078809.1 malate synthase G -
  UN84_RS15945 - 3363731..3364714 (+) 984 WP_046080322.1 alpha/beta hydrolase -
  UN84_RS15950 - 3364711..3365211 (+) 501 WP_046078810.1 hypothetical protein -
  UN84_RS15955 comM 3365231..3366736 (-) 1506 WP_046078811.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  UN84_RS15960 - 3366822..3367181 (-) 360 WP_046078812.1 accessory factor UbiK family protein -
  UN84_RS15965 - 3367498..3367836 (+) 339 WP_016855902.1 P-II family nitrogen regulator -
  UN84_RS15970 - 3367876..3369198 (+) 1323 WP_035594997.1 ammonium transporter -
  UN84_RS15975 glnK 3369345..3369683 (+) 339 WP_035594994.1 P-II family nitrogen regulator -
  UN84_RS15980 - 3369941..3370717 (-) 777 WP_016855905.1 ferredoxin--NADP reductase -

Sequence


Protein


Download         Length: 501 a.a.        Molecular weight: 53594.66 Da        Isoelectric Point: 7.3500

>NTDB_id=237157 UN84_RS15955 WP_046078811.1 3365231..3366736(-) (comM) [Halomonas sp. HG01]
MTLSIVRTRAGLGLEAPEVLVEVHLANGLPGITLVGLPETAVRESRERVRSALVNAGFDFPMRRITLNLAPADLPKEGGR
FDLPIALGLLVASGQIAAEALAEIECAGELALDGALRPVPGMLPLALATRGAGRRLIVPRANADEAALAGDLEVLPADHL
LEVVAHLLGQTPIEPHRASRPAEALTTGADLAEVRGQHQARRALEVAASGGHNLLLAGPPGTGKTMLASRLPGILPPLTE
NEALEVAAVRSVCGLPLDEDWGRRPFRSPHHTASAVALVGGGSRPRPGEISLAHRGVLFLDELPEFPRQVLEVMREPMES
GRIHIARASHERRYPASFQLVAAMNPCPCGHLGDPRRACHCTAAQIQRYQARLSGPLLDRIDLQVEVPALAPEQLTARES
GEPSAAVRERVLAARERQWARGALNARLGSRELEAACALSGTDRAWLAGVLERLNLSARAYHRVLRVALTLADLAGEPHP
GREQLVEAIGYRQLDRLLQGR

Nucleotide


Download         Length: 1506 bp        

>NTDB_id=237157 UN84_RS15955 WP_046078811.1 3365231..3366736(-) (comM) [Halomonas sp. HG01]
ATGACCCTGTCGATCGTTCGCACCCGGGCCGGCCTCGGCCTCGAGGCTCCCGAGGTGCTGGTGGAAGTGCATCTGGCCAA
CGGCCTGCCCGGCATCACCCTGGTCGGGCTGCCGGAGACCGCCGTTCGGGAGAGCCGCGAGCGGGTCCGCAGCGCGCTGG
TCAACGCCGGCTTCGACTTCCCGATGCGTCGCATCACCCTCAACCTGGCGCCCGCCGACCTGCCCAAGGAAGGCGGGCGC
TTCGATCTGCCCATCGCCCTGGGGTTGCTGGTGGCCTCGGGCCAGATCGCCGCCGAGGCGCTGGCCGAGATTGAATGCGC
GGGCGAGCTCGCCCTGGACGGCGCGCTGCGTCCGGTGCCCGGGATGCTGCCGCTGGCGCTGGCCACCCGCGGGGCCGGGC
GGCGTCTGATCGTGCCCCGGGCCAACGCCGACGAGGCGGCGCTCGCGGGCGATCTCGAGGTGCTGCCCGCCGATCACCTG
CTCGAGGTGGTCGCACACCTGCTGGGCCAGACGCCGATCGAGCCCCATCGGGCGTCGCGCCCCGCCGAGGCGCTGACGAC
GGGTGCCGATCTCGCCGAGGTACGTGGCCAGCACCAGGCGCGGCGCGCCCTGGAGGTCGCCGCCAGCGGTGGCCACAATC
TCCTGCTGGCCGGTCCTCCAGGCACCGGCAAGACCATGCTGGCCAGCCGCCTGCCCGGGATCCTGCCGCCGCTGACCGAG
AACGAGGCCCTGGAGGTGGCGGCGGTGCGTTCGGTGTGTGGCCTGCCGCTCGACGAGGACTGGGGGCGGCGCCCCTTCCG
CTCGCCCCACCACACCGCCAGCGCGGTGGCGCTGGTCGGCGGTGGCTCGCGGCCCAGGCCTGGCGAGATCTCGCTGGCCC
ATCGGGGCGTGCTGTTCCTCGACGAGTTGCCCGAGTTTCCCCGCCAGGTGCTCGAGGTGATGCGCGAGCCCATGGAATCC
GGGCGCATCCACATCGCCCGCGCCAGCCACGAGCGGCGCTATCCGGCGAGCTTCCAGCTGGTGGCGGCGATGAATCCCTG
CCCCTGCGGCCATCTCGGTGACCCTCGCCGGGCCTGCCACTGCACCGCCGCCCAGATCCAGCGCTATCAGGCGCGGCTTT
CCGGACCGCTGCTCGACCGTATCGACCTGCAGGTGGAGGTGCCGGCGCTGGCGCCGGAGCAGCTGACCGCCCGGGAGAGC
GGAGAGCCCTCGGCGGCGGTACGCGAGCGGGTACTGGCGGCCCGGGAGCGGCAGTGGGCGCGCGGGGCGCTCAATGCCCG
GCTGGGCAGTCGCGAGCTGGAGGCCGCCTGCGCACTGAGCGGGACGGATCGGGCCTGGCTGGCCGGGGTGCTGGAGCGGC
TCAATCTCTCGGCGCGGGCCTATCATCGGGTGCTGCGGGTGGCGCTGACCCTGGCCGACCTGGCCGGCGAGCCGCACCCC
GGGCGCGAGCAACTGGTCGAGGCGATCGGCTATCGTCAGCTCGATCGGCTGCTGCAGGGGCGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

55.666

100

0.559

  comM Vibrio cholerae strain A1552

54.871

100

0.551

  comM Glaesserella parasuis strain SC1401

53.937

100

0.547

  comM Haemophilus influenzae Rd KW20

53.452

100

0.541

  comM Legionella pneumophila str. Paris

50.402

99.401

0.501

  comM Legionella pneumophila strain ERS1305867

50.402

99.401

0.501

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

48.692

99.202

0.483


Multiple sequence alignment