Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   I4484_RS20045 Genome accession   NZ_CP065435
Coordinates   4278134..4279639 (-) Length   501 a.a.
NCBI ID   WP_197448908.1    Uniprot ID   -
Organism   Halomonas sp. SS10-MC5     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4273134..4284639
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I4484_RS20025 (I4484_20025) - 4273163..4274101 (+) 939 WP_197448904.1 substrate-binding domain-containing protein -
  I4484_RS20030 (I4484_20030) - 4274192..4276366 (-) 2175 WP_197448905.1 malate synthase G -
  I4484_RS20035 (I4484_20035) - 4276671..4277654 (+) 984 WP_197448906.1 alpha/beta hydrolase -
  I4484_RS20040 (I4484_20040) - 4277648..4278127 (+) 480 WP_197448907.1 hypothetical protein -
  I4484_RS20045 (I4484_20045) comM 4278134..4279639 (-) 1506 WP_197448908.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  I4484_RS20050 (I4484_20050) - 4279715..4280071 (-) 357 WP_197448909.1 accessory factor UbiK family protein -
  I4484_RS20055 (I4484_20055) - 4280359..4280697 (+) 339 WP_197448910.1 P-II family nitrogen regulator -
  I4484_RS20060 (I4484_20060) - 4280743..4282068 (+) 1326 WP_197448911.1 ammonium transporter -
  I4484_RS20065 (I4484_20065) glnK 4282207..4282545 (+) 339 WP_086509163.1 P-II family nitrogen regulator -
  I4484_RS21010 - 4282572..4282673 (+) 102 Protein_3941 P-II family nitrogen regulator -
  I4484_RS20070 (I4484_20070) - 4282821..4283597 (-) 777 WP_197448912.1 ferredoxin--NADP reductase -

Sequence


Protein


Download         Length: 501 a.a.        Molecular weight: 53948.12 Da        Isoelectric Point: 7.2371

>NTDB_id=509653 I4484_RS20045 WP_197448908.1 4278134..4279639(-) (comM) [Halomonas sp. SS10-MC5]
MLAIVATRAGLGLEAPEVQVEVHLANGLPGMTLVGLPEAAVKESRERVRSALTNAGFDFPNTRRIVLNLAPADLPKVGGR
FDLPIALGILVASGQLPGEALEGIESVGELALDGRLRPINGVLPLALAARKAGRKLIVPRENADEAALAGDLQVLPAEHL
LEVMAHLLGQEPIAAHRLEAPPLADTREGDLAEVRGQHQARRALEIAAAGSHNLLFAGPPGTGKTMLASRLPGILPELTE
EEALEVAAIRSVCGLPLAEQWGRRPFRAPHHTASAVALVGGGSWPRPGEISLAHHGVLFLDELPEFSRHVLEVMREPMES
GRIHIARANHERRFPARFQLVAAMNPCPCGHLGDPRQACQCTAAQIQRYQSRLSGPLLDRIDLQIEVPALPPEQLTSREV
GESSAEVRARVVAARERQYARGALNAHLGGRELEAACALSTADRAWLAGVLERLRLSARAYHRVLRVALTLADLAAKPHP
ERGELMEAIGYRQLDRQLRQA

Nucleotide


Download         Length: 1506 bp        

>NTDB_id=509653 I4484_RS20045 WP_197448908.1 4278134..4279639(-) (comM) [Halomonas sp. SS10-MC5]
ATGCTGGCGATCGTTGCCACACGGGCCGGCCTGGGGCTGGAAGCGCCCGAGGTGCAGGTCGAGGTACATCTGGCCAACGG
CCTGCCGGGGATGACCCTGGTCGGCCTGCCCGAGGCCGCCGTCAAGGAGAGCCGCGAGCGGGTGCGTAGCGCCCTTACCA
ACGCCGGCTTCGATTTCCCCAATACGCGCAGGATCGTGCTCAACCTGGCGCCCGCCGACCTGCCCAAGGTCGGCGGGCGC
TTCGATCTGCCCATTGCACTGGGCATCCTGGTGGCCTCCGGCCAGCTGCCCGGCGAGGCACTCGAAGGCATCGAGAGCGT
CGGCGAGCTGGCGCTGGACGGCCGTCTGCGCCCCATCAACGGCGTGCTGCCCCTGGCGCTCGCCGCGCGCAAGGCCGGGC
GCAAGCTGATCGTGCCCCGGGAGAATGCCGACGAGGCGGCGCTGGCCGGCGACCTTCAGGTGCTGCCGGCCGAGCACCTG
CTCGAGGTGATGGCCCACCTGCTGGGGCAGGAGCCCATCGCCGCGCACCGTCTCGAAGCGCCACCCCTCGCCGACACCCG
CGAGGGCGATCTGGCCGAGGTGCGCGGCCAGCACCAGGCCCGCCGCGCGCTGGAAATCGCCGCGGCCGGCTCGCACAACC
TGCTCTTCGCGGGGCCACCCGGCACCGGCAAGACCATGCTGGCGAGCCGCCTGCCCGGTATCCTGCCCGAACTCACCGAG
GAGGAGGCGCTGGAAGTCGCCGCCATCCGCTCGGTGTGCGGCCTGCCCCTGGCCGAGCAGTGGGGGCGCCGCCCCTTCCG
CGCGCCGCACCACACCGCCAGCGCCGTGGCGCTGGTCGGCGGCGGCTCCTGGCCGCGGCCGGGCGAGATCTCGCTGGCCC
ACCACGGCGTGCTGTTTCTCGACGAGCTGCCCGAGTTCAGCCGTCACGTGCTCGAGGTGATGCGCGAGCCGATGGAGTCG
GGCCGCATCCATATCGCTCGCGCCAACCACGAGCGCCGCTTCCCTGCCCGTTTCCAGCTGGTGGCCGCGATGAACCCGTG
CCCCTGCGGTCACCTGGGCGACCCGCGCCAGGCCTGCCAGTGCACCGCGGCGCAGATCCAGCGCTACCAGTCGCGGCTCT
CCGGGCCGCTGCTCGACCGCATCGACCTGCAGATCGAAGTGCCGGCGCTGCCGCCGGAGCAGCTCACCTCACGCGAGGTA
GGCGAGAGCTCGGCGGAGGTGCGCGCCCGGGTCGTGGCCGCCCGCGAGCGGCAGTACGCCCGCGGCGCGCTCAACGCCCA
CCTGGGCGGGCGCGAACTGGAAGCGGCCTGCGCCCTGTCGACGGCCGACCGTGCCTGGCTGGCGGGGGTGCTCGAACGCC
TGCGTTTGTCGGCGCGCGCCTACCACCGGGTGCTGCGCGTGGCGCTGACGCTGGCCGATCTGGCCGCCAAGCCGCATCCG
GAGCGCGGCGAGCTGATGGAGGCGATCGGCTACCGGCAGCTAGACCGCCAGCTCCGCCAGGCATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

56.175

100

0.563

  comM Vibrio cholerae strain A1552

55.976

100

0.561

  comM Haemophilus influenzae Rd KW20

54.348

100

0.549

  comM Glaesserella parasuis strain SC1401

52.964

100

0.535

  comM Legionella pneumophila str. Paris

49.798

99.002

0.493

  comM Legionella pneumophila strain ERS1305867

49.798

99.002

0.493

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.389

100

0.489