Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   SR933_RS17790 Genome accession   NZ_CP139472
Coordinates   3786162..3787667 (+) Length   501 a.a.
NCBI ID   WP_013330860.1    Uniprot ID   -
Organism   Halomonas elongata DSM 2581 strain ATCC 33173     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3779039..3798097 3786162..3787667 within 0


Gene organization within MGE regions


Location: 3779039..3798097
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SR933_RS17755 (SR933_17755) - 3779039..3779563 (+) 525 WP_013330853.1 TRAP transporter small permease -
  SR933_RS17760 (SR933_17760) - 3779560..3780858 (+) 1299 WP_013330854.1 TRAP transporter large permease -
  SR933_RS17765 (SR933_17765) - 3780882..3783251 (+) 2370 WP_013330855.1 TIM-barrel domain-containing protein -
  SR933_RS17770 (SR933_17770) glnK 3783316..3783654 (-) 339 WP_013330856.1 P-II family nitrogen regulator -
  SR933_RS17775 (SR933_17775) - 3783810..3785051 (-) 1242 WP_013330857.1 ammonium transporter -
  SR933_RS17780 (SR933_17780) - 3785097..3785435 (-) 339 WP_013330858.1 P-II family nitrogen regulator -
  SR933_RS17785 (SR933_17785) - 3785728..3786069 (+) 342 WP_013330859.1 accessory factor UbiK family protein -
  SR933_RS17790 (SR933_17790) comM 3786162..3787667 (+) 1506 WP_013330860.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  SR933_RS17795 (SR933_17795) cas1f 3787930..3788907 (+) 978 WP_041601826.1 type I-F CRISPR-associated endonuclease Cas1f -
  SR933_RS17800 (SR933_17800) cas3f 3788904..3792245 (+) 3342 WP_013330862.1 type I-F CRISPR-associated helicase Cas3f -
  SR933_RS17805 (SR933_17805) csy1 3792638..3794014 (+) 1377 WP_013330863.1 type I-F CRISPR-associated protein Csy1 -
  SR933_RS17810 (SR933_17810) csy2 3794007..3794966 (+) 960 WP_013330864.1 type I-F CRISPR-associated protein Csy2 -
  SR933_RS17815 (SR933_17815) csy3 3794984..3796015 (+) 1032 WP_013330865.1 type I-F CRISPR-associated protein Csy3 -
  SR933_RS17820 (SR933_17820) cas6f 3796019..3796588 (+) 570 WP_109637282.1 type I-F CRISPR-associated endoribonuclease Cas6/Csy4 -

Sequence


Protein


Download         Length: 501 a.a.        Molecular weight: 53806.82 Da        Isoelectric Point: 6.9349

>NTDB_id=909928 SR933_RS17790 WP_013330860.1 3786162..3787667(+) (comM) [Halomonas elongata DSM 2581 strain ATCC 33173]
MTLAIIRTRAGLGLEAPEVLVEVHLTNGLPGITLVGLPETAVKESRERVRSALVNAGFEFPLRRITLNLAPADLPKDGGR
FDLPIALGLLVASGQIPPEALAEVECVGELALDGGLRPASGVLPLAMATRQAGRRLIVPRANADEAALAGDLEVLPAEHL
LEVVAHLLGQETIAAHRLQAPPRRDTSEPDLREVRGQHQARRALEVAAAGGHNLLFAGPPGTGKTMLASRLPGILPPLGE
DEALEVAAVRSVSGLPLAEQWGRRPFRAPHHTASAVALVGGGSRPKPGEISLAHHGVLFLDELPEFSRQVLEVMREPMES
GQIHIARANHERRYPARFQLVAAMNPCPCGHLGDPRQACHCTAAQIQRYQARLSGPLLDRIDLQVEVPALPAEQLTSRES
GEDSATVRERVLAARERQWSRGALNAYLAGPDLEAACALGADDRAWLAEVLERLQLSARAFHRVLRVALTLADLAGAPRP
TREHLIEAIGYRQLDRLLKGG

Nucleotide


Download         Length: 1506 bp        

>NTDB_id=909928 SR933_RS17790 WP_013330860.1 3786162..3787667(+) (comM) [Halomonas elongata DSM 2581 strain ATCC 33173]
ATGACGCTGGCGATCATTCGCACCCGGGCGGGCCTCGGCCTGGAGGCGCCCGAGGTGCTTGTCGAGGTACACCTGACCAA
CGGCCTGCCTGGCATCACGCTGGTCGGGCTGCCCGAAACCGCCGTCAAGGAAAGCCGGGAGAGGGTGCGCAGCGCCCTGG
TCAATGCCGGTTTCGAATTTCCGCTGCGGCGTATCACCCTGAATCTGGCGCCCGCCGATCTTCCCAAGGACGGCGGGCGC
TTTGATCTCCCCATCGCACTGGGCCTGCTGGTCGCTTCCGGACAGATTCCGCCCGAGGCCCTGGCCGAGGTGGAGTGTGT
GGGCGAACTGGCGTTGGACGGCGGCCTGCGCCCGGCGAGCGGGGTGCTACCGCTGGCCATGGCCACGCGGCAAGCGGGGC
GGCGCTTGATCGTGCCCCGAGCCAACGCCGACGAAGCGGCCCTGGCCGGTGATCTCGAGGTTCTGCCCGCCGAGCATCTG
CTGGAGGTGGTGGCCCATCTTCTCGGGCAGGAAACCATTGCCGCCCATCGGCTACAGGCGCCGCCACGTCGCGATACCTC
GGAGCCGGATTTACGCGAGGTGAGAGGGCAGCACCAGGCGCGTCGTGCCCTGGAAGTCGCGGCGGCGGGAGGCCACAACC
TGTTGTTCGCCGGCCCGCCCGGCACCGGCAAGACCATGCTGGCCAGTCGTCTGCCCGGCATCCTGCCGCCGCTCGGCGAG
GACGAGGCCCTGGAGGTCGCGGCGGTACGTTCGGTCAGTGGATTGCCGCTGGCCGAGCAGTGGGGACGTCGCCCCTTTCG
AGCCCCACATCACACTGCGAGTGCCGTAGCTCTGGTCGGCGGCGGCTCGCGTCCCAAGCCGGGGGAGATCTCCCTGGCGC
ACCACGGCGTACTGTTTCTCGACGAACTGCCGGAGTTCTCGCGGCAGGTTCTGGAAGTGATGCGCGAGCCCATGGAATCC
GGACAGATCCACATTGCCCGCGCCAACCACGAGCGTCGTTATCCGGCGCGTTTCCAACTGGTGGCGGCCATGAATCCCTG
CCCCTGCGGTCATCTTGGCGACCCGCGCCAGGCCTGTCACTGCACGGCCGCCCAGATTCAGCGCTATCAGGCGCGACTGT
CAGGCCCCTTGCTGGATCGCATCGACCTGCAGGTGGAAGTGCCAGCCCTGCCAGCAGAGCAATTGACCTCGCGGGAGTCG
GGAGAGGATTCGGCGACGGTACGCGAACGGGTTTTGGCGGCGCGTGAGCGCCAATGGTCGAGAGGAGCGCTCAACGCCTA
CCTGGCAGGCCCCGATCTGGAAGCTGCTTGCGCGCTGGGTGCCGATGACCGTGCCTGGCTCGCCGAAGTGCTGGAGCGAC
TGCAGCTTTCGGCACGAGCCTTCCATCGCGTGCTCCGGGTGGCCCTGACCCTCGCCGACCTGGCCGGTGCCCCCAGGCCG
ACCCGCGAACATCTGATCGAGGCCATCGGTTATCGCCAGCTCGACCGCTTGCTCAAGGGCGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

55.556

100

0.559

  comM Vibrio cholerae strain A1552

54.582

100

0.547

  comM Haemophilus influenzae Rd KW20

53.557

100

0.541

  comM Glaesserella parasuis strain SC1401

53.346

100

0.541

  comM Legionella pneumophila str. Paris

51.098

100

0.511

  comM Legionella pneumophila strain ERS1305867

51.098

100

0.511

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.686

99.202

0.473