Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   M5M_RS02545 Genome accession   NC_018868
Coordinates   567891..568100 (+) Length   69 a.a.
NCBI ID   WP_016389179.1    Uniprot ID   -
Organism   Simiduia agarivorans SA1 = DSM 21679     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 562818..608745 567891..568100 within 0


Gene organization within MGE regions


Location: 562818..608745
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  M5M_RS02530 (M5M_02530) - 562818..565040 (+) 2223 WP_016389178.1 bifunctional prephenate dehydrogenase/3-phosphoshikimate 1-carboxyvinyltransferase -
  M5M_RS02535 (M5M_02535) - 565087..566442 (+) 1356 WP_015045899.1 phosphomannomutase/phosphoglucomutase -
  M5M_RS02540 (M5M_02540) galU 566590..567468 (+) 879 WP_015045900.1 UTP--glucose-1-phosphate uridylyltransferase GalU -
  M5M_RS02545 (M5M_02545) comEA 567891..568100 (+) 210 WP_016389179.1 ComEA family DNA-binding protein Machinery gene
  M5M_RS02550 (M5M_02550) cmk 568381..569052 (+) 672 WP_015045902.1 (d)CMP kinase -
  M5M_RS02560 (M5M_02560) rpsA 569222..570904 (+) 1683 WP_015045904.1 30S ribosomal protein S1 -
  M5M_RS02565 (M5M_02565) ihfB 571230..571526 (+) 297 WP_015045905.1 integration host factor subunit beta -
  M5M_RS02570 (M5M_02570) - 571535..571807 (+) 273 WP_015045906.1 LapA family protein -
  M5M_RS02575 (M5M_02575) - 571815..572996 (+) 1182 WP_015045907.1 tetratricopeptide repeat protein -
  M5M_RS02580 (M5M_02580) pyrF 573217..573921 (+) 705 WP_024330264.1 orotidine-5'-phosphate decarboxylase -
  M5M_RS02585 (M5M_02585) cysD 574175..575086 (+) 912 WP_015045909.1 sulfate adenylyltransferase subunit CysD -
  M5M_RS02590 (M5M_02590) cysN 575181..576599 (+) 1419 WP_015045910.1 sulfate adenylyltransferase subunit CysN -
  M5M_RS02595 (M5M_02595) - 576669..578411 (+) 1743 WP_162141154.1 glycosyltransferase family 2 protein -
  M5M_RS02600 (M5M_02600) - 578457..579338 (+) 882 WP_162141153.1 glycosyltransferase -
  M5M_RS02605 (M5M_02605) - 579341..580204 (+) 864 WP_015045913.1 glycosyltransferase family 2 protein -
  M5M_RS02610 (M5M_02610) - 580234..580755 (+) 522 WP_015045914.1 hypothetical protein -
  M5M_RS02615 (M5M_02617) - 580807..581613 (+) 807 WP_016389181.1 ABC transporter permease -
  M5M_RS02620 (M5M_02625) - 581610..582338 (+) 729 WP_015045915.1 ABC transporter ATP-binding protein -
  M5M_RS02625 (M5M_02630) - 582406..583812 (+) 1407 WP_015045916.1 hypothetical protein -
  M5M_RS02630 (M5M_02635) - 583951..585084 (+) 1134 WP_015045917.1 hypothetical protein -
  M5M_RS19280 (M5M_02640) - 585087..586355 (+) 1269 WP_015045918.1 FkbM family methyltransferase -
  M5M_RS02640 (M5M_02645) - 586425..587642 (+) 1218 WP_024330260.1 exostosin domain-containing protein -
  M5M_RS02645 (M5M_02650) - 587639..588664 (+) 1026 WP_015045920.1 hypothetical protein -
  M5M_RS02650 (M5M_02655) - 588677..589678 (+) 1002 WP_016389182.1 GSCFA domain-containing protein -
  M5M_RS02655 (M5M_02660) - 589656..590693 (+) 1038 WP_144062370.1 hypothetical protein -
  M5M_RS02660 (M5M_02665) gmd 590761..591876 (+) 1116 WP_015045923.1 GDP-mannose 4,6-dehydratase -
  M5M_RS02665 (M5M_02670) fcl 591932..592891 (+) 960 WP_016389183.1 GDP-L-fucose synthase -
  M5M_RS02670 (M5M_02675) - 592897..593352 (+) 456 WP_015045925.1 GDP-mannose mannosyl hydrolase -
  M5M_RS02675 (M5M_02680) - 593354..594763 (+) 1410 WP_015045926.1 mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase -
  M5M_RS02680 (M5M_02685) - 594776..595663 (+) 888 WP_015045927.1 hypothetical protein -
  M5M_RS02685 (M5M_02690) - 595792..596685 (+) 894 WP_015045928.1 glycosyltransferase family 10 domain-containing protein -
  M5M_RS02690 (M5M_02695) - 596733..597899 (+) 1167 WP_015045929.1 nucleotide sugar dehydrogenase -
  M5M_RS02695 (M5M_02700) rfbB 598026..599039 (+) 1014 WP_015045930.1 dTDP-glucose 4,6-dehydratase -
  M5M_RS02700 (M5M_02705) rfbA 599041..599913 (+) 873 WP_015045931.1 glucose-1-phosphate thymidylyltransferase RfbA -
  M5M_RS02705 (M5M_02710) rfbC 599925..600473 (+) 549 WP_015045932.1 dTDP-4-dehydrorhamnose 3,5-epimerase -
  M5M_RS19285 (M5M_02715) - 600445..603315 (+) 2871 WP_144062371.1 glycosyltransferase -
  M5M_RS02715 (M5M_02720) - 603328..604275 (+) 948 WP_081640091.1 sulfotransferase family 2 domain-containing protein -
  M5M_RS02720 (M5M_02725) - 604299..606251 (-) 1953 WP_081640092.1 polysaccharide biosynthesis protein -
  M5M_RS02725 (M5M_02730) - 606205..607218 (-) 1014 WP_015045936.1 MraY family glycosyltransferase -
  M5M_RS02730 (M5M_02735) - 607212..608150 (-) 939 WP_015045937.1 NAD-dependent epimerase/dehydratase family protein -
  M5M_RS02735 (M5M_02740) - 608305..608745 (+) 441 WP_015045938.1 REP-associated tyrosine transposase -

Sequence


Protein


Download         Length: 69 a.a.        Molecular weight: 7291.35 Da        Isoelectric Point: 9.8677

>NTDB_id=54078 M5M_RS02545 WP_016389179.1 567891..568100(+) (comEA) [Simiduia agarivorans SA1 = DSM 21679]
MAQTSTVNINTASAEELSASLKGIGTSKAQAIVDYREKVGKFVSIDQLAEVKGIGAATVEKNRALIRLQ

Nucleotide


Download         Length: 210 bp        

>NTDB_id=54078 M5M_RS02545 WP_016389179.1 567891..568100(+) (comEA) [Simiduia agarivorans SA1 = DSM 21679]
GTGGCACAGACCTCTACTGTAAATATCAACACCGCCAGCGCGGAGGAGTTGTCGGCCTCCCTTAAGGGGATTGGCACATC
TAAAGCGCAAGCCATTGTAGATTACCGGGAAAAGGTGGGTAAGTTTGTCTCCATCGATCAGCTTGCTGAAGTTAAGGGTA
TAGGCGCTGCCACGGTTGAAAAAAACCGCGCGCTTATCCGGCTGCAGTAA

Domains


Predicted by InterproScan.

(4-67)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae C6706

69.841

91.304

0.638

  comEA Vibrio cholerae strain A1552

69.841

91.304

0.638

  comEA/comE1 Glaesserella parasuis strain SC1401

61.905

91.304

0.565

  comEA Vibrio parahaemolyticus RIMD 2210633

60.317

91.304

0.551

  comE1/comEA Haemophilus influenzae Rd KW20

63.158

82.609

0.522

  comEA Acinetobacter baylyi ADP1

57.895

82.609

0.478

  comEA Vibrio campbellii strain DS40M4

52.381

91.304

0.478

  comEA Acinetobacter baumannii D1279779

49.231

94.203

0.464

  Cj0011c Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

58.182

79.71

0.464

  comEA/celA/cilE Streptococcus mitis NCTC 12261

56.364

79.71

0.449

  comE Neisseria gonorrhoeae MS11

54.386

82.609

0.449

  comE Neisseria gonorrhoeae MS11

54.386

82.609

0.449

  comE Neisseria gonorrhoeae MS11

54.386

82.609

0.449

  comE Neisseria gonorrhoeae MS11

54.386

82.609

0.449

  comEA Acinetobacter baumannii strain A118

47.692

94.203

0.449

  comEA Thermus thermophilus HB27

45.455

95.652

0.435

  comEA Legionella pneumophila str. Paris

52.632

82.609

0.435

  comEA Legionella pneumophila strain ERS1305867

52.632

82.609

0.435

  comEA/celA/cilE Streptococcus pneumoniae R6

49.18

88.406

0.435

  comEA/celA/cilE Streptococcus pneumoniae Rx1

49.18

88.406

0.435

  comEA/celA/cilE Streptococcus pneumoniae D39

49.18

88.406

0.435

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

54.545

79.71

0.435

  comEA/celA/cilE Streptococcus mitis SK321

54.545

79.71

0.435

  comEA Bacillus subtilis subsp. subtilis str. 168

46.032

91.304

0.42

  comEA Staphylococcus aureus N315

50

84.058

0.42

  comEA Staphylococcus aureus MW2

50

84.058

0.42

  comEA Lactococcus lactis subsp. cremoris KW2

52.727

79.71

0.42

  comA Synechocystis sp. PCC 6803

45

86.957

0.391

  comEA Latilactobacillus sakei subsp. sakei 23K

39.683

91.304

0.362

  comEA Streptococcus thermophilus LMD-9

43.86

82.609

0.362


Multiple sequence alignment