• Yassene Mohammed
  • Shayan Shahand
  • Vladimir Korkhov
  • Angela C.M. Luyf
  • Barbera D.C. Van Schaik
  • Matthan W.A. Caan
  • Antoine H.C. Van Kampen
  • Magnus Palmblad
  • Silvia D. Olabarriaga

As the focus of e-Science is moving toward the forth paradigm and data intensive science, data access remains dependent on the architecture of the used e-Science infrastructure. Such architecture is in general job-driven, i.e., a (grid) job is a sequence of commands that run on the same worker node. Making use of the infrastructure involves having a parallelized application. This is done foremost by data decomposition. In general practice of parallel programming, data decomposition depends on the programmer's experience and knowledge about the used data and the algorithm/application. On the other hand, data mining scientists have an established foundation for data decomposition, automatic decomposition methods are already in use, methodologies and patterns are defined. Our experience in porting biomedical applications to the Dutch e-Science infrastructure shows that the used data decomposition to gain parallelism fit to some degree a subgroup of the data mining decomposition patterns, i.e., object set decomposition. In this paper we discuss porting three biomedical packages to a grid computing environment, two for medical imaging and one for DNA sequencing. We show how the data access of the applications was reengineered around the executables to make use of the parallel capacity of e-Science infrastructure.

Original languageEnglish
Title of host publicationProceedings - 7th IEEE International Conference on e-Science Workshops, eScienceW 2011
Pages158-165
Number of pages8
DOIs
StatePublished - 1 Dec 2011
Event7th IEEE International Conference on e-Science Workshops, eScienceW 201 - Stockholm, Sweden
Duration: 5 Dec 20118 Dec 2011

Publication series

NameProceedings - 7th IEEE International Conference on e-Science Workshops, eScienceW 2011

Conference

Conference7th IEEE International Conference on e-Science Workshops, eScienceW 201
Country/TerritorySweden
CityStockholm
Period5/12/118/12/11

    Scopus subject areas

  • Computer Networks and Communications
  • Management Science and Operations Research

    Research areas

  • data decomposition, e-infrastructure, grid Computing, Healthgrid, legacy application, porting to grid, workflows

ID: 53076195