Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Provenance for distributed biomedical workflow execution

View through CrossRef
Scientific research has become very data and compute intensive because of the progress in data acquisition and measurement devices, which is particularly true in Life Sciences. To cope with this deluge of data, scientists use distributed computing and storage infrastructures. The use of such infrastructures introduces by itself new challenges to the scientists in terms of proper and efficient use. Scientific workflow management systems play an important role in facilitating the use of the infrastructure by hiding some of its complexity. Althought most scientific workflow management systems are provenance-aware, not all of them come with provenance functionality out of the box. In this paper we describe the improvement and integration of a provenance system into an e-infrastructure for biomedical research based on the MOTEUR workflow management system. The main contributions of the paper are: presenting an OPM implementation using relational database backend for the provenance store, providing an e-infrastructure with a comprehensive provenance system, defining a generic approach to provenance implementation, potentially suitable for other workflow systems and application domains and demonstrating the value of this system based on use cases presenting the provenance data through a user-friendly web interface.
Title: Provenance for distributed biomedical workflow execution
Description:
Scientific research has become very data and compute intensive because of the progress in data acquisition and measurement devices, which is particularly true in Life Sciences.
To cope with this deluge of data, scientists use distributed computing and storage infrastructures.
The use of such infrastructures introduces by itself new challenges to the scientists in terms of proper and efficient use.
Scientific workflow management systems play an important role in facilitating the use of the infrastructure by hiding some of its complexity.
Althought most scientific workflow management systems are provenance-aware, not all of them come with provenance functionality out of the box.
In this paper we describe the improvement and integration of a provenance system into an e-infrastructure for biomedical research based on the MOTEUR workflow management system.
The main contributions of the paper are: presenting an OPM implementation using relational database backend for the provenance store, providing an e-infrastructure with a comprehensive provenance system, defining a generic approach to provenance implementation, potentially suitable for other workflow systems and application domains and demonstrating the value of this system based on use cases presenting the provenance data through a user-friendly web interface.

Related Results

BioWorkbench: a high-performance framework for managing and analyzing bioinformatics experiments
BioWorkbench: a high-performance framework for managing and analyzing bioinformatics experiments
Advances in sequencing techniques have led to exponential growth in biological data, demanding the development of large-scale bioinformatics experiments. Because these experiments ...
Towards Next Generation Provenance Systems for E-Science
Towards Next Generation Provenance Systems for E-Science
e-Science helps scientists to automate scientific discovery processes and experiments, and promote collaboration across organizational boundaries and disciplines. These experiments...
EDQWS: an enhanced divide and conquer algorithm for workflow scheduling in cloud
EDQWS: an enhanced divide and conquer algorithm for workflow scheduling in cloud
AbstractA workflow is an effective way for modeling complex applications and serves as a means for scientists and researchers to better understand the details of applications. Clou...
Interoperability of Cross-organizational Workflows based on Process-view for Collaborative Product Development
Interoperability of Cross-organizational Workflows based on Process-view for Collaborative Product Development
Collaborative product development (CPD) has been widely accepted as an advanced collaboration paradigm that combines geographically distributed product development teams to develop...
Optimizing Emergency Department Workflow Using Radio Frequency Identification Device (RFID) Data Analytics
Optimizing Emergency Department Workflow Using Radio Frequency Identification Device (RFID) Data Analytics
Emergency Department (ED) is a complex care delivery environment in a hospital that provides time sensitive urgent and lifesaving care [1]. Emergency medicine is an unscheduled pra...
Quantification of Regression Test Suite Execution Time in Parallel Execution Setup with Weighted Test Suite Split Algorithm
Quantification of Regression Test Suite Execution Time in Parallel Execution Setup with Weighted Test Suite Split Algorithm
Regression test suite execution time study focus is essentially on two aspects. They are execution time reduction and making effective use of available hardware resources and manpo...
Quantification of Regression Test Suite Execution Time in Parallel Execution Setup with Weighted Test Suite Split Algorithm
Quantification of Regression Test Suite Execution Time in Parallel Execution Setup with Weighted Test Suite Split Algorithm
Regression test suite execution time study focus is essentially on two aspects. They are execution time reduction and making effective use of available hardware resources and manpo...
Market orientations, product innovation and organizational performance: A case study on selected beer factories found in Ethiopia
Market orientations, product innovation and organizational performance: A case study on selected beer factories found in Ethiopia
Abstract Overview was led to explore the relationship between advertise direction, creation process, item execution, authoritative execution and budgetary execution. The mo...

Back to Top