Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Towards Next Generation Provenance Systems for E-Science

View through CrossRef
e-Science helps scientists to automate scientific discovery processes and experiments, and promote collaboration across organizational boundaries and disciplines. These experiments involve data discovery, knowledge discovery, integration, linking, and analysis through different software tools and activities. Scientific workflow is one technique through which such activities and processes can be interlinked, automated, and ultimately shared amongst the collaborating scientists. Workflows are realized by the workflow enactment engine, which interprets the process definition and interacts with the workflow participants. Since workflows are typically executed on a shared and distributed infrastructure, the information on the workflow activities, data processed, and results generated (also known as provenance), needs to be recorded in order to be reproduced and reused. A range of solutions and techniques have been suggested for the provenance of data collection and analysis; however, these are predominantly workflow enactment engine and domain dependent. This paper includes taxonomy of existing provenance techniques and a novel solution named VePS (The Vienna e-Science Provenance System) for e-Science provenance collection.
Title: Towards Next Generation Provenance Systems for E-Science
Description:
e-Science helps scientists to automate scientific discovery processes and experiments, and promote collaboration across organizational boundaries and disciplines.
These experiments involve data discovery, knowledge discovery, integration, linking, and analysis through different software tools and activities.
Scientific workflow is one technique through which such activities and processes can be interlinked, automated, and ultimately shared amongst the collaborating scientists.
Workflows are realized by the workflow enactment engine, which interprets the process definition and interacts with the workflow participants.
Since workflows are typically executed on a shared and distributed infrastructure, the information on the workflow activities, data processed, and results generated (also known as provenance), needs to be recorded in order to be reproduced and reused.
A range of solutions and techniques have been suggested for the provenance of data collection and analysis; however, these are predominantly workflow enactment engine and domain dependent.
This paper includes taxonomy of existing provenance techniques and a novel solution named VePS (The Vienna e-Science Provenance System) for e-Science provenance collection.

Related Results

Provenance for distributed biomedical workflow execution
Provenance for distributed biomedical workflow execution
Scientific research has become very data and compute intensive because of the progress in data acquisition and measurement devices, which is particularly true in Life Sciences. To ...
Provenance and Probabilities in Relational Databases
Provenance and Probabilities in Relational Databases
We review the basics of data provenance in relational databases. We describe different provenance formalisms, from Boolean provenance to provenance semirings and beyond, that can b...
The role of provenance in luxury textile brands
The role of provenance in luxury textile brands
Purpose – The purpose of this paper is to analyse the role that provenance holds within the luxury textiles market. It defines similarities and differences in the p...
Geochemical characteristics of Neogene mudstone in Guanzhong Basin,recovery of provenance and paleo-sedimentary environment
Geochemical characteristics of Neogene mudstone in Guanzhong Basin,recovery of provenance and paleo-sedimentary environment
At present, the geothermal resources developed and utilized in the Guanzhong Basin are mainly Cenozoic sandstone and glutenite pore -fissure geothermal resources, and the developme...
ANALYSIS OF THE OPERATION MODE OF THE SOLAR POWER PLANT
ANALYSIS OF THE OPERATION MODE OF THE SOLAR POWER PLANT
The article examines the load change schedule of the solar power plant in the Ukraine-Moldova energy union. The analysis of data averaged at minute and 15-minute intervals in the p...
BioWorkbench: a high-performance framework for managing and analyzing bioinformatics experiments
BioWorkbench: a high-performance framework for managing and analyzing bioinformatics experiments
Advances in sequencing techniques have led to exponential growth in biological data, demanding the development of large-scale bioinformatics experiments. Because these experiments ...
Practical Extension of Provenance to Healthcare Data Based on the W3C PROV Standard
Practical Extension of Provenance to Healthcare Data Based on the W3C PROV Standard
Secondary use of healthcare data is dependent on the availability of provenance data for assessing its quality, reliability or trustworthiness. Usually, instance-level data that mi...
An LLM-guided Platform for Multi-Granular Collection and Management of Data Provenance
An LLM-guided Platform for Multi-Granular Collection and Management of Data Provenance
Abstract As machine learning and AI systems become more prevalent, understanding how their decisions are made is key to maintaining their trust. To solve this problem, it i...

Back to Top