Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Counterfactual Shapley Values for Explaining Reinforcement Learning

View through CrossRef
Abstract This paper introduces an approach based on Counterfactual Shapley Values, which enhances explainability in reinforcement learning by integrating counterfactual analysis with Shapley Values. The approach aims to quantify and compare the contributions of different state dimensions to various action choices. To more accurately analyze the impacts of these contributions, we introduce new characteristic value functions, the Counterfactual Difference based Characteristic Value functions and the Average Counterfactual Difference based Characteristic Value functions. These functions help to evaluate the differences in contributions between optimal and non-optimal actions. Experiments across several RL domains, such as GridWorld, FrozenLake, and Taxi, demonstrate the effectiveness of the Counterfactual Shapley Values method. The results show that this method not only improves transparency in complex RL systems but also quantifies the differences across various decisions.
Title: Counterfactual Shapley Values for Explaining Reinforcement Learning
Description:
Abstract This paper introduces an approach based on Counterfactual Shapley Values, which enhances explainability in reinforcement learning by integrating counterfactual analysis with Shapley Values.
The approach aims to quantify and compare the contributions of different state dimensions to various action choices.
To more accurately analyze the impacts of these contributions, we introduce new characteristic value functions, the Counterfactual Difference based Characteristic Value functions and the Average Counterfactual Difference based Characteristic Value functions.
These functions help to evaluate the differences in contributions between optimal and non-optimal actions.
Experiments across several RL domains, such as GridWorld, FrozenLake, and Taxi, demonstrate the effectiveness of the Counterfactual Shapley Values method.
The results show that this method not only improves transparency in complex RL systems but also quantifies the differences across various decisions.

Related Results

Data Augmentation using Counterfactuals: Proximity vs Diversity
Data Augmentation using Counterfactuals: Proximity vs Diversity
Counterfactual explanations are gaining in popularity as a way of explaining machine learning models. Counterfactual examples are generally created to help interpret the decision o...
Downward counterfactual insights into weather extremes
Downward counterfactual insights into weather extremes
<p>There are many regions where the duration of reliable scientific observations of key weather hazard variables, such as rainfall and wind speed, is of the order of ...
A Shapley-érték komplexitása és becslése
A Shapley-érték komplexitása és becslése
A kooperatív játékelmélet számos társadalmi dilemma illetve pénzügyi és gazdasági probléma modellje. Általában akkor alkalmazzuk, ha egy közösség által elérhető eredmény meghaladja...
Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach
Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach
We examine counterfactual explanations for explaining the decisions made by model-based AI systems. The counterfactual approach we consider defines an explanation as a set of the s...
The Effect of Compression Reinforcement on the Shear Behavior of Concrete Beams with Hybrid Reinforcement
The Effect of Compression Reinforcement on the Shear Behavior of Concrete Beams with Hybrid Reinforcement
Abstract This study examines the impact of steel compression reinforcement on the shear behavior of concrete beams reinforced with glass fiber reinforced polymer (GFRP) bar...
Improving the Weighting Strategy in KernelSHAP
Improving the Weighting Strategy in KernelSHAP
Abstract In Explainable AI (XAI), Shapley values are a popular model-agnostic framework for explaining predictions made by complex machine learning models. The computatio...
Study on Scheme Optimization of bridge reinforcement increasing ratio
Study on Scheme Optimization of bridge reinforcement increasing ratio
Abstract The bridge reinforcement methods, each method has its advantages and disadvantages. The load-bearing capacity of bridge members is controlled by the ultimat...
The Asymmetry of Counterfactual Dependence
The Asymmetry of Counterfactual Dependence
A certain type of counterfactual is thought to be intimately related to causation, control, and explanation. The time asymmetry of these phenomena therefore plausibly arises from a...

Back to Top