Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Counterfactual Shapley Values for Explaining Reinforcement Learning

View through CrossRef
Abstract This paper introduces an approach based on Counterfactual Shapley Values, which enhances explainability in reinforcement learning by integrating counterfactual analysis with Shapley Values. The approach aims to quantify and compare the contributions of different state dimensions to various action choices. To more accurately analyze the impacts of these contributions, we introduce new characteristic value functions, the Counterfactual Difference based Characteristic Value functions and the Average Counterfactual Difference based Characteristic Value functions. These functions help to evaluate the differences in contributions between optimal and non-optimal actions. Experiments across several RL domains, such as GridWorld, FrozenLake, and Taxi, demonstrate the effectiveness of the Counterfactual Shapley Values method. The results show that this method not only improves transparency in complex RL systems but also quantifies the differences across various decisions.
Title: Counterfactual Shapley Values for Explaining Reinforcement Learning
Description:
Abstract This paper introduces an approach based on Counterfactual Shapley Values, which enhances explainability in reinforcement learning by integrating counterfactual analysis with Shapley Values.
The approach aims to quantify and compare the contributions of different state dimensions to various action choices.
To more accurately analyze the impacts of these contributions, we introduce new characteristic value functions, the Counterfactual Difference based Characteristic Value functions and the Average Counterfactual Difference based Characteristic Value functions.
These functions help to evaluate the differences in contributions between optimal and non-optimal actions.
Experiments across several RL domains, such as GridWorld, FrozenLake, and Taxi, demonstrate the effectiveness of the Counterfactual Shapley Values method.
The results show that this method not only improves transparency in complex RL systems but also quantifies the differences across various decisions.

Related Results

STRENGTH OF BUTT WELDED BUTT JOINT OF REINFORCEMENT OF CLASS A500C
STRENGTH OF BUTT WELDED BUTT JOINT OF REINFORCEMENT OF CLASS A500C
The paper presents the results of experimental studies of the strength of cross-shaped welded joints of types К1-Кт and К3-Рр [1] of thermomechanically hardened reinforcement of cl...
Counterfactual Examples for Data Augmentation: A Case Study
Counterfactual Examples for Data Augmentation: A Case Study
Counterfactual explanations are gaining in popularity as a way of explaining machine learning models. Counterfactual examples are generally created to help interpret the decision o...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Data Augmentation using Counterfactuals: Proximity vs Diversity
Data Augmentation using Counterfactuals: Proximity vs Diversity
Counterfactual explanations are gaining in popularity as a way of explaining machine learning models. Counterfactual examples are generally created to help interpret the decision o...
Downward counterfactual insights into weather extremes
Downward counterfactual insights into weather extremes
<p>There are many regions where the duration of reliable scientific observations of key weather hazard variables, such as rainfall and wind speed, is of the order of ...
Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach
Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach
We examine counterfactual explanations for explaining the decisions made by model-based AI systems. The counterfactual approach we consider defines an explanation as a set of the s...
A Shapley-érték komplexitása és becslése
A Shapley-érték komplexitása és becslése
A kooperatív játékelmélet számos társadalmi dilemma illetve pénzügyi és gazdasági probléma modellje. Általában akkor alkalmazzuk, ha egy közösség által elérhető eredmény meghaladja...

Back to Top