Javascript must be enabled to continue!

Counterfactual Shapley Values for Explaining Reinforcement Learning

Abstract This paper introduces an approach based on Counterfactual Shapley Values, which enhances explainability in reinforcement learning by integrating counterfactual analysis with Shapley Values. The approach aims to quantify and compare the contributions of different state dimensions to various action choices. To more accurately analyze the impacts of these contributions, we introduce new characteristic value functions, the Counterfactual Difference based Characteristic Value functions and the Average Counterfactual Difference based Characteristic Value functions. These functions help to evaluate the differences in contributions between optimal and non-optimal actions. Experiments across several RL domains, such as GridWorld, FrozenLake, and Taxi, demonstrate the effectiveness of the Counterfactual Shapley Values method. The results show that this method not only improves transparency in complex RL systems but also quantifies the differences across various decisions.

Springer Nature Switzerland

Yiwei Shi Weiru Liu

Communications in Computer and Information Science

2025

Title: Counterfactual Shapley Values for Explaining Reinforcement Learning

Description:

Abstract This paper introduces an approach based on Counterfactual Shapley Values, which enhances explainability in reinforcement learning by integrating counterfactual analysis with Shapley Values.

The approach aims to quantify and compare the contributions of different state dimensions to various action choices.

To more accurately analyze the impacts of these contributions, we introduce new characteristic value functions, the Counterfactual Difference based Characteristic Value functions and the Average Counterfactual Difference based Characteristic Value functions.

These functions help to evaluate the differences in contributions between optimal and non-optimal actions.

Experiments across several RL domains, such as GridWorld, FrozenLake, and Taxi, demonstrate the effectiveness of the Counterfactual Shapley Values method.

The results show that this method not only improves transparency in complex RL systems but also quantifies the differences across various decisions.

Back

The paper presents the results of experimental studies of the strength of cross-shaped welded joints of types К1-Кт and К3-Рр [1] of thermomechanically hardened reinforcement of cl...

Counterfactual Examples for Data Augmentation: A Case Study

Counterfactual explanations are gaining in popularity as a way of explaining machine learning models. Counterfactual examples are generally created to help interpret the decision o...

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021

The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

Data Augmentation using Counterfactuals: Proximity vs Diversity

Counterfactual explanations are gaining in popularity as a way of explaining machine learning models. Counterfactual examples are generally created to help interpret the decision o...

Fiber reinforcement as an alternative to the compressed zone linear reinforcement and the flexible concrete elements stretched zone prestressing

Abstract The results of a numerical experiment in the framework of a theoretical study of the strength and crack resistance of the reinforced concrete beams availabl...

Downward counterfactual insights into weather extremes

<p>There are many regions where the duration of reliable scientific observations of key weather hazard variables, such as rainfall and wind speed, is of the order of ...

Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach

We examine counterfactual explanations for explaining the decisions made by model-based AI systems. The counterfactual approach we consider defines an explanation as a set of the s...

A Shapley-érték komplexitása és becslése

A kooperatív játékelmélet számos társadalmi dilemma illetve pénzügyi és gazdasági probléma modellje. Általában akkor alkalmazzuk, ha egy közösség által elérhető eredmény meghaladja...

Email:
Password:

Email:

Counterfactual Shapley Values for Explaining Reinforcement Learning

Related Results