Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Improving the Weighting Strategy in KernelSHAP

View through CrossRef
Abstract In Explainable AI (XAI), Shapley values are a popular model-agnostic framework for explaining predictions made by complex machine learning models. The computation of Shapley values requires estimating non-trivial contribution functions representing predictions with only a subset of the features present. As the number of these terms grows exponentially with the number of features, computational costs escalate rapidly, creating a pressing need for efficient and accurate approximation methods. For tabular data, the framework is considered the state-of-the-art model-agnostic approximation framework. approximates the Shapley values using a weighted sample of the contribution functions for different feature subsets. We propose a novel modification of which replaces the stochastic weights with deterministic ones to reduce the variance of the resulting Shapley value approximations. This may also be combined with our simple, yet effective modification to the variant implemented in the popular Python library . Additionally, we provide an overview of established methods. Numerical experiments demonstrate that our methods can reduce the required number of contribution function evaluations by $$5\%$$ 5 % to $$50\%$$ 50 % while preserving the same accuracy of the approximated Shapley values – essentially reducing the running time by up to $$50\%$$ 50 % . These computational advancements push the boundaries of the feature dimensionality and number of predictions that can be accurately explained with Shapley values within a feasible runtime.
Title: Improving the Weighting Strategy in KernelSHAP
Description:
Abstract In Explainable AI (XAI), Shapley values are a popular model-agnostic framework for explaining predictions made by complex machine learning models.
The computation of Shapley values requires estimating non-trivial contribution functions representing predictions with only a subset of the features present.
As the number of these terms grows exponentially with the number of features, computational costs escalate rapidly, creating a pressing need for efficient and accurate approximation methods.
For tabular data, the framework is considered the state-of-the-art model-agnostic approximation framework.
approximates the Shapley values using a weighted sample of the contribution functions for different feature subsets.
We propose a novel modification of which replaces the stochastic weights with deterministic ones to reduce the variance of the resulting Shapley value approximations.
This may also be combined with our simple, yet effective modification to the variant implemented in the popular Python library .
Additionally, we provide an overview of established methods.
Numerical experiments demonstrate that our methods can reduce the required number of contribution function evaluations by $$5\%$$ 5 % to $$50\%$$ 50 % while preserving the same accuracy of the approximated Shapley values – essentially reducing the running time by up to $$50\%$$ 50 % .
These computational advancements push the boundaries of the feature dimensionality and number of predictions that can be accurately explained with Shapley values within a feasible runtime.

Related Results

Association on Supervised Term Weighting Method for Classification on Data Twitter
Association on Supervised Term Weighting Method for Classification on Data Twitter
Term weighting is a preprocessing phase that has an important role in the text classification by giving the appropriate weight for each term in all documents. In previous research,...
The Extension of the Variable-Flux-Weighting Method to Multidimensional Problems Including Gravity and Capillarity
The Extension of the Variable-Flux-Weighting Method to Multidimensional Problems Including Gravity and Capillarity
ABSTRACT The variable-flux-weighting method has been shown to largely eliminate numerical diffusion at saturation discontinuities in two-phase flow in two-dimensiona...
Shapley Value-Based Carbon Emission Responsibility Allocation in Power Systems with Energy Storage
Shapley Value-Based Carbon Emission Responsibility Allocation in Power Systems with Energy Storage
Energy storage systems (ESS) are essential for power system decarbonization, yet their carbon emission responsibilities remain ambiguous under market operations due to their unique...
Shapley Value-Based Carbon Emission Responsibility Allocation in Power Systems with Energy Storage
Shapley Value-Based Carbon Emission Responsibility Allocation in Power Systems with Energy Storage
Energy storage systems (ESS) are essential for power system decarbonization, yet their carbon emission responsibilities remain ambiguous under market operations due to their unique...
The Evaluation of Interblock Mobility Using a Modified Midpoint Weighting Scheme
The Evaluation of Interblock Mobility Using a Modified Midpoint Weighting Scheme
ABSTRACT A modified midpoint weighting scheme is a technique which can be used for increasing the accuracy and stability of finite difference numerical simulations. ...
Shape and Weighting Optimization of a Subarray for an mm-Wave Phased Array Antenna
Shape and Weighting Optimization of a Subarray for an mm-Wave Phased Array Antenna
This paper discusses how to optimize the weighting of individual subarrays to derive the low sidelobe level (SLL) based on quadratic programming (QP) and how to derive QP parameter...

Back to Top