Javascript must be enabled to continue!
Improving the Weighting Strategy in KernelSHAP
View through CrossRef
Abstract
In Explainable AI (XAI), Shapley values are a popular model-agnostic framework for explaining predictions made by complex machine learning models. The computation of Shapley values requires estimating non-trivial contribution functions representing predictions with only a subset of the features present. As the number of these terms grows exponentially with the number of features, computational costs escalate rapidly, creating a pressing need for efficient and accurate approximation methods. For tabular data, the framework is considered the state-of-the-art model-agnostic approximation framework. approximates the Shapley values using a weighted sample of the contribution functions for different feature subsets. We propose a novel modification of which replaces the stochastic weights with deterministic ones to reduce the variance of the resulting Shapley value approximations. This may also be combined with our simple, yet effective modification to the variant implemented in the popular Python library . Additionally, we provide an overview of established methods. Numerical experiments demonstrate that our methods can reduce the required number of contribution function evaluations by
$$5\%$$
5
%
to
$$50\%$$
50
%
while preserving the same accuracy of the approximated Shapley values – essentially reducing the running time by up to
$$50\%$$
50
%
. These computational advancements push the boundaries of the feature dimensionality and number of predictions that can be accurately explained with Shapley values within a feasible runtime.
Springer Nature Switzerland
Title: Improving the Weighting Strategy in KernelSHAP
Description:
Abstract
In Explainable AI (XAI), Shapley values are a popular model-agnostic framework for explaining predictions made by complex machine learning models.
The computation of Shapley values requires estimating non-trivial contribution functions representing predictions with only a subset of the features present.
As the number of these terms grows exponentially with the number of features, computational costs escalate rapidly, creating a pressing need for efficient and accurate approximation methods.
For tabular data, the framework is considered the state-of-the-art model-agnostic approximation framework.
approximates the Shapley values using a weighted sample of the contribution functions for different feature subsets.
We propose a novel modification of which replaces the stochastic weights with deterministic ones to reduce the variance of the resulting Shapley value approximations.
This may also be combined with our simple, yet effective modification to the variant implemented in the popular Python library .
Additionally, we provide an overview of established methods.
Numerical experiments demonstrate that our methods can reduce the required number of contribution function evaluations by
$$5\%$$
5
%
to
$$50\%$$
50
%
while preserving the same accuracy of the approximated Shapley values – essentially reducing the running time by up to
$$50\%$$
50
%
.
These computational advancements push the boundaries of the feature dimensionality and number of predictions that can be accurately explained with Shapley values within a feasible runtime.
Related Results
Association on Supervised Term Weighting Method for Classification on Data Twitter
Association on Supervised Term Weighting Method for Classification on Data Twitter
Term weighting is a preprocessing phase that has an important role in the text classification by giving the appropriate weight for each term in all documents. In previous research,...
The Extension of the Variable-Flux-Weighting Method to Multidimensional Problems Including Gravity and Capillarity
The Extension of the Variable-Flux-Weighting Method to Multidimensional Problems Including Gravity and Capillarity
ABSTRACT
The variable-flux-weighting method has been shown to largely eliminate numerical diffusion at saturation discontinuities in two-phase flow in two-dimensiona...
Sistem Informasi untuk Menentukan Potensi Mahasiswa Menggunakan Metode Fuzzy Multiple Attribute Decision Making dengan Simple Additive Weighting Studi Kasus : STIKES DUTA GAMA KLATEN
Sistem Informasi untuk Menentukan Potensi Mahasiswa Menggunakan Metode Fuzzy Multiple Attribute Decision Making dengan Simple Additive Weighting Studi Kasus : STIKES DUTA GAMA KLATEN
Dalam kegiatan belajar dan mengajar lingkungan akademisi tentunya menghasilkan peserta didik yang memiliki keunggulan diatas rata – rata. Keunggulan mahasiswa dalam sebuah lingkung...
Optimal Weighting of Preclinical Alzheimer’s Cognitive Composite (PACC) Scales to Improve their Performance as Outcome Measures for Alzheimer’s Disease Clinical Trials
Optimal Weighting of Preclinical Alzheimer’s Cognitive Composite (PACC) Scales to Improve their Performance as Outcome Measures for Alzheimer’s Disease Clinical Trials
Introduction: Cognitive composite scales constructed by combining existing neuropsychometric tests are seeing wide application as endpoints for clinical trials and cohort studies o...
Shapley Value-Based Carbon Emission Responsibility Allocation in Power Systems with Energy Storage
Shapley Value-Based Carbon Emission Responsibility Allocation in Power Systems with Energy Storage
Energy storage systems (ESS) are essential for power system decarbonization, yet their carbon emission responsibilities remain ambiguous under market operations due to their unique...
Shapley Value-Based Carbon Emission Responsibility Allocation in Power Systems with Energy Storage
Shapley Value-Based Carbon Emission Responsibility Allocation in Power Systems with Energy Storage
Energy storage systems (ESS) are essential for power system decarbonization, yet their carbon emission responsibilities remain ambiguous under market operations due to their unique...
The Evaluation of Interblock Mobility Using a Modified Midpoint Weighting Scheme
The Evaluation of Interblock Mobility Using a Modified Midpoint Weighting Scheme
ABSTRACT
A modified midpoint weighting scheme is a technique which can be used for increasing the accuracy and stability of finite difference numerical simulations. ...
Shape and Weighting Optimization of a Subarray for an mm-Wave Phased Array Antenna
Shape and Weighting Optimization of a Subarray for an mm-Wave Phased Array Antenna
This paper discusses how to optimize the weighting of individual subarrays to derive the low sidelobe level (SLL) based on quadratic programming (QP) and how to derive QP parameter...

