Javascript must be enabled to continue!
Improving the Weighting Strategy in KernelSHAP
View through CrossRef
Abstract
In Explainable AI (XAI), Shapley values are a popular model-agnostic framework for explaining predictions made by complex machine learning models. The computation of Shapley values requires estimating non-trivial contribution functions representing predictions with only a subset of the features present. As the number of these terms grows exponentially with the number of features, computational costs escalate rapidly, creating a pressing need for efficient and accurate approximation methods. For tabular data, the framework is considered the state-of-the-art model-agnostic approximation framework. approximates the Shapley values using a weighted sample of the contribution functions for different feature subsets. We propose a novel modification of which replaces the stochastic weights with deterministic ones to reduce the variance of the resulting Shapley value approximations. This may also be combined with our simple, yet effective modification to the variant implemented in the popular Python library . Additionally, we provide an overview of established methods. Numerical experiments demonstrate that our methods can reduce the required number of contribution function evaluations by
$$5\%$$
5
%
to
$$50\%$$
50
%
while preserving the same accuracy of the approximated Shapley values – essentially reducing the running time by up to
$$50\%$$
50
%
. These computational advancements push the boundaries of the feature dimensionality and number of predictions that can be accurately explained with Shapley values within a feasible runtime.
Springer Nature Switzerland
Title: Improving the Weighting Strategy in KernelSHAP
Description:
Abstract
In Explainable AI (XAI), Shapley values are a popular model-agnostic framework for explaining predictions made by complex machine learning models.
The computation of Shapley values requires estimating non-trivial contribution functions representing predictions with only a subset of the features present.
As the number of these terms grows exponentially with the number of features, computational costs escalate rapidly, creating a pressing need for efficient and accurate approximation methods.
For tabular data, the framework is considered the state-of-the-art model-agnostic approximation framework.
approximates the Shapley values using a weighted sample of the contribution functions for different feature subsets.
We propose a novel modification of which replaces the stochastic weights with deterministic ones to reduce the variance of the resulting Shapley value approximations.
This may also be combined with our simple, yet effective modification to the variant implemented in the popular Python library .
Additionally, we provide an overview of established methods.
Numerical experiments demonstrate that our methods can reduce the required number of contribution function evaluations by
$$5\%$$
5
%
to
$$50\%$$
50
%
while preserving the same accuracy of the approximated Shapley values – essentially reducing the running time by up to
$$50\%$$
50
%
.
These computational advancements push the boundaries of the feature dimensionality and number of predictions that can be accurately explained with Shapley values within a feasible runtime.
Related Results
Association on Supervised Term Weighting Method for Classification on Data Twitter
Association on Supervised Term Weighting Method for Classification on Data Twitter
Term weighting is a preprocessing phase that has an important role in the text classification by giving the appropriate weight for each term in all documents. In previous research,...
The Evaluation of Interblock Mobility Using a Modified Midpoint Weighting Scheme
The Evaluation of Interblock Mobility Using a Modified Midpoint Weighting Scheme
ABSTRACT
A modified midpoint weighting scheme is a technique which can be used for increasing the accuracy and stability of finite difference numerical simulations. ...
Shape and Weighting Optimization of a Subarray for an mm-Wave Phased Array Antenna
Shape and Weighting Optimization of a Subarray for an mm-Wave Phased Array Antenna
This paper discusses how to optimize the weighting of individual subarrays to derive the low sidelobe level (SLL) based on quadratic programming (QP) and how to derive QP parameter...
Analisa Pemborosan Pada Proses Produksi Air Minum Dalam Kemasan Dengan Metode Value Stream Mapping Dan Waste Assesment Model Di CV Dafista Mulia
Analisa Pemborosan Pada Proses Produksi Air Minum Dalam Kemasan Dengan Metode Value Stream Mapping Dan Waste Assesment Model Di CV Dafista Mulia
CV Dafista Mulia is a drinking water producer company located in Lamongan Regency. The problem that is often faced by companies is the amount of waste that occurs in the process of...
Comparison of A-weighted sound pressure level (dB(A)), loudness-level weighted sound pressure level (dB(EQL)), and loudness with respect to environmental noise assessment.
Comparison of A-weighted sound pressure level (dB(A)), loudness-level weighted sound pressure level (dB(EQL)), and loudness with respect to environmental noise assessment.
A pair of related tools useful for environmental noise assessment are time-varying psychoacoustic loudness, and an expression of sound pressure level in dB[SPL] weighted dynamicall...
INDICATORS OF INFORMATION FEATURES FOR RECOGNISING THE STATE OF SOURCES AND OBJECTS OF TELECOMMUNICATION NETWORKS AND SYSTEMS
INDICATORS OF INFORMATION FEATURES FOR RECOGNISING THE STATE OF SOURCES AND OBJECTS OF TELECOMMUNICATION NETWORKS AND SYSTEMS
Background. The majority of modern procedures for the recognition of radio sources and objects are based on the use of binary and multivalued logic, which have low specific feature...
L-Factors and Adjacent Vertex-Distinguishing Edge-Weighting
L-Factors and Adjacent Vertex-Distinguishing Edge-Weighting
An edge-weighting problem of a graph G is an assignment of an integer
weight to each edge e. Based on an edge-weighting problem, several types of vertex-coloring
problems are put f...
TEKNIK SUBSTRING WEIGHTING UNTUK KONVERSI MATA KULIAH
TEKNIK SUBSTRING WEIGHTING UNTUK KONVERSI MATA KULIAH
[Id]
Konversi mata kuliah adalah suatu pekerjaan yang mungkin membutuhkan waktu lebih bila dilakukan secara manual. Pemindahan data transkrip menjadi data digital membutuhkan...

