Javascript must be enabled to continue!
Data Augmentation using Counterfactuals: Proximity vs Diversity
View through CrossRef
Counterfactual explanations are gaining in popularity as a way of explaining machine learning models. Counterfactual examples are generally created to help interpret the decision of a model. In that case, if a model makes a certain decision for an instance, the counterfactual examples of that instance reverse the decision of the model. Counterfactual examples can be created by craftily changing particular feature values of the instance. Though counterfactual examples are generated to explain the decision of machine learning models, we have already explored that counterfactual examples can be used for effective data augmentation. In this work, we want to explore what kind of counterfactual examples work best for data augmentation. In particular, we want to generate counterfactual examples from two perspectives: proximity and diversity. We want to observewhich perspective works best in this regard. We demonstrate the efficacy of these approaches on the widely used “Adult-Income” dataset. We consider several scenarios where we do not have enough data and use each of these approaches to augment the dataset. We compare these two approaches and discuss the implications of the results.
University of Florida George A Smathers Libraries
Title: Data Augmentation using Counterfactuals: Proximity vs Diversity
Description:
Counterfactual explanations are gaining in popularity as a way of explaining machine learning models.
Counterfactual examples are generally created to help interpret the decision of a model.
In that case, if a model makes a certain decision for an instance, the counterfactual examples of that instance reverse the decision of the model.
Counterfactual examples can be created by craftily changing particular feature values of the instance.
Though counterfactual examples are generated to explain the decision of machine learning models, we have already explored that counterfactual examples can be used for effective data augmentation.
In this work, we want to explore what kind of counterfactual examples work best for data augmentation.
In particular, we want to generate counterfactual examples from two perspectives: proximity and diversity.
We want to observewhich perspective works best in this regard.
We demonstrate the efficacy of these approaches on the widely used “Adult-Income” dataset.
We consider several scenarios where we do not have enough data and use each of these approaches to augment the dataset.
We compare these two approaches and discuss the implications of the results.
Related Results
University-Industry Collaborations (UICs): A Matter of Proximity Dimensions?
University-Industry Collaborations (UICs): A Matter of Proximity Dimensions?
Firms and universities interact with each other despite several barriers hindering their collaboration, such as distances in their worldviews, organizational structures and cogniti...
The Uncertainty of Counterfactuals in Deep Learning
The Uncertainty of Counterfactuals in Deep Learning
Counterfactuals have become a useful tool for explainable Artificial Intelligence (XAI). Counterfactuals provide various perturbations to a data instance to yield an alternate clas...
The Effectiveness of Data Augmentation for Bone Suppression in Chest Radiograph using Convolutional Neural Network
The Effectiveness of Data Augmentation for Bone Suppression in Chest Radiograph using Convolutional Neural Network
Objective: Bone suppression of chest radiograph holds great promise to improve the localization accuracy in Image-Guided Radiation Therapy (IGRT). However, data scarcity has long b...
Diffusion Counterfactuals for Image Regressors
Diffusion Counterfactuals for Image Regressors
Abstract
Counterfactual explanations have been successfully applied to create human interpretable explanations for various black-box models. They are handy for tasks in t...
Text Data Augmentation for Deep Learning
Text Data Augmentation for Deep Learning
Abstract
Natural Language Processing (NLP) is one of the most captivating applications of Deep Learning. In this survey, we consider how the Data Augmentation training stra...
Enhancing Non-Formal Learning Certificate Classification with Text Augmentation: A Comparison of Character, Token, and Semantic Approaches
Enhancing Non-Formal Learning Certificate Classification with Text Augmentation: A Comparison of Character, Token, and Semantic Approaches
Aim/Purpose: The purpose of this paper is to address the gap in the recognition of prior learning (RPL) by automating the classification of non-formal learning certificates using d...
Back to the Present: How Not to Use Counterfactuals to Explain Causal Asymmetry
Back to the Present: How Not to Use Counterfactuals to Explain Causal Asymmetry
A plausible thought is that we should evaluate counterfactuals in the actual world by holding the present ‘fixed’; the state of the counterfactual world at the time of the antecede...
Silicone Implant Versus Silicone Implant Assisted by Stromal Enriched Lipograft Breast Augmentation: A Prospective Comparative Study
Silicone Implant Versus Silicone Implant Assisted by Stromal Enriched Lipograft Breast Augmentation: A Prospective Comparative Study
Background: Implant-assisted breast augmentation is among the most performed surgeries performed by plastic surgeons today. This prospective study evaluated the patient satisfactio...

