Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Utilizing data sampling techniques on algorithmic fairness for customer churn prediction with data imbalance problems

View through CrossRef
Background: Customer churn prediction (CCP) refers to detecting which customers are likely to cancel the services provided by a service provider, for example, internet services. The class imbalance problem (CIP) in machine learning occurs when there is a huge difference in the samples of positive class compared to the negative class. It is one of the major obstacles in CCP as it deteriorates performance in the classification process. Utilizing data sampling techniques (DSTs) helps to resolve the CIP to some extent. Methods: In this paper, we review the effect of using DSTs on algorithmic fairness, i.e., to investigate whether the results pose any discrimination between male and female groups and compare the results before and after using DSTs. Three real-world datasets with unequal balancing rates were prepared and four ubiquitous DSTs were applied to them. Six popular classification techniques were utilized in the classification process. Both classifier’s performance and algorithmic fairness are evaluated with notable metrics. Results: The results indicated that Random Forest classifier outperforms other classifiers in all three datasets and, using SMOTE and ADASYN techniques cause more discrimination in the female group. The rate of unintentional discrimination seems to be higher in the original data of extremely unbalanced datasets under the following classifiers: Logistics Regression, LightGBM, and XGBoost. Conclusions: Algorithmic fairness has become a broadly studied area in recent years, yet there is a very little systematic study on the effect of using DSTs on algorithmic fairness. This study presents important findings to further the use of algorithmic fairness in CCP research.
Title: Utilizing data sampling techniques on algorithmic fairness for customer churn prediction with data imbalance problems
Description:
Background: Customer churn prediction (CCP) refers to detecting which customers are likely to cancel the services provided by a service provider, for example, internet services.
The class imbalance problem (CIP) in machine learning occurs when there is a huge difference in the samples of positive class compared to the negative class.
It is one of the major obstacles in CCP as it deteriorates performance in the classification process.
Utilizing data sampling techniques (DSTs) helps to resolve the CIP to some extent.
Methods: In this paper, we review the effect of using DSTs on algorithmic fairness, i.
e.
, to investigate whether the results pose any discrimination between male and female groups and compare the results before and after using DSTs.
Three real-world datasets with unequal balancing rates were prepared and four ubiquitous DSTs were applied to them.
Six popular classification techniques were utilized in the classification process.
Both classifier’s performance and algorithmic fairness are evaluated with notable metrics.
Results: The results indicated that Random Forest classifier outperforms other classifiers in all three datasets and, using SMOTE and ADASYN techniques cause more discrimination in the female group.
The rate of unintentional discrimination seems to be higher in the original data of extremely unbalanced datasets under the following classifiers: Logistics Regression, LightGBM, and XGBoost.
Conclusions: Algorithmic fairness has become a broadly studied area in recent years, yet there is a very little systematic study on the effect of using DSTs on algorithmic fairness.
This study presents important findings to further the use of algorithmic fairness in CCP research.

Related Results

Algorithmic Individual Fairness and Healthcare: A Scoping Review
Algorithmic Individual Fairness and Healthcare: A Scoping Review
AbstractObjectiveStatistical and artificial intelligence algorithms are increasingly being developed for use in healthcare. These algorithms may reflect biases that magnify dispari...
Utilizing data sampling techniques on algorithmic fairness for customer churn prediction with data imbalance problems
Utilizing data sampling techniques on algorithmic fairness for customer churn prediction with data imbalance problems
Background: Customer churn prediction (CCP) refers to detecting which customers are likely to cancel the services provided by a service provider, for example, internet services. Th...
Application of Machine Learning Techniques for Customer Churn Prediction in the Banking Sector
Application of Machine Learning Techniques for Customer Churn Prediction in the Banking Sector
Aim/Purpose: Previous studies have primarily focused on comparing predictive models without considering the impact of data preprocessing on model performance. Therefore, this study...
Churn prediction using machine learning: A coupon optimization technique
Churn prediction using machine learning: A coupon optimization technique
Customer retention has been identified as one of the most crucial difficulties in every Business particularly in the grocery retail industry. In this context, an accurate forecast ...
Yayak Kartika Sari Prediksi Customer Churn Berbasis Adaptive Neuro Fuzzy Inference System
Yayak Kartika Sari Prediksi Customer Churn Berbasis Adaptive Neuro Fuzzy Inference System
Abstrak – Customer Churn adalah pelanggan yang berhenti berlangganan dan pindahpada perusahaan lain, karena berbagai faktor. Customer churn merupakan masalah yang sangatpenting yan...
Customer Churn Prediction Model Based on Adaptive Clustering Mixed-Sampling
Customer Churn Prediction Model Based on Adaptive Clustering Mixed-Sampling
Predicting the probability of customer churn is an important reference for formulating and implementing customer retention strategies. Compared with single classification method, e...
Identifying customer churn in Telecom sector: A Machine Learning Approach
Identifying customer churn in Telecom sector: A Machine Learning Approach
Nowadays, there is no shortage of options for customers when choosing where to put their money. As a result, customer churn and engagement have become one of the top issues. With t...
The Impact of Customer Service Quality on Customer Satisfaction: A study on Bangladeshi Banks
The Impact of Customer Service Quality on Customer Satisfaction: A study on Bangladeshi Banks
Abstract This research study examines the impact of customer service quality on customer satisfaction at Bangladeshi Banks. The study aimed to fill existing gaps in underst...

Back to Top