Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Utilizing data sampling techniques on algorithmic fairness for customer churn prediction with data imbalance problems

View through CrossRef
Background: Customer churn prediction (CCP) refers to detecting which customers are likely to cancel the services provided by a service provider, for example, internet services. The class imbalance problem (CIP) in machine learning occurs when there is a huge difference in the samples of the positive class compared to the negative class. It is one of the major obstacles in CCP as it deteriorates performance in the classification process. Utilizing data sampling techniques (DSTs) helps to resolve the CIP to some extent. Methods: In this paper, we review the effect of using DSTs on algorithmic fairness, i.e., to investigate whether the results pose any discrimination between male and female groups and compare the results before and after using DSTs. Three real-world datasets with unequal balancing rates were prepared and four ubiquitous DSTs were applied to them. Six popular classification techniques were utilized in the classification process. Both classifier’s performance and algorithmic fairness are evaluated with notable metrics. Results: The results indicated that the Random Forest classifier outperforms other classifiers in all three datasets and, that using SMOTE and ADASYN techniques causes more discrimination in the female group. The rate of unintentional discrimination seems to be higher in the original data of extremely unbalanced datasets under the following classifiers: Logistics Regression, LightGBM, and XGBoost. Conclusions: Algorithmic fairness has become a broadly studied area in recent years, yet there is very little systematic study on the effect of using DSTs on algorithmic fairness. This study presents important findings to further the use of algorithmic fairness in CCP research.
Title: Utilizing data sampling techniques on algorithmic fairness for customer churn prediction with data imbalance problems
Description:
Background: Customer churn prediction (CCP) refers to detecting which customers are likely to cancel the services provided by a service provider, for example, internet services.
The class imbalance problem (CIP) in machine learning occurs when there is a huge difference in the samples of the positive class compared to the negative class.
It is one of the major obstacles in CCP as it deteriorates performance in the classification process.
Utilizing data sampling techniques (DSTs) helps to resolve the CIP to some extent.
Methods: In this paper, we review the effect of using DSTs on algorithmic fairness, i.
e.
, to investigate whether the results pose any discrimination between male and female groups and compare the results before and after using DSTs.
Three real-world datasets with unequal balancing rates were prepared and four ubiquitous DSTs were applied to them.
Six popular classification techniques were utilized in the classification process.
Both classifier’s performance and algorithmic fairness are evaluated with notable metrics.
Results: The results indicated that the Random Forest classifier outperforms other classifiers in all three datasets and, that using SMOTE and ADASYN techniques causes more discrimination in the female group.
The rate of unintentional discrimination seems to be higher in the original data of extremely unbalanced datasets under the following classifiers: Logistics Regression, LightGBM, and XGBoost.
Conclusions: Algorithmic fairness has become a broadly studied area in recent years, yet there is very little systematic study on the effect of using DSTs on algorithmic fairness.
This study presents important findings to further the use of algorithmic fairness in CCP research.

Related Results

Algorithmic Individual Fairness and Healthcare: A Scoping Review
Algorithmic Individual Fairness and Healthcare: A Scoping Review
AbstractObjectiveStatistical and artificial intelligence algorithms are increasingly being developed for use in healthcare. These algorithms may reflect biases that magnify dispari...
A Novel Model for Partial and Total Churn Prediction in E-Commerce
A Novel Model for Partial and Total Churn Prediction in E-Commerce
Abstract The e-commerce market is a rapidly growing industry, with many companies entering the market to provide customers with easy access to a variety of products and ser...
Utilizing data sampling techniques on algorithmic fairness for customer churn prediction with data imbalance problems
Utilizing data sampling techniques on algorithmic fairness for customer churn prediction with data imbalance problems
Background: Customer churn prediction (CCP) refers to detecting which customers are likely to cancel the services provided by a service provider, for example, internet services. Th...
Privacy-Preserving Based Technique for Customer Churn Prediction in Telecom Industry
Privacy-Preserving Based Technique for Customer Churn Prediction in Telecom Industry
In recent years, customer churn has been one of the most prominent topics, especially in the telecom industry. The telecommunications industry is producing massive amounts of data ...
Customer churn prediction model: a case of the telecommunication market
Customer churn prediction model: a case of the telecommunication market
AbstractThe telecommunications market is well developed but is characterized by oversaturation and high levels of competition. Based on this, the urgent problem is to retain custom...
Customer Churn Prediction System Using Machine Learning: A Case Study ROSHAN Telecom-Afghanistan
Customer Churn Prediction System Using Machine Learning: A Case Study ROSHAN Telecom-Afghanistan
The success of any business relies on its customers, so it is crucial for firms to prioritize customer satisfaction. Customer churn is a significant concern for companies due to in...
Application of Machine Learning Techniques for Customer Churn Prediction in the Banking Sector
Application of Machine Learning Techniques for Customer Churn Prediction in the Banking Sector
Aim/Purpose: Previous studies have primarily focused on comparing predictive models without considering the impact of data preprocessing on model performance. Therefore, this study...
Customer churn prediction using composite deep learning technique
Customer churn prediction using composite deep learning technique
AbstractCustomer churn, a phenomenon that causes large financial losses when customers leave a business, makes it difficult for modern organizations to retain customers. When dissa...

Back to Top