Javascript must be enabled to continue!
Application of Machine Learning Techniques for Customer Churn Prediction in the Banking Sector
View through CrossRef
Aim/Purpose: Previous studies have primarily focused on comparing predictive models without considering the impact of data preprocessing on model performance. Therefore, this study sets two main objectives. The first objective is to investigate the effect of resampling methods for handling imbalanced data on model effectiveness. The second objective is to compare and evaluate machine learning methods to identify the optimal model for each resampling technique, thereby determining the model that achieves the highest performance.
Background: In the highly competitive banking industry, attrition of customers is a major challenge for banks trying to improve customer retention. While many studies have focused on building and evaluating models to predict customer churn, they often miss addressing the problem of imbalanced data, which can significantly affect the model’s accuracy.
Methodology: In this study, following exploratory data analysis (EDA), we apply various techniques to address data imbalance and use a range of machine learning models, including Naïve Bayes, Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, Gradient Boosting, XGBoost, and LightGBM, to predict customer churn using the dataset.
Contribution: The contribution of this research lies in its comprehensive evaluation and comparison of various techniques for handling imbalanced data in churn prediction models. The study identifies SMOTE-ENN as the most effective method for resampling imbalanced data. Among the models tested, LightGBM (accuracy = 0.979) achieves the highest performance based on evaluation metrics. Additionally, the research highlights that tree-based machine learning models generally perform better when trained on imbalanced datasets.
Findings: Tree-based and ensemble models perform better than regression and probability-based methods when dealing with imbalanced data. SMOTE-ENN has been shown to improve the performance of machine learning models greatly.
Recommendations for Practitioners: Practitioners can deploy high-performance models, such as XGBoost and LightGBM, combined with effective resampling methods like SMOTE-ENN to predict customer churn in banking, marketing, and human resources.
Recommendation for Researchers: To optimize the predictive model in the study, researchers can focus on feature selection, dimensionality reduction, or hyperparameter tuning.
Impact on Society: Customer churn reduces revenue and threatens competitive advantage, so businesses need effective retention strategies to maintain sustainable growth. High-performance customer churn prediction models can be an effective solution to address this issue.
Future Research: Deploy the model on real-world datasets while further optimizing the feature selection process and hyperparameter tuning, combined with SHAP values analysis to identify key features that significantly influence the model’s predictions.
Title: Application of Machine Learning Techniques for Customer Churn Prediction in the Banking Sector
Description:
Aim/Purpose: Previous studies have primarily focused on comparing predictive models without considering the impact of data preprocessing on model performance.
Therefore, this study sets two main objectives.
The first objective is to investigate the effect of resampling methods for handling imbalanced data on model effectiveness.
The second objective is to compare and evaluate machine learning methods to identify the optimal model for each resampling technique, thereby determining the model that achieves the highest performance.
Background: In the highly competitive banking industry, attrition of customers is a major challenge for banks trying to improve customer retention.
While many studies have focused on building and evaluating models to predict customer churn, they often miss addressing the problem of imbalanced data, which can significantly affect the model’s accuracy.
Methodology: In this study, following exploratory data analysis (EDA), we apply various techniques to address data imbalance and use a range of machine learning models, including Naïve Bayes, Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, Gradient Boosting, XGBoost, and LightGBM, to predict customer churn using the dataset.
Contribution: The contribution of this research lies in its comprehensive evaluation and comparison of various techniques for handling imbalanced data in churn prediction models.
The study identifies SMOTE-ENN as the most effective method for resampling imbalanced data.
Among the models tested, LightGBM (accuracy = 0.
979) achieves the highest performance based on evaluation metrics.
Additionally, the research highlights that tree-based machine learning models generally perform better when trained on imbalanced datasets.
Findings: Tree-based and ensemble models perform better than regression and probability-based methods when dealing with imbalanced data.
SMOTE-ENN has been shown to improve the performance of machine learning models greatly.
Recommendations for Practitioners: Practitioners can deploy high-performance models, such as XGBoost and LightGBM, combined with effective resampling methods like SMOTE-ENN to predict customer churn in banking, marketing, and human resources.
Recommendation for Researchers: To optimize the predictive model in the study, researchers can focus on feature selection, dimensionality reduction, or hyperparameter tuning.
Impact on Society: Customer churn reduces revenue and threatens competitive advantage, so businesses need effective retention strategies to maintain sustainable growth.
High-performance customer churn prediction models can be an effective solution to address this issue.
Future Research: Deploy the model on real-world datasets while further optimizing the feature selection process and hyperparameter tuning, combined with SHAP values analysis to identify key features that significantly influence the model’s predictions.
Related Results
A Novel Model for Partial and Total Churn Prediction in E-Commerce
A Novel Model for Partial and Total Churn Prediction in E-Commerce
Abstract
The e-commerce market is a rapidly growing industry, with many companies entering the market to provide customers with easy access to a variety of products and ser...
Customer Churn Prediction System Using Machine Learning: A Case Study ROSHAN Telecom-Afghanistan
Customer Churn Prediction System Using Machine Learning: A Case Study ROSHAN Telecom-Afghanistan
The success of any business relies on its customers, so it is crucial for firms to prioritize customer satisfaction. Customer churn is a significant concern for companies due to in...
Privacy-Preserving Based Technique for Customer Churn Prediction in Telecom Industry
Privacy-Preserving Based Technique for Customer Churn Prediction in Telecom Industry
In recent years, customer churn has been one of the most prominent topics, especially in the telecom industry. The telecommunications industry is producing massive amounts of data ...
Customer churn prediction model: a case of the telecommunication market
Customer churn prediction model: a case of the telecommunication market
AbstractThe telecommunications market is well developed but is characterized by oversaturation and high levels of competition. Based on this, the urgent problem is to retain custom...
Customer churn prediction using composite deep learning technique
Customer churn prediction using composite deep learning technique
AbstractCustomer churn, a phenomenon that causes large financial losses when customers leave a business, makes it difficult for modern organizations to retain customers. When dissa...
Identifying customer churn in Telecom sector: A Machine Learning Approach
Identifying customer churn in Telecom sector: A Machine Learning Approach
Nowadays, there is no shortage of options for customers when choosing where to put their money. As a result, customer churn and engagement have become one of the top issues. With t...
The Impact of Customer Service Quality on Customer Satisfaction: A study on Bangladeshi Banks
The Impact of Customer Service Quality on Customer Satisfaction: A study on Bangladeshi Banks
Abstract
This research study examines the impact of customer service quality on customer satisfaction at Bangladeshi Banks. The study aimed to fill existing gaps in underst...
Churn prediction using machine learning: A coupon optimization technique
Churn prediction using machine learning: A coupon optimization technique
Customer retention has been identified as one of the most crucial difficulties in every Business particularly in the grocery retail industry. In this context, an accurate forecast ...

