Enhancing Customer Churn Prediction With Resampling: A Comparative Study

被引:0
|
作者
Ong, Jia-Xuan [1 ]
Tong, Gee-Kok [1 ]
Khor, Kok-Chin [2 ]
Haw, Su-Cheng [1 ]
机构
[1] Multimedia Univ, Fac Comp & Informat, Persiaran Multimedia, Cyberjaya 63100, Selangor, Malaysia
[2] Univ Tunku Abdul Rahman, Lee Kong Chian Fac Engn & Sci, Jalan Sungai Long, Bandar Sungai Long 43000, Kajang, Malaysia
关键词
Customer churn prediction; imbalance datasets; resampling; oversampling;
D O I
10.18421/TEM133-20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this competitive business world, accurately predicting customer churn is crucial to maintaining and preventing revenue loss. However, due to the imbalanced nature of customer churn data, traditional machine learning algorithms often fail to identify churned customers accurately. This has led to exploring resampling techniques, demonstrating their efficacy in addressing this issue. However, current studies in the customer churn prediction field frequently overlook the untapped potential of comprehensive investigation and comparison of resampling techniques. Instead of exploring and comparing various resampling methods, many studies predominantly rely on a single resampling method, such as SMOTE. Hence, this paper aims to compare and evaluate the effectiveness of several resampling methods, including oversampling, undersampling, and hybrid techniques. We utilized the benchmark dataset, telecommunication customer churn, from IBM Watson, where approximately 26.5% of the customers have churned, indicating that the data is imbalanced. Our results demonstrate that the combination of random forest with a hybrid sampling method - SMOTE-ENN obtained the best result. The combination yields an F1 score of 95.3% and an accuracy of 96.0%, surpassing the studies that utilized the same dataset. This highlights the benefits of comparing resampling techniques in predicting customer churn, specifically in imbalanced datasets.
引用
收藏
页码:1927 / 1936
页数:10
相关论文
共 50 条
  • [21] ADTreesLogit model for customer churn prediction
    Jiayin Qi
    Li Zhang
    Yanping Liu
    Ling Li
    Yongpin Zhou
    Yao Shen
    Liang Liang
    Huaizu Li
    Annals of Operations Research, 2009, 168
  • [22] Customer Segmentation and Churn Prediction via Customer Metrics
    Bozkan, Tunahan
    Cakar, Tuna
    Sayar, Alperen
    Ertugrul, Seyit
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [23] The application of AdaBoost in customer churn prediction
    Jinbo, Shao
    Xiu, Li
    Wenhuang, Liu
    2007 INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1-3, 2007, : 513 - +
  • [24] Study on the Qualitative Simulation-Based Customer Churn Prediction
    Wang, Hu
    Li, Wei-jian
    IEEC 2009: FIRST INTERNATIONAL SYMPOSIUM ON INFORMATION ENGINEERING AND ELECTRONIC COMMERCE, PROCEEDINGS, 2009, : 528 - 532
  • [25] Detecting the Risk of Customer Churn in Telecom Sector: A Comparative Study
    Edwine, Nabahirwa
    Wang, Wenjuan
    Song, Wei
    Ssebuggwawo, Denis
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [26] Customer churn prediction by hybrid model
    Lee, Jae Sik
    Lee, Jin Chun
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 959 - 966
  • [27] Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art
    Bogaert, Matthias
    Delaere, Lex
    MATHEMATICS, 2023, 11 (05)
  • [28] Constraint mining in business intelligence: A case study of customer churn prediction
    1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Australia (08):
  • [29] Study on Customer Churn Prediction Methods based on Multiple Classifiers Combination
    Xiao, Yao
    He, Changzheng
    Xiao, Jin
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 1, PROCEEDINGS, 2009, : 597 - 601
  • [30] Study of machine learning methods for customer churn prediction in telecommunication company
    Sniegula, Anna
    Poniszewska-Maranda, Aneta
    Popovic, Milan
    IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 640 - 644