Enhancing Customer Churn Prediction With Resampling: A Comparative Study

被引:0
|
作者
Ong, Jia-Xuan [1 ]
Tong, Gee-Kok [1 ]
Khor, Kok-Chin [2 ]
Haw, Su-Cheng [1 ]
机构
[1] Multimedia Univ, Fac Comp & Informat, Persiaran Multimedia, Cyberjaya 63100, Selangor, Malaysia
[2] Univ Tunku Abdul Rahman, Lee Kong Chian Fac Engn & Sci, Jalan Sungai Long, Bandar Sungai Long 43000, Kajang, Malaysia
关键词
Customer churn prediction; imbalance datasets; resampling; oversampling;
D O I
10.18421/TEM133-20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this competitive business world, accurately predicting customer churn is crucial to maintaining and preventing revenue loss. However, due to the imbalanced nature of customer churn data, traditional machine learning algorithms often fail to identify churned customers accurately. This has led to exploring resampling techniques, demonstrating their efficacy in addressing this issue. However, current studies in the customer churn prediction field frequently overlook the untapped potential of comprehensive investigation and comparison of resampling techniques. Instead of exploring and comparing various resampling methods, many studies predominantly rely on a single resampling method, such as SMOTE. Hence, this paper aims to compare and evaluate the effectiveness of several resampling methods, including oversampling, undersampling, and hybrid techniques. We utilized the benchmark dataset, telecommunication customer churn, from IBM Watson, where approximately 26.5% of the customers have churned, indicating that the data is imbalanced. Our results demonstrate that the combination of random forest with a hybrid sampling method - SMOTE-ENN obtained the best result. The combination yields an F1 score of 95.3% and an accuracy of 96.0%, surpassing the studies that utilized the same dataset. This highlights the benefits of comparing resampling techniques in predicting customer churn, specifically in imbalanced datasets.
引用
收藏
页码:1927 / 1936
页数:10
相关论文
共 50 条
  • [41] A Feature Interaction Network for Customer Churn Prediction
    Tang, Qi
    Xia, Guoen
    Zhang, Xianquan
    Li, Yaxiang
    ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 242 - 248
  • [42] A Novel Approach to Customer Churn Prediction in Telecom
    Senthilselvi, A.
    Kanishk, V
    Vineesh, K.
    Raj, Praveen A.
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [43] Handling class imbalance in customer churn prediction
    Burez, J.
    Van den Poel, D.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 4626 - 4636
  • [44] A Prudent Based Approach for Customer Churn Prediction
    Amin, Adnan
    Rahim, Faisal
    Ramzan, Muhammad
    Anwar, Sajid
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2015, 2015, 521 : 320 - 332
  • [45] Customer churn prediction using a novel meta-classifier: an investigation on transaction, Telecommunication and customer churn datasets
    Ehsani, Fatemeh
    Hosseini, Monireh
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2024, 48 (01)
  • [46] Study on the Prediction Model of Customer Churn Risk Based on BP Neural Networks
    Li, Yu
    Hu, Wang
    NINTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS I-III, 2010, : 2610 - 2615
  • [47] Customer retention and churn prediction in the telecommunication industry: a case study on a Danish university
    Saleh, Sarkaft
    Saha, Subrata
    SN APPLIED SCIENCES, 2023, 5 (07):
  • [48] Customer retention and churn prediction in the telecommunication industry: a case study on a Danish university
    Sarkaft Saleh
    Subrata Saha
    SN Applied Sciences, 2023, 5
  • [49] Attribute Selection and Customer Churn Prediction in Telecom Industry
    Umayaparvathi, V.
    Iyakutti, K.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA MINING AND ADVANCED COMPUTING (SAPIENCE), 2016, : 84 - 90
  • [50] Churn Prediction Model for Effective Gym Customer Retention
    Semrl, Jas
    Matei, Alexandru
    PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC ADVANCE IN BEHAVIORAL, ECONOMIC, SOCIOCULTURAL COMPUTING (BESC), 2017,