Customer churn prediction in telecommunication industry using data certainty

被引:111
|
作者
Amin, Adnan [1 ]
Al-Obeidat, Feras [2 ]
Shah, Babar [2 ]
Adnan, Awais [1 ]
Loo, Jonathan [3 ]
Anwar, Sajid [1 ]
机构
[1] Inst Management Sci, Ctr Excellence Informat Technol, Peshawar 25000, Pakistan
[2] Zayed Univ, Coll Technol Innovat, Abu Dhabi 144534, U Arab Emirates
[3] Univ West London, Comp & Commun Engn, London, England
关键词
Churn prediction; Uncertain samples; Classification; Telecommunication; Customer churn; SUPPORT VECTOR MACHINES; CLASS IMBALANCE PROBLEM; ALGORITHM;
D O I
10.1016/j.jbusres.2018.03.003
中图分类号
F [经济];
学科分类号
02 ;
摘要
Customer Churn Prediction (CCP) is a challenging activity for decision makers and machine learning community because most of the time, churn and non-churn customers have resembling features. From different experiments on customer churn and related data, it can be seen that a classifier shows different accuracy levels for different zones of a dataset. In such situations, a correlation can easily be observed in the level of classifier's accuracy and certainty of its prediction. If a mechanism can be defined to estimate the classifier's certainty for different zones within the data, then the expected classifier's accuracy can be estimated even before the classification. In this paper, a novel CCP approach is presented based on the above concept of classifier's certainty estimation using distance factor. The dataset is grouped into different zones based on the distance factor which are then divided into two categories as; (i) data with high certainty, and (ii) data with low certainty, for predicting customers exhibiting Churn and Non-churn behavior. Using different state-of-the-art evaluation measures (e.g., accuracy, f-measure, precision and recall) on different publicly available the Telecommunication Industry (TCI) datasets show that (i) the distance factor is strongly co-related with the certainty of the classifier, and (ii) the classifier obtained high accuracy in the zone with greater distance factor's value (i.e., customer churn and non-churn with high certainty) than those placed in the zone with smaller distance factor's value (i.e., customer chum and non-churn with low certainty).
引用
收藏
页码:290 / 301
页数:12
相关论文
共 50 条
  • [11] Improved churn prediction in telecommunication industry using data mining techniques
    Keramati, A.
    Jafari-Marandi, R.
    Aliannejadi, M.
    Ahmadian, I.
    Mozaffari, M.
    Abbasi, U.
    APPLIED SOFT COMPUTING, 2014, 24 : 994 - 1012
  • [12] Supervised Massive Data Analysis for Telecommunication Customer Churn Prediction
    Li, Hui
    Yang, Deliang
    Yang, Lingling
    Lu, Yao
    Lin, Xiaola
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 163 - 169
  • [13] Customer Churn Prediction in Telecommunication Industry: With and without Counter-Example
    Amin, Adnan
    Khan, Changez
    Ali, Imtiaz
    Anwar, Sajid
    NATURE-INSPIRED COMPUTATION AND MACHINE LEARNING, PT II, 2014, 8857 : 206 - 218
  • [14] ChurnNet: Deep Learning Enhanced Customer Churn Prediction in Telecommunication Industry
    Saha, Somak
    Saha, Chamak
    Haque, Md. Mahidul
    Alam, Md. Golam Rabiul
    Talukder, Ashis
    IEEE ACCESS, 2024, 12 : 4471 - 4484
  • [15] Customer Churn Prediction in telecommunication Industry: with and without Counter-Example
    Amin, Adnan
    Khan, Changez
    Ali, Imtiaz
    Anwar, Sajid
    2014 EUROPEAN NETWORK INTELLIGENCE CONFERENCE (ENIC), 2014, : 134 - 137
  • [16] A comparative analysis of data preparation algorithms for customer churn prediction: A case study in the telecommunication industry
    Coussement, Kristof
    Lessmann, Stefan
    Verstraeten, Geert
    DECISION SUPPORT SYSTEMS, 2017, 95 : 27 - 36
  • [17] Enhanced Prediction Model for Customer Churn in Telecommunication Using EMOTE
    Babu, S.
    Ananthanarayanan, N. R.
    INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS, ICICA 2016, 2018, 632 : 465 - 475
  • [18] Customer retention and churn prediction in the telecommunication industry: a case study on a Danish university
    Saleh, Sarkaft
    Saha, Subrata
    SN APPLIED SCIENCES, 2023, 5 (07):
  • [19] Customer retention and churn prediction in the telecommunication industry: a case study on a Danish university
    Sarkaft Saleh
    Subrata Saha
    SN Applied Sciences, 2023, 5
  • [20] Predicting Telecommunication Customer Churn Using Data Mining Techniques
    AlOmari, Diana
    Hassan, Mohammad Mehedi
    INTERNET AND DISTRIBUTED COMPUTING SYSTEMS, IDCS 2016, 2016, 9864 : 167 - 178