Customer churn prediction in telecommunication industry using data certainty

被引:111
|
作者
Amin, Adnan [1 ]
Al-Obeidat, Feras [2 ]
Shah, Babar [2 ]
Adnan, Awais [1 ]
Loo, Jonathan [3 ]
Anwar, Sajid [1 ]
机构
[1] Inst Management Sci, Ctr Excellence Informat Technol, Peshawar 25000, Pakistan
[2] Zayed Univ, Coll Technol Innovat, Abu Dhabi 144534, U Arab Emirates
[3] Univ West London, Comp & Commun Engn, London, England
关键词
Churn prediction; Uncertain samples; Classification; Telecommunication; Customer churn; SUPPORT VECTOR MACHINES; CLASS IMBALANCE PROBLEM; ALGORITHM;
D O I
10.1016/j.jbusres.2018.03.003
中图分类号
F [经济];
学科分类号
02 ;
摘要
Customer Churn Prediction (CCP) is a challenging activity for decision makers and machine learning community because most of the time, churn and non-churn customers have resembling features. From different experiments on customer churn and related data, it can be seen that a classifier shows different accuracy levels for different zones of a dataset. In such situations, a correlation can easily be observed in the level of classifier's accuracy and certainty of its prediction. If a mechanism can be defined to estimate the classifier's certainty for different zones within the data, then the expected classifier's accuracy can be estimated even before the classification. In this paper, a novel CCP approach is presented based on the above concept of classifier's certainty estimation using distance factor. The dataset is grouped into different zones based on the distance factor which are then divided into two categories as; (i) data with high certainty, and (ii) data with low certainty, for predicting customers exhibiting Churn and Non-churn behavior. Using different state-of-the-art evaluation measures (e.g., accuracy, f-measure, precision and recall) on different publicly available the Telecommunication Industry (TCI) datasets show that (i) the distance factor is strongly co-related with the certainty of the classifier, and (ii) the classifier obtained high accuracy in the zone with greater distance factor's value (i.e., customer churn and non-churn with high certainty) than those placed in the zone with smaller distance factor's value (i.e., customer chum and non-churn with low certainty).
引用
收藏
页码:290 / 301
页数:12
相关论文
共 50 条
  • [31] Customer Churn Analysis : A Case Study on the Telecommunication Industry of Thailand
    Wanchai, Paweena
    2017 12TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2017, : 325 - 331
  • [32] Prediction of customer plan using churn analysis for telecom industry
    Ajitha P.
    Sivasangari A.
    Gomathi R.M.
    Indira K.
    Recent Advances in Computer Science and Communications, 2020, 13 (05): : 926 - 929
  • [33] Cross-company customer churn prediction in telecommunication: A comparison of data transformation methods
    Amin, Adnan
    Shah, Babar
    Khattak, Asad Masood
    Lopes Moreira, Fernando Joaquim
    Ali, Gohar
    Rocha, Alvaro
    Anwar, Sajid
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2019, 46 : 304 - 319
  • [34] Customer churn prediction using data mining approach
    Qaisi, Laila M.
    Rodan, Ali
    Qaddoum, Kefaya
    Al-Sayyed, Rizik
    2018 FIFTH HCT INFORMATION TECHNOLOGY TRENDS (ITT): EMERGING TECHNOLOGIES FOR ARTIFICIAL INTELLIGENCE, 2018, : 348 - 352
  • [35] Churn prediction in telecommunication industry using kernel Support Vector Machines
    Nhu, Nguyen Y.
    Tran Van Lyid
    Dao Vu Truong Son
    PLOS ONE, 2022, 17 (05):
  • [36] A Customer Churn Prediction Model in Telecom Industry Using Boosting
    Lu, Ning
    Lin, Hua
    Lu, Jie
    Zhang, Guangquan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2014, 10 (02) : 1659 - 1665
  • [37] The application of social network analysis on the telecommunication customer churn prediction
    Wu, Jianlin
    Zhang, Zhansheng
    PROCEEDINGS OF JOURNAL PUBLICATION MEETING (2007), 2007, : 119 - 123
  • [38] Just-in-time customer churn prediction in the telecommunication sector
    Amin, Adnan
    Al-Obeidat, Feras
    Shah, Babar
    Al Tae, May
    Khan, Changez
    Durrani, Hamood Ur Rehman
    Anwar, Sajid
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (06): : 3924 - 3948
  • [39] Just-in-time customer churn prediction in the telecommunication sector
    Adnan Amin
    Feras Al-Obeidat
    Babar Shah
    May Al Tae
    Changez Khan
    Hamood Ur Rehman Durrani
    Sajid Anwar
    The Journal of Supercomputing, 2020, 76 : 3924 - 3948
  • [40] Using PCA to Predict Customer Churn in Telecommunication Dataset
    Sato, T.
    Huang, B. Q.
    Huang, Y.
    Kechadi, M-T
    Buckley, B.
    ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 326 - 335