A novel evolutionary data mining algorithm with applications to churn prediction

被引:191
|
作者
Au, WH [1 ]
Chan, KCC
Yao, X
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[2] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
关键词
churn prediction; customer retention; data mining; evolutionary computation; genetic algorithms;
D O I
10.1109/TEVC.2003.819264
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is an important topic in data mining research. Given a set of data records; each of which belongs to one of a number of predefined classes, the classification problem is concerned with the discovery of classification rules that can allow records with unknown class membership to be correctly classified. Many algorithms have been developed to mine large data sets for classification models and they have been shown to be very effective. However, when it comes to determining the likelihood of each classification made, many of them are not designed with, such purpose in mind. For this, they are not readily applicable to such problem as churn prediction. For such an application, the goal is not only to predict whether or not a subscriber would switch from one carrier to another, it is. also important that the likelihood of the subscriber's doing so be predicted. The reason for this is that a carrier can then choose to provide special personalized offer and services to those subscribers who are predicted with higher likelihood to churn. Given its importance, we propose a new data mining algorithm, called data mining by evolutionary learning (DMEL), to handle classification problems of which the accuracy of each predictions made has to be estimated. In performing its tasks, DMEL searches through the possible rule space using an evolutionary approach that has the following characteristics: 1) the evolutionary process begins with the generation of an initial set of first-order rules (i.e., rules with one conjunct/condition) using a probabilistic induction technique and based on these rules, rules of higher order (two or more conjuncts) are obtained iteratively; 2) when identifying interesting rules, an objective interestingness measure is used; 3) the fitness of a chromosome is defined in terms of the probability that the attribute values of a record can be correctly determined using the rules it encodes; and 4) the likelihood of predictions (or classifications) made are estimated so that subscribers can be ranked according to their likelihood to churn. Experiments with different data sets showed that DMEL is able to effectively discover interesting classification rules. In particular; it is able to predict churn accurately under different churn rates when applied to real telecom subscriber data.
引用
收藏
页码:532 / 545
页数:14
相关论文
共 50 条
  • [41] Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm
    Taherkhani, Leila
    Daneshvar, Amir
    Khalili, Hossein Amoozad
    Sanaei, Mohamad Reza
    ADVANCES IN CIVIL ENGINEERING, 2023, 2023
  • [42] A novel quantum swarm evolutionary algorithm and its applications
    Wang, Yan
    Feng, Xiao-Yue
    Huang, Yan-Xin
    Pu, Dong-Bing
    Zhou, Wen-Gang
    Liang, Yan-Chun
    Zhou, Chun-Guang
    NEUROCOMPUTING, 2007, 70 (4-6) : 633 - 640
  • [43] Modelling Customer Churn Using Segmentation and Data Mining
    Hiziroglu, Abdulkadir
    Seymen, Omer Faruk
    DATABASES AND INFORMATION SYSTEMS VIII, 2014, 270 : 259 - 271
  • [44] An Approach for Predicting Employee Churn by Using Data Mining
    Yigit, Ibrahim Onuralp
    Shourabizadeh, Hamed
    2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [45] A predictive model of churn in telecommunications based on data mining
    Mo Zan
    Zhao Shan
    Li Li
    Liu Ai-Jun
    2007 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-7, 2007, : 1382 - 1386
  • [46] Applying Fuzzy Data Mining to Telecom Churn Management
    Liao, Kuo-Hsiung
    Chueh, Hao-En
    INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT I, 2011, 134 (0I): : 259 - 264
  • [47] Customer Churn Prediction in Superannuation: A Sequential Pattern Mining Approach
    Culbert, Ben
    Fu, Bin
    Brownlow, James
    Chu, Charles
    Meng, Qinxue
    Xu, Guandong
    DATABASES THEORY AND APPLICATIONS, ADC 2018, 2018, 10837 : 123 - 134
  • [48] Applications of Harmony Search Algorithm in Data Mining: A Survey
    Assad, Assif
    Deep, Kusum
    PROCEEDINGS OF FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2015), VOL 2, 2016, 437 : 863 - 874
  • [49] A Novel Approach to Customer Churn Prediction in Telecom
    Senthilselvi, A.
    Kanishk, V
    Vineesh, K.
    Raj, Praveen A.
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [50] Dimensionality and data reduction in telecom churn prediction
    Lin, Wei-Chao
    Tsai, Chih-Fong
    Ke, Shih-Wen
    KYBERNETES, 2014, 43 (05) : 737 - 749