A Model-Selection Framework for Concept-Drifting Data Streams

被引:0
|
作者
Chen, Bo-Heng [1 ]
Chuang, Kun-Ta [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 701, Taiwan
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
There has been an increasing research interest in classification for data streams. Due to the evolving nature of data streams, it is a highly challenging issue to detect the appearance of concept drifts, which will make the current classification model invalid as time passes. So far most stream classification solutions exploit the so-called incremental learning process to continuously track the deviation of prediction accuracy. Unfortunately, to achieve the prompt concept-drifting detection, such strategies usually rely on an infeasible assumption about the availability of data instances with true labels. We in this paper propose a new framework, called Inference of Concept Evolution ( abbreviated as ICE), to minimize the need of real-time acquisition of true labels. Specifically, the ICE framework is devised based on the idea of model reuse. The dictionary learning technique is utilized to determine whether the concept drift appears without the need of label acquisition. When the drift happens, the ICE framework will select the best model maintained in the model pool, decreasing the need of model re-training and its costly label acquisition. As demonstrated in our experimental result, the ICE framework can track the best model correctly and efficiently, showing its feasibility in real cases.
引用
收藏
页码:290 / 296
页数:7
相关论文
共 50 条
  • [31] A Multi-partition Multi-chunk Ensemble Technique to Classify Concept-Drifting Data Streams
    Masud, Mohammad M.
    Gao, Jing
    Khan, Latifur
    Han, Jiawei
    Thuraisingham, Bhavani
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 363 - +
  • [32] An Efficient Continuous Attributes Handling Method for Mining Concept-Drifting Data Streams Based on Skip List
    Ouyang, Zhenzheng
    Gao, Yuhai
    Li, Mingjun
    Luo, Jianshu
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2011, 7002 : 364 - +
  • [33] Pyramid Stack Data Stream Mining for Handling Concept-drifting
    Xu, Zhuoran
    Hou, Cuiqin
    Xia, Yingju
    Sun, Jun
    Inakoshi, Hiroya
    Yugami, Nobuhiro
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 33 - 37
  • [34] An Algorithm for Anticipating Future Decision Trees from Concept-Drifting Data
    Boettcher, Mirko
    Spott, Martin
    Kruse, Rudolf
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXV, 2009, : 293 - +
  • [35] Knowledge maintenance on data streams with concept drifting
    Natwichai, J
    Li, X
    COMPUTATIONAL AND INFORMATION SCIENCE, PROCEEDINGS, 2004, 3314 : 705 - 710
  • [36] An Improving Fuzzy C-means Algorithm for Concept-Drifting Data Stream
    Zhang, Baoju
    Xue, Lei
    Wang, Wei
    Qin, Shan
    Wang, Dan
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2018, 423 : 439 - 450
  • [37] Learning from Concept Drifting Data Streams with Unlabeled Data
    Li, Peipei
    Wu, Xindong
    Hu, Xuegang
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1945 - 1946
  • [38] Learning from concept drifting data streams with unlabeled data
    Wu, Xindong
    Li, Peipei
    Hu, Xuegang
    NEUROCOMPUTING, 2012, 92 : 145 - 155
  • [39] Better algorithm for classifying data streams with concept drifting
    Department of Computer Science and Engineering, Northwestern Polytechnical University, Xi'an 710072, China
    不详
    Xibei Gongye Daxue Xuebao, 2007, 4 (603-607):
  • [40] GP Boosting Classification on Concept Drifting Data Streams
    Kumar, Dirisala J. Nagendra
    Murthy, J. V. R.
    Satapathy, Suresh Chandra
    Pullela, S. V. V. S. R. Kumar
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 265 - +