Learning model trees from evolving data streams

被引:193
|
作者
Ikonomovska, Elena [1 ,4 ]
Gama, Joao [2 ,3 ]
Dzeroski, Saso [1 ]
机构
[1] Jozef Stefan Inst, Ljubljana 1000, Slovenia
[2] Univ Porto, LIAAD INESC, P-4050190 Oporto, Portugal
[3] Univ Porto, Fac Econ, P-4200 Oporto, Portugal
[4] Ss Cyril & Methodius Univ, Fac Elect Engn & Informat Technol, Skopje 1000, Macedonia
关键词
Non-stationary data streams; Stream data mining; Regression trees; Model trees; Incremental algorithms; On-line learning; Concept drift; On-line change detection; REGRESSION TREES; DRIFT;
D O I
10.1007/s10618-010-0201-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of real-time extraction of meaningful patterns from time-changing data streams is of increasing importance for the machine learning and data mining communities. Regression in time-changing data streams is a relatively unexplored topic, despite the apparent applications. This paper proposes an efficient and incremental stream mining algorithm which is able to learn regression and model trees from possibly unbounded, high-speed and time-changing data streams. The algorithm is evaluated extensively in a variety of settings involving artificial and real data. To the best of our knowledge there is no other general purpose algorithm for incremental learning regression/model trees able to perform explicit change detection and informed adaptation. The algorithm performs online and in real-time, observes each example only once at the speed of arrival, and maintains at any-time a ready-to-use model tree. The tree leaves contain linear models induced online from the examples assigned to them, a process with low complexity. The algorithm has mechanisms for drift detection and model adaptation, which enable it to maintain accurate and updated regression models at any time. The drift detection mechanism exploits the structure of the tree in the process of local change detection. As a response to local drift, the algorithm is able to update the tree structure only locally. This approach improves the any-time performance and greatly reduces the costs of adaptation.
引用
收藏
页码:128 / 168
页数:41
相关论文
共 50 条
  • [21] Clustering Based Active Learning for Evolving Data Streams
    Ienco, Dino
    Bifet, Albert
    Zliobaite, Indre
    Pfahringer, Bernhard
    DISCOVERY SCIENCE, 2013, 8140 : 79 - 93
  • [22] Adaptive online incremental learning for evolving data streams
    Zhang, Si -si
    Liu, Jian-wei
    Zuo, Xin
    APPLIED SOFT COMPUTING, 2021, 105
  • [23] TECNO-STREAMS: Tracking evolving clusters in noisy data streams with a scalable immune system learning model
    Nasraoui, F
    Uribe, CC
    Coronel, CR
    Gonzalez, F
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 235 - 242
  • [24] Learning accurate very fast decision trees from uncertain data streams
    Liang, Chunquan
    Zhang, Yang
    Shi, Peng
    Hu, Zhengguo
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2015, 46 (16) : 3032 - 3050
  • [25] Learning higher accuracy decision trees from concept drifting data streams
    Nishimura, Satoru
    Terabe, Masahiro
    Hashimoto, Kazuo
    Mihara, Koichiro
    NEW FRONTIERS IN APPLIED ARTIFICIAL INTELLIGENCE, 2008, 5027 : 179 - 188
  • [26] Learning decision trees from time-changing uncertain data streams
    Liang, Chunquan
    Zhang, Yang
    Hu, Shaojun
    Information Technology Journal, 2013, 12 (24) : 8469 - 8475
  • [27] Semi-supervised federated learning on evolving data streams
    Mawuli, Cobbinah B.
    Kumar, Jay
    Nanor, Ebenezer
    Fu, Shangxuan
    Pan, Liangxu
    Yang, Qinli
    Zhang, Wei
    Shao, Junming
    INFORMATION SCIENCES, 2023, 643
  • [28] AN IMPROVING ONLINE ACCURACY UPDATED ENSEMBLE METHOD IN LEARNING FROM EVOLVING DATA STREAMS
    Gu, Xiao-Feng
    Xu, Jia-Wen
    Huang, Shi-Jing
    Wang, Liao-Ming
    2014 11TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2014, : 430 - 433
  • [29] Recurring concept meta-learning for evolving data streams
    Anderson, Robert
    Koh, Yun Sing
    Dobbie, Gillian
    Bifet, Albert
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 138
  • [30] Meta Expert Learning and Efficient Pruning for Evolving Data Streams
    Azarafrooz, Mahdi
    Daneshmand, Mahmoud
    IEEE INTERNET OF THINGS JOURNAL, 2015, 2 (04): : 268 - 273