Incremental Optimization Mechanism for Constructing a Decision Tree in Data Stream Mining

被引:16
|
作者
Yang, Hang [1 ]
Fong, Simon [1 ]
机构
[1] Univ Macau, Fac Sci & Technol, Dept Comp & Informat Sci, Taipa, Peoples R China
关键词
D O I
10.1155/2013/580397
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Imperfect data stream leads to tree size explosion and detrimental accuracy problems. Overfitting problem and the imbalanced class distribution reduce the performance of the original decision-tree algorithm for stream mining. In this paper, we propose an incremental optimization mechanism to solve these problems. The mechanism is called Optimized Very Fast Decision Tree (OVFDT) that possesses an optimized node-splitting control mechanism. Accuracy, tree size, and the learning time are the significant factors influencing the algorithm's performance. Naturally a bigger tree size takes longer computation time. OVFDT is a pioneer model equipped with an incremental optimization mechanism that seeks for a balance between accuracy and tree size for data stream mining. It operates incrementally by a test-then-train approach. Three types of functional tree leaves improve the accuracy with which the tree model makes a prediction for a new data stream in the testing phase. The optimized node-splitting mechanism controls the tree model growth in the training phase. The experiment shows that OVFDT obtains an optimal tree structure in both numeric and nominal datasets.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] The CART decision tree for mining data streams
    Rutkowski, Leszek
    Jaworski, Maciej
    Pietruczuk, Lena
    Duda, Piotr
    INFORMATION SCIENCES, 2014, 266 : 1 - 15
  • [22] Optimization and Data Mining for Decision Making
    Abu Haris, Norhaidah
    Abdullah, Munaisyah
    Othman, Abu Talib
    Rahman, Fauziah Abdul
    2014 WORLD CONGRESS ON COMPUTER APPLICATIONS AND INFORMATION SYSTEMS (WCCAIS), 2014,
  • [23] Constructing complete FP-tree for incremental mining of frequent patterns in dynamic databases
    Adnan, Muhaimenul
    Alhajj, Reda
    Barker, Ken
    ADVANCES IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4031 : 363 - 372
  • [24] Fuzzy Hoeffding Decision Tree for Data Stream Classification
    Ducange, Pietro
    Marcelloni, Francesco
    Pecori, Riccardo
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 946 - 964
  • [25] A Statistical Decision Tree Algorithm for Data Stream Classification
    Cazzolato, Mirela Teixeira
    Ribeiro, Marcela Xavier
    Yaguinuma, Cristiane
    Prado Santos, Marilde Terezinha
    ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2013, : 217 - 223
  • [26] Decision Tree Induction from Numeric Data Stream
    Nishimura, Satoru
    Terabe, Masahiro
    Hashimoto, Kazuo
    AI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5360 : 311 - 317
  • [27] Decision tree incremental learning algorithm oriented intelligence data
    Wang H.
    Chu C.
    Xie X.
    Wang N.
    Sun J.
    International Journal of Performability Engineering, 2018, 14 (05) : 849 - 856
  • [28] Constructing a decision tree from data with hierarchical class labels
    Chen, Yen-Liang
    Hu, Hsiao-Wei
    Tang, Kwei
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 4838 - 4847
  • [29] Decision tree models for data mining in hit discovery
    Hammann, Felix
    Drewe, Juergen
    EXPERT OPINION ON DRUG DISCOVERY, 2012, 7 (04) : 341 - 352
  • [30] Decision tree construction for data mining on grid computing
    Tsai, ST
    Yang, CT
    2004 IEEE INTERNATIONAL CONFERNECE ON E-TECHNOLOGY, E-COMMERE AND E-SERVICE, PROCEEDINGS, 2004, : 441 - 447