Incremental Optimization Mechanism for Constructing a Decision Tree in Data Stream Mining

被引:16
|
作者
Yang, Hang [1 ]
Fong, Simon [1 ]
机构
[1] Univ Macau, Fac Sci & Technol, Dept Comp & Informat Sci, Taipa, Peoples R China
关键词
D O I
10.1155/2013/580397
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Imperfect data stream leads to tree size explosion and detrimental accuracy problems. Overfitting problem and the imbalanced class distribution reduce the performance of the original decision-tree algorithm for stream mining. In this paper, we propose an incremental optimization mechanism to solve these problems. The mechanism is called Optimized Very Fast Decision Tree (OVFDT) that possesses an optimized node-splitting control mechanism. Accuracy, tree size, and the learning time are the significant factors influencing the algorithm's performance. Naturally a bigger tree size takes longer computation time. OVFDT is a pioneer model equipped with an incremental optimization mechanism that seeks for a balance between accuracy and tree size for data stream mining. It operates incrementally by a test-then-train approach. Three types of functional tree leaves improve the accuracy with which the tree model makes a prediction for a new data stream in the testing phase. The optimized node-splitting mechanism controls the tree model growth in the training phase. The experiment shows that OVFDT obtains an optimal tree structure in both numeric and nominal datasets.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] IMPROVING ADAPTABILITY OF DECISION TREE FOR MINING BIG DATA
    Yang, Hang
    Fong, Simon
    NEW MATHEMATICS AND NATURAL COMPUTATION, 2013, 9 (01) : 77 - 95
  • [32] MHFlexDT: A Multivariate Branch Fuzzy Decision Tree Data Stream Mining Strategy Based on Hybrid Partitioning Standard
    Song, Xin
    Wang, Han
    He, Huiyuan
    Meng, Yakun
    ADVANCES IN NEURAL NETWORKS - ISNN 2018, 2018, 10878 : 310 - 317
  • [34] Elegant decision tree algorithm for classification in data mining
    Chandra, B
    Mazumdar, S
    Arena, V
    Parimi, N
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING (WORKSHOPS), 2002, : 160 - 169
  • [35] Variable precision rough set optimization algorithm for constructing decision tree
    Song, Xudong
    Mu, Jianwei
    Feng, Ruifang
    Qiu, Zhanzhi
    ADVANCED MATERIALS SCIENCE AND TECHNOLOGY, PTS 1-2, 2011, 181-182 : 43 - +
  • [36] Hybrid Splitting Criterion in Decision Trees for Data Stream Mining
    Jaworski, Maciej
    Rutkowski, Leszek
    Pawlak, Miroslaw
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, (ICAISC 2016), PT II, 2016, 9693 : 60 - 72
  • [37] Underwater Sonar Signals Recognition by Incremental Data Stream Mining with Conflict Analysis
    Fong, Simon
    Deb, Suash
    Wong, Raymond
    Sun, Guangmin
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2014,
  • [38] Incremental multi-dimension scaling visualization mining method for data stream
    Ni, Ping
    Liao, Jian-Xin
    Zhu, Xiao-Min
    Wan, Li
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2011, 41 (03): : 817 - 821
  • [39] Constructing Decision Trees for Mining High-speed Data Streams
    Xu Wenhua
    Qin Zheng
    CHINESE JOURNAL OF ELECTRONICS, 2012, 21 (02): : 215 - 220
  • [40] Real time decision making forecasting using Data mining and Decision tree
    Asaduzzaman, Md
    Shahjahan, Md
    Murase, Kazuyuki
    2014 JOINT 7TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 15TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2014, : 1029 - 1033