A comparative analysis of methods for pruning decision trees

被引:303
|
作者
Esposito, F
Malerba, D
Semeraro, G
机构
[1] Dipartimento di Infarmatica, Università Degli Studi di Bari, 70126 Bari
关键词
decision trees; top-down induction of decision trees; simplification of decision trees; pruning and grafting operators; optimal pruning; comparative studies;
D O I
10.1109/34.589207
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of retrospectively pruning decision trees induced from data, according to a top-down approach. This problem has received considerable attention in the areas of pattern recognition and machine learning, and many distinct methods have been proposed in literature. We make a comparative study of six well-known pruning methods with the aim of understanding their theoretical foundations, their computational complexity, and the strengths and weaknesses of their formulation. Comments on the characteristics of each method are empirically supported. In particular, a wide experimentation performed on several data sets leads us to opposite conclusions on the predictive accuracy of simplified trees from some drawn in the literature. We attribute this divergence to differences in experimental designs. Finally, we prove and make use of a property of the reduced error pruning method to obtain an objective evaluation of the tendency to overprune/underprune observed in each method.
引用
收藏
页码:476 / 491
页数:16
相关论文
共 50 条
  • [31] Maximum a posteriori pruning on decision trees and its application to bootstrap BUMPing
    Kim, J
    Kim, Y
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (03) : 710 - 719
  • [32] Comparative Analysis of Modern Decision-Making Methods
    Cugova, Aneta
    Kubala, Pavol
    VISION 2020: SUSTAINABLE ECONOMIC DEVELOPMENT AND APPLICATION OF INNOVATION MANAGEMENT, 2018, : 7386 - 7394
  • [33] Computational methods for probabilistic decision trees
    Clark, DE
    COMPUTERS AND BIOMEDICAL RESEARCH, 1997, 30 (01): : 19 - 33
  • [34] Fuzzy decision trees: Issues and methods
    Janikow, CZ
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (01): : 1 - 14
  • [35] A Comparative Analysis of Pruning Methods for C4.5 and Fuzzy C4.5
    Naseer, Tayyeba
    Asghar, Sohail
    Zhuang, Yan
    Fong, Simon
    ADVANCES IN DIGITAL TECHNOLOGIES, 2015, 275 : 304 - 312
  • [36] Fringe analysis of plane trees related to cutting and pruning
    Hackl, Benjamin
    Heuberger, Clemens
    Kropf, Sara
    Prodinger, Helmut
    AEQUATIONES MATHEMATICAE, 2018, 92 (02) : 311 - 353
  • [37] Fringe analysis of plane trees related to cutting and pruning
    Benjamin Hackl
    Clemens Heuberger
    Sara Kropf
    Helmut Prodinger
    Aequationes mathematicae, 2018, 92 : 311 - 353
  • [38] New Methods for Pruning and Ordering of Syntax Parsing Trees Comparison and Combination
    Kovar, Vojtech
    Horak, Ales
    Kadlec, Vladimir
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 125 - +
  • [39] Pruning of Fruit trees
    Scarpare Filho, Joao Alexio
    REVISTA BRASILEIRA DE FRUTICULTURA, 2013, 35 (03) : III - III
  • [40] Scab Intensity in Pecan Trees in Relation to Hedge-Pruning Methods
    Bock, Clive H.
    Shapiro-Ilan, David I.
    Hotchkiss, Michael W.
    Toledo, Pedro F. S.
    Wells, Lenny
    Schmidt, Jason M.
    Pisani, Cristina
    Acebes-Doria, Angelita L.
    PLANT DISEASE, 2024, 108 (11) : 3381 - 3392