A comparative analysis of methods for pruning decision trees

被引:303
|
作者
Esposito, F
Malerba, D
Semeraro, G
机构
[1] Dipartimento di Infarmatica, Università Degli Studi di Bari, 70126 Bari
关键词
decision trees; top-down induction of decision trees; simplification of decision trees; pruning and grafting operators; optimal pruning; comparative studies;
D O I
10.1109/34.589207
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of retrospectively pruning decision trees induced from data, according to a top-down approach. This problem has received considerable attention in the areas of pattern recognition and machine learning, and many distinct methods have been proposed in literature. We make a comparative study of six well-known pruning methods with the aim of understanding their theoretical foundations, their computational complexity, and the strengths and weaknesses of their formulation. Comments on the characteristics of each method are empirically supported. In particular, a wide experimentation performed on several data sets leads us to opposite conclusions on the predictive accuracy of simplified trees from some drawn in the literature. We attribute this divergence to differences in experimental designs. Finally, we prove and make use of a property of the reduced error pruning method to obtain an objective evaluation of the tendency to overprune/underprune observed in each method.
引用
收藏
页码:476 / 491
页数:16
相关论文
共 50 条
  • [21] METHODS OF DECISION-ANALYSIS - PROTOCOLS, DECISION TREES, AND ALGORITHMS IN MEDICINE
    GREEP, JM
    SIEZENIS, LMLC
    WORLD JOURNAL OF SURGERY, 1989, 13 (03) : 240 - 244
  • [22] A comparative study of pruned decision trees and fuzzy decision trees
    Benbrahim, H
    Bensaid, A
    PEACHFUZZ 2000 : 19TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 2000, : 227 - 231
  • [23] COMPARISON OF 2 PRUNING METHODS ON MATURE LEMON TREES
    BURNS, RM
    BOSWELL, SB
    WEAR, SF
    MCCARTY, CD
    CALIFORNIA AGRICULTURE, 1975, 29 (12) : 16 - 17
  • [24] Cost-sensitive decision trees with pre-pruning
    Du, Jun
    Cai, Zhihua
    Ling, Charles X.
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4509 : 171 - +
  • [25] Multi-Pruning of Decision Trees for Knowledge Representation and Classification
    Azad, Mohammad
    Chikalov, Igor
    Hussain, Shahid
    Moshkov, Mikhail
    Proceedings 3rd IAPR Asian Conference on Pattern Recognition ACPR 2015, 2015, : 604 - 608
  • [26] Pre-pruning decision trees by local association rules
    Takamitsu, T
    Miura, T
    Shioya, I
    INTELLIGENT DAA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 148 - 151
  • [27] A heuristic for learning decision trees and pruning them into classification rules
    Ranilla, J
    Luaces, O
    Bahamonde, A
    AI COMMUNICATIONS, 2003, 16 (02) : 71 - 87
  • [28] Selective Rademacher penalization and reduced error pruning of decision trees
    Kääriäinen, M
    Malinen, T
    Elomaa, T
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 1107 - 1126
  • [29] Sampling methods in decision trees
    Mehrotra, KG
    Jeragh, M
    IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 1069 - 1075
  • [30] Comparative Analysis of Deterministic and Nondeterministic Decision Trees for Decision Tables from Closed Classes
    Ostonov, Azimkhon
    Moshkov, Mikhail
    ENTROPY, 2024, 26 (06)