A comparison of pruning criteria for probability trees

被引:8
|
作者
Fierens, Daan [1 ]
Ramon, Jan [1 ]
Blockeel, Hendrik [1 ]
Bruynooghe, Maurice [1 ]
机构
[1] Katholieke Univ Leuven, Dept Comp Sci, B-3001 Louvain, Belgium
关键词
Decision trees; Pruning; Probability estimation; Randomization tests; INDUCTION;
D O I
10.1007/s10994-009-5147-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Probability trees are decision trees that predict class probabilities rather than the most likely class. The pruning criterion used to learn a probability tree strongly influences the size of the tree and thereby also the quality of its probability estimates. While the effect of pruning criteria on classification accuracy is well-studied, only recently has there been more interest in the effect on probability estimates. Hence, it is currently unclear which pruning criteria for probability trees are preferable under which circumstances. In this paper we survey six of the most important pruning criteria for probability trees, and discuss their theoretical advantages and disadvantages. We also perform an extensive experimental study of the relative performance of these pruning criteria. The main conclusion is that overall a pruning criterion based on randomization tests performs best because it is most robust to extreme data characteristics (such as class skew or a high number of classes). We also identify and explain several shortcomings of the other pruning criteria.
引用
收藏
页码:251 / 285
页数:35
相关论文
共 50 条
  • [41] A Note on Probability Trees
    Hurley, W. J.
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2007, 6 (02) : 645 - 648
  • [42] Effect of tree pruning and pruning application to trees on nitrogen fixation by Leucaena and Gliricidia
    Kadiata, BD
    Mulongoy, K
    Isirimah, NO
    AGROFORESTRY SYSTEMS, 1997, 39 (02) : 117 - 128
  • [43] Measurement criteria for neural network pruning
    Erdogan, SS
    Ng, GS
    Patrick, KHC
    1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 83 - 89
  • [44] Effect of tree pruning and pruning application to trees on nitrogen fixation by Leucaena and Gliricidia
    B. D. Kadiata
    K. Mulongoy
    N. O. Isirimah
    Agroforestry Systems, 1997, 39 : 117 - 128
  • [45] BENDING BY PRUNING IN FRUIT PRODUCTION (SECTORIAL DOUBLE PRUNING OF FRUIT-TREES)
    BRUNNER, T
    BOTANIKAI KOZLEMENYEK-BOTANICAL PUBLICATIONS, 1979, 66 (2-4): : 305 - 311
  • [46] Inheritance Patterns: Probability Rules & Probability Trees
    Garimella, Umadevi
    Sahin, Nesrin
    AMERICAN BIOLOGY TEACHER, 2022, 84 (01): : 22 - 27
  • [47] An Optimal Constrained Pruning Strategy for Decision Trees
    Sherali, Hanif D.
    Hobeika, Antoine G.
    Jeenanunta, Chawalit
    INFORMS JOURNAL ON COMPUTING, 2009, 21 (01) : 49 - 61
  • [48] Utilization of woody pruning residues of apple trees
    Gilanipoor, Najibeh
    Spinelli, Rafaele
    Naghdi, Ramin
    Najafi, Akbar
    FOREST SCIENCE AND TECHNOLOGY, 2020, 16 (04) : 216 - 223
  • [49] FUZZY DECISIONS TOOLS FOR PRUNING OF FOREST TREES
    KAHN, M
    VONGADOW, K
    OR SPEKTRUM, 1995, 17 (01) : 37 - 40
  • [50] ESSENTIAL PRUNING TECHNIQUES: Trees, Shrubs, and Conifers
    Browning, Dominique
    NEW YORK TIMES BOOK REVIEW, 2017, 122 (23): : 33 - 33