A comparison of pruning criteria for probability trees

被引:8
|
作者
Fierens, Daan [1 ]
Ramon, Jan [1 ]
Blockeel, Hendrik [1 ]
Bruynooghe, Maurice [1 ]
机构
[1] Katholieke Univ Leuven, Dept Comp Sci, B-3001 Louvain, Belgium
关键词
Decision trees; Pruning; Probability estimation; Randomization tests; INDUCTION;
D O I
10.1007/s10994-009-5147-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Probability trees are decision trees that predict class probabilities rather than the most likely class. The pruning criterion used to learn a probability tree strongly influences the size of the tree and thereby also the quality of its probability estimates. While the effect of pruning criteria on classification accuracy is well-studied, only recently has there been more interest in the effect on probability estimates. Hence, it is currently unclear which pruning criteria for probability trees are preferable under which circumstances. In this paper we survey six of the most important pruning criteria for probability trees, and discuss their theoretical advantages and disadvantages. We also perform an extensive experimental study of the relative performance of these pruning criteria. The main conclusion is that overall a pruning criterion based on randomization tests performs best because it is most robust to extreme data characteristics (such as class skew or a high number of classes). We also identify and explain several shortcomings of the other pruning criteria.
引用
收藏
页码:251 / 285
页数:35
相关论文
共 50 条
  • [22] Pruning regression trees with MDL
    Robnik-Sikonja, M
    Kononenko, I
    ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 455 - 459
  • [23] Controlling size of plum trees by summer pruning, root pruning, and growing trees in polypropylene containers
    Morgas, H
    Mika, A
    Konopacka, D
    Gawalkiewicz, H
    VI INTERNATIONAL SYMPOSIUM ON PLUM AND PRUNE GENETICS, BREEDING AND POMOLOGY, 1998, (478): : 249 - 253
  • [24] Probability trees
    McCool, MD
    Harwood, PK
    GRAPHICS INTERFACE '97 - PROCEEDINGS, 1997, : 37 - 46
  • [25] EVALUATION OF GRADUAL PRUNING OF PECAN TREES TO 2 HEIGHTS AND NO PRUNING
    WORLEY, RE
    HORTSCIENCE, 1987, 22 (05) : 721 - 721
  • [26] ScoringNet: A Neural Network Based Pruning Criteria for Structured Pruning
    Wang S.
    Zhang Z.
    Scientific Programming, 2023, 2023
  • [27] A comparison between Possibility and Probability in multiple criteria decision making
    Iglesias, A.
    Del Castillo, M. D.
    Santos, M.
    Serrano, J. I.
    Oliva, J.
    COMPUTATIONAL INTELLIGENCE IN DECISION AND CONTROL, 2008, 1 : 307 - 312
  • [28] OVERSIZE MOWER CUTTERBAR FOR PRUNING TREES
    MONROE, GE
    PETERSON, DL
    TRANSACTIONS OF THE ASAE, 1977, 20 (04): : 606 - 609
  • [29] Pruning Method of Belief Decision Trees
    Trabelsi, Salsabil
    Elouedi, Zied
    Mellouli, Khaled
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 14, 2006, 14 : 424 - 429
  • [30] ARE PRUNING AND SHEARING EXPENSES OF TREES DEDUCTIBLE
    不详
    JOURNAL OF TAXATION, 1971, 34 (06): : 382 - 382