Loss-optimal classification trees: a generalized framework and the logistic case

被引:0
|
作者
Aldinucci, Tommaso [1 ]
Lapucci, Matteo [1 ]
机构
[1] Univ Florence, Dipartimento Ingn Informaz, Via Santa Marta 3, I-50139 Florence, Italy
关键词
Optimal classification trees; Logistic regression; Interpretability; Mixed-integer programming; DECISION TREES; REGRESSION; SELECTION; OPTIMIZATION; INDUCTION; MODELS;
D O I
10.1007/s11750-024-00674-y
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Classification trees are one of the most common models in interpretable machine learning. Although such models are usually built with greedy strategies, in recent years, thanks to remarkable advances in mixed-integer programming (MIP) solvers, several exact formulations of the learning problem have been developed. In this paper, we argue that some of the most relevant ones among these training models can be encapsulated within a general framework, whose instances are shaped by the specification of loss functions and regularizers. Next, we introduce a novel realization of this framework: specifically, we consider the logistic loss, handled in the MIP setting by a piece-wise linear approximation, and couple it with & ell; 1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ell _1$$\end{document} -regularization terms. The resulting optimal logistic classification tree model numerically proves to be able to induce trees with enhanced interpretability properties and competitive generalization capabilities, compared to the state-of-the-art MIP-based approaches.
引用
收藏
页码:323 / 350
页数:28
相关论文
共 50 条
  • [21] Classification trees with optimal multivariate decision nodes
    Brown, DE
    Pittard, CL
    Park, H
    PATTERN RECOGNITION LETTERS, 1996, 17 (07) : 699 - 703
  • [22] Integrating classification trees with local logistic regression in Intensive Care prognosis
    Abu-Hanna, A
    de Keizer, N
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2003, 29 (1-2) : 5 - 23
  • [23] Robust optimal classification trees under noisy labels
    Victor Blanco
    Alberto Japón
    Justo Puerto
    Advances in Data Analysis and Classification, 2022, 16 : 155 - 179
  • [24] Hadoop framework for efficient sentiment classification using trees
    Sridharan, K.
    Komarasamy, G.
    Daniel Madan Raja, S.
    IET NETWORKS, 2020, 9 (05) : 223 - 228
  • [25] Multiclass optimal classification trees with SVM-splits
    Blanco, Victor
    Japon, Alberto
    Puerto, Justo
    MACHINE LEARNING, 2023, 112 (12) : 4905 - 4928
  • [26] Multiclass optimal classification trees with SVM-splits
    Víctor Blanco
    Alberto Japón
    Justo Puerto
    Machine Learning, 2023, 112 : 4905 - 4928
  • [27] Robust Optimal Classification Trees against Adversarial Examples
    Vos, Daniel
    Verwer, Sicco
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8520 - 8528
  • [28] Robust optimal classification trees under noisy labels
    Blanco, Victor
    Japon, Alberto
    Puerto, Justo
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (01) : 155 - 179
  • [29] Globally optimal fuzzy decision trees for classification and regression
    Suárez, A
    Lutsko, JF
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (12) : 1297 - 1311
  • [30] Minimax-optimal classification with dyadic decision trees
    Scott, C
    Nowak, RD
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (04) : 1335 - 1353