Robust optimal classification trees under noisy labels

被引:0
|
作者
Victor Blanco
Alberto Japón
Justo Puerto
机构
[1] Universidad de Granada,Institute of Mathematics (IMAG)
[2] Universidad de Sevilla,Institute of Mathematics (IMUS)
关键词
Multiclass classification; Optimal classification trees; Support vector machines; Mixed integer non linear programming; Classification; Hyperplanes; 62H30; 90C11; 68T05; 32S22;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper we propose a novel methodology to construct Optimal Classification Trees that takes into account that noisy labels may occur in the training sample. The motivation of this new methodology is based on the superaditive effect of combining together margin based classifiers and outlier detection techniques. Our approach rests on two main elements: (1) the splitting rules for the classification trees are designed to maximize the separation margin between classes applying the paradigm of SVM; and (2) some of the labels of the training sample are allowed to be changed during the construction of the tree trying to detect the label noise. Both features are considered and integrated together to design the resulting Optimal Classification Tree. We present a Mixed Integer Non Linear Programming formulation for the problem, suitable to be solved using any of the available off-the-shelf solvers. The model is analyzed and tested on a battery of standard datasets taken from UCI Machine Learning repository, showing the effectiveness of our approach. Our computational results show that in most cases the new methodology outperforms both in accuracy and AUC the results of the benchmarks provided by OCT and OCT-H.
引用
收藏
页码:155 / 179
页数:24
相关论文
共 50 条
  • [1] Robust optimal classification trees under noisy labels
    Blanco, Victor
    Japon, Alberto
    Puerto, Justo
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (01) : 155 - 179
  • [2] Correction to: Robust optimal classification trees under noisy labels
    Victor Blanco
    Alberto Japón
    Justo Puerto
    Advances in Data Analysis and Classification, 2022, 16 (4) : 1095 - 1095
  • [3] Robust optimal classification trees under noisy labels (Oct, 10.1007/s11634-021-00467-2, 2021)
    Blanco, Victor
    Japon, Alberto
    Puerto, Justo
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (04) : 1095 - 1095
  • [4] Robust Loss Functions for Training Decision Trees with Noisy Labels
    Wilton, Jonathan
    Ye, Nan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15859 - 15867
  • [5] Robust Semisupervised Classification for PolSAR Image With Noisy Labels
    Hou, Biao
    Wu, Qian
    Wen, Zaidao
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (11): : 6440 - 6455
  • [6] Robust Classification of Incomplete Time Series with Noisy Labels
    Qin, Xin
    Yao, Pengshuai
    Liu, Mengna
    Cheng, Xu
    Shi, Fan
    Guo, Lili
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 2620 - 2625
  • [7] Coastal Image Classification under Noisy Labels
    Tai, Xiaoxiao
    Wang, Guangxing
    Grecos, Christos
    Ren, Peng
    JOURNAL OF COASTAL RESEARCH, 2020, : 151 - 156
  • [8] Classification of calcareous algae under noisy labels
    Bento, Vitor
    Kohler, Manoela
    Pacheco, Marco Aurelio
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (06): : 3197 - 3214
  • [9] Classification of calcareous algae under noisy labels
    Vitor Bento
    Manoela Kohler
    Marco Aurelio Pacheco
    Neural Computing and Applications, 2024, 36 : 3197 - 3214
  • [10] Towards Robust Learning with Noisy and Pseudo Labels for Text Classification
    Wen, Murtadha Ahmeda Bo
    Ao, Luo
    Pan, Shengfeng
    Su, Jianlin
    Cao, Xinxin
    Liu, Yunfeng
    INFORMATION SCIENCES, 2024, 661