Parameter-free classification in multi-class imbalanced data sets

被引:20
|
作者
Cerf, Loic [1 ]
Gay, Dominique [2 ]
Selmaoui-Folcher, Nazha [3 ]
Cremilleux, Bruno [4 ]
Boulicaut, Jean-Francois [5 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil
[2] Orange Labs, F-22307 Lannion, France
[3] Univ New Caledonia, PPME EA3325, Noumea, New Caledonia
[4] Univ Caen, GREYC CNRS UMR6072, F-14032 Caen, France
[5] Univ Lyon, CNRS, INRIA, INSA Lyon,LIRIS,UMR5205, F-69621 Villeurbanne, France
关键词
Classification; Association rules; Multi-class context; Imbalanced data set; One-Versus-Each framework; DISCOVERY; PATTERNS; SMOTE;
D O I
10.1016/j.datak.2013.06.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many applications deal with classification in multi-class imbalanced contexts. In such difficult situations, classical CBA-like approaches (Classification Based on Association rules) show their limits. Most CBA-like methods actually are One-Vs-All approaches (OVA), i.e., the selected classification rules are relevant for one class and irrelevant for the union of the other classes. In this paper, we point out recurrent problems encountered by OVA approaches applied to multi-class imbalanced data sets (e.g., improper bias towards majority classes, conflicting rules). That is why we propose a new One-Versus-Each (OVE) framework. In this framework, a rule has to be relevant for one class and irrelevant for every other class taken separately. Our approach, called fitcare, is empirically validated on various benchmark data sets and our theoretical findings are confirmed. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:109 / 129
页数:21
相关论文
共 50 条
  • [1] PFSC: Parameter-free sphere classifier for imbalanced data classification
    Park, Yeontark
    Lee, Jong-Seok
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [2] A survey of multi-class imbalanced data classification methods
    Han, Meng
    Li, Ang
    Gao, Zhihui
    Mu, Dongliang
    Liu, Shujuan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 2471 - 2501
  • [3] Multi-class imbalanced big data classification on Spark
    Sleeman, William C.
    Krawczyk, Bartosz
    KNOWLEDGE-BASED SYSTEMS, 2021, 212
  • [4] A Combination Method for Multi-Class Imbalanced Data Classification
    Li, Hu
    Zou, Peng
    Han, Weihong
    Xia, Rongze
    2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 365 - 368
  • [5] Parameter-Free Loss for Class-Imbalanced Deep Learning in Image Classification
    Du, Jie
    Zhou, Yanhong
    Liu, Peng
    Vong, Chi-Man
    Wang, Tianfu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (06) : 3234 - 3240
  • [6] Selecting local ensembles for multi-class imbalanced data classification
    Krawczyk, Bartosz
    Cano, Alberto
    Wozniak, Michal
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [7] Undersampling with Support Vectors for Multi-Class Imbalanced Data Classification
    Krawczyk, Bartosz
    Bellinger, Colin
    Corizzo, Roberto
    Japkowicz, Nathalie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [8] Parameter-Free Extreme Learning Machine for Imbalanced Classification
    Li, Li
    Zhao, Kaiyi
    Sun, Ruizhi
    Gan, Jiangzhang
    Yuan, Gang
    Liu, Tong
    NEURAL PROCESSING LETTERS, 2020, 52 (03) : 1927 - 1944
  • [9] A Parameter-Free Cleaning Method for SMOTE in Imbalanced Classification
    Yan, Yuanting
    Liu, Ruiqing
    Ding, Zihan
    Du, Xiuquan
    Chen, Jie
    Zhang, Yanping
    IEEE ACCESS, 2019, 7 : 23537 - 23548
  • [10] Parameter-Free Extreme Learning Machine for Imbalanced Classification
    Li Li
    Kaiyi Zhao
    Ruizhi Sun
    Jiangzhang Gan
    Gang Yuan
    Tong Liu
    Neural Processing Letters, 2020, 52 : 1927 - 1944