Addressing Class Imbalance in Non-Binary Classification Problems

被引:2
|
作者
Seliya, Naeem [1 ]
Xu, Zhiwei [1 ]
Khoshgoftaar, Taghi M. [2 ]
机构
[1] Univ Michigan, Comp Informat Sci, 4901 Evergreen Rd, Dearborn, MI 48128 USA
[2] Florida Atlantic Univ, Comp Sci Engn, Boca Raton, FL 33431 USA
关键词
Machine learning; class imbalance; non-binary classifiers; data sampling; artificial intelligence;
D O I
10.1109/ICTAI.2008.120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of class imbalance in machine learning is quite real and cumbersome when it comes to building a useful and practical classification model. We present a unique insight into addressing class imbalance for classification problems that involve three or more categories, i.e. non-binary. This study is different than related works in the literature because most works focus on addressing class imbalance only for binary classification problems, even if it means transforming a non-binary dataset into a binary classification problem. We propose an effective, yet simple approach to alleviating class imbalance issues when the classification problem involves more than two classes. The process, with four different methods, is based on applying random undersampling and random oversampling to different parts of the dataset for achieving better classification performance. The proposed data sampling methods are evaluated in the context of two real-world datasets obtained from the UCI Repository for Machine Learning Databases, and two commonly used classification algorithms: C4.5 and RIPPER. Our results demonstrate that the multi-group classification accuracy increases significantly in most cases after the proposed data sampling methods are applied. The positive outcome of this study motivates us to further our research on class imbalance and non-binary classification problems.
引用
收藏
页码:460 / +
页数:2
相关论文
共 50 条
  • [1] Non-binary classification trees
    Keprta, S
    STATISTICS AND COMPUTING, 1996, 6 (03) : 231 - 243
  • [2] Seeking non-binary solutions to binary problems
    Ellis Montalban, Paloma B.
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2023, 58 : 530 - 530
  • [3] A class of non-binary matroids with many binary miners
    Mills, AD
    Oxley, JG
    DISCRETE MATHEMATICS, 1999, 207 (1-3) : 173 - 187
  • [4] A hybrid tractable class for non-binary CSPs
    El Mouelhi, Achref
    Jegou, Philippe
    Terrioux, Cyril
    CONSTRAINTS, 2015, 20 (04) : 383 - 413
  • [5] A hybrid tractable class for non-binary CSPs
    Achref El Mouelhi
    Philippe Jégou
    Cyril Terrioux
    Constraints, 2015, 20 : 383 - 413
  • [6] A Hybrid Tractable Class for Non-Binary CSPs
    El Mouelhi, Achref
    Jegou, Philippe
    Terrioux, Cyril
    2013 IEEE 25TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2013, : 947 - 954
  • [7] Encodings of non-binary constraint satisfaction problems
    Stergiou, K
    Walsh, T
    SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 163 - 168
  • [8] On the conversion between non-binary and binary constraint satisfaction problems
    Bacchus, F
    van Beek, P
    FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 311 - 318
  • [9] Addressing class imbalance in deep learning for acoustic target classification
    Pala, Ahmet
    Oleynik, Anna
    Utseth, Ingrid
    Handegard, Nils Olav
    ICES JOURNAL OF MARINE SCIENCE, 2023, 80 (10) : 2530 - 2544
  • [10] A New Class of Explanations for Classifiers with Non-binary Features
    Ji, Chunxi
    Darwiche, Adnan
    LOGICS IN ARTIFICIAL INTELLIGENCE, JELIA 2023, 2023, 14281 : 106 - 122