Learning classifiers from imbalanced data based on biased minimax probability machine

被引:0
|
作者
Huang, KZ [1 ]
Yang, HQ [1 ]
King, I [1 ]
Lyu, MR [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Shatin, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of the binary classification on imbalanced data, in which nearly all the instances are labelled as one class, while far fewer instances are labelled as the other class, usually the more important class. Traditional machine learning methods seeking an accurate performance over a full range of instances are not suitable to deal with this problem, since they tend to classify all the data into the majority, usually the less important class. Moreover, some current methods have tried to utilize some intermediate factors, e.g., the distribution of the training set, the decision thresholds or the cost matrices, to influence the bias of the classification. However, it remains uncertain whether these methods can improve the performance in a systematic way. In this paper, we propose a novel model named Biased Minimax Probability Machine. Different from previous methods, this model directly controls the worst-case real accuracy of classification of the future data to build up biased classifiers. Hence, it provides a rigorous treatment on imbalanced data. The experimental results on the novel model comparing with those of three competitive methods, i.e., the Naive Bayesian classifier, the k-Nearest Neighbor method, and the decision tree method C4.5, demonstrate the superiority of our novel model.
引用
收藏
页码:558 / 563
页数:6
相关论文
共 50 条
  • [21] Parallel classifiers ensemble with hierarchical machine learning for imbalanced classes
    Zhang, Yun
    Luo, Bing
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 94 - 99
  • [22] Types of minority class examples and their influence on learning classifiers from imbalanced data
    Napierala, Krystyna
    Stefanowski, Jerzy
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2016, 46 (03) : 563 - 597
  • [23] Types of minority class examples and their influence on learning classifiers from imbalanced data
    Krystyna Napierala
    Jerzy Stefanowski
    Journal of Intelligent Information Systems, 2016, 46 : 563 - 597
  • [24] Fair Graph Representation Learning with Imbalanced and Biased Data
    Wang, Yu
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1557 - 1558
  • [25] IMBALANCED DATA CLASSIFICATION BASED ON EXTREME LEARNING MACHINE AUTOENCODER
    Shen, Chu
    Zhang, Su-Fang
    Zhai, Jun-Hal
    Luo, Ding-Sheng
    Chen, Jun-Fen
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 399 - 404
  • [26] Machine learning for mining imbalanced data
    Arafat, Md. Yasir
    Hoque, Sabera
    Xu, Shuxiang
    Farid, Dewan Md
    IAENG International Journal of Computer Science, 2019, 46 (02) : 332 - 348
  • [27] Customer purchase prediction from the perspective of imbalanced data: A machine learning framework based on factorization machine
    Chen, Shui-xia
    Wang, Xiao-kang
    Zhang, Hong-yu
    Wang, Jian-qiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 173
  • [28] Active Learning with Abstaining Classifiers for Imbalanced Drifting Data Streams
    Korycki, Lukasz
    Cano, Alberto
    Krawczyk, Bartosz
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2334 - 2343
  • [29] Multi-label learning via minimax probability machine
    Rastogi, Reshma
    Jain, Sambhav
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2022, 145 : 1 - 17
  • [30] Twin minimax probability extreme learning machine for pattern recognition
    Ma, Jun
    Yang, Liming
    Wen, Yakun
    Sun, Qun
    KNOWLEDGE-BASED SYSTEMS, 2020, 187