Analysis of imbalanced data using cost-sensitive learning

被引:0
|
作者
Kim, Sojin [1 ]
Song, Jongwoo [1 ]
机构
[1] Ewha Womans Univ, Dept Stat, Seoul, South Korea
关键词
Imbalanced classification; cost-sensitive learning; classification performance; hybrid classification; SMOTE;
D O I
10.1080/03610926.2025.2472792
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Typically, classification algorithms strive to maximize the accuracy. However, when dealing with significantly imbalanced data, accuracy may not be the most suitable metric. We believe that the most effective approach for handling imbalanced cases is to minimize the total costs. Unfortunately, precise costs for misclassification are often unavailable in real-world scenarios. To address this problem, we offer a simple and efficient search algorithm for cost-sensitive learning. We also introduce a new performance metric, imbalanced data classification performance (IDCP), which combines the F-score and the area under the curve (AUC). By utilizing the imbalance ratio (IR) as a crucial factor, we use IDCP to determine optimal weights in cost-sensitive learning. Through extensive experiments, we show that our method can find the optimal decision boundary in imbalanced datasets. Our code is available at https://github.com/sssojin/Imbalanced_Classification
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Cost-sensitive Hybrid Neural Networks for Heterogeneous and Imbalanced Data
    Jiang, Xinxin
    Pan, Shirui
    Long, Guodong
    Chang, Jiang
    Jiang, Jing
    Zhang, Chengqi
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [32] Ensemble cost-sensitive hypernetwork models for imbalanced data classification
    Sun, Kaiwei, 1600, Binary Information Press (10):
  • [33] Using Cost-Sensitive Learning and Feature Selection Algorithms to Improve the Performance of Imbalanced Classification
    Feng, Fang
    Li, Kuan-Ching
    Shen, Jun
    Zhou, Qingguo
    Yang, Xuhui
    IEEE ACCESS, 2020, 8 : 69979 - 69996
  • [34] LW-ELM: A Fast and Flexible Cost-Sensitive Learning Framework for Classifying Imbalanced Data
    Yu, Hualong
    Sun, Changyin
    Yang, Xibei
    Zheng, Shang
    Wang, Qi
    Xi, Xiaoyan
    IEEE ACCESS, 2018, 6 : 28488 - 28500
  • [35] Adaptive cost-sensitive learning: Improving the convergence of intelligent diagnosis models under imbalanced data
    Ren, Zhijun
    Zhu, Yongsheng
    Kang, Wei
    Fu, Hong
    Niu, Qingbo
    Gao, Dawei
    Yan, Ke
    Hong, Jun
    KNOWLEDGE-BASED SYSTEMS, 2022, 241
  • [36] Cost-Sensitive Large margin Distribution Machine for classification of imbalanced data
    Cheng, Fanyong
    Zhang, Jing
    Wen, Cuihong
    PATTERN RECOGNITION LETTERS, 2016, 80 : 107 - 112
  • [37] Large cost-sensitive margin distribution machine for imbalanced data classification
    Cheng, Fanyong
    Zhang, Jing
    Wen, Cuihong
    Liu, Zhaohua
    Li, Zuoyong
    NEUROCOMPUTING, 2017, 224 : 45 - 57
  • [38] Cost-Sensitive Learning for Imbalanced Bad Debt Datasets in Healthcare Industry
    Shi, Donghui
    Guan, Jian
    Zurada, Jozef
    2015 ASIA-PACIFIC CONFERENCE ON COMPUTER-AIDED SYSTEM ENGINEERING - APCASE 2015, 2015, : 30 - 35
  • [39] Cost-Sensitive Latent Space Learning for Imbalanced PolSAR Image Classification
    Wu, Qian
    Hou, Biao
    Wen, Zaidao
    Ren, Zhongle
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06): : 4802 - 4817
  • [40] Resampling and Cost-Sensitive Methods for Imbalanced Multi-instance Learning
    Wang, Xiaoguang
    Liu, Xuan
    Japkowicz, Nathalie
    Matwin, Stan
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 808 - 816