Cost-Sensitive Variational Autoencoding Classifier for Imbalanced Data Classification

被引:3
|
作者
Liu, Fen [1 ]
Qian, Quan [1 ,2 ,3 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Mat Genome Inst, Shanghai 200444, Peoples R China
[3] Zhejiang Lab, Hangzhou 311100, Peoples R China
关键词
variational autoencoder; imbalanced data classification; cost-sensitive learning; MACHINE; SMOTE;
D O I
10.3390/a15050139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is among the core tasks in machine learning. Existing classification algorithms are typically based on the assumption of at least roughly balanced data classes. When performing tasks involving imbalanced data, such classifiers ignore the minority data in consideration of the overall accuracy. The performance of traditional classification algorithms based on the assumption of balanced data distribution is insufficient because the minority-class samples are often more important than others, such as positive samples, in disease diagnosis. In this study, we propose a cost-sensitive variational autoencoding classifier that combines data-level and algorithm-level methods to solve the problem of imbalanced data classification. Cost-sensitive factors are introduced to assign a high cost to the misclassification of minority data, which biases the classifier toward minority data. We also designed misclassification costs closely related to tasks by embedding domain knowledge. Experimental results show that the proposed method performed the classification of bulk amorphous materials well.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Cost-sensitive boosting for classification of imbalanced data
    Sun, Yamnin
    Kamel, Mohamed S.
    Wong, Andrew K. C.
    Wang, Yang
    PATTERN RECOGNITION, 2007, 40 (12) : 3358 - 3378
  • [2] COST-SENSITIVE SPFCNN MINER FOR CLASSIFICATION OF IMBALANCED DATA
    Zhao, Linchang
    Shang, Zhaowei
    Zhao, Ling
    Wei, Yu
    Tang, Yuan Yan
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2019, : 51 - 57
  • [3] Reinforcement learning-based cost-sensitive classifier for imbalanced fault classification
    Xinmin ZHANG
    Saite FAN
    Zhihuan SONG
    Science China(Information Sciences), 2023, 66 (11) : 113 - 126
  • [4] Reinforcement learning-based cost-sensitive classifier for imbalanced fault classification
    Xinmin Zhang
    Saite Fan
    Zhihuan Song
    Science China Information Sciences, 2023, 66
  • [5] Reinforcement learning-based cost-sensitive classifier for imbalanced fault classification
    Zhang, Xinmin
    Fan, Saite
    Song, Zhihuan
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (11)
  • [6] A Statistical Approach to Cost-Sensitive AdaBoost for Imbalanced Data Classification
    Bei, Honghan
    Wang, Yajie
    Ren, Zhaonuo
    Jiang, Shuo
    Li, Keran
    Wang, Wenyang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [7] Ensemble cost-sensitive hypernetwork models for imbalanced data classification
    Sun, Kaiwei, 1600, Binary Information Press (10):
  • [8] Cost-Sensitive Large margin Distribution Machine for classification of imbalanced data
    Cheng, Fanyong
    Zhang, Jing
    Wen, Cuihong
    PATTERN RECOGNITION LETTERS, 2016, 80 : 107 - 112
  • [9] Large cost-sensitive margin distribution machine for imbalanced data classification
    Cheng, Fanyong
    Zhang, Jing
    Wen, Cuihong
    Liu, Zhaohua
    Li, Zuoyong
    NEUROCOMPUTING, 2017, 224 : 45 - 57
  • [10] Cost-sensitive learning for imbalanced data streams
    Loezer, Lucas
    Enembreck, Fabricio
    Barddal, Jean Paul
    Britto Jr, Alceu de Souza
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 498 - 504