Cost-Sensitive Variational Autoencoding Classifier for Imbalanced Data Classification

被引:3
|
作者
Liu, Fen [1 ]
Qian, Quan [1 ,2 ,3 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Mat Genome Inst, Shanghai 200444, Peoples R China
[3] Zhejiang Lab, Hangzhou 311100, Peoples R China
关键词
variational autoencoder; imbalanced data classification; cost-sensitive learning; MACHINE; SMOTE;
D O I
10.3390/a15050139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is among the core tasks in machine learning. Existing classification algorithms are typically based on the assumption of at least roughly balanced data classes. When performing tasks involving imbalanced data, such classifiers ignore the minority data in consideration of the overall accuracy. The performance of traditional classification algorithms based on the assumption of balanced data distribution is insufficient because the minority-class samples are often more important than others, such as positive samples, in disease diagnosis. In this study, we propose a cost-sensitive variational autoencoding classifier that combines data-level and algorithm-level methods to solve the problem of imbalanced data classification. Cost-sensitive factors are introduced to assign a high cost to the misclassification of minority data, which biases the classifier toward minority data. We also designed misclassification costs closely related to tasks by embedding domain knowledge. Experimental results show that the proposed method performed the classification of bulk amorphous materials well.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] Classification cost: An empirical comparison among traditional classifier, Cost-Sensitive Classifier, and MetaCost
    Kim, Jungeun
    Choi, Keunho
    Kim, Gunwoo
    Suh, Yongmoo
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (04) : 4013 - 4019
  • [32] Machine learning based novel cost-sensitive seizure detection classifier for imbalanced EEG data sets
    Mohammad Khubeb Siddiqui
    Xiaodi Huang
    Ruben Morales-Menendez
    Nasir Hussain
    Khudeja Khatoon
    International Journal on Interactive Design and Manufacturing (IJIDeM), 2020, 14 : 1491 - 1509
  • [33] Machine learning based novel cost-sensitive seizure detection classifier for imbalanced EEG data sets
    Siddiqui, Mohammad Khubeb
    Huang, Xiaodi
    Morales-Menendez, Ruben
    Hussain, Nasir
    Khatoon, Khudeja
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2020, 14 (04): : 1491 - 1509
  • [34] An Adaptive Cost-sensitive Classifier
    Chen, Xiaolin
    Song, Enming
    Ma, Guangzhi
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 1, 2010, : 699 - 701
  • [35] Cost-sensitive Hybrid Neural Networks for Heterogeneous and Imbalanced Data
    Jiang, Xinxin
    Pan, Shirui
    Long, Guodong
    Chang, Jiang
    Jiang, Jing
    Zhang, Chengqi
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [36] Cost-Sensitive Learning based on Performance Metric for Imbalanced Data
    Aurelio, Yuri Sousa
    de Almeida, Gustavo Matheus
    de Castro, Cristiano Leite
    Braga, Antonio Padua
    NEURAL PROCESSING LETTERS, 2022, 54 (04) : 3097 - 3114
  • [37] Improving Imbalanced Dialogue Act Classification Using Cost-Sensitive Learning
    Miyagi, Takaaki
    Endo, Satoshi
    2022 JOINT 12TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 23RD INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS&ISIS), 2022,
  • [38] Cost-Sensitive Dual-Stream Residual Networks for Imbalanced Classification
    Ma, Congcong
    Mi, Jiaqi
    Gao, Wanlin
    Tao, Sha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4243 - 4261
  • [39] Cost-sensitive design of quadratic discriminant analysis for imbalanced data
    Bejaoui, Amine
    Elkhalil, Khalil
    Kammoun, Abla
    Alouini, Mohamed-Slim
    Al-Naffouri, Tareq
    PATTERN RECOGNITION LETTERS, 2021, 149 : 24 - 29
  • [40] Cost-Sensitive Learning based on Performance Metric for Imbalanced Data
    Yuri Sousa Aurelio
    Gustavo Matheus de Almeida
    Cristiano Leite de Castro
    Antonio Padua Braga
    Neural Processing Letters, 2022, 54 : 3097 - 3114