Cost-Sensitive Variational Autoencoding Classifier for Imbalanced Data Classification

被引:3
|
作者
Liu, Fen [1 ]
Qian, Quan [1 ,2 ,3 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Mat Genome Inst, Shanghai 200444, Peoples R China
[3] Zhejiang Lab, Hangzhou 311100, Peoples R China
关键词
variational autoencoder; imbalanced data classification; cost-sensitive learning; MACHINE; SMOTE;
D O I
10.3390/a15050139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is among the core tasks in machine learning. Existing classification algorithms are typically based on the assumption of at least roughly balanced data classes. When performing tasks involving imbalanced data, such classifiers ignore the minority data in consideration of the overall accuracy. The performance of traditional classification algorithms based on the assumption of balanced data distribution is insufficient because the minority-class samples are often more important than others, such as positive samples, in disease diagnosis. In this study, we propose a cost-sensitive variational autoencoding classifier that combines data-level and algorithm-level methods to solve the problem of imbalanced data classification. Cost-sensitive factors are introduced to assign a high cost to the misclassification of minority data, which biases the classifier toward minority data. We also designed misclassification costs closely related to tasks by embedding domain knowledge. Experimental results show that the proposed method performed the classification of bulk amorphous materials well.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Cost-sensitive incremental Classification under the MapReduce framework for Mining Imbalanced Massive Data Streams
    Huang Yuwen
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2015, 18 (1-2): : 177 - 194
  • [42] A Cost-Sensitive Based Approach for Improving Associative Classification on Imbalanced Datasets
    Waiyamai, Kitsana
    Suwannarattaphoom, Phoonperm
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 : 31 - 42
  • [43] Cost-sensitive classification with inadequate labeled data
    Wang, Tao
    Qin, Zhenxing
    Zhang, Shichao
    Zhang, Chengqi
    INFORMATION SYSTEMS, 2012, 37 (05) : 508 - 516
  • [44] Cost-sensitive convolutional neural networks for imbalanced time series classification
    Geng, Yue
    Luo, Xinyu
    INTELLIGENT DATA ANALYSIS, 2019, 23 (02) : 357 - 370
  • [45] Cost-Sensitive Latent Space Learning for Imbalanced PolSAR Image Classification
    Wu, Qian
    Hou, Biao
    Wen, Zaidao
    Ren, Zhongle
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06): : 4802 - 4817
  • [46] Assessment and classification of grid stability with cost-sensitive stacked ensemble classifier
    Ramasamy, Karthikeyan
    Sundaramurthy, Arivoli
    Velusamy, Durgadevi
    AUTOMATIKA, 2023, 64 (04) : 783 - 797
  • [47] Swarm-based Cost-sensitive Decision Tree Using Optimized Rules for Imbalanced Data Classification
    Mansouri, Mehdi
    Nadimi-Shahraki, Mohammad H.
    Beheshti, Zahra
    JOURNAL OF BIONIC ENGINEERING, 2025,
  • [48] Cost-sensitive hierarchical classification via multi-scale information entropy for data with an imbalanced distribution
    Weijie Zheng
    Hong Zhao
    Applied Intelligence, 2021, 51 : 5940 - 5952
  • [49] Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data
    Khan, Salman H.
    Hayat, Munawar
    Bennamoun, Mohammed
    Sohel, Ferdous A.
    Togneri, Roberto
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (08) : 3573 - 3587
  • [50] Cost-sensitive hierarchical classification via multi-scale information entropy for data with an imbalanced distribution
    Zheng, Weijie
    Zhao, Hong
    APPLIED INTELLIGENCE, 2021, 51 (08) : 5940 - 5952