Variational Autoencoder Based Synthetic Data Generation for Imbalanced Learning

被引:0
|
作者
Wan, Zhiqiang [1 ]
Zhang, Yazhou [1 ]
He, Haibo [1 ]
机构
[1] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering pattern from imbalanced data plays an important role in numerous applications, such as health service, cyber security, and financial engineering. However, the imbalanced data greatly compromise the performance of most learning algorithms. Recently, various synthetic sampling methods have been proposed to balance the dataset. Although these methods have achieved great success in many datasets, they are less effective for high-dimensional data, such as the image. In this paper, we propose a variational autoencoder (VAE) based synthetic data generation method for imbalanced learning. VAE can produce new samples which are similar to those in the original dataset, but not exactly the same. We evaluate and compare our proposed method with the traditional synthetic sampling methods on various datasets under five evaluation metrics. The experimental results demonstrate the effectiveness of the proposed method.
引用
收藏
页码:1500 / 1506
页数:7
相关论文
共 50 条
  • [31] Generative Data Augmentation for Learning-based Electrical Impedance Tomography via Variational Autoencoder
    Zhan, Yangen
    Guan, Ru
    Ren, Shangjie
    Dong, Feng
    2021 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2021), 2021,
  • [32] Semi-supervised learning of speech recognizers based on variational autoencoder and unsupervised data augmentation
    Ho, Hyeon
    Kang, Byung Ok
    Kwon, Oh-Wook
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (06): : 578 - 586
  • [33] AR-ADASYN: angle radius-adaptive synthetic data generation approach for imbalanced learning
    Park, Hyejoon
    Kim, Hyunjoong
    STATISTICS AND COMPUTING, 2024, 34 (05)
  • [34] Unsupervised feature learning for electrocardiogram data using the convolutional variational autoencoder
    Jang, Jong-Hwan
    Kim, Tae Young
    Lim, Hong-Seok
    Yoon, Dukyong
    PLOS ONE, 2021, 16 (12):
  • [35] A class imbalanced wafer defect classification framework based on variational autoencoder generative adversarial network
    Wang, Yitian
    Wei, Yuxiang
    Wang, Huan
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (02)
  • [36] An Active Learning Method Based on Variational Autoencoder and DBSCAN Clustering
    Chen, Fang
    Zhang, Tao
    Liu, Ruilin
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [37] Learning from Synthetic Data Using a Stacked Multichannel Autoencoder
    Zhang, Xi
    Fu, Yanwei
    Jiang, Shanshan
    Sigal, Leonid
    Agam, Gady
    2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 461 - 464
  • [38] An Iterated Greedy Algorithm for Improving the Generation of Synthetic Patterns in Imbalanced Learning
    Javier Maestre-Garcia, Francisco
    Garcia-Martinez, Carlos
    Perez-Ortiz, Maria
    Antonio Gutierrez, Pedro
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2017, PT II, 2017, 10306 : 513 - 524
  • [39] A Method for Generating Sea Clutter Data Based on Variational Autoencoder
    Deng, Xingyu
    Hui, Bingwei
    Han, Xing
    Gao, Fei
    Duan, Dawei
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 54 - 61
  • [40] MLSMOTE: Approaching imbalanced multilabel learning through synthetic instance generation
    Charte, Francisco
    Rivera, Antonio J.
    del Jesus, Maria J.
    Herrera, Francisco
    KNOWLEDGE-BASED SYSTEMS, 2015, 89 : 385 - 397