Variational Autoencoder Based Synthetic Data Generation for Imbalanced Learning

被引：0

作者：

Wan, Zhiqiang ^{[1
]}

Zhang, Yazhou ^{[1
]}

He, Haibo ^{[1
]}

机构：

[1] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA

来源：

2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI) | 2017年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Discovering pattern from imbalanced data plays an important role in numerous applications, such as health service, cyber security, and financial engineering. However, the imbalanced data greatly compromise the performance of most learning algorithms. Recently, various synthetic sampling methods have been proposed to balance the dataset. Although these methods have achieved great success in many datasets, they are less effective for high-dimensional data, such as the image. In this paper, we propose a variational autoencoder (VAE) based synthetic data generation method for imbalanced learning. VAE can produce new samples which are similar to those in the original dataset, but not exactly the same. We evaluate and compare our proposed method with the traditional synthetic sampling methods on various datasets under five evaluation metrics. The experimental results demonstrate the effectiveness of the proposed method.

引用

页码：1500 / 1506

页数：7

共 50 条

[31] Generative Data Augmentation for Learning-based Electrical Impedance Tomography via Variational Autoencoder
Zhan, Yangen
Guan, Ru
Ren, Shangjie
Dong, Feng
2021 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2021), 2021,
[32] Semi-supervised learning of speech recognizers based on variational autoencoder and unsupervised data augmentation
Ho, Hyeon
Kang, Byung Ok
Kwon, Oh-Wook
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (06): : 578 - 586
[33] AR-ADASYN: angle radius-adaptive synthetic data generation approach for imbalanced learning
Park, Hyejoon
Kim, Hyunjoong
STATISTICS AND COMPUTING, 2024, 34 (05)
[34] Unsupervised feature learning for electrocardiogram data using the convolutional variational autoencoder
Jang, Jong-Hwan
Kim, Tae Young
Lim, Hong-Seok
Yoon, Dukyong
PLOS ONE, 2021, 16 (12):
[35] A class imbalanced wafer defect classification framework based on variational autoencoder generative adversarial network
Wang, Yitian
Wei, Yuxiang
Wang, Huan
MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (02)
[36] An Active Learning Method Based on Variational Autoencoder and DBSCAN Clustering
Chen, Fang
Zhang, Tao
Liu, Ruilin
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
[37] Learning from Synthetic Data Using a Stacked Multichannel Autoencoder
Zhang, Xi
Fu, Yanwei
Jiang, Shanshan
Sigal, Leonid
Agam, Gady
2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 461 - 464
[38] An Iterated Greedy Algorithm for Improving the Generation of Synthetic Patterns in Imbalanced Learning
Javier Maestre-Garcia, Francisco
Garcia-Martinez, Carlos
Perez-Ortiz, Maria
Antonio Gutierrez, Pedro
ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2017, PT II, 2017, 10306 : 513 - 524
[39] A Method for Generating Sea Clutter Data Based on Variational Autoencoder
Deng, Xingyu
Hui, Bingwei
Han, Xing
Gao, Fei
Duan, Dawei
2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 54 - 61
[40] MLSMOTE: Approaching imbalanced multilabel learning through synthetic instance generation
Charte, Francisco
Rivera, Antonio J.
del Jesus, Maria J.
Herrera, Francisco
KNOWLEDGE-BASED SYSTEMS, 2015, 89 : 385 - 397

← 1 2 3 4 5 →