Data augmentation for handwritten digit recognition using generative adversarial networks

被引:14
|
作者
Jha, Ganesh [1 ]
Cecotti, Hubert [1 ]
机构
[1] Calif State Univ Fresno Fresno State, Coll Sci & Math, Dept Comp Sci, 2576 E San Ramon MS ST 109, Fresno, CA 93740 USA
关键词
Machine learning; Neural networks; Classification; Generative adversarial networks; CHARACTER-RECOGNITION;
D O I
10.1007/s11042-020-08883-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Supervised learning techniques require labeled examples that can be time consuming to obtain. In particular, deep learning approaches, where all the feature extraction stages are learned within the artificial neural network, require a large number of labeled examples to train the model. Various data augmentation techniques can be performed to overcome this issue by taking advantage of known variations that have no impact on the label of an example. Typical solutions in computer vision and document analysis and recognition are based on geometric transformations (e.g. shift and rotation) and random elastic deformations of the original training examples. In this paper, we consider Generative Adversarial Networks (GAN), a technique that does not require prior knowledge of the possible variabilities that exist across examples to create novel artificial examples. In the case of a training dataset with a low number of labeled examples, which are described in a high dimensional space, the classifier may generalize poorly. Therefore, we aim at enriching databases of images or signals for improving the classifier performance by designing a GAN for creating artificial images. While adding more images through a GAN can help, the extent to which it will help is unknown, and it may degrade the performance if too many artificial images are added. The approach is tested on four datasets on handwritten digits (Latin, Bangla, Devanagri, and Oriya). The accuracy for each dataset shows that the addition of GAN generated images in the training dataset provides an improvement of the accuracy. However, the results suggest that the addition of too many GAN generated images deteriorates the performance.
引用
收藏
页码:35055 / 35068
页数:14
相关论文
共 50 条
  • [11] MCMC Based Generative Adversarial Networks for Handwritten Numeral Augmentation
    Zhang, He
    Luo, Chunbo
    Yu, Xingrui
    Ren, Peng
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 2702 - 2710
  • [12] Data Augmentation for EEG-Based Emotion Recognition Using Generative Adversarial Networks
    Bao, Guangcheng
    Yan, Bin
    Tong, Li
    Shu, Jun
    Wang, Linyuan
    Yang, Kai
    Zeng, Ying
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2021, 15
  • [13] Data Augmentation with Generative Adversarial Networks for Grocery Product Image Recognition
    Wei, Yuchen
    Xu, Shuxiang
    Son Tran
    Kang, Byeong
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 963 - 968
  • [14] Biomedical Data Augmentation Using Generative Adversarial Neural Networks
    Calimeri, Francesco
    Marzullo, Aldo
    Stamile, Claudio
    Terracina, Giorgio
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 626 - 634
  • [15] SEQUENTIAL IOT DATA AUGMENTATION USING GENERATIVE ADVERSARIAL NETWORKS
    Tschuchnig, Maximilian Ernst
    Ferner, Cornelia
    Wegenkittl, Stefan
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4212 - 4216
  • [16] Efficient Approaches for Data Augmentation by Using Generative Adversarial Networks
    Saha, Pretom Kumar
    Logofatu, Doina
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2022, 2022, 1600 : 386 - 399
  • [17] Speech emotion recognition using data augmentation method by cycle-generative adversarial networks
    Shilandari, Arash
    Marvi, Hossein
    Khosravi, Hossein
    Wang, Wenwu
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (07) : 1955 - 1962
  • [18] Speech emotion recognition using data augmentation method by cycle-generative adversarial networks
    Arash Shilandari
    Hossein Marvi
    Hossein Khosravi
    Wenwu Wang
    Signal, Image and Video Processing, 2022, 16 : 1955 - 1962
  • [19] Generative Adversarial Networks for Bitcoin Data Augmentation
    Zola, Francesco
    Lukas Bruse, Jan
    Etxeberria Barrio, Xabier
    Galar, Mikel
    Orduna Urrutia, Raul
    2020 2ND CONFERENCE ON BLOCKCHAIN RESEARCH & APPLICATIONS FOR INNOVATIVE NETWORKS AND SERVICES (BRAINS), 2020, : 136 - 143
  • [20] Data Augmentation with Improved Generative Adversarial Networks
    Shi, Hongjiang
    Wang, Lu
    Ding, Guangtai
    Yang, Fenglei
    Li, Xiaoqiang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 73 - 78