Data augmentation for handwritten digit recognition using generative adversarial networks

被引:13
|
作者
Jha, Ganesh [1 ]
Cecotti, Hubert [1 ]
机构
[1] Calif State Univ Fresno Fresno State, Coll Sci & Math, Dept Comp Sci, 2576 E San Ramon MS ST 109, Fresno, CA 93740 USA
关键词
Machine learning; Neural networks; Classification; Generative adversarial networks; CHARACTER-RECOGNITION;
D O I
10.1007/s11042-020-08883-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Supervised learning techniques require labeled examples that can be time consuming to obtain. In particular, deep learning approaches, where all the feature extraction stages are learned within the artificial neural network, require a large number of labeled examples to train the model. Various data augmentation techniques can be performed to overcome this issue by taking advantage of known variations that have no impact on the label of an example. Typical solutions in computer vision and document analysis and recognition are based on geometric transformations (e.g. shift and rotation) and random elastic deformations of the original training examples. In this paper, we consider Generative Adversarial Networks (GAN), a technique that does not require prior knowledge of the possible variabilities that exist across examples to create novel artificial examples. In the case of a training dataset with a low number of labeled examples, which are described in a high dimensional space, the classifier may generalize poorly. Therefore, we aim at enriching databases of images or signals for improving the classifier performance by designing a GAN for creating artificial images. While adding more images through a GAN can help, the extent to which it will help is unknown, and it may degrade the performance if too many artificial images are added. The approach is tested on four datasets on handwritten digits (Latin, Bangla, Devanagri, and Oriya). The accuracy for each dataset shows that the addition of GAN generated images in the training dataset provides an improvement of the accuracy. However, the results suggest that the addition of too many GAN generated images deteriorates the performance.
引用
收藏
页码:35055 / 35068
页数:14
相关论文
共 50 条
  • [41] Tamil Language Handwritten Document Digitization and Analysis of the Impact of Data Augmentation Using Generative Adversarial Networks (GANs) on the Accuracy of CNN Model
    Murugesh, Venkatesh
    Parthasarathy, Aditya
    Gopinath, Gokul P.
    Khade, Anindita
    MACHINE LEARNING AND AUTONOMOUS SYSTEMS, 2022, 269 : 159 - 177
  • [42] Generative Adversarial Networks as an Advanced Data Augmentation Technique for MRI Data
    Konidaris, Filippos
    Tagaris, Thanos
    Sdraka, Maria
    Stafylopatis, Andreas
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 48 - 59
  • [43] Data Augmentation for Imbalanced HRRP Recognition Using Deep Convolutional Generative Adversarial Network
    Song, Yiheng
    Li, Yang
    Wang, Yanhua
    Hu, Cheng
    IEEE ACCESS, 2020, 8 : 201686 - 201695
  • [44] Cancer classification with data augmentation based on generative adversarial networks
    Wei, Kaimin
    Li, Tianqi
    Huang, Feiran
    Chen, Jinpeng
    He, Zefan
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (02)
  • [45] A deep data augmentation framework based on generative adversarial networks
    Wang, Qiping
    Luo, Ling
    Xie, Haoran
    Rao, Yanghui
    Lau, Raymond Y. K.
    Zhang, Detian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42871 - 42887
  • [46] Explainable evaluation of generative adversarial networks for wearables data augmentation
    Narteni, Sara
    Orani, Vanessa
    Ferrari, Enrico
    Verda, Damiano
    Cambiaso, Enrico
    Mongelli, Maurizio
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 145
  • [47] A deep data augmentation framework based on generative adversarial networks
    Qiping Wang
    Ling Luo
    Haoran Xie
    Yanghui Rao
    Raymond Y.K. Lau
    Detian Zhang
    Multimedia Tools and Applications, 2022, 81 : 42871 - 42887
  • [48] Seismic Data Augmentation Based on Conditional Generative Adversarial Networks
    Li, Yuanming
    Ku, Bonhwa
    Zhang, Shou
    Ahn, Jae-Kwang
    Ko, Hanseok
    SENSORS, 2020, 20 (23) : 1 - 13
  • [49] Generative Adversarial Networks for Data Augmentation in Structural Adhesive Inspection
    Peres, Ricardo Silva
    Azevedo, Miguel
    Araujo, Sara Oleiro
    Guedes, Magno
    Miranda, Fabio
    Barata, Jose
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [50] Generative adversarial networks for data augmentation in machine fault diagnosis
    Shao, Siyu
    Wang, Pu
    Yan, Ruqiang
    COMPUTERS IN INDUSTRY, 2019, 106 : 85 - 93