DATA AUGMENTATION FOR LOW RESOURCE SENTIMENT ANALYSIS USING GENERATIVE ADVERSARIAL NETWORKS

被引:0
|
作者
Gupta, Rahul [1 ]
机构
[1] Amazon Com, Seattle, WA 98109 USA
关键词
Generative Adversarial Networks; sentiment analysis;
D O I
10.1109/icassp.2019.8682544
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Sentiment analysis is a task that may suffer from a lack of data in certain cases, as the datasets are often generated and annotated by humans. In cases where data is inadequate for training discriminative models, generate models may aid training via data augmentation. Generative Adversarial Networks (GANs) are one such model that has advanced the state of the art in several tasks, including as image and text generation. In this paper, I train GAN models on low resource datasets, then use them for the purpose of data augmentation towards improving sentiment classifier generalization. Given the constraints of limited data, I explore various techniques to train the GAN models. I also present an analysis of the quality of generated GAN data as more training data for the GAN is made available. In this analysis, the generated data is evaluated as a test set (against a model trained on real data points) as well as a training set to train classification models. Finally, I also conduct a visual analysis by projecting the generated and the real data into a two-dimensional space using the t-Distributed Stochastic Neighbor Embedding (t-SNE) method.
引用
收藏
页码:7380 / 7384
页数:5
相关论文
共 50 条
  • [1] Generative-Adversarial Networks for Low-Resource Language Data Augmentation in Machine Translation
    Zeng, Linda
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 11 - 18
  • [2] Biomedical Data Augmentation Using Generative Adversarial Neural Networks
    Calimeri, Francesco
    Marzullo, Aldo
    Stamile, Claudio
    Terracina, Giorgio
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 626 - 634
  • [3] SEQUENTIAL IOT DATA AUGMENTATION USING GENERATIVE ADVERSARIAL NETWORKS
    Tschuchnig, Maximilian Ernst
    Ferner, Cornelia
    Wegenkittl, Stefan
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4212 - 4216
  • [4] Efficient Approaches for Data Augmentation by Using Generative Adversarial Networks
    Saha, Pretom Kumar
    Logofatu, Doina
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2022, 2022, 1600 : 386 - 399
  • [5] Data Augmentation for Voiceprint Recognition Using Generative Adversarial Networks
    Lin, Yao-San
    Chen, Hung-Yu
    Huang, Mei-Ling
    Hsieh, Tsung-Yu
    ALGORITHMS, 2024, 17 (12)
  • [6] Generative Adversarial Networks for Bitcoin Data Augmentation
    Zola, Francesco
    Lukas Bruse, Jan
    Etxeberria Barrio, Xabier
    Galar, Mikel
    Orduna Urrutia, Raul
    2020 2ND CONFERENCE ON BLOCKCHAIN RESEARCH & APPLICATIONS FOR INNOVATIVE NETWORKS AND SERVICES (BRAINS), 2020, : 136 - 143
  • [7] Data Augmentation with Improved Generative Adversarial Networks
    Shi, Hongjiang
    Wang, Lu
    Ding, Guangtai
    Yang, Fenglei
    Li, Xiaoqiang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 73 - 78
  • [8] Data Augmentation Powered by Generative Adversarial Networks
    Poka, Karoly Bence
    Szemenyei, Marton
    2020 23RD IEEE INTERNATIONAL SYMPOSIUM ON MEASUREMENT AND CONTROL IN ROBOTICS (ISMCR), 2020,
  • [9] Conditional Generative Adversarial Networks with Adversarial Attack and Defense for Generative Data Augmentation
    Baek, Francis
    Kim, Daeho
    Park, Somin
    Kim, Hyoungkwan
    Lee, SangHyun
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2022, 36 (03)
  • [10] Adaptive Traffic Data Augmentation Using Generative Adversarial Networks for Optical Networks
    Li, Shuai
    Li, Jin
    Zhang, Min
    Wang, Danshi
    Song, Chuang
    Zhen, Xinghua
    2019 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2019,