Variational model for low-resource natural language generation in spoken dialogue systems

被引:4
|
作者
Van-Khanh Tran [1 ,2 ]
Le-Minh Nguyen [1 ]
机构
[1] Japan Adv Inst Sci & Technol, JAIST 1-1 Asahidai, Nomi, Ishikawa 9231292, Japan
[2] ICTU Thai Nguyen Univ, Univ Informat & Commun Technol, Thai Nguyen, Vietnam
来源
关键词
Neural language generation; Domain adaptation; Low-resource data; Variational autoencoder; Deconvolutional neural network; CNN; RNN; LSTM; DOMAIN ADAPTATION;
D O I
10.1016/j.csl.2020.101120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural Language Generation (NLG) plays a critical role in Spoken Dialogue Systems (SDSs), aims at converting a meaning representation into natural language utterances. Recent deep learning-based generators have shown improving results irrespective of providing sufficient annotated data. Nevertheless, how to build a generator that can effectively utilize as much of knowledge from a low-resource setting data is a crucial issue for NLG in SDSs. This paper presents a variational-based NLG framework to tackle the NLG problem of having limited annotated data in two scenarios, domain adaptation and low-resource in-domain training data. Based on this framework, we propose a novel adversarial domain adaptation NLG taclking the former issue, while the latter issue is also handled by a second proposed dual variational model. We extensively conducted the experiments on four different domains in a variety of training scenarios, in which the experimental results show that the proposed methods not only outperform previous methods when having sufficient training dataset but also show its ability to work acceptably well when there is a small amount of in-domain data or adapt quickly to a new domain with only a low-resource target domain data. (C) 2020 Published by Elsevier Ltd.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios
    Hedderich, Michael A.
    Lange, Lukas
    Adel, Heike
    Strotgen, Jannik
    Klakow, Dietrich
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2545 - 2568
  • [32] Flexible guidance generation using user model in spoken dialogue systems
    Komatani, K
    Ueno, S
    Kawahara, T
    Okuno, HG
    41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2003, : 256 - 263
  • [33] Language Model Prior for Low-Resource Neural Machine Translation
    Baziotis, Christos
    Haddow, Barry
    Birch, Alexandra
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7622 - 7634
  • [34] Reusing MT Components in Natural Language Generation for Dialogue Systems
    Amores, Gabriel
    Perez, Guillermo
    Manchon, Pilar
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2006, (37): : 215 - 221
  • [35] Towards Low-Resource Semi-Supervised Dialogue Generation with Meta-Learning
    Huang, Yi
    Feng, Junlan
    Ma, Shuo
    Du, Xiaoyu
    Wu, Xiaoting
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4123 - 4128
  • [36] Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation
    Lin, Shuai
    Zhou, Pan
    Liang, Xiaodan
    Tang, Jianheng
    Zhao, Ruihui
    Chen, Ziliang
    Lin, Liang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13362 - 13370
  • [37] Natural spoken dialogue systems for telephony applications
    Boyce, SJ
    COMMUNICATIONS OF THE ACM, 2000, 43 (09) : 29 - 34
  • [38] A Unified Data Augmentation Framework for Low-Resource Multi-domain Dialogue Generation
    Liu, Yongkang
    Nie, Ercong
    Feng, Shi
    Hua, Zheng
    Ding, Zifeng
    Wang, Daling
    Zhang, Yifei
    Schuetze, Hinrich
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT II, ECML PKDD 2024, 2024, 14942 : 162 - 177
  • [39] Prior latent distribution comparison for the RNN Variational Autoencoder in low-resource language modeling
    Kostiuk, Yevhen
    Lukashchuk, Mykola
    Gelbukh, Alexander
    Sidorov, Grigori
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4541 - 4549
  • [40] Shifting the design philosophy of spoken natural language dialogue: From invisible to transparent systems
    Karsenty L.
    International Journal of Speech Technology, 2002, 5 (02) : 147 - 157