Variational Hierarchical Dialog Autoencoder for Dialog State Tracking Data Augmentation

被引:0
|
作者
Yoo, Kang Min [1 ]
Lee, Hanbit [1 ]
Dernoncourt, Franck [2 ]
Bui, Trung [2 ]
Chang, Walter [2 ]
Lee, Sang-Goo [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Adobe Res, San Jose, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent works have shown that generative data augmentation, where synthetic samples generated from deep generative models complement the training dataset, benefit NLP tasks. In this work, we extend this approach to the task of dialog state tracking for goal-oriented dialogs. Due to the inherent hierarchical structure of goal-oriented dialogs over utterances and related annotations, the deep generative model must be capable of capturing the coherence among different hierarchies and types of dialog features. We propose the Variational Hierarchical Dialog Autoencoder (VHDA) for modeling the complete aspects of goal-oriented dialogs, including linguistic features and underlying structured annotations, namely speaker information, dialog acts, and goals. The proposed architecture is designed to model each aspect of goal-oriented dialogs using inter-connected latent variables and learns to generate coherent goal-oriented dialogs from the latent spaces. To overcome training issues that arise from training complex variational models, we propose appropriate training strategies. Experiments on various dialog datasets show that our model improves the downstream dialog trackers' robustness via generative data augmentation. We also discover additional benefits of our unified approach to modeling goal-oriented dialogs dialog response generation and user simulation, where our model outperforms previous strong baselines.
引用
收藏
页码:3406 / 3425
页数:20
相关论文
共 50 条
  • [41] THE DIALOG OF THE CHRISTIAN CHURCHES AND THE STATE
    VISCHER, L
    UNIVERSITAS-STUTTGART, 1983, 25 (03): : 179 - 182
  • [42] Data Augmentation and Feature Extraction using Variational Autoencoder for Acoustic Modeling
    Nishizaki, Hiromitsu
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1263 - 1268
  • [43] Data Augmentation using Variational Autoencoder for Embedding based Speaker Verification
    Wu, Zhanghao
    Wang, Shuai
    Qian, Yanmin
    Yu, Kai
    INTERSPEECH 2019, 2019, : 1163 - 1167
  • [44] TripPy: A Triple Copy Strategy for Value Independent Neural Dialog State Tracking
    Heck, Michael
    van Niekerk, Carel
    Lubis, Nurul
    Geishauser, Christian
    Lin, Hsien-Chin
    Moresi, Marco
    Gasic, Milica
    SIGDIAL 2020: 21ST ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2020), 2020, : 35 - 44
  • [45] Exploiting the ASR N-Best by tracking multiple dialog state hypotheses
    Williams, Jason D.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 191 - 194
  • [46] Hierarchical Transformer for Task Oriented Dialog Systems
    Santra, Bishal
    Anusha, Potnuru
    Goyal, Pawan
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5649 - 5658
  • [47] DIALOG STATE TRACKING WITH ATTENTION-BASED SEQUENCE-TO-SEQUENCE LEARNING
    Hori, Takaaki
    Wang, Hai
    Hori, Chiori
    Watanabe, Shinji
    Harsham, Bret
    Le Roux, Jonathan
    Hershey, John R.
    Koji, Yusuke
    Jing, Yi
    Zhu, Zhaocheng
    Aikawa, Takeyuki
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 552 - 558
  • [48] Dialog State Tracking for Interview Coaching Using Two-Level LSTM
    Su, Ming-Hsiang
    Wu, Chung-Hsien
    Huang, Kun-Yi
    Yang, Tsung-Hsien
    Huang, Tsui-Ching
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [49] STO-CVAE: state transition-oriented conditional variational autoencoder for data augmentation in disability classification
    Seong Jin Bang
    Min Jung Kang
    Min-Goo Lee
    Sang Min Lee
    Complex & Intelligent Systems, 2024, 10 : 4201 - 4222
  • [50] A MULTICHANNEL CONVOLUTIONAL NEURAL NETWORK FOR CROSS-LANGUAGE DIALOG STATE TRACKING
    Shi, Hongjie
    Ushio, Takashi
    Endo, Mitsuru
    Yamagami, Katsuyoshi
    Horii, Noriaki
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 559 - 564