Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System

被引:0
|
作者
Zhang, Jianguo [1 ]
Roller, Stephen [2 ]
Qian, Kun [3 ]
Liu, Zhiwei [1 ]
Meng, Rui [1 ]
Heinecke, Shelby [1 ]
Wang, Huan [1 ]
Savarese, Silvio [1 ]
Xiong, Caiming [1 ]
机构
[1] Salesforce AI, Atlanta, GA 30326 USA
[2] Character AI, Menlo Pk, CA USA
[3] Columbia Univ, New York, NY 10027 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
End-to-end task-oriented dialogue (TOD) systems have achieved promising performance by leveraging sophisticated natural language understanding and natural language generation capabilities of pre-trained models. This work enables the TOD systems with more flexibility through a simple cache. The cache provides the flexibility to dynamically update the TOD systems and handle both existing and unseen dialogue scenarios. Towards this end, we first fine-tune a retrieval module to effectively retrieve the most relevant information entries from the cache. We then train end-to-end TOD models that can refer to and ground on both dialogue history and retrieved information during TOD generation. The introduced cache is straightforward to construct, and the backbone models of TOD systems are compatible with existing pre-trained generative models. Extensive experiments demonstrate the superior performance of our framework, with a notable improvement in non-empty joint goal accuracy by 6.7% compared to strong baselines.
引用
收藏
页码:509 / 518
页数:10
相关论文
共 50 条
  • [41] ITERATIVE POLICY LEARNING IN END-TO-END TRAINABLE TASK-ORIENTED NEURAL DIALOG MODELS
    Liu, Bing
    Lane, Ian
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 482 - 489
  • [42] KB-Transformer: Incorporating Knowledge into End-to-End Task-Oriented Dialog Systems
    E, Haihong
    Zhang, Wenjing
    Song, Meina
    2019 15TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG 2019), 2019, : 44 - 48
  • [43] An End-to-End Trainable Neural Network Model with Belief Tracking for Task-Oriented Dialog
    Liu, Bing
    Lane, Ian
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2506 - 2510
  • [44] An End-to-End Human Simulator for Task-Oriented Multimodal Human-Robot Collaboration
    Shervedani, Afagh Mehri
    Li, Siyu
    Monaikul, Natawut
    Abbasi, Bahareh
    Di Eugenio, Barbara
    Zefran, Milos
    2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN, 2023, : 614 - 620
  • [45] End-to-End Task-Oriented Dialog Modeling with Semi-Structured Knowledge Management
    Gao, Silin
    Takanobu, Ryuichi
    Bosselut, Antoine
    Huang, Minlie
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2022, 30 : 2173 - 2187
  • [46] SUMBT plus LaRL: Effective Multi-Domain End-to-End Neural Task-Oriented Dialog System
    Lee, Hwaran
    Jo, Seokhwan
    Kim, Hyungjun
    Jung, Sangkeun
    Kim, Tae-Yoon
    IEEE ACCESS, 2021, 9 : 116133 - 116146
  • [47] FG2SEQ: EFFECTIVELY ENCODING KNOWLEDGE FOR END-TO-END TASK-ORIENTED DIALOG
    He, Zhenhao
    He, Yuhong
    Wu, Qingyao
    Chen, Jian
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8029 - 8033
  • [48] Mars: Modeling Context & State Representations with Contrastive Learning for End-to-End Task-Oriented Dialog
    Sun, Haipeng
    Bao, Junwei
    Wu, Youzheng
    He, Xiaodong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11139 - 11160
  • [49] END-TO-END DYNAMIC QUERY MEMORY NETWORK FOR ENTITY-VALUE INDEPENDENT TASK-ORIENTED DIALOG
    Wu, Chien-Sheng
    Madotto, Andrea
    Winata, Genta Indra
    Fung, Pascale
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6154 - 6158
  • [50] Hierarchical Attention and Knowledge Matching Networks With Information Enhancement for End-to-End Task-Oriented Dialog Systems
    He, Junqing
    Wang, Bing
    Fu, Mingming
    Yang, Tianqi
    Zhao, Xuemin
    IEEE ACCESS, 2019, 7 : 18871 - 18883