Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue System

被引:6
|
作者
Hu, Zhiyuan [1 ]
Feng, Yue [2 ]
Luu, Anh Tuan [3 ]
Hooi, Bryan [1 ]
Lipani, Aldo [2 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] UCL, London, England
[3] Nanyang Technol Univ, Singapore, Singapore
关键词
Dialogue system; Large Language Model; User Simulation;
D O I
10.1145/3583780.3615220
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dialogue systems and large language models (LLMs) have gained considerable attention. However, the direct utilization of LLMs as task-oriented dialogue (TOD) models has been found to underperform compared to smaller task-specific models. Nonetheless, it is crucial to acknowledge the significant potential of LLMs and explore improved approaches for leveraging their impressive abilities. Motivated by the goal of leveraging LLMs, we propose an alternative approach called User-Guided Response Optimization (UGRO) to combine it with a smaller TOD model. This approach uses LLM as an annotation-free user simulator to assess dialogue responses, combining them with smaller fine-tuned end-to-end TOD models. By utilizing the satisfaction feedback generated by LLMs, UGRO further optimizes the supervised fine-tuned TOD model. Specifically, the TOD model takes the dialogue history as input and, with the assistance of the user simulator's feedback, generates high-satisfaction responses that meet the user's requirements. Through empirical experiments on two TOD benchmarks, we validate the effectiveness of our method. The results demonstrate that our approach outperforms previous state-of-the-art (SOTA) results.
引用
收藏
页码:3953 / 3957
页数:5
相关论文
共 44 条
  • [41] Navigating Technological Shifts: An Examination of User Inertia and Technology Prestige in Large-Language-Model AI Chatbot Transition
    Xi, Yipeng
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024,
  • [42] Potential applications of innovative AI-based tools in hydrogen energy development: Leveraging large language model technologies
    Shahin, Matin
    Simjoo, Mohammad
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2025, 102 : 918 - 936
  • [43] Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-World Multi-Turn Dialogue
    Yang, Songhua
    Zhao, Hanjie
    Zhu, Senbin
    Zhou, Guangyu
    Xu, Hongfei
    Jia, Yuxiang
    Zan, Hongying
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19368 - 19376
  • [44] Conversational Recommender System and Large Language Model Are Made for Each Other in E-commerce Pre-sales Dialogue
    Liu, Yuanxing
    Zhang, Wei-Nan
    Chen, Yifan
    Zhang, Yuchi
    Bai, Haopeng
    Feng, Fan
    Cui, Hengbin
    Lie, Yongbin
    Che, Wanxiang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 9587 - 9605