Evaluation of a hierarchical reinforcement learning spoken dialogue system

被引:33
|
作者
Cuayahuitl, Heriberto [1 ]
Renals, Steve [1 ]
Lemon, Oliver [1 ]
Shimodaira, Hiroshi [1 ]
机构
[1] Univ Edinburgh, Inst Communicating & Collaborat Syst, Sch Informat, Edinburgh EH8 9AB, Midlothian, Scotland
来源
COMPUTER SPEECH AND LANGUAGE | 2010年 / 24卷 / 02期
关键词
Spoken dialogue systems; Hierarchical reinforcement learning; Human-machine dialogue simulation; Dialogue strategies; System evaluation; MODEL;
D O I
10.1016/j.csl.2009.07.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe an evaluation of spoken dialogue strategies designed using hierarchical reinforcement learning agents. The dialogue strategies were learnt in a simulated environment and tested in a laboratory setting with 32 users. These dialogues were used to evaluate three types of machine dialogue behaviour: hand-coded, fully-learnt and semi-learnt. These experiments also served to evaluate the realism of simulated dialogues using two proposed metrics contrasted with 'Precision-Recall'. The learnt dialogue behaviours used the Semi-Markov Decision Process (SMDP) model, and we report the first evaluation of this model in a realistic conversational environment. Experimental results in the travel planning domain provide evidence to support the following claims: (a) hierarchical semi-learnt dialogue agents are a better alternative (with higher overall performance) than deterministic or fully-learnt behaviour; (b) spoken dialogue strategies learnt with highly coherent user behaviour and conservative recognition error rates (keyword error rate of 20%) can outperform a reasonable hand-coded strategy; and (c) hierarchical reinforcement learning dialogue agents are feasible and promising for the (semi) automatic design of optimized dialogue behaviours in larger-scale systems. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:395 / 429
页数:35
相关论文
共 50 条
  • [41] Autonomous Robotic Dialogue System with Reinforcement Learning for Elderlies with Dementia
    Magyar, Jan
    Kobayashi, Masahiko
    Nishio, Shuichi
    Sincak, Peter
    Ishiguro, Hiroshi
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3416 - 3421
  • [42] Design and Implementation of Spoken Dialogue Based Multiplication Table Learning Support System
    Okazaki, Yasuhisa
    Egashira, Hiroyuki
    Watanabe, Kenzi
    Kondo, Hiroki
    TOWARDS SUSTAINABLE AND SCALABLE EDUCATIONAL INNOVATIONS INFORMED BY LEARNING SCIENCES, 2005, 133 : 844 - 847
  • [43] Contrastive Learning Based ASR Robust Knowledge Selection For Spoken Dialogue System
    Zhu, Zhiyuan
    Liao, Yusheng
    Wang, Yu
    Guan, Yunfeng
    INTERSPEECH 2023, 2023, : 725 - 729
  • [44] A reinforcement learning parser for spoken language
    Jeong, H
    Hwang, KI
    Seo, MW
    IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 1285 - 1291
  • [45] Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning
    Ultes, Stefan
    Budzianowski, Pawel
    Casanueva, Inigo
    Mrksic, Nikola
    Rojas-Barahona, Lina
    Su, Pei-Hao
    Wen, Tsung-Hsien
    Gasic, Milica
    Young, Steve
    18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017, : 65 - 70
  • [46] Metacognition and Learning in Spoken Dialogue Computer Tutoring
    Forbes-Riley, Kate
    Litman, Diane
    INTELLIGENT TUTORING SYSTEMS, PT 1, PROCEEDINGS, 2010, 6094 : 379 - 388
  • [47] LEARNING USER INTENTIONS IN SPOKEN DIALOGUE SYSTEMS
    Chinaei, Hamid R.
    Chaib-draa, Brahim
    Lamontagne, Luc
    ICAART 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, 2009, : 107 - +
  • [48] Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning
    Saha, Tulika
    Gupta, Dhawal
    Saha, Sriparna
    Bhattacharyya, Pushpak
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162 (162)
  • [49] Learning Dialogue History for Spoken Language Understanding
    Zhang, Xiaodong
    Ma, Dehong
    Wang, Houfeng
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT I, 2018, 11108 : 120 - 132
  • [50] Automatically training a problematic dialogue predictor for a spoken dialogue system
    Walker, M.A. (WALKER@RESEARCH.ATT.COM), 1600, AI Access Foundation (16):