Evaluation of a hierarchical reinforcement learning spoken dialogue system

被引:33
|
作者
Cuayahuitl, Heriberto [1 ]
Renals, Steve [1 ]
Lemon, Oliver [1 ]
Shimodaira, Hiroshi [1 ]
机构
[1] Univ Edinburgh, Inst Communicating & Collaborat Syst, Sch Informat, Edinburgh EH8 9AB, Midlothian, Scotland
来源
COMPUTER SPEECH AND LANGUAGE | 2010年 / 24卷 / 02期
关键词
Spoken dialogue systems; Hierarchical reinforcement learning; Human-machine dialogue simulation; Dialogue strategies; System evaluation; MODEL;
D O I
10.1016/j.csl.2009.07.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe an evaluation of spoken dialogue strategies designed using hierarchical reinforcement learning agents. The dialogue strategies were learnt in a simulated environment and tested in a laboratory setting with 32 users. These dialogues were used to evaluate three types of machine dialogue behaviour: hand-coded, fully-learnt and semi-learnt. These experiments also served to evaluate the realism of simulated dialogues using two proposed metrics contrasted with 'Precision-Recall'. The learnt dialogue behaviours used the Semi-Markov Decision Process (SMDP) model, and we report the first evaluation of this model in a realistic conversational environment. Experimental results in the travel planning domain provide evidence to support the following claims: (a) hierarchical semi-learnt dialogue agents are a better alternative (with higher overall performance) than deterministic or fully-learnt behaviour; (b) spoken dialogue strategies learnt with highly coherent user behaviour and conservative recognition error rates (keyword error rate of 20%) can outperform a reasonable hand-coded strategy; and (c) hierarchical reinforcement learning dialogue agents are feasible and promising for the (semi) automatic design of optimized dialogue behaviours in larger-scale systems. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:395 / 429
页数:35
相关论文
共 50 条
  • [1] Empirical evaluation of a reinforcement learning spoken dialogue system
    Singh, S
    Kearns, M
    Litman, DJ
    Walker, MA
    SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 645 - 651
  • [2] Reinforcement learning for spoken dialogue systems
    Singh, S
    Kearns, M
    Litman, D
    Walker, M
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 956 - 962
  • [3] An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email
    Walker, Marilyn A.
    Journal of Artificial Intelligence Research, 2001, 12 (00):
  • [4] An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email
    Walker, MA
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 12 : 387 - 416
  • [5] A Multi-Agent Reinforcement Learning Algorithm for Disambiguation in a Spoken Dialogue System
    Wang, Fangju
    INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 116 - 123
  • [6] Spoken dialogue system for learning Braille
    Araki, Masahiro
    Shibahara, Kana
    Mizukami, Yuko
    2011 35TH IEEE ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2011, : 152 - 156
  • [7] Recent research advances in Reinforcement Learning in Spoken Dialogue Systems
    Frampton, Matthew
    Lemon, Oliver
    KNOWLEDGE ENGINEERING REVIEW, 2009, 24 (04): : 375 - 408
  • [8] Reinforcement learning for parameter estimation in statistical spoken dialogue systems
    Jurcicek, Filip
    Thomson, Blaise
    Young, Steve
    COMPUTER SPEECH AND LANGUAGE, 2012, 26 (03): : 168 - 192
  • [9] Reinforcement learning of dialogue strategies with hierarchical abstract machines
    Cuayahuitl, Heriberto
    Renals, Steve
    Lemon, Oliver
    Shimodaira, Hiroshi
    2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 182 - +
  • [10] User modeling for spoken dialogue system evaluation
    Eckert, W
    Levin, E
    Pieraccini, R
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 80 - 87