Automatic Learning and Evaluation of User-Centered Objective Functions for Dialogue System Optimisation

被引:0
|
作者
Rieser, Verena [1 ]
Lemon, Oliver [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
来源
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008 | 2008年
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The ultimate goal when building dialogue systems is to satisfy the needs of real users, but quality assurance for dialogue strategies is a non-trivial problem. The applied evaluation metrics and resulting design principles are often obscure, emerge by trial-and-error, and are highly context dependent. This paper introduces data-driven methods for obtaining reliable objective functions for system design. In particular, we test whether an objective function obtained from Wizard-of-Oz (WOZ) data is a valid estimate of real users' preferences. We test this in a test-retest comparison between the model obtained from the WOZ study and the models obtained when testing with real users. We can show that, despite a low fit to the initial data, the objective function obtained from WOZ data makes accurate predictions for automatic dialogue evaluation, and, when automatically optimising a policy using these predictions, the improvement over a strategy simply mimicking the data becomes clear from an error analysis.
引用
收藏
页码:2356 / 2361
页数:6
相关论文
共 50 条
  • [21] User-centered design and evaluation of virtual environments
    Gabbard, Joseph L.
    Hix, Deborah
    Swan II, J. Edward
    IEEE Computer Graphics and Applications, 19 (06): : 51 - 59
  • [22] Adaptive predictions in a user-centered recommender system
    Boyer, Anne
    Castagnos, Sylvain
    WEBIST 2007: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL WIA: WEB INTERFACES AND APPLICATIONS, 2007, : 51 - +
  • [23] Design and user-centered evaluation of recommender systems for mobile devices Methodology for user-centered evaluation of context-aware recommender systems
    Arana-Llanes, Julia Y.
    Rendon-Miranda, Juan C.
    Gonzalez-Serna, Juan G.
    Alejandres-Sanchez, Hugo O.
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 2, 2014, : 277 - 280
  • [24] A User-Centered Active Learning Approach for Appliance Recognition
    Shin, Eura
    Khamesi, Atieh R.
    Bahr, Zachary
    Silvestri, Simone
    Baker, D. A.
    2020 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP), 2020, : 208 - 213
  • [25] The user-centered privacy-aware control system PRICON: An interdisciplinary evaluation
    Walter, J.
    Abendroth, B.
    von Pape, T.
    Plappert, C.
    Zelle, D.
    Krauss, C.
    Gagzow, G.
    Decke, H.
    13TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY (ARES 2018), 2019,
  • [26] User-Centered Development of an Information System in Patient's Motor Capacity Evaluation
    Coton, Justine
    Vincent-Genod, D.
    Thomann, Guillaume
    Vuillerot, Carole
    Villeneuve, Francois
    HEALTH CARE SYSTEMS ENGINEERING, 2017, 210 : 121 - 131
  • [27] Evaluation metrics and methodologies for user-centered evaluation of intelligent systems
    Scholtz, Jean
    Morse, Emile
    Steves, Michelle Potts
    INTERACTING WITH COMPUTERS, 2006, 18 (06) : 1186 - 1214
  • [28] User-Centered Design and Evaluation of an Upper Limb Rehabilitation System with a Virtual Environment
    Rios-Hernandez, Monserrat
    Manuel Jacinto-Villegas, Juan
    Portillo-Rodriguez, Otniel
    Herlinda Vilchis-Gonzalez, Adriana
    APPLIED SCIENCES-BASEL, 2021, 11 (20):
  • [29] User-centered evaluation of Arizona BioPathway:: An information extraction, integration, and visualization system
    Quinones, Karin D.
    Su, Hua
    Marshall, Byron
    Eggers, Shauna
    Chen, Hsinchun
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2007, 11 (05): : 527 - 536
  • [30] User-Centered Design and Evaluation of a Mobile Shopping Robot
    Doering, Nicola
    Poeschl, Sandra
    Gross, Horst-Michael
    Bley, Andreas
    Martin, Christian
    Boehme, Hans-Joachim
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2015, 7 (02) : 203 - 225