Automatic Learning and Evaluation of User-Centered Objective Functions for Dialogue System Optimisation

被引:0
|
作者
Rieser, Verena [1 ]
Lemon, Oliver [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
来源
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008 | 2008年
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The ultimate goal when building dialogue systems is to satisfy the needs of real users, but quality assurance for dialogue strategies is a non-trivial problem. The applied evaluation metrics and resulting design principles are often obscure, emerge by trial-and-error, and are highly context dependent. This paper introduces data-driven methods for obtaining reliable objective functions for system design. In particular, we test whether an objective function obtained from Wizard-of-Oz (WOZ) data is a valid estimate of real users' preferences. We test this in a test-retest comparison between the model obtained from the WOZ study and the models obtained when testing with real users. We can show that, despite a low fit to the initial data, the objective function obtained from WOZ data makes accurate predictions for automatic dialogue evaluation, and, when automatically optimising a policy using these predictions, the improvement over a strategy simply mimicking the data becomes clear from an error analysis.
引用
收藏
页码:2356 / 2361
页数:6
相关论文
共 50 条
  • [31] User-Centered Evaluation Model for Medical Digital Libraries
    Kostkova, Patty
    Madle, Gemma
    KNOWLEDGE MANAGEMENT FOR HEALTH CARE PROCEDURES, 2009, 5626 : 92 - 103
  • [32] User-centered design and evaluation of multimodal tourist maps
    Mulazimoglu, Emre
    Basaraner, Melih
    INTERNATIONAL JOURNAL OF ENGINEERING AND GEOSCIENCES, 2019, 4 (03): : 115 - 128
  • [33] Development and Evaluation of a User-Centered Mobile Telestroke Platform
    Smith, Sherita N. Chapman
    Brown, Pamela C.
    Waits, Kaitlynne H.
    Wong, Jason S.
    Bhatti, Muhammad S.
    Toqeer, Qaiser
    Ricks, Jamie V.
    Stockner, Michelle L.
    Habtamu, Tsion
    Seelam, Joshnamaithili
    Britt, Rashon C.
    Giovia, Jacob M.
    Blankson, Baaba K.
    Bennam, Poanna
    Gormley, Mirinda A.
    Lu, Juan
    Ornato, Joseph P.
    TELEMEDICINE AND E-HEALTH, 2019, 25 (07) : 638 - 648
  • [34] A User-Centered Evaluation Study of a Mobile Arm Support
    Lund, Katarina
    Brandt, Richard
    Gelderblom, Gert-Jan
    Herder, Just L.
    2009 IEEE 11TH INTERNATIONAL CONFERENCE ON REHABILITATION ROBOTICS, VOLS 1 AND 2, 2009, : 678 - +
  • [35] User-Centered Library Websites: Usability Evaluation Methods
    Joint, Nicholas
    LIBRARY REVIEW, 2010, 59 (01) : 69 - +
  • [36] User-centered evaluation for machine translation of spoken language
    Palmer, DD
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1013 - 1016
  • [37] User-Centered Design and Evaluation of a Mobile Shopping Robot
    Nicola Doering
    Sandra Poeschl
    Horst-Michael Gross
    Andreas Bley
    Christian Martin
    Hans-Joachim Boehme
    International Journal of Social Robotics, 2015, 7 : 203 - 225
  • [38] Automatic Code Generation of User-centered Serious Games: A Decade in Review
    P. O. Silva-Vásquez
    V. Y. Rosales-Morales
    E. Benítez-Guerrero
    Programming and Computer Software, 2022, 48 : 685 - 701
  • [39] Development of a User-Centered Radiology Teaching File System
    dos Santos, Marcelo
    Fujino, Asa
    MEDICAL IMAGING 2011: ADVANCED PACS-BASED IMAGING INFORMATICS AND THERAPEUTIC APPLICATIONS, 2011, 7967
  • [40] Automatic Code Generation of User-centered Serious Games: A Decade in Review
    Silva-Vasquez, P. O.
    Rosales-Morales, V. Y.
    Benitez-Guerrero, E.
    PROGRAMMING AND COMPUTER SOFTWARE, 2022, 48 (08) : 685 - 701