Automatic Learning and Evaluation of User-Centered Objective Functions for Dialogue System Optimisation

被引:0
|
作者
Rieser, Verena [1 ]
Lemon, Oliver [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
来源
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008 | 2008年
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The ultimate goal when building dialogue systems is to satisfy the needs of real users, but quality assurance for dialogue strategies is a non-trivial problem. The applied evaluation metrics and resulting design principles are often obscure, emerge by trial-and-error, and are highly context dependent. This paper introduces data-driven methods for obtaining reliable objective functions for system design. In particular, we test whether an objective function obtained from Wizard-of-Oz (WOZ) data is a valid estimate of real users' preferences. We test this in a test-retest comparison between the model obtained from the WOZ study and the models obtained when testing with real users. We can show that, despite a low fit to the initial data, the objective function obtained from WOZ data makes accurate predictions for automatic dialogue evaluation, and, when automatically optimising a policy using these predictions, the improvement over a strategy simply mimicking the data becomes clear from an error analysis.
引用
收藏
页码:2356 / 2361
页数:6
相关论文
共 50 条
  • [41] User-Centered Development of a Pedestrian Assistance System Using End-to-End Learning
    Qureshi, Hasham Shahid
    Glasmachers, Tobias
    Wiczorek, Rebecca
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 808 - 813
  • [42] A User-Centered Approach Towards Attention Visualization for Learning Activities
    Andujar, Marvin
    Gilbert, Juan E.
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2017 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC '17 ADJUNCT), 2017, : 871 - 876
  • [43] A user-centered, learning asthma smartphone application for patients and providers
    Gaynor, Mark
    Schneider, David
    Seltzer, Margo
    Crannage, Erica
    Barron, Mary Lee
    Waterman, Jason
    Oberle, Andrew
    LEARNING HEALTH SYSTEMS, 2020, 4 (03):
  • [44] Incorporating Technology into Braille Learning Through a User-Centered Methodology
    Moreno Rocha, Mario Alberto
    Garcia Lopez, Eneas Kevin
    Quintero Sanchez, Angel
    Cruz Gomez, Nancy Lizbeth
    CLIHC'17: PROCEEDINGS OF THE 8TH LATIN AMERICAN CONFERENCE ON HUMAN-COMPUTER INTERACTION, 2015,
  • [45] Applying Human Learning Principles to User-Centered IoT Systems
    Lee, Sang Wan
    Prenzel, Oliver
    Bien, Zeungnam
    COMPUTER, 2013, 46 (02) : 46 - 52
  • [46] The impact of user-centered design concepts in virtual learning environments
    Klett, F
    ITHET 2004: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY BASED HIGHER EDUCATION AND TRAINING, 2004, : 222 - 226
  • [47] Assessing user perspectives on clinical pharmacogenomics consultation documentation: a user-centered evaluation
    Desai, Nina
    Ravindra, Namratha
    Hall, Bradley
    Al Alshaykh, Hana
    Lemke, Lauren
    Eken, Eda
    Cicali, Emily J.
    Wiisanen, Kristin
    Cavallari, Larisa H.
    Nguyen, Khoa A.
    FRONTIERS IN PHARMACOLOGY, 2024, 15
  • [48] User Perspectives on Blockchain Technology: User-Centered Evaluation and Design Strategies for DApps
    Jang, Hyeji
    Han, Sung H.
    Kim, Ju Hwan
    IEEE ACCESS, 2020, 8 : 226213 - 226223
  • [49] A user-centered functional metadata evaluation of Moving Image Collections
    Zhang, Ying
    Li, Yuelin
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2008, 59 (08): : 1331 - 1346
  • [50] Evaluation Methods for User-Centered Child-Robot Interaction
    Charisi, Vicky
    Davison, Daniel
    Reidsma, Dennis
    Evers, Vanessa
    2016 25TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2016, : 545 - 550