Effect of Errors on the Evaluation of Machine Learning Systems

被引:1
|
作者
Bracamonte, Vanessa [1 ]
Hidano, Seira [1 ]
Kiyomoto, Shinsaku [1 ]
机构
[1] KDDI Res Inc, Saitama, Japan
关键词
User Perception; Errors; Machine Learning Model Evaluation; User Study; AUTOMATION; TRUST;
D O I
10.5220/0010839300003124
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Information such as accuracy and outcome explanations can be useful for the evaluation of machine learning systems, but they can also lead to over-trust. This means that an evaluator may not have suspicion that a machine learning system could have errors, and that they may overlook problems in the explanation of those systems. Research has shown that errors not only decrease trust but can also promote curiosity about the performance of the system. Therefore, presenting errors to evaluators may be an option to induce suspicion in the context of the evaluation of a machine learning system. In this paper, we evaluate this possibility by conducting three experiments where we asked participants to evaluate text classification systems. We presented two types of errors: incorrect predictions and errors in the explanation. The results show that patterns of errors in explanation negatively influenced willingness to recommend a system, and that fewer participants chose a system with higher accuracy when there was an error pattern, compared to when the errors were random. Moreover, more participants gave evidence from the explanations in their reason for their evaluation of the systems, suggesting that they were able to detect error patterns.
引用
收藏
页码:48 / 57
页数:10
相关论文
共 50 条
  • [31] A machine learning approach to determine refractive errors of the eye
    Ohlendorf, Arne
    Leube, Alexander
    Leibig, Christian
    Wahl, Siegfried
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2017, 58 (08)
  • [32] Finding errors in astronomical catalogs using machine learning
    Fuentes, O
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XI, 2002, 281 : 148 - 151
  • [33] Using a Machine Learning System to Identify and Prevent Medication Prescribing Errors: A Clinical and Cost Analysis Evaluation
    Rozenblum, Ronen
    Rodriguez-Monguio, Rosa
    Volk, Lynn A.
    Forsythe, Katherine J.
    Myers, Sara
    McGurrin, Maria
    Williams, Deborah H.
    Bates, David W.
    Schiff, Gordon
    Seoane-Vazquez, Enrique
    JOINT COMMISSION JOURNAL ON QUALITY AND PATIENT SAFETY, 2020, 46 (01): : 3 - 10
  • [34] Evaluation of prediction errors in nine intraocular lens calculation formulas using an explainable machine learning model
    Richul Oh
    Joo Youn Oh
    Hyuk Jin Choi
    Mee Kum Kim
    Chang Ho Yoon
    BMC Ophthalmology, 24 (1)
  • [35] Feature Evaluation of Emerging E-Learning Systems Using Machine Learning: An Extensive Survey
    Aslam, Shabnam Mohamed
    Jilani, Abdul Khader
    Sultana, Jabeen
    Almutairi, Laila
    IEEE ACCESS, 2021, 9 : 69573 - 69587
  • [36] Evaluation of Milling Machine Properties Based on Shape Errors
    Piorkowski, Pawel
    Skoczynski, Waclaw
    ADVANCES IN SCIENCE AND TECHNOLOGY-RESEARCH JOURNAL, 2021, 15 (02) : 148 - 155
  • [37] Evaluation of Machine Translation Errors in English and Iraqi Arabic
    Condon, Sherri
    Parvaz, Dan
    Aberdeen, John
    Doran, Christy
    Freeman, Andrew
    Awad, Marwan
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [38] Research on Athlete Training Effect Evaluation Based on Machine Learning Algorithm
    Zou, Yan
    Wang, Chu
    Jiao, Qianqian
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [39] An Evaluation of Machine Learning Frameworks
    Wafo, Franck
    Mabou, Ivan Cedric
    Heilmann, Dan
    Zengeler, Nico
    Handmann, Uwe
    PROCEEDINGS OF THE 2021 IEEE 16TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2021), 2021, : 1411 - 1416
  • [40] Machine Learning in Systems Biology
    Florence d'Alché-Buc
    Louis Wehenkel
    BMC Proceedings, 2 (Suppl 4)