Conversational Explanations of Machine Learning Predictions Through Class-contrastive Counterfactual Statements

被引:0
|
作者
Sokol, Kacper [1 ]
Flach, Peter [1 ]
机构
[1] Univ Bristol, Dept Comp Sci, Bristol, Avon, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning models have become pervasive in our everyday life; they decide on important matters influencing our education, employment and judicial system. Many of these predictive systems are commercial products protected by trade secrets, hence their decision-making is opaque. Therefore, in our research we address interpretability and explainability of predictions made by machine learning models. Our work draws heavily on human explanation research in social sciences: contrastive and exemplar explanations provided through a dialogue. This user-centric design, focusing on a lay audience rather than domain experts, applied to machine learning allows explainees to drive the explanation to suit their needs instead of being served a precooked template.
引用
收藏
页码:5785 / 5786
页数:2
相关论文
共 50 条
  • [1] Explaining Machine Learning Classifiers through Diverse Counterfactual Explanations
    Mothilal, Ramaravind K.
    Sharma, Amit
    Tan, Chenhao
    FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, : 607 - 617
  • [2] Contrastive Learning with Counterfactual Explanations for Radiology Report Generation
    Li, Mingjie
    Lin, Haokun
    Qiu, Liang
    Liang, Xiaodan
    Chen, Ling
    Elsaddik, Abdulmotaleb
    Chang, Xiaojun
    COMPUTER VISION-ECCV 2024, PT XLIII, 2025, 15101 : 162 - 180
  • [3] "Why Not Other Classes?": Towards Class-Contrastive Back-Propagation Explanations
    Wang, Yipei
    Wang, Xiaoqian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] Human performance effects of combining counterfactual explanations with normative and contrastive explanations in supervised machine learning for automated decision assistance
    Gentile, Davide
    Donmez, Birsen
    Jamieson, Greg A.
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2025, 196
  • [5] Contrastive Visual Explanations for Reinforcement Learning via Counterfactual Rewards
    Liu, Xiaowei
    McAreavey, Kevin
    Liu, Weiru
    EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT II, 2023, 1902 : 72 - 87
  • [6] A class-contrastive human-interpretable machine learning approach to predict mortality in severe mental illness
    Banerjee, Soumya
    Lio, Pietro
    Jones, Peter B.
    Cardinal, Rudolf N.
    NPJ SCHIZOPHRENIA, 2021, 7 (01):
  • [7] Robust Counterfactual Explanations in Machine Learning: A Survey
    Jiang, Junqi
    Leofante, Francesco
    Rago, Antonio
    Toni, Francesca
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8086 - 8094
  • [8] A class-contrastive human-interpretable machine learning approach to predict mortality in severe mental illness
    Soumya Banerjee
    Pietro Lio
    Peter B. Jones
    Rudolf N. Cardinal
    npj Schizophrenia, 7
  • [9] Strategic Predictions and Explanations By Machine Learning
    Wu, Caesar
    Li, Jian
    Xu, Jingjing
    Bouvry, Pascal
    38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 268 - 273
  • [10] Counterfactual Explanations and Algorithmic Recourses for Machine Learning: A Review
    Verma, Sahil
    Boonsanong, Varich
    Hoang, Minh
    Hines, Keegan
    Dickerson, John
    Shah, Chirag
    ACM COMPUTING SURVEYS, 2024, 56 (12)