Learning Model-Agnostic Counterfactual Explanations for Tabular Data

被引:84
|
作者
Pawelczyk, Martin [1 ]
Broelemann, Klaus [2 ]
Kasneci, Gjergji [1 ]
机构
[1] Univ Tubingen, Tubingen, Germany
[2] Schufa Holding AG, Wiesbaden, Germany
关键词
Transparency; Counterfactual explanations; Interpretability;
D O I
10.1145/3366423.3380087
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Counterfactual explanations can be obtained by identifying the smallest change made to an input vector to influence a prediction in a positive way from a user's viewpoint; for example, from 'loan rejected' to 'awarded' or from 'high risk of cardiovascular disease' to 'low risk'. Previous approaches would not ensure that the produced counterfactuals be proximate (i.e., not local outliers) and connected to regions with substantial data density (i.e., close to correctly classified observations), two requirements known as counterfactual faithfulness. Our contribution is twofold. First, drawing ideas from the manifold learning literature, we develop a framework, called C-CHVAE, that generates faithful counter-factuals. Second, we suggest to complement the catalog of counterfactual quality measures using a criterion to quantify the degree of difficulty for a certain counterfactual suggestion. Our real world experiments suggest that faithful counterfactuals come at the cost of higher degrees of difficulty.
引用
收藏
页码:3126 / 3132
页数:7
相关论文
共 50 条
  • [41] Stable local interpretable model-agnostic explanations based on a variational autoencoder
    Xu Xiang
    Hong Yu
    Ye Wang
    Guoyin Wang
    Applied Intelligence, 2023, 53 : 28226 - 28240
  • [42] Bayesian Model-Agnostic Meta-Learning
    Yoon, Jaesik
    Kim, Taesup
    Dia, Ousmane
    Kim, Sungwoong
    Bengio, Yoshua
    Ahn, Sungjin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [43] Probabilistic Model-Agnostic Meta-Learning
    Finn, Chelsea
    Xu, Kelvin
    Levine, Sergey
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [44] A comparative study of methods for estimating model-agnostic Shapley value explanations
    Olsen, Lars Henry Berge
    Glad, Ingrid Kristine
    Jullum, Martin
    Aas, Kjersti
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (04) : 1782 - 1829
  • [45] Interpretable heartbeat classification using local model-agnostic explanations on ECGs
    Neves, Ines
    Folgado, Duarte
    Santos, Sara
    Barandas, Marilia
    Campagner, Andrea
    Ronzio, Luca
    Cabitza, Federico
    Gamboa, Hugo
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 133
  • [46] Model-agnostic counterfactual reasoning for identifying and mitigating answer bias in knowledge tracing
    Cui, Chaoran
    Ma, Hebo
    Dong, Xiaolin
    Zhang, Chen
    Zhang, Chunyun
    Yao, Yumo
    Chen, Meng
    Ma, Yuling
    NEURAL NETWORKS, 2024, 178
  • [47] AcME-Accelerated model-agnostic explanations: Fast whitening of the machine-learning black box
    Dandolo, David
    Masiero, Chiara
    Carletti, Mattia
    Pezze, Davide Dalle
    Susto, Gian Antonio
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 214
  • [48] MCCE: Monte Carlo sampling of valid and realistic counterfactual explanations for tabular data
    Redelmeier, Annabelle
    Jullum, Martin
    Aas, Kjersti
    Loland, Anders
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (04) : 1830 - 1861
  • [49] Plug-and-Play Model-Agnostic Counterfactual Policy Synthesis for Deep Reinforcement Learning-Based Recommendation
    Wang, Siyu
    Chen, Xiaocong
    McAuley, Julian
    Cripps, Sally
    Yao, Lina
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1044 - 1055
  • [50] A novel dataset and local interpretable model-agnostic explanations (LIME) for monkeypox prediction
    Sharma, Nonita
    Mohanty, Sachi Nandan
    Mahato, Shalini
    Pattanaik, Chinmaya Ranjan
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2023, 17 (04): : 1297 - 1308