Learning Model-Agnostic Counterfactual Explanations for Tabular Data

被引:84
|
作者
Pawelczyk, Martin [1 ]
Broelemann, Klaus [2 ]
Kasneci, Gjergji [1 ]
机构
[1] Univ Tubingen, Tubingen, Germany
[2] Schufa Holding AG, Wiesbaden, Germany
关键词
Transparency; Counterfactual explanations; Interpretability;
D O I
10.1145/3366423.3380087
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Counterfactual explanations can be obtained by identifying the smallest change made to an input vector to influence a prediction in a positive way from a user's viewpoint; for example, from 'loan rejected' to 'awarded' or from 'high risk of cardiovascular disease' to 'low risk'. Previous approaches would not ensure that the produced counterfactuals be proximate (i.e., not local outliers) and connected to regions with substantial data density (i.e., close to correctly classified observations), two requirements known as counterfactual faithfulness. Our contribution is twofold. First, drawing ideas from the manifold learning literature, we develop a framework, called C-CHVAE, that generates faithful counter-factuals. Second, we suggest to complement the catalog of counterfactual quality measures using a criterion to quantify the degree of difficulty for a certain counterfactual suggestion. Our real world experiments suggest that faithful counterfactuals come at the cost of higher degrees of difficulty.
引用
收藏
页码:3126 / 3132
页数:7
相关论文
共 50 条
  • [21] Model-Agnostic Explanations using Minimal Forcing Subsets
    Han, Xing
    Ghosh, Joydeep
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [22] Model-Agnostic Explanations for Decisions Using Minimal Patterns
    Asano, Kohei
    Chun, Jinhee
    Koike, Atsushi
    Tokuyama, Takeshi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 241 - 252
  • [23] LIVE: A Local Interpretable model-agnostic Visualizations and Explanations
    Shi, Peichang
    Gangopadhyay, Aryya
    Yu, Ping
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 245 - 254
  • [24] Is Bayesian Model-Agnostic Meta Learning Better than Model-Agnostic Meta Learning, Provably?
    Chen, Lisha
    Chen, Tianyi
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [25] Toward Learning Model-Agnostic Explanations for Deep Learning-Based Signal Modulation Classifiers
    Tian, Yunzhe
    Xu, Dongyue
    Tong, Endong
    Sun, Rui
    Chen, Kang
    Li, Yike
    Baker, Thar
    Niu, Wenjia
    Liu, Jiqiang
    IEEE TRANSACTIONS ON RELIABILITY, 2024, 73 (03) : 1529 - 1543
  • [26] Evaluating Local Interpretable Model-Agnostic Explanations on Clinical Machine Learning Classification Models
    Kumarakulasinghe, Nesaretnam Barr
    Blomberg, Tobias
    Lin, Jintai
    Leao, Alexandra Saraiva
    Papapetrou, Panagiotis
    2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, : 7 - 12
  • [27] Deterministic Local Interpretable Model-Agnostic Explanations for Stable Explainability
    Zafar, Muhammad Rehman
    Khan, Naimul
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2021, 3 (03): : 525 - 541
  • [28] MODEL-AGNOSTIC VISUAL EXPLANATIONS VIA APPROXIMATE BILINEAR MODELS
    Joukovsky, Boris
    Sammani, Fawaz
    Deligiannis, Nikos
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1770 - 1774
  • [29] Unsupervised Anomaly Detection for Financial Auditing with Model-Agnostic Explanations
    Kiefer, Sebastian
    Pesch, Gunter
    ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2021, 2021, 12873 : 291 - 308
  • [30] Model-Agnostic Knowledge Graph Embedding Explanations for Recommender Systems
    Zanon, Andre Levi
    Dutra da Rocha, Leonardo Chaves
    Manzato, Marcelo Garcia
    EXPLAINABLE ARTIFICIAL INTELLIGENCE, PT II, XAI 2024, 2024, 2154 : 3 - 27