Learning Model-Agnostic Counterfactual Explanations for Tabular Data

被引：84

作者：

Pawelczyk, Martin ^{[1
]}

Broelemann, Klaus ^{[2
]}

Kasneci, Gjergji ^{[1
]}

机构：

[1] Univ Tubingen, Tubingen, Germany

[2] Schufa Holding AG, Wiesbaden, Germany

来源：

WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020) | 2020年

关键词：

Transparency; Counterfactual explanations; Interpretability;

D O I：

10.1145/3366423.3380087

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Counterfactual explanations can be obtained by identifying the smallest change made to an input vector to influence a prediction in a positive way from a user's viewpoint; for example, from 'loan rejected' to 'awarded' or from 'high risk of cardiovascular disease' to 'low risk'. Previous approaches would not ensure that the produced counterfactuals be proximate (i.e., not local outliers) and connected to regions with substantial data density (i.e., close to correctly classified observations), two requirements known as counterfactual faithfulness. Our contribution is twofold. First, drawing ideas from the manifold learning literature, we develop a framework, called C-CHVAE, that generates faithful counter-factuals. Second, we suggest to complement the catalog of counterfactual quality measures using a criterion to quantify the degree of difficulty for a certain counterfactual suggestion. Our real world experiments suggest that faithful counterfactuals come at the cost of higher degrees of difficulty.

引用

页码：3126 / 3132

页数：7

共 50 条

[21] Model-Agnostic Explanations using Minimal Forcing Subsets
Han, Xing
Ghosh, Joydeep
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[22] Model-Agnostic Explanations for Decisions Using Minimal Patterns
Asano, Kohei
Chun, Jinhee
Koike, Atsushi
Tokuyama, Takeshi
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 241 - 252
[23] LIVE: A Local Interpretable model-agnostic Visualizations and Explanations
Shi, Peichang
Gangopadhyay, Aryya
Yu, Ping
2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 245 - 254
[24] Is Bayesian Model-Agnostic Meta Learning Better than Model-Agnostic Meta Learning, Provably?
Chen, Lisha
Chen, Tianyi
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
[25] Toward Learning Model-Agnostic Explanations for Deep Learning-Based Signal Modulation Classifiers
Tian, Yunzhe
Xu, Dongyue
Tong, Endong
Sun, Rui
Chen, Kang
Li, Yike
Baker, Thar
Niu, Wenjia
Liu, Jiqiang
IEEE TRANSACTIONS ON RELIABILITY, 2024, 73 (03) : 1529 - 1543
[26] Evaluating Local Interpretable Model-Agnostic Explanations on Clinical Machine Learning Classification Models
Kumarakulasinghe, Nesaretnam Barr
Blomberg, Tobias
Lin, Jintai
Leao, Alexandra Saraiva
Papapetrou, Panagiotis
2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, : 7 - 12
[27] Deterministic Local Interpretable Model-Agnostic Explanations for Stable Explainability
Zafar, Muhammad Rehman
Khan, Naimul
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2021, 3 (03): : 525 - 541
[28] MODEL-AGNOSTIC VISUAL EXPLANATIONS VIA APPROXIMATE BILINEAR MODELS
Joukovsky, Boris
Sammani, Fawaz
Deligiannis, Nikos
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1770 - 1774
[29] Unsupervised Anomaly Detection for Financial Auditing with Model-Agnostic Explanations
Kiefer, Sebastian
Pesch, Gunter
ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2021, 2021, 12873 : 291 - 308
[30] Model-Agnostic Knowledge Graph Embedding Explanations for Recommender Systems
Zanon, Andre Levi
Dutra da Rocha, Leonardo Chaves
Manzato, Marcelo Garcia
EXPLAINABLE ARTIFICIAL INTELLIGENCE, PT II, XAI 2024, 2024, 2154 : 3 - 27

← 1 2 3 4 5 →