EXPLAINING DATA-DRIVEN DECISIONS MADE BY AI SYSTEMS: THE COUNTERFACTUAL APPROACH

被引:11
|
作者
Fernandez-Loria, Carlos [1 ]
Provost, Foster [2 ]
Han, Xintian [3 ]
机构
[1] Hong Kong Univ Sci & Technol, HKUST Business Sch, Clear Water Bay, Hong Kong, Peoples R China
[2] NYU, Stern Sch Business, New York, NY USA
[3] NYU, Ctr Data Sci, New York, NY USA
关键词
Explanations; system decisions; interpretable machine learning; explainable artificial intelligence; RULE EXTRACTION; EXPLANATIONS; CLASSIFICATIONS; ANALYTICS; TAXONOMY;
D O I
10.25300/MISQ/2022/16749
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We examine counterfactual explanations for explaining the decisions made by model-based AI systems. The counterfactual approach we consider defines an explanation as a set of the system's data inputs that causally drives the decision (i.e., changing the inputs in the set changes the decision) and is irreducible (i.e., changing any subset of the inputs does not change the decision). We (1) demonstrate how this framework may be used to provide explanations for decisions made by general data-driven AI systems that can incorporate features with arbitrary data types and multiple predictive models, and (2) propose a heuristic procedure to find the most useful explanations depending on the context. We then contrast counterfactual explanations with methods that explain model predictions by weighting features according to their importance (e.g., Shapley additive explanations [SHAP], local interpretable model-agnostic explanations [LIME]) and present two fundamental reasons why we should carefully consider whether importance-weight explanations are well suited to explain system decisions. Specifically, we show that (1) features with a large importance weight for a model prediction may not affect the corresponding decision, and (2) importance weights are insufficient to communicate whether and how features influence decisions. We demonstrate this with several concise examples and three detailed case studies that compare the counterfactual approach with SHAP to illustrate conditions under which counterfactual explanations explain data-driven decisions better than importance weights.
引用
收藏
页码:1635 / 1660
页数:26
相关论文
共 50 条
  • [41] A data-driven approach to model calibration for nonlinear dynamical systems
    Greve, C. M.
    Hara, K.
    Martin, R. S.
    Eckhardt, D. Q.
    Koo, J. W.
    JOURNAL OF APPLIED PHYSICS, 2019, 125 (24)
  • [42] A data-driven complex systems approach to early prediction of landslides
    Tordesillas, Antoinette
    Zhou, Zongzheng
    Batterham, Robin
    MECHANICS RESEARCH COMMUNICATIONS, 2018, 92 : 137 - 141
  • [43] Data-Driven Development, A Complementing Approach for Automotive Systems Engineering
    Bach, Johannes
    Langner, Jacob
    Otten, Stefan
    Holzaepfel, Marc
    Sax, Eric
    2017 IEEE INTERNATIONAL SYMPOSIUM ON SYSTEMS ENGINEERING (ISSE 2017), 2017, : 283 - 288
  • [44] Data-driven control of nonlinear systems: An online sequential approach
    Vu, Minh
    Huang, Yunshen
    Zeng, Shen
    Systems and Control Letters, 2024, 193
  • [45] Innovation: A data-driven approach
    Kusiak, Andrew
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2009, 122 (01) : 440 - 448
  • [46] AN APPROACH TO DATA-DRIVEN LEARNING
    MARKOV, Z
    LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1991, 535 : 127 - 140
  • [47] Approach to data-driven learning
    Markov, Z.
    International Workshop on Fundamentals of Artificial Intelligence Research, 1991,
  • [48] DATA-DRIVEN FEEDFORWARD TUNING APPROACH FOR LPV MOTION SYSTEMS
    Huang, Weicai
    Yang, Kaiming
    Zhu, Yu
    Lu, Sen
    PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, 2019, VOL 4, 2020,
  • [49] Stability Monitoring of Rotorcraft Systems: A Dynamic Data-Driven Approach
    Sonti, Siddharth
    Keller, Eric
    Horn, Joseph
    Ray, Asok
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2014, 136 (02):
  • [50] A data-driven approach for predicting failure scenarios in nuclear systems
    Zio, Enrico
    Di Maio, Francesco
    Stasi, Marco
    ANNALS OF NUCLEAR ENERGY, 2010, 37 (04) : 482 - 491