EXPLAINING DATA-DRIVEN DECISIONS MADE BY AI SYSTEMS: THE COUNTERFACTUAL APPROACH

被引:11
|
作者
Fernandez-Loria, Carlos [1 ]
Provost, Foster [2 ]
Han, Xintian [3 ]
机构
[1] Hong Kong Univ Sci & Technol, HKUST Business Sch, Clear Water Bay, Hong Kong, Peoples R China
[2] NYU, Stern Sch Business, New York, NY USA
[3] NYU, Ctr Data Sci, New York, NY USA
关键词
Explanations; system decisions; interpretable machine learning; explainable artificial intelligence; RULE EXTRACTION; EXPLANATIONS; CLASSIFICATIONS; ANALYTICS; TAXONOMY;
D O I
10.25300/MISQ/2022/16749
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We examine counterfactual explanations for explaining the decisions made by model-based AI systems. The counterfactual approach we consider defines an explanation as a set of the system's data inputs that causally drives the decision (i.e., changing the inputs in the set changes the decision) and is irreducible (i.e., changing any subset of the inputs does not change the decision). We (1) demonstrate how this framework may be used to provide explanations for decisions made by general data-driven AI systems that can incorporate features with arbitrary data types and multiple predictive models, and (2) propose a heuristic procedure to find the most useful explanations depending on the context. We then contrast counterfactual explanations with methods that explain model predictions by weighting features according to their importance (e.g., Shapley additive explanations [SHAP], local interpretable model-agnostic explanations [LIME]) and present two fundamental reasons why we should carefully consider whether importance-weight explanations are well suited to explain system decisions. Specifically, we show that (1) features with a large importance weight for a model prediction may not affect the corresponding decision, and (2) importance weights are insufficient to communicate whether and how features influence decisions. We demonstrate this with several concise examples and three detailed case studies that compare the counterfactual approach with SHAP to illustrate conditions under which counterfactual explanations explain data-driven decisions better than importance weights.
引用
收藏
页码:1635 / 1660
页数:26
相关论文
共 50 条
  • [31] Code analysis for intelligent cyber systems: A data-driven approach
    Coulter, Rory
    Han, Qing-Long
    Pan, Lei
    Zhang, Jun
    Xiang, Yang
    INFORMATION SCIENCES, 2020, 524 (46-58) : 46 - 58
  • [32] A data-driven switching control approach for braking systems with constraints?
    Sassella, Andrea
    Breschi, Valentina
    Formentin, Simone
    Savaresi, Sergio M.
    NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2022, 46
  • [33] Data-Driven Control of Unknown Systems: A Linear Programming Approach
    Tanzanakis, Alexandros
    Lygeros, John
    IFAC PAPERSONLINE, 2020, 53 (02): : 7 - 13
  • [34] Data-Driven Energy Conservation in Cellular Networks: A Systems Approach
    Premsankar, Gopika
    Piao, Guangyuan
    Nicholson, Patrick K.
    Di Francesco, Mario
    Lugones, Diego
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (03): : 3567 - 3582
  • [35] Dynamic data-driven systems approach for simulation based optimizations
    Kurc, Tahsin
    Zhang, Xi
    Parashar, Manish
    Klie, Hector
    Wheeler, Mar F.
    Catalyurek, Umit
    Saltz, Joel
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 1, PROCEEDINGS, 2007, 4487 : 1213 - +
  • [36] A Data-Driven Approach for Fault Diagnosis in HVAC Chiller Systems
    Beghi, Alessandro
    Brignoli, Riccardo
    Cecchinato, Luca
    Menegazzo, Gabriele
    Rampazzo, Mirco
    2015 IEEE CONFERENCE ON CONTROL AND APPLICATIONS (CCA 2015), 2015, : 966 - 971
  • [37] A Data-Driven Approach to Forecasting the Distribution of Distributed Photovoltaic Systems
    Zhou, Ziqiang
    Zhao, Teng
    Zhang, Yan
    Su, Yun
    2018 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2018, : 867 - 872
  • [38] Data-Driven Approach to Transactive Energy Systems with Commercial Buildings
    Ramesh, Meghana
    Xie, Jing
    McDermott, Thomas E.
    Mukherjee, Monish
    Diedesch, Michael
    Bose, Anjan
    2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,
  • [39] Detecting Replay Attacks in Power Systems: A Data-Driven Approach
    Ma, Mingliang
    Zhou, Peng
    Du, Dajun
    Peng, Chen
    Fei, Minrui
    AlBuflasa, Hanan Mubarak
    ADVANCED COMPUTATIONAL METHODS IN ENERGY, POWER, ELECTRIC VEHICLES, AND THEIR INTEGRATION, LSMS 2017, PT 3, 2017, 763 : 450 - 457
  • [40] LFT Representation of a Class of Nonlinear Systems: A Data-Driven Approach
    Sinha, Sourav
    Muniraj, Devaprakash
    Farhood, Mazen
    2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 866 - 871