In-Training Explainability Frameworks: A Method to Make Black-Box Machine Learning Models More Explainable

被引:0
|
作者
Acun, Cagla [1 ]
Nasraoui, Olfa [1 ]
机构
[1] Univ Louisville, Web Min & Knowledge Discovery Lab, Louisville, KY 40292 USA
关键词
Explainability in Artificial Intelligence; XAI;
D O I
10.1109/WI-IAT59888.2023.00036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite ongoing efforts to make black-box machine learning models more explainable, transparent, and trustworthy, there is a growing advocacy for using only inherently interpretable models for high-stake decision making. For instance, post-hoc explanations have recently been criticized because they learn surrogate white-box (explainer) models that, while optimized to approximate the original predictive model, remain different from the latter. Moreover, the post-hoc models necessitate a post-hoc training phase at prediction time, that adds to the computational burden. In this paper, we propose two novel explainability approaches that make black-box models more explainable, which we call pre-hoc explainability and co-hoc explainability. Our goal is to maintain the black-box model's prediction accuracy while benefiting from the explanations that come with an inherently interpretable white-box model, and without the need for a post-hoc training phase at prediction time. In contrast to post-hoc methods, the black-box model training phase is guided by explanations that are used as a regularizer. Our experiments demonstrate the advantages of our proposed technique on three real-life datasets, in terms of fidelity, without compromising accuracy.
引用
收藏
页码:230 / 237
页数:8
相关论文
共 50 条
  • [1] Explainable Debugger for Black-box Machine Learning Models
    Rasouli, Peyman
    Yu, Ingrid Chieh
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [2] Shedding Light on the Black Box: Integrating Prediction Models and Explainability Using Explainable Machine Learning
    Zhang, Yucheng
    Zheng, Yuyan
    Wang, Dan
    Gu, Xiaowei
    Zyphur, Michael J.
    Xiao, Lin
    Liao, Shudi
    Deng, Yangyang
    ORGANIZATIONAL RESEARCH METHODS, 2025,
  • [3] What Lies Beneath: A Note on the Explainability of Black-box Machine Learning Models for Road Traffic Forecasting
    Barredo-Arrieta, Alejandro
    Lana, Ibai
    Del Ser, Javier
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 2232 - 2237
  • [4] Semantics of the Black-Box: Can Knowledge Graphs Help Make Deep Learning Systems More Interpretable and Explainable?
    Gaur, Manas
    Faldu, Keyur
    Sheth, Amit
    IEEE INTERNET COMPUTING, 2021, 25 (01) : 51 - 59
  • [5] Reproducibility and explainability in digital pathology: The need to make black-box artificial intelligence systems more transparent
    Faa, Gavino
    Fraschini, Matteo
    Barberini, Luigi
    JOURNAL OF PUBLIC HEALTH RESEARCH, 2024, 13 (04)
  • [6] Explainable AI: To Reveal the Logic of Black-Box Models
    Chinu, Urvashi
    Bansal, Urvashi
    NEW GENERATION COMPUTING, 2024, 42 (01) : 53 - 87
  • [7] Explainable AI: To Reveal the Logic of Black-Box Models
    Chinu
    Bansal, Urvashi
    New Generation Computing, 42 (01): : 53 - 87
  • [8] Identifying the Machine Learning Family from Black-Box Models
    Fabra-Boluda, Raul
    Ferri, Cesar
    Hernandez-Orallo, Jose
    Martinez-Plumed, Fernando
    Jose Ramirez-Quintana, Maria
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CAEPIA 2018, 2018, 11160 : 55 - 65
  • [9] Data Synthesis for Testing Black-Box Machine Learning Models
    Saha, Diptikalyan
    Aggarwal, Aniya
    Hans, Sandeep
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 110 - 114
  • [10] Explainable AI: To Reveal the Logic of Black-Box Models
    Urvashi Chinu
    New Generation Computing, 2024, 42 : 53 - 87