In-Training Explainability Frameworks: A Method to Make Black-Box Machine Learning Models More Explainable

被引:0
|
作者
Acun, Cagla [1 ]
Nasraoui, Olfa [1 ]
机构
[1] Univ Louisville, Web Min & Knowledge Discovery Lab, Louisville, KY 40292 USA
关键词
Explainability in Artificial Intelligence; XAI;
D O I
10.1109/WI-IAT59888.2023.00036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite ongoing efforts to make black-box machine learning models more explainable, transparent, and trustworthy, there is a growing advocacy for using only inherently interpretable models for high-stake decision making. For instance, post-hoc explanations have recently been criticized because they learn surrogate white-box (explainer) models that, while optimized to approximate the original predictive model, remain different from the latter. Moreover, the post-hoc models necessitate a post-hoc training phase at prediction time, that adds to the computational burden. In this paper, we propose two novel explainability approaches that make black-box models more explainable, which we call pre-hoc explainability and co-hoc explainability. Our goal is to maintain the black-box model's prediction accuracy while benefiting from the explanations that come with an inherently interpretable white-box model, and without the need for a post-hoc training phase at prediction time. In contrast to post-hoc methods, the black-box model training phase is guided by explanations that are used as a regularizer. Our experiments demonstrate the advantages of our proposed technique on three real-life datasets, in terms of fidelity, without compromising accuracy.
引用
收藏
页码:230 / 237
页数:8
相关论文
共 50 条
  • [41] E-XAI: Evaluating Black-Box Explainable AI Frameworks for Network Intrusion Detection
    Arreche, Osvaldo
    Guntur, Tanish R.
    Roberts, Jack W.
    Abdallah, Mustafa
    IEEE ACCESS, 2024, 12 : 23954 - 23988
  • [42] Explaining Artificial Intelligence with Care Analyzing the Explainability of Black Box Multiclass Machine Learning Models in Forensics
    Szepannek, Gero
    Luebke, Karsten
    KUNSTLICHE INTELLIGENZ, 2022, 36 (02): : 125 - 134
  • [43] Inferring the dynamics of black-box systems using a learning machine
    Zhao, Hong
    Zhao, Hong (zhaoh@xmu.edu.cn), 1600, Science in China Press (64):
  • [44] Inferring the dynamics of "black-box" systems using a learning machine
    Zhao, Hong
    SCIENCE CHINA-PHYSICS MECHANICS & ASTRONOMY, 2021, 64 (07)
  • [45] MACHINE-LEARNING IN OPTIMIZATION OF EXPENSIVE BLACK-BOX FUNCTIONS
    Tenne, Yoel
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2017, 27 (01) : 105 - 118
  • [46] Inferring the dynamics of “black-box” systems using a learning machine
    Hong Zhao
    Science China Physics, Mechanics & Astronomy, 2021, 64
  • [47] Inferring the dynamics of “black-box” systems using a learning machine
    Hong Zhao
    Science China(Physics,Mechanics & Astronomy), 2021, Mechanics & Astronomy)2021 (07) : 76 - 85
  • [48] IHCP: interpretable hepatitis C prediction system based on black-box machine learning models
    Fan, Yongxian
    Lu, Xiqian
    Sun, Guicong
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [49] IHCP: interpretable hepatitis C prediction system based on black-box machine learning models
    Yongxian Fan
    Xiqian Lu
    Guicong Sun
    BMC Bioinformatics, 24
  • [50] ComplAI: Framework for Multi-factor Assessment of Black-Box Supervised Machine Learning Models
    De, Arkadipta
    Gudipudi, Satya Swaroop
    Panchanan, Sourab
    Desarkar, Maunendra Sankar
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1096 - 1099