In-Training Explainability Frameworks: A Method to Make Black-Box Machine Learning Models More Explainable

被引：0

作者：

Acun, Cagla ^{[1
]}

Nasraoui, Olfa ^{[1
]}

机构：

[1] Univ Louisville, Web Min & Knowledge Discovery Lab, Louisville, KY 40292 USA

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT | 2023年

关键词：

Explainability in Artificial Intelligence; XAI;

D O I：

10.1109/WI-IAT59888.2023.00036

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite ongoing efforts to make black-box machine learning models more explainable, transparent, and trustworthy, there is a growing advocacy for using only inherently interpretable models for high-stake decision making. For instance, post-hoc explanations have recently been criticized because they learn surrogate white-box (explainer) models that, while optimized to approximate the original predictive model, remain different from the latter. Moreover, the post-hoc models necessitate a post-hoc training phase at prediction time, that adds to the computational burden. In this paper, we propose two novel explainability approaches that make black-box models more explainable, which we call pre-hoc explainability and co-hoc explainability. Our goal is to maintain the black-box model's prediction accuracy while benefiting from the explanations that come with an inherently interpretable white-box model, and without the need for a post-hoc training phase at prediction time. In contrast to post-hoc methods, the black-box model training phase is guided by explanations that are used as a regularizer. Our experiments demonstrate the advantages of our proposed technique on three real-life datasets, in terms of fidelity, without compromising accuracy.

引用

页码：230 / 237

页数：8

共 50 条

[41] E-XAI: Evaluating Black-Box Explainable AI Frameworks for Network Intrusion Detection
Arreche, Osvaldo
Guntur, Tanish R.
Roberts, Jack W.
Abdallah, Mustafa
IEEE ACCESS, 2024, 12 : 23954 - 23988
[42] Explaining Artificial Intelligence with Care Analyzing the Explainability of Black Box Multiclass Machine Learning Models in Forensics
Szepannek, Gero
Luebke, Karsten
KUNSTLICHE INTELLIGENZ, 2022, 36 (02): : 125 - 134
[43] Inferring the dynamics of black-box systems using a learning machine
Zhao, Hong
Zhao, Hong (zhaoh@xmu.edu.cn), 1600, Science in China Press (64):
[44] Inferring the dynamics of "black-box" systems using a learning machine
Zhao, Hong
SCIENCE CHINA-PHYSICS MECHANICS & ASTRONOMY, 2021, 64 (07)
[45] MACHINE-LEARNING IN OPTIMIZATION OF EXPENSIVE BLACK-BOX FUNCTIONS
Tenne, Yoel
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2017, 27 (01) : 105 - 118
[46] Inferring the dynamics of “black-box” systems using a learning machine
Hong Zhao
Science China Physics, Mechanics & Astronomy, 2021, 64
[47] Inferring the dynamics of “black-box” systems using a learning machine
Hong Zhao
Science China(Physics,Mechanics & Astronomy), 2021, Mechanics & Astronomy)2021 (07) : 76 - 85
[48] IHCP: interpretable hepatitis C prediction system based on black-box machine learning models
Fan, Yongxian
Lu, Xiqian
Sun, Guicong
BMC BIOINFORMATICS, 2023, 24 (01)
[49] IHCP: interpretable hepatitis C prediction system based on black-box machine learning models
Yongxian Fan
Xiqian Lu
Guicong Sun
BMC Bioinformatics, 24
[50] ComplAI: Framework for Multi-factor Assessment of Black-Box Supervised Machine Learning Models
De, Arkadipta
Gudipudi, Satya Swaroop
Panchanan, Sourab
Desarkar, Maunendra Sankar
38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1096 - 1099

← 1 2 3 4 5 →