In-Training Explainability Frameworks: A Method to Make Black-Box Machine Learning Models More Explainable

被引:0
|
作者
Acun, Cagla [1 ]
Nasraoui, Olfa [1 ]
机构
[1] Univ Louisville, Web Min & Knowledge Discovery Lab, Louisville, KY 40292 USA
关键词
Explainability in Artificial Intelligence; XAI;
D O I
10.1109/WI-IAT59888.2023.00036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite ongoing efforts to make black-box machine learning models more explainable, transparent, and trustworthy, there is a growing advocacy for using only inherently interpretable models for high-stake decision making. For instance, post-hoc explanations have recently been criticized because they learn surrogate white-box (explainer) models that, while optimized to approximate the original predictive model, remain different from the latter. Moreover, the post-hoc models necessitate a post-hoc training phase at prediction time, that adds to the computational burden. In this paper, we propose two novel explainability approaches that make black-box models more explainable, which we call pre-hoc explainability and co-hoc explainability. Our goal is to maintain the black-box model's prediction accuracy while benefiting from the explanations that come with an inherently interpretable white-box model, and without the need for a post-hoc training phase at prediction time. In contrast to post-hoc methods, the black-box model training phase is guided by explanations that are used as a regularizer. Our experiments demonstrate the advantages of our proposed technique on three real-life datasets, in terms of fidelity, without compromising accuracy.
引用
收藏
页码:230 / 237
页数:8
相关论文
共 50 条
  • [21] Using Machine Learning for Black-Box Autoscaling
    Wajahat, Muhammad
    Gandhi, Anshul
    Karve, Alexei
    Kochut, Andrzej
    2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2016,
  • [22] Removing the Black-Box from Machine Learning
    Fernando Kuri-Morales, Angel
    PATTERN RECOGNITION, MCPR 2023, 2023, 13902 : 36 - 46
  • [23] On the black-box explainability of object detection models for safe and trustworthy industrial applications
    Andres, Alain
    Martinez-Seras, Aitor
    Lana, Ibai
    Del Ser, Javier
    RESULTS IN ENGINEERING, 2024, 24
  • [24] Comparing Explanations from Glass-Box and Black-Box Machine-Learning Models
    Kuk, Michal
    Bobek, Szymon
    Nalepa, Grzegorz J.
    COMPUTATIONAL SCIENCE - ICCS 2022, PT III, 2022, 13352 : 668 - 675
  • [25] Learning Groupwise Explanations for Black-Box Models
    Gao, Jingyue
    Wang, Xiting
    Wang, Yasha
    Yan, Yulan
    Xie, Xing
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2396 - 2402
  • [26] What's inside the black-box? A genetic programming method for interpreting complex machine learning models
    Evans, Benjamin P.
    Xue, Bing
    Zhang, Mengjie
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 1012 - 1020
  • [27] Demystifying the black box: an overview of explainability methods in machine learning
    Kinger S.
    Kulkarni V.
    International Journal of Computers and Applications, 2024, 46 (02) : 90 - 100
  • [28] CoSP: co-selection pick for a global explainability of black box machine learning models
    Mansouri, Dou El Kefel
    Benkabou, Seif-Eddine
    Meddahi, Khaoula
    Hadjali, Allel
    Mesmoudi, Amin
    Benabdeslem, Khalid
    Chaib, Souleyman
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (06): : 3965 - 3981
  • [29] Towards a Co-selection Approach for a Global Explainability of Black Box Machine Learning Models
    Meddahi, Khoula
    Benkabou, Seif-Eddine
    Hadjali, Allel
    Mesmoudi, Amin
    Mansouri, Dou El Kefel
    Benabdeslem, Khalid
    Chaib, Souleyman
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2022, 2022, 13724 : 97 - 109
  • [30] CoSP: co-selection pick for a global explainability of black box machine learning models
    Dou El Kefel Mansouri
    Seif-Eddine Benkabou
    Khaoula Meddahi
    Allel Hadjali
    Amin Mesmoudi
    Khalid Benabdeslem
    Souleyman Chaib
    World Wide Web, 2023, 26 : 3965 - 3981