Explaining deep neural networks: A survey on the global interpretation methods

被引:52
|
作者
Saleem, Rabia [1 ]
Yuan, Bo [2 ]
Kurugollu, Fatih [1 ,3 ]
Anjum, Ashiq [2 ]
Liu, Lu [2 ]
机构
[1] Univ Derby, Sch Comp & Engn, Kedleston Rd, Derby DE22 1GB, England
[2] Univ Leicester, Sch Comp & Math Sci, Univ Rd, Leicester LE1 7RH, England
[3] Univ Sharjah, Dept Comp Sci, Sharjah, U Arab Emirates
关键词
Artificial intelligence; Deep neural networks; Black box Models; Explainable artificial intelligence; Global interpretation; BLACK-BOX; CLASSIFIERS; RULES; MODEL;
D O I
10.1016/j.neucom.2022.09.129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A substantial amount of research has been carried out in Explainable Artificial Intelligence (XAI) models, especially in those which explain the deep architectures of neural networks. A number of XAI approaches have been proposed to achieve trust in Artificial Intelligence (AI) models as well as provide explainability of specific decisions made within these models. Among these approaches, global interpretation methods have emerged as the prominent methods of explainability because they have the strength to explain every feature and the structure of the model. This survey attempts to provide a comprehensive review of global interpretation methods that completely explain the behaviour of the AI models. We present a taxonomy of the available global interpretations models and systematically highlight the critical features and algorithms that differentiate them from local as well as hybrid models of explainability. Through examples and case studies from the literature, we evaluate the strengths and weaknesses of the global interpretation models and assess challenges when these methods are put into practice. We conclude the paper by providing the future directions of research in how the existing challenges in global interpre-tation methods could be addressed and what values and opportunities could be realized by the resolution of these challenges.(c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:165 / 180
页数:16
相关论文
共 50 条
  • [1] Explaining the black-box model: A survey of local interpretation methods for deep neural networks
    Liang, Yu
    Li, Siguang
    Yan, Chungang
    Li, Maozhen
    Jiang, Changjun
    NEUROCOMPUTING, 2021, 419 : 168 - 182
  • [2] Perturbation-based methods for explaining deep neural networks: A survey
    Ivanovs, Maksims
    Kadikis, Roberts
    Ozols, Kaspars
    PATTERN RECOGNITION LETTERS, 2021, 150 : 228 - 234
  • [3] Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications
    Samek, Wojciech
    Montavon, Gregoire
    Lapuschkin, Sebastian
    Anders, Christopher J.
    Mueller, Klaus-Robert
    PROCEEDINGS OF THE IEEE, 2021, 109 (03) : 247 - 278
  • [4] A survey of quantization methods for deep neural networks
    Yang C.
    Zhang R.
    Huang L.
    Ti S.
    Lin J.
    Dong Z.
    Chen S.
    Liu Y.
    Yin X.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2023, 45 (10): : 1613 - 1629
  • [5] Semantics for Global and Local Interpretation of Deep Convolutional Neural Networks
    Gu, Jindong
    Zhao, Rui
    Tresp, Volker
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] Explaining Neural Networks - Deep and Shallow
    Hammer, Barbara
    ADVANCES IN SELF-ORGANIZING MAPS, LEARNING VECTOR QUANTIZATION, INTERPRETABLE MACHINE LEARNING, AND BEYOND, WSOM PLUS 2024, 2024, 1087 : 139 - 140
  • [7] A Survey of Image Stylization Methods Based on Deep Neural Networks
    Tu P.
    Gao C.
    Sang N.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (04): : 333 - 347
  • [8] A Survey of Sparse-learning Methods for Deep Neural Networks
    Ma, Rongrong
    Niu, Lingfeng
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 647 - 650
  • [9] Representer Point Selection for Explaining Deep Neural Networks
    Yeh, Chih-Kuan
    Kim, Joon Sik
    Yen, Ian E. H.
    Ravikumar, Pradeep
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] Explaining the Behavior of Neuron Activations in Deep Neural Networks
    Wang, Longwei
    Wang, Chengfei
    Li, Yupeng
    Wang, Rui
    AD HOC NETWORKS, 2021, 111