An attribution graph-based interpretable method for CNNs

被引:6
|
作者
Zheng, Xiangwei [1 ,3 ]
Zhang, Lifeng [1 ,3 ]
Xu, Chunyan [2 ,3 ]
Chen, Xuanchi [1 ,3 ]
Cui, Zhen
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Shandong, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
[3] State Key Lab High End Server & Storage Technol, Jinan 250300, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Interpretable CNN; Attribution graph; GCN; Kernel importance;
D O I
10.1016/j.neunet.2024.106597
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNNs) have demonstrated outstanding performance in various domains, such as face recognition, object detection, and image segmentation. However, the lack of transparency and limited interpretability inherent in CNNs pose challenges in fields such as medical diagnosis, autonomous driving, finance, and military applications. Several studies have explored the interpretability of CNNs and proposed various post-hoc interpretable methods. The majority of these methods are feature-based, focusing on the influence of input variables on outputs. Few methods undertake the analysis of parameters in CNNs and their overall structure. To explore the structure of CNNs and intuitively comprehend the role of their internal parameters, we propose an Attribution Graph-based Interpretable method for CNNs (AGIC) which models the overall structure of CNNs as graphs and provides interpretability from global and local perspectives. The runtime parameters of CNNs and feature maps of each image sample are applied to construct attribution graphs (At-GCs), where the convolutional kernels are represented as nodes and the SHAP values between kernel outputs are assigned as edges. These At-GCs are then employed to pretrain a newly designed heterogeneous graph encoder based on Deep Graph Infomax (DGI). To comprehensively delve into the overall structure of CNNs, the pretrained encoder is used for two types of interpretable tasks: (1) a classifier is attached to the pretrained encoder for the classification of At-GCs, revealing the dependency of At-GC's topological characteristics on the image sample categories, and (2) a scoring aggregation (SA) network is constructed to assess the importance of each node in At-GCs, thus reflecting the relative importance of kernels in CNNs. The experimental results indicate that the topological characteristics of At-GC exhibit a dependency on the sample category used in its construction, which reveals that kernels in CNNs show distinct combined activation patterns for processing different image categories, meanwhile, the kernels that receive high scores from SA network are crucial for feature extraction, whereas low-scoring kernels can be pruned without affecting model performance, thereby enhancing the interpretability of CNNs.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Method of Cyber Attack Attribution Based on Graph Model
    Huang, Ke-Zhen
    Lian, Yi-Feng
    Feng, Deng-Guo
    Zhang, Hai-Xia
    Wu, Di
    Ma, Xiang-Liang
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (02): : 683 - 698
  • [32] Mapping the customer journey: Lessons learned from graph-based online attribution modeling
    Anderl, Eva
    Becker, Ingo
    von Wangenheim, Florian
    Schumann, Jan Hendrik
    INTERNATIONAL JOURNAL OF RESEARCH IN MARKETING, 2016, 33 (03) : 457 - 474
  • [33] Power System Transient Stability Assessment Based on Graph Neural Network with Interpretable Attribution Analysis
    Gu, Sili
    Qiao, Ji
    Zhao, Zixuan
    Zhu, Qiongfeng
    Han, Fujia
    2022 4TH INTERNATIONAL CONFERENCE ON SMART POWER & INTERNET ENERGY SYSTEMS, SPIES, 2022, : 1374 - 1379
  • [34] An automatic graph-based method for characterizing multichannel networks
    Liu, Yanhui
    Carling, Paul A.
    Wang, Yuanjian
    Jiang, Enhui
    Atkinson, Peter M.
    COMPUTERS & GEOSCIENCES, 2022, 166
  • [35] An Balanced, and Scalable Graph-Based Multiview Clustering Method
    Zhao, Zihua
    Nie, Feiping
    Wang, Rong
    Wang, Zheng
    Li, Xuelong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 7643 - 7656
  • [36] A Graph-Based Method for Analyzing Electronic Medical Records
    Yesha, Rose
    Gangopadhyay, Aryya
    Siegel, Eliot
    PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, : 1036 - 1041
  • [37] A Graph-Based Interpretability Method for Deep Neural Networks
    Wang, Tao
    Zheng, Xiangwei
    Zhang, Lifeng
    Cui, Zhen
    Xu, Chunyan
    SSRN, 2022,
  • [38] Temporal Enhancement of Graph-Based Depth Estimation Method
    Mieloch, Dawid
    Dziembowski, Adrian
    Grzelka, Adam
    Stankiewicz, Olgierd
    Domanski, Marek
    2017 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2017,
  • [39] FolkDiffusion: A Graph-Based Tag Suggestion Method for Folksonomies
    Liu, Zhiyuan
    Shi, Chuan
    Sun, Maosong
    INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 231 - +
  • [40] A Conceptual Graph-Based Method to Compute Information Content
    Quintero, Rolando
    Torres-Ruiz, Miguel
    Saldana-Perez, Magdalena
    Guzman Sanchez-Mejorada, Carlos
    Mata-Rivera, Felix
    MATHEMATICS, 2023, 11 (18)