An attribution graph-based interpretable method for CNNs

被引:6
|
作者
Zheng, Xiangwei [1 ,3 ]
Zhang, Lifeng [1 ,3 ]
Xu, Chunyan [2 ,3 ]
Chen, Xuanchi [1 ,3 ]
Cui, Zhen
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Shandong, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
[3] State Key Lab High End Server & Storage Technol, Jinan 250300, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Interpretable CNN; Attribution graph; GCN; Kernel importance;
D O I
10.1016/j.neunet.2024.106597
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNNs) have demonstrated outstanding performance in various domains, such as face recognition, object detection, and image segmentation. However, the lack of transparency and limited interpretability inherent in CNNs pose challenges in fields such as medical diagnosis, autonomous driving, finance, and military applications. Several studies have explored the interpretability of CNNs and proposed various post-hoc interpretable methods. The majority of these methods are feature-based, focusing on the influence of input variables on outputs. Few methods undertake the analysis of parameters in CNNs and their overall structure. To explore the structure of CNNs and intuitively comprehend the role of their internal parameters, we propose an Attribution Graph-based Interpretable method for CNNs (AGIC) which models the overall structure of CNNs as graphs and provides interpretability from global and local perspectives. The runtime parameters of CNNs and feature maps of each image sample are applied to construct attribution graphs (At-GCs), where the convolutional kernels are represented as nodes and the SHAP values between kernel outputs are assigned as edges. These At-GCs are then employed to pretrain a newly designed heterogeneous graph encoder based on Deep Graph Infomax (DGI). To comprehensively delve into the overall structure of CNNs, the pretrained encoder is used for two types of interpretable tasks: (1) a classifier is attached to the pretrained encoder for the classification of At-GCs, revealing the dependency of At-GC's topological characteristics on the image sample categories, and (2) a scoring aggregation (SA) network is constructed to assess the importance of each node in At-GCs, thus reflecting the relative importance of kernels in CNNs. The experimental results indicate that the topological characteristics of At-GC exhibit a dependency on the sample category used in its construction, which reveals that kernels in CNNs show distinct combined activation patterns for processing different image categories, meanwhile, the kernels that receive high scores from SA network are crucial for feature extraction, whereas low-scoring kernels can be pruned without affecting model performance, thereby enhancing the interpretability of CNNs.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] An Interpretable Graph-based Image Classifier
    Bianchi, Filippo M.
    Scardapane, Simone
    Livi, Lorenzo
    Uncini, Aurelio
    Rizzi, Antonello
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2339 - 2346
  • [2] Temporal Graph-Based CNNs (TG-CNNs) for Online Course Dropout Prediction
    Hancox, Zoe
    Relton, Samuel D.
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2022), 2022, 13515 : 357 - 367
  • [3] An Interpretable Graph-Based Mapping of Trustworthy Machine Learning Research
    Derzsy, Noemi
    Majumdar, Subhabrata
    Malik, Rajat
    COMPLEX NETWORKS XII, 2021, : 73 - 85
  • [4] Multi-kernel Diffusion CNNs for Graph-Based Learning on Point Clouds
    Hansen, Lasse
    Diesel, Jasper
    Heinrich, Mattias P.
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III, 2019, 11131 : 456 - 469
  • [5] Naturally Interpretable Control Policies via Graph-Based Genetic Programming
    Nadizar, Giorgia
    Medvet, Eric
    Wilson, Dennis G.
    GENETIC PROGRAMMING, EUROGP 2024, 2024, 14631 : 73 - 89
  • [6] Interpretable Graph-Based Semi-Supervised Learning via Flows
    Rustamov, Raif M.
    Klosowski, James T.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3976 - 3983
  • [7] Dynamic Spatio-Temporal Graph-Based CNNs for Traffic Flow Prediction
    Chen, Ken
    Chen, Fei
    Lai, Baisheng
    Jin, Zhongming
    Liu, Yong
    Li, Kai
    Wei, Long
    Wang, Pengfei
    Tang, Yandong
    Huang, Jianqiang
    Hua, Xian-Sheng
    IEEE ACCESS, 2020, 8 : 185136 - 185145
  • [8] A graph-based method for improving GSAT
    Kask, K
    Dechter, R
    PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, 1996, : 350 - 355
  • [9] Strengthening of a Graph-Based Cryptanalysis Method
    Ghasemzadeh, Mohammad
    Shirmohammadi, Mahsa
    2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 2, 2009, : 118 - 121
  • [10] Graph-Based Relation Validation Method
    Rahman, Rashedur
    Grau, Brigitte
    Rosset, Sophie
    KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, 2017, 10180 : 167 - 171