An attribution graph-based interpretable method for CNNs

被引：6

作者：

Zheng, Xiangwei ^{[1
,3
]}

Zhang, Lifeng ^{[1
,3
]}

Xu, Chunyan ^{[2
,3
]}

Chen, Xuanchi ^{[1
,3
]}

Cui, Zhen

机构：

[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Shandong, Peoples R China

[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China

[3] State Key Lab High End Server & Storage Technol, Jinan 250300, Shandong, Peoples R China

来源：

NEURAL NETWORKS | 2024年 / 179卷

基金：

中国国家自然科学基金;

关键词：

Interpretable CNN; Attribution graph; GCN; Kernel importance;

D O I：

10.1016/j.neunet.2024.106597

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Convolutional Neural Networks (CNNs) have demonstrated outstanding performance in various domains, such as face recognition, object detection, and image segmentation. However, the lack of transparency and limited interpretability inherent in CNNs pose challenges in fields such as medical diagnosis, autonomous driving, finance, and military applications. Several studies have explored the interpretability of CNNs and proposed various post-hoc interpretable methods. The majority of these methods are feature-based, focusing on the influence of input variables on outputs. Few methods undertake the analysis of parameters in CNNs and their overall structure. To explore the structure of CNNs and intuitively comprehend the role of their internal parameters, we propose an Attribution Graph-based Interpretable method for CNNs (AGIC) which models the overall structure of CNNs as graphs and provides interpretability from global and local perspectives. The runtime parameters of CNNs and feature maps of each image sample are applied to construct attribution graphs (At-GCs), where the convolutional kernels are represented as nodes and the SHAP values between kernel outputs are assigned as edges. These At-GCs are then employed to pretrain a newly designed heterogeneous graph encoder based on Deep Graph Infomax (DGI). To comprehensively delve into the overall structure of CNNs, the pretrained encoder is used for two types of interpretable tasks: (1) a classifier is attached to the pretrained encoder for the classification of At-GCs, revealing the dependency of At-GC's topological characteristics on the image sample categories, and (2) a scoring aggregation (SA) network is constructed to assess the importance of each node in At-GCs, thus reflecting the relative importance of kernels in CNNs. The experimental results indicate that the topological characteristics of At-GC exhibit a dependency on the sample category used in its construction, which reveals that kernels in CNNs show distinct combined activation patterns for processing different image categories, meanwhile, the kernels that receive high scores from SA network are crucial for feature extraction, whereas low-scoring kernels can be pruned without affecting model performance, thereby enhancing the interpretability of CNNs.

引用

页数：17

共 50 条

[21] A Graph-based Recommendation Method for the Academic Community
Ma, Yongzheng
Yang, Qi
Liu, Bing
Chen, Wenyu
PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 688 - 692
[22] A graph-based clustering method and its applications
Foggia, Pasquale
Percannella, Gennaro
Sansone, Carlo
Vento, Mario
ADVANCES IN BRAIN, VISION, AND ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4729 : 277 - +
[23] A Graph-Based Representation Method for Fashion Color
Chen, Yuyilan
Dai, Yuqian
Li, Li
Ma, Chenqu
Liu, Xiaogang
APPLIED SCIENCES-BASEL, 2022, 12 (13):
[24] Improving the graph-based image segmentation method
Zhang, Ming
Alhajj, Reda
ICTAI-2006: EIGHTEENTH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, : 617 - +
[25] Fuzzy Clustering Method with Graph-based Regularization
Chen, Long
Guo, Li
Lu, Xiliang
Chen, C. L. Philip
2016 INTERNATIONAL CONFERENCE ON FUZZY THEORY AND ITS APPLICATIONS (IFUZZY), 2016,
[26] A Graph-Based Method for Interactive Mapping Revision
Li, Weizhuo
Zhang, Songmao
Qi, Guilin
Fu, Xuefeng
Ji, Qiu
SEMANTIC TECHNOLOGY (JIST 2018), 2018, 11341 : 244 - 261
[27] A New Graph-Based Method for Automatic Segmentation
Gemme, Laura
Dellepiane, Silvana
IMAGE ANALYSIS AND PROCESSING - ICIAP 2015, PT I, 2015, 9279 : 601 - 611
[28] A Graph-Based Method for IFC Data Merging
Zhao, Qin
Li, Yuchao
Hei, Xinhong
Yang, Mingsong
ADVANCES IN CIVIL ENGINEERING, 2020, 2020
[29] Attribution-based Salience Method towards Interpretable Reinforcement Learning
Research and Development Group Hitachi, Ltd
CEUR Workshop Proc.,
[30] Spatio-temporal graph-based CNNs for anomaly detection in weakly-labeled videos
Mu, Huiyu
Sun, Ruizhi
Wang, Miao
Chen, Zeqiu
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)

← 1 2 3 4 5 →