BR-NPA: A non-parametric high-resolution attention model to improve the interpretability of attention

被引：0

作者：

Gomez T. ^{[1
,3
]}

Ling S. ^{[1
,3
]}

Fréour T. ^{[2
,4
,5
,6
]}

Mouchère H. ^{[1
,3
]}

机构：

[1] Christian Pauc Street, Nantes

[2] 63 Magellan Quai, Nantes

[3] Nantes University, CNRS, LS2N, CNRS UMR 6004, Nantes

[4] Nantes University Hospital, Department of Reproductive Medicine and Biology, Nantes

[5] Nantes University, Nantes University Hospital, Inserm, CRTI, Inserm UMR 1064, Nantes

[6] Nantes University, Nantes University Hospital, Inserm, CNRS, SFR Santé, Inserm UMS 016, CNRS UMS, 3556, Nantes

来源：

Pattern Recognition | 2022年 / 132卷

关键词：

Deep learning; Interpretability; Non-parametric; Resolution; Spatial attention;

D O I：

10.1016/j.patcog.2022.108927

中图分类号：

TB18 [人体工程学]; Q98 [人类学];

学科分类号：

030303 ; 1201 ;

摘要：

The prevalence of employing attention mechanisms has brought along concerns about the interpretability of attention distributions. Although it provides insights into how a model is operating, utilizing attention as the explanation of model predictions is still highly dubious. The community is still seeking more interpretable strategies for better identifying local active regions that contribute the most to the final decision. To improve the interpretability of existing attention models, we propose a novel Bilinear Representative Non-Parametric Attention (BR-NPA) strategy that captures the task-relevant human-interpretable information. The target model is first distilled to have higher-resolution intermediate feature maps. From which, representative features are then grouped based on local pairwise feature similarity, to produce finer-grained, more precise attention maps highlighting task-relevant parts of the input. The obtained attention maps are ranked according to the activity level of the compound feature, which provides information regarding the important level of the highlighted regions. The proposed model can be easily adapted in a wide variety of modern deep models, where classification is involved. Extensive quantitative and qualitative experiments showcase more comprehensive and accurate visual explanations compared to state-of-the-art attention models and visualization methods across multiple tasks including fine-grained image classification, few-shot classification, and person re-identification, without compromising the classification accuracy. The proposed visualization model sheds imperative light on how neural networks ‘pay their attention’ differently in different tasks. © 2022

引用

共 50 条

[21] Ensemble model with cascade attention mechanism for high-resolution remote sensing image scene classification
Li, Fengpeng
Feng, Ruyi
Han, Wei
Wang, Lizhe
OPTICS EXPRESS, 2020, 28 (15) : 22358 - 22387
[22] NLKFill: high-resolution image inpainting with a novel large kernel attention
Wang, Ting
Xiang, Dong
Yang, Chuan
Liang, Jiaying
Shi, Canghong
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 4921 - 4938
[23] Microdisplays' performance, high-resolution draw industry attention at SID '99
Ajluni, C
ELECTRONIC DESIGN, 1999, 47 (13) : 29 - 29
[24] Multiple Attention Siamese Network for High-Resolution Image Change Detection
Huang, Jiru
Shen, Qian
Wang, Min
Yang, Mengyuan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[25] Lightweight Attention Network for Very High-Resolution Image Semantic Segmentation
Guan, Renchu
Wang, Mingming
Bruzzone, Lorenzo
Zhao, Haishi
Yang, Chen
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[26] Integrating Gate and Attention Modules for High-Resolution Image Semantic Segmentation
Zheng, Zixian
Zhang, Xueliang
Xiao, Pengfeng
Li, Zhenshi
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 4530 - 4546
[27] High-Resolution Remote Sensing Image Captioning Based on Structured Attention
Zhao, Rui
Shi, Zhenwei
Zou, Zhengxia
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[28] A High-Resolution Network Based on Feature Redundancy Reduction and Attention Mechanism
Pan, Yuqing
Lan, Weiming
Xu, Feng
Ren, Qinghua
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 510 - 521
[29] Visual attention model based mining area recognition on massive high-resolution remote sensing images
Song, Xiaolu
He, Guojin
Zhang, Zhaoming
Long, Tengfei
Peng, Yan
Wang, Zhihua
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 541 - 548
[30] Visual attention model based mining area recognition on massive high-resolution remote sensing images
Xiaolu Song
Guojin He
Zhaoming Zhang
Tengfei Long
Yan Peng
Zhihua Wang
Cluster Computing, 2015, 18 : 541 - 548

← 1 2 3 4 5 →