BR-NPA: A non-parametric high-resolution attention model to improve the interpretability of attention

被引:0
|
作者
Gomez T. [1 ,3 ]
Ling S. [1 ,3 ]
Fréour T. [2 ,4 ,5 ,6 ]
Mouchère H. [1 ,3 ]
机构
[1] Christian Pauc Street, Nantes
[2] 63 Magellan Quai, Nantes
[3] Nantes University, CNRS, LS2N, CNRS UMR 6004, Nantes
[4] Nantes University Hospital, Department of Reproductive Medicine and Biology, Nantes
[5] Nantes University, Nantes University Hospital, Inserm, CRTI, Inserm UMR 1064, Nantes
[6] Nantes University, Nantes University Hospital, Inserm, CNRS, SFR Santé, Inserm UMS 016, CNRS UMS, 3556, Nantes
关键词
Deep learning; Interpretability; Non-parametric; Resolution; Spatial attention;
D O I
10.1016/j.patcog.2022.108927
中图分类号
TB18 [人体工程学]; Q98 [人类学];
学科分类号
030303 ; 1201 ;
摘要
The prevalence of employing attention mechanisms has brought along concerns about the interpretability of attention distributions. Although it provides insights into how a model is operating, utilizing attention as the explanation of model predictions is still highly dubious. The community is still seeking more interpretable strategies for better identifying local active regions that contribute the most to the final decision. To improve the interpretability of existing attention models, we propose a novel Bilinear Representative Non-Parametric Attention (BR-NPA) strategy that captures the task-relevant human-interpretable information. The target model is first distilled to have higher-resolution intermediate feature maps. From which, representative features are then grouped based on local pairwise feature similarity, to produce finer-grained, more precise attention maps highlighting task-relevant parts of the input. The obtained attention maps are ranked according to the activity level of the compound feature, which provides information regarding the important level of the highlighted regions. The proposed model can be easily adapted in a wide variety of modern deep models, where classification is involved. Extensive quantitative and qualitative experiments showcase more comprehensive and accurate visual explanations compared to state-of-the-art attention models and visualization methods across multiple tasks including fine-grained image classification, few-shot classification, and person re-identification, without compromising the classification accuracy. The proposed visualization model sheds imperative light on how neural networks ‘pay their attention’ differently in different tasks. © 2022
引用
收藏
相关论文
共 50 条
  • [21] Ensemble model with cascade attention mechanism for high-resolution remote sensing image scene classification
    Li, Fengpeng
    Feng, Ruyi
    Han, Wei
    Wang, Lizhe
    OPTICS EXPRESS, 2020, 28 (15) : 22358 - 22387
  • [22] NLKFill: high-resolution image inpainting with a novel large kernel attention
    Wang, Ting
    Xiang, Dong
    Yang, Chuan
    Liang, Jiaying
    Shi, Canghong
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 4921 - 4938
  • [24] Multiple Attention Siamese Network for High-Resolution Image Change Detection
    Huang, Jiru
    Shen, Qian
    Wang, Min
    Yang, Mengyuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [25] Lightweight Attention Network for Very High-Resolution Image Semantic Segmentation
    Guan, Renchu
    Wang, Mingming
    Bruzzone, Lorenzo
    Zhao, Haishi
    Yang, Chen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [26] Integrating Gate and Attention Modules for High-Resolution Image Semantic Segmentation
    Zheng, Zixian
    Zhang, Xueliang
    Xiao, Pengfeng
    Li, Zhenshi
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 4530 - 4546
  • [27] High-Resolution Remote Sensing Image Captioning Based on Structured Attention
    Zhao, Rui
    Shi, Zhenwei
    Zou, Zhengxia
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [28] A High-Resolution Network Based on Feature Redundancy Reduction and Attention Mechanism
    Pan, Yuqing
    Lan, Weiming
    Xu, Feng
    Ren, Qinghua
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 510 - 521
  • [29] Visual attention model based mining area recognition on massive high-resolution remote sensing images
    Song, Xiaolu
    He, Guojin
    Zhang, Zhaoming
    Long, Tengfei
    Peng, Yan
    Wang, Zhihua
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 541 - 548
  • [30] Visual attention model based mining area recognition on massive high-resolution remote sensing images
    Xiaolu Song
    Guojin He
    Zhaoming Zhang
    Tengfei Long
    Yan Peng
    Zhihua Wang
    Cluster Computing, 2015, 18 : 541 - 548