MEDA: Multi-output Encoder-Decoder for Spatial Attention in Convolutional Neural Networks

被引:0
|
作者
Li, Huayu [1 ]
Razi, Abolfazl [1 ]
机构
[1] No Arizona Univ, Sch Informat Comp & Cyber Syst, Flagstaff, AZ 86011 USA
关键词
Attention Mechanism; Deep Learning; Encoder-Decoder Architecture; Convolutional Networks;
D O I
10.1109/ieeeconf44664.2019.9048981
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Utilizing channel-wise spatial attention mechanisms to emphasize special parts of an input image is an effective method to improve the performance of convolutional neural networks (CNNs). There are multiple effective implementations of attention mechanism. One is adding squeeze-and-excitation (SE) blocks to the CNN structure that selectively emphasize the most informative channels and suppress the relatively less informative channels by taking advantage of channel dependence. Another method is adding convolutional block attention module (CBAM) to implement both channel-wise and spatial attention mechanisms to select important pixels of the feature maps while emphasizing informative channels. In this paper, we propose an encoder-decoder architecture based on the idea of letting the channel-wise and spatial attention blocks share the same latent space representation. Instead of separating the channel-wise and spatial attention modules into two independent parts in CBAM, we combine them into one encoder-decoder architecture with two outputs. To evaluate the performance of the proposed algorithm, we apply it to different CNN architectures and test it on image classification and semantic segmentation. Through comparing the resulting structure equipped with MEDA blocks against other attention module, we show that the proposed method achieves better performance across different test scenarios.
引用
收藏
页码:2087 / 2091
页数:5
相关论文
共 50 条
  • [41] Street-view Change Detection via Siamese Encoder-decoder Structured Convolutional Neural Networks
    Zhao, Xinwei
    Li, Haichang
    Wang, Rui
    Zheng, Changwen
    Shi, Song
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 525 - 532
  • [42] Pedestrian Detection at Night in Infrared Images Using an Attention-Guided Encoder-Decoder Convolutional Neural Network
    Chen, Yunfan
    Shin, Hyunchul
    APPLIED SCIENCES-BASEL, 2020, 10 (03):
  • [43] Empowering Regional Rainfall-Runoff Modeling Through Encoder-Decoder Based on Convolutional Neural Networks
    Jiang, Wei
    Dang, Xupeng
    Zhang, Rui
    WATER, 2025, 17 (03)
  • [44] Interpretable Transformations with Encoder-Decoder Networks
    Worrall, Daniel E.
    Garbin, Stephan J.
    Turmukhambetov, Daniyar
    Brostow, Gabriel J.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5737 - 5746
  • [45] Encoder-Decoder Convolutional Neural Network based Iris-Sclera Segmentation
    Sahin, Gurkan
    Susuz, Orkun
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [46] CEDRNN: A Convolutional Encoder-Decoder Residual Neural Network for Liver Tumour Segmentation
    Arivazhagan Selvaraj
    Emerson Nithiyaraj
    Neural Processing Letters, 2023, 55 : 1605 - 1624
  • [47] Convolutional neural network based encoder-decoder architectures for semantic segmentation of plants
    Kolhar, Shrikrishna
    Jagtap, Jayant
    ECOLOGICAL INFORMATICS, 2021, 64
  • [48] Fusing Spatial Attention with Spectral-Channel Attention Mechanism for Hyperspectral Image Classification via Encoder-Decoder Networks
    Sun, Jun
    Zhang, Junbo
    Gao, Xuesong
    Wang, Mantao
    Ou, Dinghua
    Wu, Xiaobo
    Zhang, Dejun
    REMOTE SENSING, 2022, 14 (09)
  • [49] CEDRNN: A Convolutional Encoder-Decoder Residual Neural Network for Liver Tumour Segmentation
    Selvaraj, Arivazhagan
    Nithiyaraj, Emerson
    NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1605 - 1624
  • [50] Low-Dose CT With a Residual Encoder-Decoder Convolutional Neural Network
    Chen, Hu
    Zhang, Yi
    Kalra, Mannudeep K.
    Lin, Feng
    Chen, Yang
    Liao, Peixi
    Zhou, Jiliu
    Wang, Ge
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (12) : 2524 - 2535