SA-NET: SHUFFLE ATTENTION FOR DEEP CONVOLUTIONAL NEURAL NETWORKS

被引:597
|
作者
Zhang, Qing-Long [1 ]
Yang, Yu-Bin [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
关键词
spatial attention; channel attention; channel shuffle; grouped features;
D O I
10.1109/ICASSP39728.2021.9414568
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Attention mechanisms, which enable a neural network to accurately focus on all the relevant elements of the input, have become an essential component to improve the performance of deep neural networks. There are mainly two attention mechanisms widely used in computer vision studies, spatial attention and channel attention, which aim to capture the pixel-level pairwise relationship and channel dependency, respectively. Although fusing them together may achieve better performance than their individual implementations, it will inevitably increase the computational overhead. In this paper, we propose an efficient Shuffle Attention (SA) module to address this issue, which adopts Shuffle Units to combine two types of attention mechanisms effectively. Specifically, SA first groups channel dimensions into multiple sub-features before processing them in parallel. Then, for each sub-feature, SA utilizes a Shuffle Unit to depict feature dependencies in both spatial and channel dimensions. After that, all sub-features are aggregated and a "channel shuffle" operator is adopted to enable information communication between different sub-features. The proposed SA module is efficient yet effective, e.g., the parameters and computations of SA against the backbone ResNet50 are 300 vs. 25.56M and 2.76e-3 GFLOPs vs. 4.12 GFLOPs, respectively, and the performance boost is more than 1.34% in terms of Top-1 accuracy. Extensive experimental results on commonused benchmarks, including ImageNet-1k for classification, MS COCO for object detection, and instance segmentation, demonstrate that the proposed SA outperforms the current SOTA methods significantly by achieving higher accuracy while having lower model complexity.
引用
收藏
页码:2235 / 2239
页数:5
相关论文
共 50 条
  • [41] Guiding visual attention in deep convolutional neural networks based on human eye movements
    van Dyck, Leonard Elia
    Denzler, Sebastian Jochen
    Gruber, Walter Roland
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [42] SA-Net: Robust State-Action Recognition for Learning from Observations
    Soans, Nihal
    Asali, Ehsan
    Hong, Yi
    Doshi, Prashant
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2153 - 2159
  • [43] QCA-Net: Quantum-based Channel Attention for Deep Neural Networks
    Zhang, Juntao
    Cheng, Peng
    Li, Zehan
    Wu, Hao
    An, Wenbo
    Zhou, Jun
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [44] Glaucoma disease detection using stacked attention U-Net and deep convolutional neural network
    Murugesan, Malathi
    Laseetha, T. S. Jeyali
    Sundaram, Senthilkumar
    Kandasamy, Hariprasath
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (01) : 1603 - 1616
  • [45] Visual Attention with Deep Neural Networks
    Canziani, Alfredo
    Culurciello, Eugenio
    2015 49TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2015,
  • [46] Street-Frontage-Net: urban image classification using deep convolutional neural networks
    Law, Stephen
    Seresinhe, Chanuki Illushka
    Shen, Yao
    Gutierrez-Roig, Mario
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2020, 34 (04) : 681 - 707
  • [47] Breast Cancer Detection in Thermography Using Convolutional Neural Networks (CNNs) with Deep Attention Mechanisms
    Alshehri, Alia
    AlSaeed, Duaa
    APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [48] Deep Convolutional Symmetric Encoder-Decoder Neural Networks to Predict Students' Visual Attention
    Hachaj, Tomasz
    Stolinska, Anna
    Andrzejewska, Magdalena
    Czerski, Piotr
    SYMMETRY-BASEL, 2021, 13 (12):
  • [49] Fpar: filter pruning via attention and rank enhancement for deep convolutional neural networks acceleration
    Chen, Yanming
    Wu, Gang
    Shuai, Mingrui
    Lou, Shubin
    Zhang, Yiwen
    An, Zhulin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2973 - 2985
  • [50] Automatic Food Recognition Using Deep Convolutional Neural Networks with Self-attention Mechanism
    Rahib Abiyev
    Joseph Adepoju
    Human-Centric Intelligent Systems, 2024, 4 (1): : 171 - 186