Pyramidal Attention for Saliency Detection

被引:12
|
作者
Hussain, Tanveer [1 ]
Anwar, Abbas [2 ]
Anwar, Saeed [3 ,4 ,5 ,6 ]
Petersson, Lars [4 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Seoul, South Korea
[2] Abdul Wali Khan Univ, Mardan, Khyber Pakhtunk, Pakistan
[3] Australian Natl Univ, Canberra, ACT, Australia
[4] Data61 CSIRO, Canberra, ACT, Australia
[5] Univ Technol Sydney, Sydney, NSW, Australia
[6] Univ Canberra, Canberra, ACT, Australia
关键词
D O I
10.1109/CVPRW56347.2022.00325
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Salient object detection (SOD) extracts meaningful contents from an input image. RGB-based SOD methods lack the complementary depth clues; hence, providing limited performance for complex scenarios. Similarly, RGB-D models process RGB and depth inputs, but the depth data availability during testing may hinder the model's practical applicability. This paper exploits only RGB images, estimates depth from RGB, and leverages the intermediate depth features. We employ a pyramidal attention structure to extract multi-level convolutional-transformer features to process initial stage representations and further enhance the subsequent ones. At each stage, the backbone transformer model produces global receptive fields and computing in parallel to attain fine-grained global predictions refined by our residual convolutional attention decoder for optimal saliency prediction. We report significantly improved performance against 21 and 40 state-of-the-art SOD methods on eight RGB and RGB-D datasets, respectively. Consequently, we present a new SOD perspective of generating RGB-D SOD without acquiring depth data during training and testing and assist RGB methods with depth clues for improved performance. The code and trained models are available at https://github.com/tanveer-hussain/EfficientSOD2
引用
收藏
页码:2877 / 2887
页数:11
相关论文
共 50 条
  • [31] Wheat Spikes Detection Method Based on Pyramidal Network of Attention Mechanism
    Zhang Q.
    Hu S.
    Shu W.
    Cheng H.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2021, 52 (11): : 253 - 262
  • [32] Synthetic aperture radar image change detection using saliency detection and attention capsule network
    Wang, Shaona
    Wang, Di
    Shi, Jia
    Zhang, Zhenghua
    Li, Xiang
    Guo, Yanmiao
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (01)
  • [33] Novelty competes with saliency for attention
    Ernst, Daniel
    Becker, Stefanie
    Horstmann, Gernot
    VISION RESEARCH, 2020, 168 : 42 - 52
  • [34] PiCANet: Pixel-Wise Contextual Attention Learning for Accurate Saliency Detection
    Liu, Nian
    Han, Junwei
    Yang, Ming-Hsuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6438 - 6451
  • [35] Attention enhanced machine instinctive vision with human-inspired saliency detection
    Khan, Habib
    Usman, Muhammad Talha
    Rida, Imad
    Koo, Jakeoung
    IMAGE AND VISION COMPUTING, 2024, 152
  • [36] Multispectral Pedestrian Detection Based on Prior-Saliency Attention and Image Fusion
    Guo, Jiaren
    Huang, Zihao
    Tao, Yanyun
    ELECTRONICS, 2024, 13 (09)
  • [37] Robust Deep Co-Saliency Detection With Group Semantic and Pyramid Attention
    Zha, Zheng-Jun
    Wang, Chong
    Liu, Dong
    Xie, Hongtao
    Zhang, Yongdong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2398 - 2408
  • [38] Learning Selective Mutual Attention and Contrast for RGB-D Saliency Detection
    Liu, Nian
    Zhang, Ni
    Shao, Ling
    Han, Junwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9026 - 9042
  • [39] A Deep Model of Visual Attention for Saliency Detection on 3D Objects
    Rouhafzay, Ghazal
    Cretu, Ana-Maria
    Payeur, Pierre
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 8847 - 8867
  • [40] Evaluating the Effect of Saliency Detection and Attention Manipulation in Human-Robot Interaction
    Guido Schillaci
    Saša Bodiroža
    Verena Vanessa Hafner
    International Journal of Social Robotics, 2013, 5 : 139 - 152