Pyramidal Attention for Saliency Detection

被引:12
|
作者
Hussain, Tanveer [1 ]
Anwar, Abbas [2 ]
Anwar, Saeed [3 ,4 ,5 ,6 ]
Petersson, Lars [4 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Seoul, South Korea
[2] Abdul Wali Khan Univ, Mardan, Khyber Pakhtunk, Pakistan
[3] Australian Natl Univ, Canberra, ACT, Australia
[4] Data61 CSIRO, Canberra, ACT, Australia
[5] Univ Technol Sydney, Sydney, NSW, Australia
[6] Univ Canberra, Canberra, ACT, Australia
关键词
D O I
10.1109/CVPRW56347.2022.00325
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Salient object detection (SOD) extracts meaningful contents from an input image. RGB-based SOD methods lack the complementary depth clues; hence, providing limited performance for complex scenarios. Similarly, RGB-D models process RGB and depth inputs, but the depth data availability during testing may hinder the model's practical applicability. This paper exploits only RGB images, estimates depth from RGB, and leverages the intermediate depth features. We employ a pyramidal attention structure to extract multi-level convolutional-transformer features to process initial stage representations and further enhance the subsequent ones. At each stage, the backbone transformer model produces global receptive fields and computing in parallel to attain fine-grained global predictions refined by our residual convolutional attention decoder for optimal saliency prediction. We report significantly improved performance against 21 and 40 state-of-the-art SOD methods on eight RGB and RGB-D datasets, respectively. Consequently, we present a new SOD perspective of generating RGB-D SOD without acquiring depth data during training and testing and assist RGB methods with depth clues for improved performance. The code and trained models are available at https://github.com/tanveer-hussain/EfficientSOD2
引用
收藏
页码:2877 / 2887
页数:11
相关论文
共 50 条
  • [1] Diffuse visual attention for saliency detection
    Liu, Risheng
    Zhong, Guangyu
    Cao, Junjie
    Su, Zhixun
    JOURNAL OF ELECTRONIC IMAGING, 2015, 24 (01)
  • [2] Saliency Detection with Deformable Convolution and Feature Attention
    Zhang, Zhe
    Ma, Junhui
    Xu, Panpan
    Wang, Wencheng
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2800 - 2807
  • [3] Pyramid Feature Attention Network for Saliency detection
    Zhao, Ting
    Wu, Xiangqian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3080 - 3089
  • [4] Global attention network for collaborative saliency detection
    Li, Ce
    Xuan, Shuxing
    Liu, Fenghua
    Chang, Enbing
    Wu, Hailei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (02) : 407 - 417
  • [5] Global attention network for collaborative saliency detection
    Ce Li
    Shuxing Xuan
    Fenghua Liu
    Enbing Chang
    Hailei Wu
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 407 - 417
  • [6] Visual Attention Modeling in Compressed Domain:From Image Saliency Detection to Video Saliency Detection
    FANG Yuming
    ZHANG Xiaoqiang
    ZTECommunications, 2019, 17 (01) : 31 - 37
  • [7] Self-attention recurrent network for saliency detection
    Sun, Fengdong
    Li, Wenhui
    Guan, Yuanyuan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (21) : 30793 - 30807
  • [8] Self-attention recurrent network for saliency detection
    Fengdong Sun
    Wenhui Li
    Yuanyuan Guan
    Multimedia Tools and Applications, 2019, 78 : 30793 - 30807
  • [9] Saliency Attention Based Abnormal Event Detection in Video
    Huan, Wang
    Guo, Huiwen
    Wu, Xinyu
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS IEEE-ROBIO 2014, 2014, : 1039 - 1043
  • [10] Visual-Patch-Attention-Aware Saliency Detection
    Jian, Muwei
    Lam, Kin-Man
    Dong, Junyu
    Shen, Linlin
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (08) : 1575 - 1586