TENet: Accurate light-field salient object detection with a transformer embedding network

被引:7
|
作者
Wang, Xingzheng [1 ]
Chen, Songwei [1 ]
Wei, Guoyao [1 ]
Liu, Jiehao [1 ]
机构
[1] Shenzhen Univ, Coll Mechatron & Control Engn, Shenzhen 518060, Peoples R China
关键词
Light-field; Salient object detection; Transformer;
D O I
10.1016/j.imavis.2022.104595
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current light-field salient object detection methods have difficulty in accurately distinguishing objects from com-plex backgrounds. In this paper, we believe that this problem can be mitigated by optimizing feature fusion and enlarging receptive field, and thus propose a novel transformer embedding network named TENet. The main idea of the network is to (1) selectively aggregate multi-features for fuller feature fusion; (2) integrate the Trans-former for larger receptive field, so as to accurately identify salient objects. For the former, firstly, a multi -modal feature fusion module (MMFF) is designed to mine the different contributions of multi-modal features (i.e., all-in-focus image features and focal stack features). Then, a multi-level feature fusion module (MLFF) is de-veloped to iteratively select and fuse complementary cues from multi-level features. For the latter, we integrate the Transformer for the first time and propose a transformer-based feature enhancement module (TFE), to pro-vide a wider receptive field for each pixel of high-level features. To validate our idea, we comprehensively eval-uate the performance of our TENet on three challenging datasets. Experimental results show that our method outperforms the state-of-the-art method, e.g., the detection accuracy is improved by 28.1%, 20.3%, and 14.9% in MAE metric, respectively.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Recursive Contour-Saliency Blending Network for Accurate Salient Object Detection
    Ke, Yun Yi
    Tsubono, Takahiro
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1360 - 1370
  • [42] Spatial Attention-Guided Light Field Salient Object Detection Network With Implicit Neural Representation
    Zheng, Xin
    Li, Zhengqu
    Liu, Deyang
    Zhou, Xiaofei
    Shan, Caifeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12437 - 12449
  • [43] Occlusion-aware Bi-directional Guided Network for Light Field Salient Object Detection
    Jing, Dong
    Zhang, Shuo
    Cong, Runmin
    Lin, Youfang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1692 - 1701
  • [44] LRNet: lightweight attention-oriented residual fusion network for light field salient object detection
    Ma, Shuai
    Zhu, Xusheng
    Xu, Long
    Zhou, Li
    Chen, Daixin
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [45] LFSamba: Marry SAM With Mamba for Light Field Salient Object Detection
    Liu, Zhengyi
    Wang, Longzhen
    Fang, Xianyong
    Tu, Zhengzheng
    Wang, Linbo
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 3144 - 3148
  • [46] Memory-oriented Decoder for Light Field Salient Object Detection
    Zhang, Miao
    Li, Jingjing
    Ji, Wei
    Piao, Yongri
    Lu, Huchuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [47] MEANet: Multi-modal edge-aware network for light field salient object detection
    Jiang, Yao
    Zhang, Wenbo
    Fu, Keren
    Zhao, Qijun
    NEUROCOMPUTING, 2022, 491 : 78 - 90
  • [48] CATNet: A Cascaded and Aggregated Transformer Network for RGB-D Salient Object Detection
    Sun, Fuming
    Ren, Peng
    Yin, Bowen
    Wang, Fasheng
    Li, Haojie
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2249 - 2262
  • [49] Superpixel attention guided network for accurate and real-time salient object detection
    Zhou, Zhiheng
    Guo, Yongfan
    Huang, Junchu
    Dai, Ming
    Deng, Ming
    Yu, Qingjun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (27) : 38921 - 38944
  • [50] Superpixel attention guided network for accurate and real-time salient object detection
    Zhiheng Zhou
    Yongfan Guo
    Junchu Huang
    Ming Dai
    Ming Deng
    Qingjun Yu
    Multimedia Tools and Applications, 2022, 81 : 38921 - 38944