TENet: Accurate light-field salient object detection with a transformer embedding network

被引:7
|
作者
Wang, Xingzheng [1 ]
Chen, Songwei [1 ]
Wei, Guoyao [1 ]
Liu, Jiehao [1 ]
机构
[1] Shenzhen Univ, Coll Mechatron & Control Engn, Shenzhen 518060, Peoples R China
关键词
Light-field; Salient object detection; Transformer;
D O I
10.1016/j.imavis.2022.104595
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current light-field salient object detection methods have difficulty in accurately distinguishing objects from com-plex backgrounds. In this paper, we believe that this problem can be mitigated by optimizing feature fusion and enlarging receptive field, and thus propose a novel transformer embedding network named TENet. The main idea of the network is to (1) selectively aggregate multi-features for fuller feature fusion; (2) integrate the Trans-former for larger receptive field, so as to accurately identify salient objects. For the former, firstly, a multi -modal feature fusion module (MMFF) is designed to mine the different contributions of multi-modal features (i.e., all-in-focus image features and focal stack features). Then, a multi-level feature fusion module (MLFF) is de-veloped to iteratively select and fuse complementary cues from multi-level features. For the latter, we integrate the Transformer for the first time and propose a transformer-based feature enhancement module (TFE), to pro-vide a wider receptive field for each pixel of high-level features. To validate our idea, we comprehensively eval-uate the performance of our TENet on three challenging datasets. Experimental results show that our method outperforms the state-of-the-art method, e.g., the detection accuracy is improved by 28.1%, 20.3%, and 14.9% in MAE metric, respectively.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Light Field Salient Object Detection via Hybrid Priors
    Zhang, Junlin
    Wang, Xu
    MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 361 - 372
  • [32] Gated Multi-Modal Edge Refinement Network for Light Field Salient Object Detection
    Li, Yefan
    Duan, Fuqing
    Lu, Ke
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (10)
  • [33] LFBCNet: Light Field Boundary-aware and Cascaded Interaction Network for Salient Object Detection
    Wang, Mianzhao
    Shi, Fan
    Cheng, Xu
    Zhao, Meng
    Zhang, Yao
    Jia, Chen
    Tian, Weiwei
    Chen, Shengyong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3430 - 3439
  • [34] Light Field Salient Object Detection With Sparse Views via Complementary and Discriminative Interaction Network
    Chen, Yilei
    Li, Gongyang
    An, Ping
    Liu, Zhi
    Huang, Xinpeng
    Wu, Qiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (02) : 1070 - 1085
  • [35] ARFNet: Attention-Oriented Refinement and Fusion Network for Light Field Salient Object Detection
    Ma, Shuai
    Zhu, Licheng
    Chen, Xianfeng
    Yan, Xu
    Wang, Shuai
    Yang, Ping
    Xu, Bing
    IEEE SYSTEMS JOURNAL, 2022, 16 (04): : 5950 - 5961
  • [36] A Simple Yet Effective Network Based on Vision Transformer for Camouflaged Object and Salient Object Detection
    Hao, Chao
    Yu, Zitong
    Liu, Xin
    Xu, Jun
    Yue, Huanjing
    Yang, Jingyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 608 - 622
  • [37] Transformer-based Cross Reference Network for video salient object detection
    Huang, Kan
    Tian, Chunwei
    Su, Jingyong
    Lin, Jerry Chun-Wei
    PATTERN RECOGNITION LETTERS, 2022, 160 : 122 - 127
  • [38] GroupTransNet: Group transformer network for RGB-D salient object detection
    Fang, Xian
    Jiang, Mingfeng
    Zhu, Jinchao
    Shao, Xiuli
    Wang, Hongpeng
    NEUROCOMPUTING, 2024, 594
  • [39] Mirror complementary transformer network for RGB-thermal salient object detection
    Jiang, Xiurong
    Hou, Yifan
    Tian, Hui
    Zhu, Lin
    IET COMPUTER VISION, 2024, 18 (01) : 15 - 32
  • [40] Salient object detection based on Pyramid Vision Transformer-gated network
    Zhou, Xiaoli
    Huo, Lina
    Wang, Wei
    Hao, Peng
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)