CSNet: a ConvNeXt-based Siamese network for RGB-D salient object detection

被引:0
|
作者
Yunhua Zhang
Hangxu Wang
Gang Yang
Jianhao Zhang
Congjin Gong
Yutao Wang
机构
[1] Northeastern University,
[2] DUT Artificial Intelligence Institute,undefined
来源
The Visual Computer | 2024年 / 40卷
关键词
Salient object detection; Siamese network; ConvNeXt; RGB-D SOD; Multi-modality;
D O I
暂无
中图分类号
学科分类号
摘要
Global contexts are critical to locating salient objects for salient object detection (SOD). However, the convolution operation in CNNs has a local receptive field, which cannot capture long-distance global information. Recent studies have shown that modernized CNN models with large kernel convolution, such as ConvNeXt, can effectively extend the receptive fields. Based on it, this paper explores the potential of large kernel CNN for SOD task. Inspired by the common information between RGB and depth images in salient objects, we propose a ConvNeXt-based Siamese network with shared weight parameters. This structural design can effectively reduce the number of parameters without sacrificing performance. Furthermore, a depth information preprocessing module is proposed to minimize the impact of low-quality depth images on predicted saliency maps. For cross-modal feature interaction, a dynamic fusion module is designed to enhance cross-modal complementarity dynamically. Extensive experiments and evaluation results on six benchmark datasets demonstrate the outstanding performance of the proposed method against 14 state-of-the-art RGB-D methods. Our code will be released at https://github.com/zyh5119232/CSNet.
引用
收藏
页码:1805 / 1823
页数:18
相关论文
共 50 条
  • [21] Hybrid-Attention Network for RGB-D Salient Object Detection
    Chen, Yuzhen
    Zhou, Wujie
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [22] Feature Calibrating and Fusing Network for RGB-D Salient Object Detection
    Zhang, Qiang
    Qin, Qi
    Yang, Yang
    Jiao, Qiang
    Han, Jungong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1493 - 1507
  • [23] GroupTransNet: Group transformer network for RGB-D salient object detection
    Fang, Xian
    Jiang, Mingfeng
    Zhu, Jinchao
    Shao, Xiuli
    Wang, Hongpeng
    NEUROCOMPUTING, 2024, 594
  • [24] Asymmetric deep interaction network for RGB-D salient object detection
    Wang, Feifei
    Li, Yongming
    Wang, Liejun
    Zheng, Panpan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
  • [25] Triple-Complementary Network for RGB-D Salient Object Detection
    Huang, Rui
    Xing, Yan
    Zou, Yaobin
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 775 - 779
  • [26] DMNet: Dynamic Memory Network for RGB-D Salient Object Detection
    Du, Haishun
    Zhang, Zhen
    Zhang, Minghao
    Qiao, Kangyi
    DIGITAL SIGNAL PROCESSING, 2023, 142
  • [27] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
    Li, Gongyang
    Liu, Zhi
    Chen, Minyu
    Bai, Zhen
    Lin, Weisi
    Ling, Haibin
    IEEE Transactions on Image Processing, 2021, 30 : 3528 - 3542
  • [28] An adaptive guidance fusion network for RGB-D salient object detection
    Sun, Haodong
    Wang, Yu
    Ma, Xinpeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (02) : 1683 - 1693
  • [29] Context-aware network for RGB-D salient object detection
    Liang, Fangfang
    Duan, Lijuan
    Ma, Wei
    Qiao, Yuanhua
    Miao, Jun
    Ye, Qixiang
    PATTERN RECOGNITION, 2021, 111
  • [30] CDNet: Complementary Depth Network for RGB-D Salient Object Detection
    Jin, Wen-Da
    Xu, Jun
    Han, Qi
    Zhang, Yi
    Cheng, Ming-Ming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3376 - 3390