CSNet: a ConvNeXt-based Siamese network for RGB-D salient object detection

被引：0

作者：

Yunhua Zhang

Hangxu Wang

Gang Yang

Jianhao Zhang

Congjin Gong

Yutao Wang

机构：

[1] Northeastern University,

[2] DUT Artificial Intelligence Institute,undefined

来源：

The Visual Computer | 2024年 / 40卷

关键词：

Salient object detection; Siamese network; ConvNeXt; RGB-D SOD; Multi-modality;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Global contexts are critical to locating salient objects for salient object detection (SOD). However, the convolution operation in CNNs has a local receptive field, which cannot capture long-distance global information. Recent studies have shown that modernized CNN models with large kernel convolution, such as ConvNeXt, can effectively extend the receptive fields. Based on it, this paper explores the potential of large kernel CNN for SOD task. Inspired by the common information between RGB and depth images in salient objects, we propose a ConvNeXt-based Siamese network with shared weight parameters. This structural design can effectively reduce the number of parameters without sacrificing performance. Furthermore, a depth information preprocessing module is proposed to minimize the impact of low-quality depth images on predicted saliency maps. For cross-modal feature interaction, a dynamic fusion module is designed to enhance cross-modal complementarity dynamically. Extensive experiments and evaluation results on six benchmark datasets demonstrate the outstanding performance of the proposed method against 14 state-of-the-art RGB-D methods. Our code will be released at https://github.com/zyh5119232/CSNet.

引用

页码：1805 / 1823

页数：18

共 50 条

[21] Hybrid-Attention Network for RGB-D Salient Object Detection
Chen, Yuzhen
Zhou, Wujie
APPLIED SCIENCES-BASEL, 2020, 10 (17):
[22] Feature Calibrating and Fusing Network for RGB-D Salient Object Detection
Zhang, Qiang
Qin, Qi
Yang, Yang
Jiao, Qiang
Han, Jungong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1493 - 1507
[23] GroupTransNet: Group transformer network for RGB-D salient object detection
Fang, Xian
Jiang, Mingfeng
Zhu, Jinchao
Shao, Xiuli
Wang, Hongpeng
NEUROCOMPUTING, 2024, 594
[24] Asymmetric deep interaction network for RGB-D salient object detection
Wang, Feifei
Li, Yongming
Wang, Liejun
Zheng, Panpan
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
[25] Triple-Complementary Network for RGB-D Salient Object Detection
Huang, Rui
Xing, Yan
Zou, Yaobin
IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 775 - 779
[26] DMNet: Dynamic Memory Network for RGB-D Salient Object Detection
Du, Haishun
Zhang, Zhen
Zhang, Minghao
Qiao, Kangyi
DIGITAL SIGNAL PROCESSING, 2023, 142
[27] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
Li, Gongyang
Liu, Zhi
Chen, Minyu
Bai, Zhen
Lin, Weisi
Ling, Haibin
IEEE Transactions on Image Processing, 2021, 30 : 3528 - 3542
[28] An adaptive guidance fusion network for RGB-D salient object detection
Sun, Haodong
Wang, Yu
Ma, Xinpeng
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (02) : 1683 - 1693
[29] Context-aware network for RGB-D salient object detection
Liang, Fangfang
Duan, Lijuan
Ma, Wei
Qiao, Yuanhua
Miao, Jun
Ye, Qixiang
PATTERN RECOGNITION, 2021, 111
[30] CDNet: Complementary Depth Network for RGB-D Salient Object Detection
Jin, Wen-Da
Xu, Jun
Han, Qi
Zhang, Yi
Cheng, Ming-Ming
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3376 - 3390

← 1 2 3 4 5 →