Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection

被引：4

作者：

Zhao, Zhengyun ^{[1
]}

Huang, Ziqing ^{[1
]}

Chai, Xiuli ^{[1
]}

Wang, Jun ^{[1
]}

机构：

[1] Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China

来源：

NEURAL PROCESSING LETTERS | 2023年 / 55卷 / 01期

基金：

中国国家自然科学基金;

关键词：

RGB-D salient object detection; Convolutional neural network; Cross-modal fusion; Depth modal enhancement; FUSION; CONSISTENT; IMAGE;

D O I：

10.1007/s11063-022-10886-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep modal can provide supplementary features for RGB images, which deeply improves the performance of salient object detection (SOD). However, depth images are disturbed by external factors during the acquisition process, resulting in low-quality acquisitions. Moreover, there are differences between the RGB and depth modals, so simply fusing the two modals cannot fully complement the depth information into the RGB modal. To enhance the quality of the depth image and integrate the cross-modal information effectively, we propose a depth enhanced cross-modal cascaded network (DCCNet) for RGB-D SOD. The entire cascaded network includes a depth cascaded branch, a RGB cascaded branch and a cross-modal fusion strategy. In the depth cascaded branch, we design a depth preprocessing algorithm to enhance the quality of the depth image. And in the process of depth feature extraction, we adopt four cascaded cross-modal guided modules to guide the RGB feature extraction process. In the RGB cascaded branch, we design five cascaded residual adaptive selection modules to output the RGB image feature extraction in each stage. In the cross-modal fusion strategy, a cross-modal channel-wise refinement is adopted to fuse the top-level features of the different modal feature branches. Finally, the multiscale loss is adopted to optimize the network training. Experimental results on six common RGB-D SOD datasets show that the performance of the proposed DCCNet is comparable to that of the state-of-the-art RGB-D SOD methods.

引用

页码：361 / 384

页数：24

共 50 条

[41] ECW-EGNet: Exploring Cross-Modal Weighting and Edge-Guided Decoder Network for RGB-D Salient Object Detection
Xia, Chenxing
Yang, Feng
Duan, Songsong
Gao, Xiuju
Ge, Bin
Li, Kuan-Ching
Fang, Xianjin
Zhang, Yan
Yang, Ke
COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2024, 21 (03)
[42] RGB depth salient object detection via cross-modal attention and boundary feature guidance
Meng, Lingbing
Yuan, Mengya
Shi, Xuehan
Zhang, Le
Liu, Qingqing
Ping, Dai
Wu, Jinhua
Cheng, Fei
IET COMPUTER VISION, 2024, 18 (02) : 273 - 288
[43] Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection
Chen, Hao
Li, Youfu
Su, Dan
PATTERN RECOGNITION, 2019, 86 : 376 - 385
[44] Discriminative Cross-Modal Transfer Learning and Densely Cross-Level Feedback Fusion for RGB-D Salient Object Detection
Chen, Hao
Li, Youfu
Su, Dan
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (11) : 4808 - 4820
[45] Modal-Adaptive Gated Recoding Network for RGB-D Salient Object Detection
Zhu, Jinchao
Zhang, Xiaoyu
Fang, Xian
Dong, Feng
Qiu, Yu
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 359 - 363
[46] Three-stream RGB-D salient object detection network based on cross-level and cross-modal dual-attention fusion
Meng, Lingbing
Yuan, Mengya
Shi, Xuehan
Liu, Qingqing
Cheng, Fei
Li, Lingli
IET IMAGE PROCESSING, 2023, 17 (11) : 3292 - 3308
[47] Absolute and Relative Depth-Induced Network for RGB-D Salient Object Detection
Kong, Yuqiu
Wang, He
Kong, Lingwei
Liu, Yang
Yao, Cuili
Yin, Baocai
SENSORS, 2023, 23 (07)
[48] Depth-aware inverted refinement network for RGB-D salient object detection
Gao, Lina
Liu, Bing
Fu, Ping
Xu, Mingzhu
NEUROCOMPUTING, 2023, 518 : 507 - 522
[49] MULTI-MODAL TRANSFORMER FOR RGB-D SALIENT OBJECT DETECTION
Song, Peipei
Zhang, Jing
Koniusz, Piotr
Barnes, Nick
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2466 - 2470
[50] RGB-D Grasp Detection via Depth Guided Learning with Cross-modal Attention
Qin, Ran
Ma, Haoxiang
Ciao, Boyang
Huang, Di
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 8003 - 8009

← 1 2 3 4 5 →