RGB-D Image Saliency Detection Based on Multi-modal Feature-fused Supervision

被引：5

作者：

Liu Zhengyi ^{[1
]}

Duan Quntao ^{[1
]}

Shi Song ^{[1
]}

Zhao Peng ^{[1
]}

机构：

[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China

来源：

JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY | 2020年 / 42卷 / 04期

基金：

中国国家自然科学基金;

关键词：

RGB-D saliency detection; Convolutional Neural Network(CNN); Multi-modal; Supervision; OBJECT DETECTION; NETWORK;

D O I：

10.11999/JEIT190297

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

RGB-D saliency detection identifies the most visually attentive target areas in a pair of RGB and Depth images. Existing two-stream networks, which treat RGB and Depth data equally, are almost identical in feature extraction. As the lower layers Depth features with a lot of noise, it causes image features not be well characterized. Therefore, a multi-modal feature-fused supervision of RGB-D saliency detection network is proposed, RGB and Depth data are studied independently through two-stream, double-side supervision module is used respectively to obtain saliency maps of each layer, and then the multi-modal feature-fused module is used to later three layers of the fused RGB and Depth of higher dimensional information to generate saliency predicted results. Finally, the information of lower layers is fused to generate the ultimate saliency maps. Experiments on three open data sets show that the proposed network has better performance and stronger robustness than the current RGB-D saliency detection models.

引用

页码：997 / 1004

页数：8

共 25 条

[1] [Anonymous], 2017, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2017.487
[2] [Anonymous], 15 EUR C COMP VIS MU
[3] [Anonymous], PROC CVPR IEEE
[4] [Anonymous], P IJCAI
[5] [Anonymous], 2015, INT C LEARN REPR ICL
[6] [Anonymous], IEEE TPAMI
[7] Three-Stream Attention-Aware Network for RGB-D Salient Object Detection
Chen, Hao
Li, Youfu
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) : 2825 - 2835
[8] Progressively Complementarity-aware Fusion Network for RGB-D Salient Object Detection
Chen, Hao
Li, Youfu
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3051 - 3060
[9] Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection
Chen, Hao
Li, Youfu
Su, Dan
[J]. PATTERN RECOGNITION, 2019, 86 : 376 - 385
[10] RGB-D Saliency Detection by Multi-stream Late Fusion Network
Chen, Hao
Li, Youfu
Su, Dan
[J]. COMPUTER VISION SYSTEMS, ICVS 2017, 2017, 10528 : 459 - 468

← 1 2 3 →