Bidirectional feature learning network for RGB-D salient object detection
被引:1
|
作者:
Niu, Ye
论文数: 0引用数: 0
h-index: 0
机构:Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
Niu, Ye
Zhou, Sanping
论文数: 0引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R ChinaXi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
Zhou, Sanping
[1
]
Dong, Yonghao
论文数: 0引用数: 0
h-index: 0
机构:Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
Dong, Yonghao
Wang, Le
论文数: 0引用数: 0
h-index: 0
机构:Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
Wang, Le
Wang, Jinjun
论文数: 0引用数: 0
h-index: 0
机构:Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
Wang, Jinjun
Zheng, Nanning
论文数: 0引用数: 0
h-index: 0
机构:Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
Zheng, Nanning
机构:
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
RGB-D salient object detection aims to perform the pixel-wise localization of salient objects from both RGB and depth images, whose challenge mainly comes from how to learn complementary features from each modality. Existing works often use increasingly large models for performance enhancement, which need large memory and time consumption in practice. In this paper, we propose a simple yet effective Bidirectional Feature Learning Network (BFLNet) for RGB-D salient object detection under limited memory and time conditions. To achieve accurate performance with lightweight backbone networks, an effective Bidirectional Feature Fusion (BFF) module is designed to merge features from both RGB and depth streams, in which the crossmodal fusions and cross-scale fusions are jointly conducted to fuse the immediate features in multiple scales and multiple modals. What is more, a simple Dual Consistency Loss (DCL) function is designed to prompt cross -modal fusion by keeping the consistency between cross -modal target predictions. Extensive experiments on four benchmark datasets demonstrate that our method has achieved the state-of-the-art performance with high efficiency in RGB-D salient object detection. Code will be available at https://github.com/nightskynostar/BFLNet.
机构:
Sichuan Univ, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China
Sichuan Univ, Natl Key Lab Fundamental Sci Synthet Vis, Chengdu 610017, Sichuan, Peoples R ChinaSichuan Univ, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China
Fu, Keren
Fan, Deng-Ping
论文数: 0引用数: 0
h-index: 0
机构:
Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R ChinaSichuan Univ, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China
Fan, Deng-Ping
Ji, Ge-Peng
论文数: 0引用数: 0
h-index: 0
机构:
Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R ChinaSichuan Univ, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China
Ji, Ge-Peng
Zhao, Qijun
论文数: 0引用数: 0
h-index: 0
机构:
Sichuan Univ, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China
Sichuan Univ, Natl Key Lab Fundamental Sci Synthet Vis, Chengdu 610017, Sichuan, Peoples R ChinaSichuan Univ, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China
Zhao, Qijun
Shen, Jianbing
论文数: 0引用数: 0
h-index: 0
机构:
Univ Macau, Dept Comp & Informat Sci, State Key Lab Internet Things Smart City, Macau, Peoples R ChinaSichuan Univ, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China
Shen, Jianbing
Zhu, Ce
论文数: 0引用数: 0
h-index: 0
机构:
Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Sichuan, Peoples R ChinaSichuan Univ, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China