Learning content-aware feature fusion for guided depth map super-resolution

被引:0
|
作者
Zuo, Yifan [1 ]
Wang, Hao [1 ]
Xu, Yaping [1 ]
Huang, Huimin [1 ]
Huang, Xiaoshui [2 ]
Xia, Xue [1 ]
Fang, Yuming [1 ]
机构
[1] Jiangxi Univ Finance & Econ, 665 Yuping West St, Nanchang 330013, Jiangxi, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Yunjing Rd 701, Shanghai 200232, Peoples R China
关键词
Convolutional neural network; Joint trilateral filter; Guided depth map super-resolution; Content-dependent network;
D O I
10.1016/j.image.2024.117140
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
RGB-D data including paired RGB color images and depth maps is widely used in downstream computer vision tasks. However, compared with the acquisition of high -resolution color images, the depth maps captured by consumer-level sensors are always in low resolution. Within decades of research, the most state -of -the -art (SOTA) methods of depth map super -resolution cannot adaptively tune the guidance fusion for all feature positions by channel-wise feature concatenation with spatially sharing convolutional kernels. This paper proposes JTFNet to resolve this issue, which simulates the traditional Joint Trilateral Filter (JTF). Specifically, a novel JTF block is introduced to adaptively tune the fusion pattern between the color features and the depth features for all feature positions. Moreover, based on the variant of JTF block whose target features and guidance features are in the cross-scale shape, the fusion for depth features is performed in a bi-directional way. Therefore, the error accumulation along scales can be effectively mitigated by iteratively HR feature guidance. Compared with the SOTA methods, the sufficient experiment is conducted on the mainstream synthetic datasets and real datasets, i.e., Middlebury, NYU and ToF-Mark, which shows remarkable improvement of our JTFNet.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Content-aware frame interpolation (CAFI): deep learning-based temporal super-resolution for fast bioimaging
    Priessner, Martin
    Gaboriau, David C. A.
    Sheridan, Arlo
    Lenn, Tchern
    Garzon-Coral, Carlos
    Dunn, Alexander R.
    Chubb, Jonathan R.
    Tousley, Aidan M.
    Majzner, Robbie G.
    Manor, Uri
    Vilar, Ramon
    Laine, Romain F.
    NATURE METHODS, 2024, 21 (02) : 322 - 330
  • [22] Content-aware frame interpolation (CAFI): deep learning-based temporal super-resolution for fast bioimaging
    Martin Priessner
    David C. A. Gaboriau
    Arlo Sheridan
    Tchern Lenn
    Carlos Garzon-Coral
    Alexander R. Dunn
    Jonathan R. Chubb
    Aidan M. Tousley
    Robbie G. Majzner
    Uri Manor
    Ramon Vilar
    Romain F. Laine
    Nature Methods, 2024, 21 : 322 - 330
  • [23] Depth-guided learning light field angular super-resolution with edge-aware inpainting
    Xia Liu
    Minghui Wang
    Anzhi Wang
    Xiyao Hua
    Shanshan Liu
    The Visual Computer, 2022, 38 : 2839 - 2851
  • [24] Depth-guided learning light field angular super-resolution with edge-aware inpainting
    Liu, Xia
    Wang, Minghui
    Wang, Anzhi
    Hua, Xiyao
    Liu, Shanshan
    VISUAL COMPUTER, 2022, 38 (08): : 2839 - 2851
  • [25] Hierarchical Features Driven Residual Learning for Depth Map Super-Resolution
    Guo, Chunle
    Li, Chongyi
    Guo, Jichang
    Cong, Runmin
    Fu, Huazhu
    Han, Ping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) : 2545 - 2557
  • [26] Multiscale Attention Fusion for Depth Map Super-Resolution Generative Adversarial Networks
    Xu, Dan
    Fan, Xiaopeng
    Gao, Wen
    ENTROPY, 2023, 25 (06)
  • [27] CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input
    Tian, Senmao
    Lu, Ming
    Liu, Jiaming
    Guo, Yandong
    Chen, Yurong
    Zhang, Shunli
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1756 - 1765
  • [28] Learning Piecewise Planar Representation for RGB Guided Depth Super-Resolution
    Xu, Ruikang
    Yao, Mingde
    Guan, Yuanshen
    Xiong, Zhiwei
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2024, 10 : 1266 - 1279
  • [29] BridgeNet: A Joint Learning Network of Depth Map Super-Resolution and Monocular Depth Estimation
    Tang, Qi
    Cong, Runmin
    Sheng, Ronghui
    He, Lingzhi
    Zhang, Dan
    Zhao, Yao
    Kwong, Sam
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2148 - 2157
  • [30] Learning Efficient Stereo Matching Network With Depth Discontinuity Aware Super-Resolution
    Guo, Chenggang
    Chen, Dongyi
    Huang, Zhiqi
    IEEE ACCESS, 2019, 7 : 159712 - 159723