Multi-step depth enhancement refine network with multi-view stereo

被引:0
|
作者
Ding, Yuxuan [1 ]
Li, Kefeng [1 ]
Zhang, Guangyuan [1 ]
Zhu, Zhenfang [1 ]
Wang, Peng [1 ]
Wang, Zhenfei [2 ]
Fu, Chen [1 ]
Li, Guangchen [1 ]
Pan, Ke [1 ]
机构
[1] Shandong Jiaotong Univ, Coll Informat Sci & Elect Engn, Jinan, Shandong, Peoples R China
[2] Shandong Zhengyuan Yeda Environm Technol Co Ltd, Jinan, Shandong, Peoples R China
来源
PLOS ONE | 2025年 / 20卷 / 02期
关键词
D O I
10.1371/journal.pone.0314418
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper introduces an innovative multi-view stereo matching network-the Multi-Step Depth Enhancement Refine Network (MSDER-MVS), aimed at improving the accuracy and computational efficiency of high-resolution 3D reconstruction. The MSDER-MVS network leverages the potent capabilities of modern deep learning in conjunction with the geometric intuition of traditional 3D reconstruction techniques, with a particular focus on optimizing the quality of the depth map and the efficiency of the reconstruction process.Our key innovations include a dual-branch fusion structure and a Feature Pyramid Network (FPN) to effectively extract and integrate multi-scale features. With this approach, we construct depth maps progressively from coarse to fine, continuously improving depth prediction accuracy at each refinement stage. For cost volume construction, we employ a variance-based metric to integrate information from multiple perspectives, optimizing the consistency of the estimates. Moreover, we introduce a differentiable depth optimization process that iteratively enhances the quality of depth estimation using residuals and the Jacobian matrix, without the need for additional learnable parameters. This innovation significantly increases the network's convergence rate and the fineness of depth prediction.Extensive experiments on the standard DTU dataset (Aanas H, 2016) show that MSDER-MVS surpasses current advanced methods in accuracy, completeness, and overall performance metrics. Particularly in scenarios rich in detail, our method more precisely recovers surface details and textures, demonstrating its effectiveness and superiority for practical applications.Overall, the MSDER-MVS network offers a robust solution for precise and efficient 3D scene reconstruction. Looking forward, we aim to extend this approach to more complex environments and larger-scale datasets, further enhancing the model's generalization and real-time processing capabilities, and promoting the widespread deployment of multi-view stereo matching technology in practical applications.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Adaptive Multi-Modality Residual Network for Compression Distorted Multi-View Depth Video Enhancement
    Chen, Siqi
    Liu, Qiong
    Yang, You
    IEEE ACCESS, 2020, 8 (08): : 97072 - 97081
  • [32] ICV-Net: An identity cost volume network for multi-view stereo depth inference
    He, Pengpeng
    Wang, Yueju
    Wen, Yangsen
    Hu, Yong
    He, Wei
    PATTERN RECOGNITION, 2025, 162
  • [33] Self-supervised Multi-view Stereo via Inter and Intra Network Pseudo Depth
    Qiu, Ke
    Lai, Yawen
    Liu, Shiyi
    Wang, Ronggang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2305 - 2313
  • [34] Self-supervised Learning of Depth Inference for Multi-view Stereo
    Yang, Jiayu
    Alvarez, Jose M.
    Liu, Miaomiao
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7522 - 7530
  • [35] Multi-view Stereo by Fusing Monocular and a Combination of Depth Representation Methods
    Yu, Fanqi
    Sun, Xinyang
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 298 - 309
  • [36] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose M.
    Liu, Miaomiao
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4876 - 4885
  • [37] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose
    Liu, Miaomiao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4748 - 4760
  • [38] IAFMVS: Iterative Depth Estimation with Adaptive Features for Multi-View Stereo
    Zhao, Guyu
    Wei, Huyixin
    He, Hongdou
    NEUROCOMPUTING, 2025, 629
  • [39] Expansion-Based Depth Map Estimation for Multi-View Stereo
    Song, Peng
    Wu, Xiaojun
    Wang, Michael Yu
    Wu, Jianhuang
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 3213 - 3218
  • [40] Long-range Attention Network for Multi-View Stereo
    Zhang, Xudong
    Hu, Yutao
    Wang, Haochen
    Cao, Xianbin
    Zhang, Baochang
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3781 - 3790