Multi-step depth enhancement refine network with multi-view stereo

被引:0
|
作者
Ding, Yuxuan [1 ]
Li, Kefeng [1 ]
Zhang, Guangyuan [1 ]
Zhu, Zhenfang [1 ]
Wang, Peng [1 ]
Wang, Zhenfei [2 ]
Fu, Chen [1 ]
Li, Guangchen [1 ]
Pan, Ke [1 ]
机构
[1] Shandong Jiaotong Univ, Coll Informat Sci & Elect Engn, Jinan, Shandong, Peoples R China
[2] Shandong Zhengyuan Yeda Environm Technol Co Ltd, Jinan, Shandong, Peoples R China
来源
PLOS ONE | 2025年 / 20卷 / 02期
关键词
D O I
10.1371/journal.pone.0314418
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper introduces an innovative multi-view stereo matching network-the Multi-Step Depth Enhancement Refine Network (MSDER-MVS), aimed at improving the accuracy and computational efficiency of high-resolution 3D reconstruction. The MSDER-MVS network leverages the potent capabilities of modern deep learning in conjunction with the geometric intuition of traditional 3D reconstruction techniques, with a particular focus on optimizing the quality of the depth map and the efficiency of the reconstruction process.Our key innovations include a dual-branch fusion structure and a Feature Pyramid Network (FPN) to effectively extract and integrate multi-scale features. With this approach, we construct depth maps progressively from coarse to fine, continuously improving depth prediction accuracy at each refinement stage. For cost volume construction, we employ a variance-based metric to integrate information from multiple perspectives, optimizing the consistency of the estimates. Moreover, we introduce a differentiable depth optimization process that iteratively enhances the quality of depth estimation using residuals and the Jacobian matrix, without the need for additional learnable parameters. This innovation significantly increases the network's convergence rate and the fineness of depth prediction.Extensive experiments on the standard DTU dataset (Aanas H, 2016) show that MSDER-MVS surpasses current advanced methods in accuracy, completeness, and overall performance metrics. Particularly in scenarios rich in detail, our method more precisely recovers surface details and textures, demonstrating its effectiveness and superiority for practical applications.Overall, the MSDER-MVS network offers a robust solution for precise and efficient 3D scene reconstruction. Looking forward, we aim to extend this approach to more complex environments and larger-scale datasets, further enhancing the model's generalization and real-time processing capabilities, and promoting the widespread deployment of multi-view stereo matching technology in practical applications.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval
    Zhang, Song
    Xu, Wenjia
    Wei, Zhiwei
    Zhang, Lili
    Wang, Yang
    Liu, Junyi
    PATTERN RECOGNITION, 2023, 144
  • [22] Feature distribution normalization network for multi-view stereo
    Chen, Ziyang
    Zhao, Yang
    He, Junling
    Lu, Yujie
    Cui, Zhongwei
    Li, Wenting
    Zhang, Yongjun
    VISUAL COMPUTER, 2025, 41 (01): : 409 - 421
  • [23] Multi-view Stereo Network with Attention Thin Volume
    Wan, Zihang
    Xu, Chao
    Hu, Jing
    Xiao, Jian
    Meng, Zhaopeng
    Chen, Jitai
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 410 - 423
  • [24] Multi-View Stereo Network With Gaussian Distribution Iteration
    Zhang, Xiaohan
    Li, Shikun
    IEEE ACCESS, 2023, 11 : 53359 - 53372
  • [25] Point-Based Multi-View Stereo Network
    Chen, Rui
    Han, Songfang
    Xu, Jing
    Su, Hao
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1538 - 1547
  • [26] Multi-view Stereo Network with Attention Thin Volume
    Wan, Zihang
    Xu, Chao
    Hu, Jing
    Xiao, Jian
    Meng, Zhaopeng
    Chen, Jitai
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13631 LNCS : 410 - 423
  • [27] Refractive Multi-view Stereo
    Cassidy, Matthew
    Melou, Jean
    Queau, Yvain
    Lauze, Francois
    Durou, Jean-Denis
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 384 - 393
  • [28] Polarimetric Multi-View Stereo
    Cui, Zhaopeng
    Gu, Jinwei
    Shi, Boxin
    Tan, Ping
    Kautz, Jan
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 369 - 378
  • [29] Multi-View Stereo: A Tutorial
    Furukawa, Yasutaka
    Hernandez, Carlos
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2013, 9 (1-2): : 1 - 148
  • [30] Multi-view multi-exposure stereo
    Troccoli, Alejandro
    Kang, Sing Bing
    Seitz, Steve
    THIRD INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING, VISUALIZATION, AND TRANSMISSION, PROCEEDINGS, 2007, : 861 - 868