ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval

被引:3
|
作者
Zhang, Song [1 ,2 ,3 ]
Xu, Wenjia [4 ]
Wei, Zhiwei [1 ,2 ]
Zhang, Lili [1 ,2 ]
Wang, Yang [1 ,2 ]
Liu, Junyi [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Elect, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[4] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
关键词
Multi-view stereo; Depth estimation; Adaptive range; Adaptive interval;
D O I
10.1016/j.patcog.2023.109885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-View Stereo (MVS) is a fundamental problem in geometric computer vision which aims to reconstruct a scene using multi-view images with known camera parameters. However, the mainstream approaches represent the scene with a fixed all-pixel depth range and equal depth interval partition, which will result in inadequate utilization of depth planes and imprecise depth estimation. In this paper, we present a novel multi-stage coarse to-fine framework to achieve adaptive all-pixel depth range and depth interval. We predict a coarse depth map in the first stage, then an Adaptive Depth Range Prediction module is proposed in the second stage to zoom in the scene by leveraging the reference image and the obtained depth map in the first stage and predict a more accurate all-pixel depth range for the following stages. In the third and fourth stages, we propose an Adaptive Depth Interval Adjustment module to achieve adaptive variable interval partition for pixel-wise depth range. The depth interval distribution in this module is normalized by Z-score, which can allocate dense depth hypothesis planes around the potential ground truth depth value and vice versa to achieve more accurate depth estimation. Extensive experiments on four widely used benchmark datasets (DTU, TnT, BlendedMVS, ETH 3D) demonstrate that our model achieves state-of-the-art performance and yields competitive generalization ability. Particularly, our method achieves the highest Acc and Overall on the DTU dataset, while attaining the highest Recall and F1-score on the Tanks and Temples intermediate and advanced dataset. Moreover, our method also achieves the lowest e1 and e3 on the BlendedMVS dataset and the highest Acc and F1-score on the ETH 3D dataset, surpassing all listed methods. Project website: https://github.com/zs670980918/ARAI-MVSNet
引用
收藏
页数:10
相关论文
共 50 条
  • [31] PSP-MVSNet: Deep Patch-Based Similarity Perceptual for Multi-view Stereo Depth Inference
    Jie, Leiping
    Zhang, Hui
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 316 - 328
  • [32] EFFICIENT EDGE, MOTION AND DEPTH-RANGE ADAPTIVE PROCESSING FOR ENHANCEMENT OF MULTI-VIEW DEPTH MAP SEQUENCES
    Ekmekcioglu, Erhan
    Velisavljevic, Vladan
    Worrall, Stewart T.
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3537 - +
  • [33] Context-Aware Multi-view Stereo Network for Efficient Edge-Preserving Depth Estimation
    Su, Wanjuan
    Tao, Wenbing
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
  • [34] Non-parametric Depth Distribution Modelling based Depth Inference for Multi-view Stereo
    Yang, Jiayu
    Alvarez, Jose M.
    Liu, Miaomiao
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8616 - 8624
  • [35] Multi-View Stereo and Depth Priors Guided NeRF for View Synthesis
    Deng, Wang
    Zhang, Xuetao
    Guo, Yu
    Lu, Zheng
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3922 - 3928
  • [36] Unsupervised Multi-View Constrained Convolutional Network for Accurate Depth Estimation
    Zhang, Yuyang
    Xu, Shibiao
    Wu, Baoyuan
    Shi, Jian
    Meng, Weiliang
    Zhang, Xiaopeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7019 - 7031
  • [37] Bundled Depth-Map Merging for Multi-View Stereo
    Li, Jianguo
    Li, Eric
    Chen, Yurong
    Xu, Lin
    Zhang, Yimin
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2769 - 2776
  • [38] Recurrent Multi-view Stereo Depth Inference with Pyramid of Images
    Wang, Xiaobao
    Dong, Enzeng
    Tong, Jigang
    Sun, Zhe
    Li, Wenyu
    Duan, Feng
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 259 - 263
  • [39] Multi-View Stereo via Geometric Expansion and Depth Refinement
    Liu, Tao
    Yuan, Ding
    Zhao, Hongwei
    Yin, Jihao
    2017 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE ROBIO 2017), 2017, : 555 - 560
  • [40] Multiple Candidates and Multiple Constraints Based Accurate Depth Estimation for Multi-View Stereo
    Zhang, Chao
    Zhou, Fugen
    Xue, Bindang
    EIGHTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2016), 2017, 10225