ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval

被引:3
|
作者
Zhang, Song [1 ,2 ,3 ]
Xu, Wenjia [4 ]
Wei, Zhiwei [1 ,2 ]
Zhang, Lili [1 ,2 ]
Wang, Yang [1 ,2 ]
Liu, Junyi [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Elect, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[4] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
关键词
Multi-view stereo; Depth estimation; Adaptive range; Adaptive interval;
D O I
10.1016/j.patcog.2023.109885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-View Stereo (MVS) is a fundamental problem in geometric computer vision which aims to reconstruct a scene using multi-view images with known camera parameters. However, the mainstream approaches represent the scene with a fixed all-pixel depth range and equal depth interval partition, which will result in inadequate utilization of depth planes and imprecise depth estimation. In this paper, we present a novel multi-stage coarse to-fine framework to achieve adaptive all-pixel depth range and depth interval. We predict a coarse depth map in the first stage, then an Adaptive Depth Range Prediction module is proposed in the second stage to zoom in the scene by leveraging the reference image and the obtained depth map in the first stage and predict a more accurate all-pixel depth range for the following stages. In the third and fourth stages, we propose an Adaptive Depth Interval Adjustment module to achieve adaptive variable interval partition for pixel-wise depth range. The depth interval distribution in this module is normalized by Z-score, which can allocate dense depth hypothesis planes around the potential ground truth depth value and vice versa to achieve more accurate depth estimation. Extensive experiments on four widely used benchmark datasets (DTU, TnT, BlendedMVS, ETH 3D) demonstrate that our model achieves state-of-the-art performance and yields competitive generalization ability. Particularly, our method achieves the highest Acc and Overall on the DTU dataset, while attaining the highest Recall and F1-score on the Tanks and Temples intermediate and advanced dataset. Moreover, our method also achieves the lowest e1 and e3 on the BlendedMVS dataset and the highest Acc and F1-score on the ETH 3D dataset, surpassing all listed methods. Project website: https://github.com/zs670980918/ARAI-MVSNet
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Multi-view depth video coding using depth view synthesis
    Na, Sang-Tae
    Oh, Kwan-Jung
    Lee, Cheon
    Ho, Yo-Sung
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 1400 - 1403
  • [42] Constraining Depth Map Geometry for Multi-View Stereo: A Dual-Depth Approach with Saddle-shaped Depth Cells
    Ye, Xinyi
    Zhao, Weiyue
    Liu, Tianqi
    Huang, Zihao
    Cao, Zhiguo
    Li, Xin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17615 - 17624
  • [43] Image depth estimation assisted by multi-view projection
    Liu, Liman
    Tian, Jinshan
    Luo, Guansheng
    Xu, Siyuan
    Zhang, Chen
    Hu, Huaifei
    Tao, Wenbing
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [44] A Benchmark and a Baseline for Robust Multi-view Depth Estimation
    Schroeppel, Philipp
    Bechtold, Jan
    Amiranashvili, Artemij
    Brox, Thomas
    2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 637 - 645
  • [45] Deep Multi-view Depth Estimation with Predicted Uncertainty
    Tong Ke
    Tien Do
    Khiem Vuong
    Sartipi, Kourosh
    Roumeliotis, Stergios, I
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 9235 - 9241
  • [46] Monocular depth estimation with multi-view attention autoencoder
    Geunho Jung
    Sang Min Yoon
    Multimedia Tools and Applications, 2022, 81 : 33759 - 33770
  • [47] Monocular depth estimation with multi-view attention autoencoder
    Jung, Geunho
    Yoon, Sang Min
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33759 - 33770
  • [48] PDE-based multi-view depth estimation
    Strecha, C
    Van Gool, L
    FIRST INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING VISUALIZATION AND TRANSMISSION, 2002, : 416 - 425
  • [49] Multi-View Stereo using Cross-View Depth Map Completion and Row-Column Depth Refinement
    Nair, Nirmal S.
    Nair, Madhu S.
    THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
  • [50] Edge-Aware Spatial Propagation Network for Multi-view Depth Estimation
    Siyuan Xu
    Qingshan Xu
    Wanjuan Su
    Wenbing Tao
    Neural Processing Letters, 2023, 55 : 10905 - 10923