ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval

被引:3
|
作者
Zhang, Song [1 ,2 ,3 ]
Xu, Wenjia [4 ]
Wei, Zhiwei [1 ,2 ]
Zhang, Lili [1 ,2 ]
Wang, Yang [1 ,2 ]
Liu, Junyi [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Elect, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[4] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
关键词
Multi-view stereo; Depth estimation; Adaptive range; Adaptive interval;
D O I
10.1016/j.patcog.2023.109885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-View Stereo (MVS) is a fundamental problem in geometric computer vision which aims to reconstruct a scene using multi-view images with known camera parameters. However, the mainstream approaches represent the scene with a fixed all-pixel depth range and equal depth interval partition, which will result in inadequate utilization of depth planes and imprecise depth estimation. In this paper, we present a novel multi-stage coarse to-fine framework to achieve adaptive all-pixel depth range and depth interval. We predict a coarse depth map in the first stage, then an Adaptive Depth Range Prediction module is proposed in the second stage to zoom in the scene by leveraging the reference image and the obtained depth map in the first stage and predict a more accurate all-pixel depth range for the following stages. In the third and fourth stages, we propose an Adaptive Depth Interval Adjustment module to achieve adaptive variable interval partition for pixel-wise depth range. The depth interval distribution in this module is normalized by Z-score, which can allocate dense depth hypothesis planes around the potential ground truth depth value and vice versa to achieve more accurate depth estimation. Extensive experiments on four widely used benchmark datasets (DTU, TnT, BlendedMVS, ETH 3D) demonstrate that our model achieves state-of-the-art performance and yields competitive generalization ability. Particularly, our method achieves the highest Acc and Overall on the DTU dataset, while attaining the highest Recall and F1-score on the Tanks and Temples intermediate and advanced dataset. Moreover, our method also achieves the lowest e1 and e3 on the BlendedMVS dataset and the highest Acc and F1-score on the ETH 3D dataset, surpassing all listed methods. Project website: https://github.com/zs670980918/ARAI-MVSNet
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Expansion-Based Depth Map Estimation for Multi-View Stereo
    Song, Peng
    Wu, Xiaojun
    Wang, Michael Yu
    Wu, Jianhuang
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 3213 - 3218
  • [22] MVSNet plus plus : Learning Depth-Based Attention Pyramid Features for Multi-View Stereo
    Chen, Po-Heng
    Yang, Hsiao-Chien
    Chen, Kuan-Wen
    Chen, Yong-Sheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7261 - 7273
  • [23] Multi-view Depth Estimation with Adaptive Feature Extraction and Region-Aware Depth Prediction
    Zhang, Chi
    Li, Lingyu
    Zhou, Jijun
    Xu, Yong
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 32 - 45
  • [24] Multi-step depth enhancement refine network with multi-view stereo
    Ding, Yuxuan
    Li, Kefeng
    Zhang, Guangyuan
    Zhu, Zhenfang
    Wang, Peng
    Wang, Zhenfei
    Fu, Chen
    Li, Guangchen
    Pan, Ke
    PLOS ONE, 2025, 20 (02):
  • [25] Multi-View Depth Estimation by Using Adaptive Point Graph to Fuse Single-View Depth Probabilities
    Wang, Ke
    Liu, Chuhao
    Liu, Zhanwen
    Xiao, Fangwei
    An, Yisheng
    Zhao, Xiangmo
    Shen, Shaojie
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6400 - 6407
  • [26] Learning Depth for Multi-View Stereo with Adversarial Training
    Wang, Liang
    Fan, Deqiao
    Li, Jianshu
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1674 - 1679
  • [27] ADAPTIVE RESISTIVE NETWORK FOR STEREO DEPTH ESTIMATION
    RAFFO, L
    ELECTRONICS LETTERS, 1995, 31 (22) : 1909 - 1910
  • [28] Rethinking Disparity: A Depth Range Free Multi-View Stereo Based on Disparity
    Yan, Qingsong
    Wang, Qiang
    Zhao, Kaiyong
    Li, Bo
    Chu, Xiaowen
    Deng, Fei
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3091 - 3099
  • [29] Range-Agnostic Multi-View Depth Estimation with Keyframe Selection
    Conti, Andrea
    Poggi, Matteo
    Cambareri, Valerio
    Mattoccia, Stefano
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1350 - 1359
  • [30] Are Multi-view Edges Incomplete for Depth Estimation?
    Khan, Numair
    Kim, Min H.
    Tompkin, James
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2639 - 2673