PSP-MVSNet: Deep Patch-Based Similarity Perceptual for Multi-view Stereo Depth Inference

被引:2
|
作者
Jie, Leiping [1 ,2 ]
Zhang, Hui [2 ]
机构
[1] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[2] BNU HKBU United Int Coll, Guangdong Key Lab Interdisciplinary Res & Applica, Zhuhai, Peoples R China
基金
中国国家自然科学基金;
关键词
Depth estimation; Patch-based similarity; Dynamic depth range; Multi-view stereo;
D O I
10.1007/978-3-031-15919-0_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes PSP-MVSNet for depth inference problem in multi-view stereo (MVS). We first introduce a novel patch-based similarity perceptual (PSP) module for effectively constructing 3D cost volume. Unlike previous methods that leverage variance-based operators to fuse feature volumes of different views, our method leverages a cosine similarity measure to calculate matching scores for pairs of deep feature vectors and then treats these scores as weights for constructing the 3D cost volume. This is based on an important observation that many performance degradation factors, e.g., illumination changes or occlusions, will lead to pixel differences between multi-view images. We demonstrate that a patch-based cosine similarity can be used as explicit supervision for feature learning and can help speed up convergence. Furthermore, To adaptively set different depth ranges for different pixels, we extend an existing dynamic depth range searching method with a simple yet effective improvement. We can use this improved searching method to train our model in an end-to-end manner and further improve the performance of our method. Experimental results show that our method achieves state-of-the-art performance on the DTU dataset and comparative results on the intermediate set of Tanks and Temples dataset.
引用
收藏
页码:316 / 328
页数:13
相关论文
共 50 条
  • [21] 360MVSNet: Deep Multi-view Stereo Network with 360° Images for Indoor Scene Reconstruction
    Chiu, Ching-Ya
    Wu, Yu-Ting
    Shen, I-Chao
    Chuang, Yung-Yu
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3056 - 3065
  • [22] Depth Estimation in Multi-View Stereo Based on Image Pyramid
    Xu, Hanfei
    Cai, Yangang
    Wang, Ronggang
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 345 - 349
  • [23] Multi-view stereo for large-scale scene reconstruction with MRF-based depth inference
    Sun, Shang
    Xu, Dan
    Wu, Hao
    Ying, Haocong
    Mou, Yurui
    COMPUTERS & GRAPHICS-UK, 2022, 106 : 248 - 258
  • [24] Multi-view stereo algorithms based on deep learning: a survey
    Huang, Hongbo
    Yan, Xiaoxu
    Zheng, Yaolin
    He, Jiayu
    Xu, Longfei
    Qin, Dechun
    Multimedia Tools and Applications, 2025, 84 (06) : 2877 - 2908
  • [25] Expansion-Based Depth Map Estimation for Multi-View Stereo
    Song, Peng
    Wu, Xiaojun
    Wang, Michael Yu
    Wu, Jianhuang
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 3213 - 3218
  • [26] Edge aware depth inference for large-scale aerial building multi-view stereo
    Zhang, Song
    Wei, Zhiwei
    Xu, Wenjia
    Zhang, Lili
    Wang, Yang
    Zhang, Jinming
    Liu, Junyi
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 207 : 27 - 42
  • [27] ICV-Net: An identity cost volume network for multi-view stereo depth inference
    He, Pengpeng
    Wang, Yueju
    Wen, Yangsen
    Hu, Yong
    He, Wei
    PATTERN RECOGNITION, 2025, 162
  • [28] Patch-based deep multi-modal learning framework for Alzheimer's disease diagnosis using multi-view neuroimaging
    Liu, Fangyu
    Yuan, Shizhong
    Li, Weimin
    Xu, Qun
    Sheng, Bin
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 80
  • [29] Charting the Landscape of Multi-view Stereo: An In-Depth Exploration of Deep Learning Techniques
    Zhou, Zhe
    Liu, Xiaozhang
    Tang, Xiangyan
    BIG DATA AND SECURITY, ICBDS 2023, PT I, 2024, 2099 : 152 - 165
  • [30] Unsupervised multi-view stereo network based on multi-stage depth estimation
    Qi, Shuai
    Sang, Xinzhu
    Yan, Binbin
    Wang, Peng
    Chen, Duo
    Wang, Huachun
    Ye, Xiaoqian
    IMAGE AND VISION COMPUTING, 2022, 122