RobustMVS: Single Domain Generalized Deep Multi-View Stereo

被引:0
|
作者
Xu, Hongbin [1 ]
Chen, Weitao [2 ]
Sun, Baigui [2 ]
Xie, Xuansong [2 ]
Kang, Wenxiong [1 ,3 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China
[2] Damo Acad, Alibaba Grp, Hangzhou, Peoples R China
[3] Pazhou Lab, Guangzhou 510335, Peoples R China
基金
中国国家自然科学基金;
关键词
Costs; Task analysis; Three-dimensional displays; Training; Covariance matrices; Visualization; Benchmark testing; Multi-view stereo; domain generalization; deep learning; 3D reconstruction; computer vision; NETWORK;
D O I
10.1109/TCSVT.2024.3399458
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite the impressive performance of Multi-view Stereo (MVS) approaches given plenty of training samples, the performance degradation when generalizing to unseen domains has not been clearly explored yet. In this work, we focus on the domain generalization problem in MVS. To evaluate the generalization results, we build a novel MVS domain generalization benchmark including synthetic and real-world datasets. In contrast to conventional domain generalization benchmarks, we consider a more realistic but challenging scenario, where only one source domain is available for training. The MVS problem can be analogized back to the feature matching task, and maintaining robust feature consistency among views is an important factor for improving generalization performance. To address the domain generalization problem in MVS, we propose a novel MVS framework, namely RobustMVS. A Depth-Clustering-guided Whitening (DCW) loss is further introduced to preserve the feature consistency among different views, which decorrelates multi-view features from viewpoint-specific style information based on geometric priors from depth maps. The experimental results further show that our method achieves superior performance on the domain generalization benchmark.
引用
收藏
页码:9181 / 9194
页数:14
相关论文
共 50 条
  • [1] Deep Multi-View Stereo Gone Wild
    Darmon, Francois
    Bascle, Benedicte
    Devaux, Jean-Clement
    Monasse, Pascal
    Aubry, Mathieu
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 484 - 493
  • [2] Multi-View Guided Multi-View Stereo
    Poggi, Matteo
    Conti, Andrea
    Mattoccia, Stefano
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8391 - 8398
  • [3] MVS2: Deep Unsupervised Multi-view Stereo with Multi-View Symmetry
    Dai, Yuchao
    Zhu, Zhidong
    Rao, Zhibo
    Li, Bo
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 1 - 8
  • [4] PIECEWISE SINGLE VIEW PHOTOMETRIC STEREO WITH MULTI-VIEW CONSTRAINTS
    Sabzevari, Reza
    Del Bue, Alessio
    Murino, Vittorio
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 21 - 24
  • [5] Sparse prior guided deep multi-view stereo
    Qi, Yuhang
    Su, Wanjuan
    Xu, Qingshan
    Tao, Wenbing
    COMPUTERS & GRAPHICS-UK, 2022, 107 : 1 - 9
  • [6] Semi-supervised Deep Multi-view Stereo
    Xu, Hongbin
    Chen, Weitao
    Liu, Yang
    Zhou, Zhipeng
    Xiao, Haihong
    Sun, Baigui
    Xie, Xuansong
    Kang, Wenxiong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4616 - 4625
  • [7] Multi-View Stereo with Single-View Semantic Mesh Refinement
    Romanoni, Andrea
    Ciccone, Marco
    Visin, Francesco
    Matteucci, Matteo
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 706 - 715
  • [8] Refractive Multi-view Stereo
    Cassidy, Matthew
    Melou, Jean
    Queau, Yvain
    Lauze, Francois
    Durou, Jean-Denis
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 384 - 393
  • [9] Deep Facial Non-Rigid Multi-View Stereo
    Bai, Ziqian
    Cui, Zhaopeng
    Rahim, Jamal Ahmed
    Liu, Xiaoming
    Tan, Ping
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5849 - 5859
  • [10] Multi-view stereo algorithms based on deep learning: a survey
    Huang, Hongbo
    Yan, Xiaoxu
    Zheng, Yaolin
    He, Jiayu
    Xu, Longfei
    Qin, Dechun
    Multimedia Tools and Applications, 2025, 84 (06) : 2877 - 2908