RobustMVS: Single Domain Generalized Deep Multi-View Stereo

被引:0
|
作者
Xu, Hongbin [1 ]
Chen, Weitao [2 ]
Sun, Baigui [2 ]
Xie, Xuansong [2 ]
Kang, Wenxiong [1 ,3 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China
[2] Damo Acad, Alibaba Grp, Hangzhou, Peoples R China
[3] Pazhou Lab, Guangzhou 510335, Peoples R China
基金
中国国家自然科学基金;
关键词
Costs; Task analysis; Three-dimensional displays; Training; Covariance matrices; Visualization; Benchmark testing; Multi-view stereo; domain generalization; deep learning; 3D reconstruction; computer vision; NETWORK;
D O I
10.1109/TCSVT.2024.3399458
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite the impressive performance of Multi-view Stereo (MVS) approaches given plenty of training samples, the performance degradation when generalizing to unseen domains has not been clearly explored yet. In this work, we focus on the domain generalization problem in MVS. To evaluate the generalization results, we build a novel MVS domain generalization benchmark including synthetic and real-world datasets. In contrast to conventional domain generalization benchmarks, we consider a more realistic but challenging scenario, where only one source domain is available for training. The MVS problem can be analogized back to the feature matching task, and maintaining robust feature consistency among views is an important factor for improving generalization performance. To address the domain generalization problem in MVS, we propose a novel MVS framework, namely RobustMVS. A Depth-Clustering-guided Whitening (DCW) loss is further introduced to preserve the feature consistency among different views, which decorrelates multi-view features from viewpoint-specific style information based on geometric priors from depth maps. The experimental results further show that our method achieves superior performance on the domain generalization benchmark.
引用
收藏
页码:9181 / 9194
页数:14
相关论文
共 50 条
  • [21] Probabilistic visibility for multi-view stereo
    Hernandez, Carlos
    Vogiatzis, George
    Cipolla, Roberto
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 1704 - 1711
  • [22] Multi-view stereo beyond Lambert
    Jin, HL
    Soatto, S
    Yezzi, AJ
    2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 171 - 178
  • [23] Multi-View Photometric Stereo Revisited
    Kaya, Berk
    Kumar, Suryansh
    Oliveira, Carlos
    Ferrari, Vittorio
    Van Gool, Luc
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3125 - 3134
  • [24] Progressive Prioritized Multi-view Stereo
    Locher, Alex
    Perdoch, Michal
    Gool, Luc Van
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3244 - 3252
  • [25] Occluding Contours for Multi-View Stereo
    Shan, Qi
    Curless, Brian
    Furukawa, Yasutaka
    Hernandez, Carlos
    Seitz, Steven M.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 4002 - 4009
  • [26] Assisted multi-view stereo reconstruction
    Dellepiane, Matteo
    Cavarretta, Emanuele
    Cignoni, Paolo
    Scopigno, Roberto
    2013 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2013), 2013, : 318 - 325
  • [27] Adaptive segmentation for multi-view stereo
    Khuboni, Ray
    Naidoo, Bashan
    IET COMPUTER VISION, 2017, 11 (01) : 10 - 21
  • [28] Learning a Multi-View Stereo Machine
    Kar, Abhishek
    Hane, Christian
    Malik, Jitendra
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [29] MULTI-VIEW STEREO WITH SEMANTIC PRIORS
    Stathopoulou, E. -K.
    Remondino, F.
    27TH CIPA INTERNATIONAL SYMPOSIUM: DOCUMENTING THE PAST FOR A BETTER FUTURE, 2019, 42-2 (W15): : 1135 - 1140
  • [30] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks
    Yao, Yao
    Luo, Zixin
    Li, Shiwei
    Zhang, Jingyang
    Ren, Yufan
    Zhou, Lei
    Fang, Tian
    Quan, Long
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1787 - 1796