RobustMVS: Single Domain Generalized Deep Multi-View Stereo

被引:0
|
作者
Xu, Hongbin [1 ]
Chen, Weitao [2 ]
Sun, Baigui [2 ]
Xie, Xuansong [2 ]
Kang, Wenxiong [1 ,3 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China
[2] Damo Acad, Alibaba Grp, Hangzhou, Peoples R China
[3] Pazhou Lab, Guangzhou 510335, Peoples R China
基金
中国国家自然科学基金;
关键词
Costs; Task analysis; Three-dimensional displays; Training; Covariance matrices; Visualization; Benchmark testing; Multi-view stereo; domain generalization; deep learning; 3D reconstruction; computer vision; NETWORK;
D O I
10.1109/TCSVT.2024.3399458
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite the impressive performance of Multi-view Stereo (MVS) approaches given plenty of training samples, the performance degradation when generalizing to unseen domains has not been clearly explored yet. In this work, we focus on the domain generalization problem in MVS. To evaluate the generalization results, we build a novel MVS domain generalization benchmark including synthetic and real-world datasets. In contrast to conventional domain generalization benchmarks, we consider a more realistic but challenging scenario, where only one source domain is available for training. The MVS problem can be analogized back to the feature matching task, and maintaining robust feature consistency among views is an important factor for improving generalization performance. To address the domain generalization problem in MVS, we propose a novel MVS framework, namely RobustMVS. A Depth-Clustering-guided Whitening (DCW) loss is further introduced to preserve the feature consistency among different views, which decorrelates multi-view features from viewpoint-specific style information based on geometric priors from depth maps. The experimental results further show that our method achieves superior performance on the domain generalization benchmark.
引用
收藏
页码:9181 / 9194
页数:14
相关论文
共 50 条
  • [41] DELS-MVS: Deep Epipolar Line Search for Multi-View Stereo
    Sormann, Christian
    Santellani, Emanuele
    Rossi, Mattia
    Kuhn, Andreas
    Fraundorfer, Friedrich
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3086 - 3095
  • [42] Multi-distribution fitting for multi-view stereo
    Jinguang Chen
    Zonghua Yu
    Lili Ma
    Kaibing Zhang
    Machine Vision and Applications, 2023, 34
  • [43] Multi-View Stereo by Temporal Nonparametric Fusion
    Hou, Yuxin
    Kannala, Juho
    Solin, Arno
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2651 - 2660
  • [44] Pyramid Multi-View Stereo with Local Consistency
    Liao, Jie
    Fu, Yanping
    Yan, Qingan
    Xiao, Chunxia
    COMPUTER GRAPHICS FORUM, 2019, 38 (07) : 335 - 346
  • [45] Multi-view stereo network with point attention
    Zhao, Rong
    Gu, Zhuoer
    Han, Xie
    He, Ligang
    Sun, Fusheng
    Jiao, Shichao
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26622 - 26636
  • [46] Monocular multi-view stereo imaging system
    Jiang, W.
    Shimizu, M.
    Okutomi, M.
    JOURNAL OF THE EUROPEAN OPTICAL SOCIETY-RAPID PUBLICATIONS, 2011, 6 : 10
  • [47] Multi-view stereo network with point attention
    Rong Zhao
    Zhuoer Gu
    Xie Han
    Ligang He
    Fusheng Sun
    Shichao Jiao
    Applied Intelligence, 2023, 53 : 26622 - 26636
  • [48] Tales of shape and radiance in multi-view stereo
    Soatto, S
    Yezzi, AJ
    Jin, HL
    NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, : 974 - 981
  • [49] Adaptive Pixelwise Inference Multi-View Stereo
    Sun, Shang
    Liu, Junjie
    Li, Yuanzhuo
    Ying, Haocong
    Zhai, Zhongguan
    Mou, Yurui
    THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [50] Image selection for improved multi-view stereo
    Hornung, Alexander
    Zeng, Boyi
    Kobbelt, Leif
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2696 - 2703