RobustMVS: Single Domain Generalized Deep Multi-View Stereo

被引:0
|
作者
Xu, Hongbin [1 ]
Chen, Weitao [2 ]
Sun, Baigui [2 ]
Xie, Xuansong [2 ]
Kang, Wenxiong [1 ,3 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China
[2] Damo Acad, Alibaba Grp, Hangzhou, Peoples R China
[3] Pazhou Lab, Guangzhou 510335, Peoples R China
基金
中国国家自然科学基金;
关键词
Costs; Task analysis; Three-dimensional displays; Training; Covariance matrices; Visualization; Benchmark testing; Multi-view stereo; domain generalization; deep learning; 3D reconstruction; computer vision; NETWORK;
D O I
10.1109/TCSVT.2024.3399458
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite the impressive performance of Multi-view Stereo (MVS) approaches given plenty of training samples, the performance degradation when generalizing to unseen domains has not been clearly explored yet. In this work, we focus on the domain generalization problem in MVS. To evaluate the generalization results, we build a novel MVS domain generalization benchmark including synthetic and real-world datasets. In contrast to conventional domain generalization benchmarks, we consider a more realistic but challenging scenario, where only one source domain is available for training. The MVS problem can be analogized back to the feature matching task, and maintaining robust feature consistency among views is an important factor for improving generalization performance. To address the domain generalization problem in MVS, we propose a novel MVS framework, namely RobustMVS. A Depth-Clustering-guided Whitening (DCW) loss is further introduced to preserve the feature consistency among different views, which decorrelates multi-view features from viewpoint-specific style information based on geometric priors from depth maps. The experimental results further show that our method achieves superior performance on the domain generalization benchmark.
引用
收藏
页码:9181 / 9194
页数:14
相关论文
共 50 条
  • [31] Generalized Binary Search Network for Highly-Efficient Multi-View Stereo
    Mi, Zhenxing
    Di, Chang
    Xu, Dan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12981 - 12990
  • [32] Planar Catadioptric Stereo: Single and Multi-View Geometry for Calibration and Localization
    Mariottini, Gian Luca
    Scheggi, Stefano
    Morbidi, Fabio
    Prattichizzo, Domenico
    ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 1510 - 1515
  • [33] Discriminative Deep Generalized Dependency Analysis for Multi-View Data
    Kumar D.
    Maji P.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1857 - 1868
  • [34] Pixelwise View Selection for Unstructured Multi-View Stereo
    Schonberger, Johannes L.
    Zheng, Enliang
    Frahm, Jan-Michael
    Pollefeys, Marc
    COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 501 - 518
  • [35] High frequency domain enhancement and channel attention module for multi-view stereo
    Yang, Yongjuan
    Cao, Jie
    Zhao, Hong
    Chang, Zhaobin
    Wang, Weijie
    Computers and Electrical Engineering, 2025, 121
  • [36] MULTI-VIEW IMAGE FEATURE CORRELATION GUIDED COST AGGREGATION FOR MULTI-VIEW STEREO
    Lai, Yawen
    Qiu, Ke
    Wang, Ronggang
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [37] Deep Neural Network for Handcrafted Cost-based Multi-view Stereo
    Jeon, Yoonbae
    Park, In Kyu
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 2021, 11766
  • [38] nLMVS-Net: Deep Non-Lambertian Multi-View Stereo
    Yamashita, Kohei
    Enyo, Yuto
    Nobuhara, Shohei
    Nishino, Ko
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3036 - 3045
  • [39] DeepC-MVS: Deep Confidence Prediction for Multi-View Stereo Reconstruction
    Kuhn, Andreas
    Sormann, Christian
    Rossi, Mattia
    Erdler, Oliver
    Fraundorfer, Friedrich
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 404 - 413
  • [40] Multi-distribution fitting for multi-view stereo
    Chen, Jinguang
    Yu, Zonghua
    Ma, Lili
    Zhang, Kaibing
    MACHINE VISION AND APPLICATIONS, 2023, 34 (05)