RobustMVS: Single Domain Generalized Deep Multi-View Stereo

被引：0

作者：

Xu, Hongbin ^{[1
]}

Chen, Weitao ^{[2
]}

Sun, Baigui ^{[2
]}

Xie, Xuansong ^{[2
]}

Kang, Wenxiong ^{[1
,3
]}

机构：

[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China

[2] Damo Acad, Alibaba Grp, Hangzhou, Peoples R China

[3] Pazhou Lab, Guangzhou 510335, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Costs; Task analysis; Three-dimensional displays; Training; Covariance matrices; Visualization; Benchmark testing; Multi-view stereo; domain generalization; deep learning; 3D reconstruction; computer vision; NETWORK;

D O I：

10.1109/TCSVT.2024.3399458

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Despite the impressive performance of Multi-view Stereo (MVS) approaches given plenty of training samples, the performance degradation when generalizing to unseen domains has not been clearly explored yet. In this work, we focus on the domain generalization problem in MVS. To evaluate the generalization results, we build a novel MVS domain generalization benchmark including synthetic and real-world datasets. In contrast to conventional domain generalization benchmarks, we consider a more realistic but challenging scenario, where only one source domain is available for training. The MVS problem can be analogized back to the feature matching task, and maintaining robust feature consistency among views is an important factor for improving generalization performance. To address the domain generalization problem in MVS, we propose a novel MVS framework, namely RobustMVS. A Depth-Clustering-guided Whitening (DCW) loss is further introduced to preserve the feature consistency among different views, which decorrelates multi-view features from viewpoint-specific style information based on geometric priors from depth maps. The experimental results further show that our method achieves superior performance on the domain generalization benchmark.

引用

页码：9181 / 9194

页数：14

共 50 条

[41] DELS-MVS: Deep Epipolar Line Search for Multi-View Stereo
Sormann, Christian
Santellani, Emanuele
Rossi, Mattia
Kuhn, Andreas
Fraundorfer, Friedrich
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3086 - 3095
[42] Multi-distribution fitting for multi-view stereo
Jinguang Chen
Zonghua Yu
Lili Ma
Kaibing Zhang
Machine Vision and Applications, 2023, 34
[43] Multi-View Stereo by Temporal Nonparametric Fusion
Hou, Yuxin
Kannala, Juho
Solin, Arno
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2651 - 2660
[44] Pyramid Multi-View Stereo with Local Consistency
Liao, Jie
Fu, Yanping
Yan, Qingan
Xiao, Chunxia
COMPUTER GRAPHICS FORUM, 2019, 38 (07) : 335 - 346
[45] Multi-view stereo network with point attention
Zhao, Rong
Gu, Zhuoer
Han, Xie
He, Ligang
Sun, Fusheng
Jiao, Shichao
APPLIED INTELLIGENCE, 2023, 53 (22) : 26622 - 26636
[46] Monocular multi-view stereo imaging system
Jiang, W.
Shimizu, M.
Okutomi, M.
JOURNAL OF THE EUROPEAN OPTICAL SOCIETY-RAPID PUBLICATIONS, 2011, 6 : 10
[47] Multi-view stereo network with point attention
Rong Zhao
Zhuoer Gu
Xie Han
Ligang He
Fusheng Sun
Shichao Jiao
Applied Intelligence, 2023, 53 : 26622 - 26636
[48] Tales of shape and radiance in multi-view stereo
Soatto, S
Yezzi, AJ
Jin, HL
NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, : 974 - 981
[49] Adaptive Pixelwise Inference Multi-View Stereo
Sun, Shang
Liu, Junjie
Li, Yuanzhuo
Ying, Haocong
Zhai, Zhongguan
Mou, Yurui
THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
[50] Image selection for improved multi-view stereo
Hornung, Alexander
Zeng, Boyi
Kobbelt, Leif
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2696 - 2703

← 1 2 3 4 5 →