NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

被引：69

作者：

Wei, Yi ^{[1
,2
]}

Liu, Shaohui ^{[3
]}

Rao, Yongming ^{[1
,2
]}

Zhao, Wang ^{[4
]}

Lu, Jiwen ^{[1
,2
]}

Zhou, Jie ^{[1
,2
]}

机构：

[1] Tsinghua Univ, Dept Automat, Beijing, Peoples R China

[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China

[3] Swiss Fed Inst Technol, Zurich, Switzerland

[4] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICCV48922.2021.00556

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we present a new multi-view depth estimation method that utilizes both conventional SfM reconstruction and learning-based priors over the recently proposed neural radiance fields (NeRF). Unlike existing neural network based optimization method that relies on estimated correspondences, our method directly optimizes over implicit volumes, eliminating the challenging step of matching pixels in indoor scenes. The key to our approach is to utilize the learning-based priors to guide the optimization process of NeRF. Our system firstly adapts a monocular depth network over the target scene by finetuning on its sparse SfM reconstruction. Then, we show that the shape-radiance ambiguity of NeRF still exists in indoor environments and propose to address the issue by employing the adapted depth priors to monitor the sampling process of volume rendering. Finally, a per-pixel confidence map acquired by error computation on the rendered image can be used to further improve the depth quality. Experiments show that our proposed framework significantly outperforms state-of-the-art methods on indoor scenes, with surprising findings presented on the effectiveness of correspondence-based optimization and NeRF-based optimization over the adapted depth priors. In addition, we show that the guided optimization scheme does not sacrifice the original synthesis capability of neural radiance fields, improving the rendering quality on both seen and novel views. Code is available at https://github.com/weiyithu/NerfingMVS.

引用

页码：5590 / 5599

页数：10

共 50 条

[31] Polarimetric Multi-View Stereo
Cui, Zhaopeng
Gu, Jinwei
Shi, Boxin
Tan, Ping
Kautz, Jan
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 369 - 378
[32] Multi-View Stereo: A Tutorial
Furukawa, Yasutaka
Hernandez, Carlos
FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2013, 9 (1-2): : 1 - 148
[33] Piecewise planar scene reconstruction and optimization for multi-view stereo
Kim, Hyojin
Xiao, Hong
Max, Nelson
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2013, 7727 LNCS (PART 4): : 191 - 204
[34] Optimization of Plane Fits to Image Segments in Multi-View Stereo
Max, Nelson
Kim, Hyojin
2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 1130 - 1136
[35] STRUCTURE OPTIMIZATION FOR MULTI-VIEW ACQUISITION AND STEREO DISPLAY SYSTEM
Cheng, Hao
You, Zhixiang
An, Ping
Zhang, Zhaoyang
2013 IEEE 11TH IVMSP WORKSHOP: 3D IMAGE/VIDEO TECHNOLOGIES AND APPLICATIONS (IVMSP 2013), 2013,
[36] Transformer-guided Feature Pyramid Network for Multi-View Stereo
Wang, Lina
She, Jiangfeng
Zhao, Qiang
Wen, Xiang
Guan, Yuzheng
NEUROCOMPUTING, 2025, 617
[37] SatelliteRF: Accelerating 3D Reconstruction in Multi-View Satellite Images with Efficient Neural Radiance Fields
Zhou, Xin
Wang, Yang
Lin, Daoyu
Cao, Zehao
Li, Biqing
Liu, Junyi
APPLIED SCIENCES-BASEL, 2024, 14 (07):
[38] MosaicMVS: Mosaic-Based Omnidirectional Multi-View Stereo for Indoor Scenes
Shin, Min-Jung
Park, Woojune
Cho, Minji
Kong, Kyeongbo
Son, Hoseong
Kim, Joonsoo
Yun, Kug-Jin
Lee, Gwangsoon
Kang, Suk-Ju
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8279 - 8290
[39] Multi-view stereo for weakly textured indoor 3D reconstruction
Wang, Tao
Gan, Vincent J. L.
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (10) : 1469 - 1489
[40] RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
Chang, Di
Bozic, Aljaz
Zhang, Tong
Yan, Qingsong
Chen, Yingcong
Susstrunk, Sabine
Niessner, Matthias
COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 665 - 680

← 1 2 3 4 5 →