NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

被引:69
|
作者
Wei, Yi [1 ,2 ]
Liu, Shaohui [3 ]
Rao, Yongming [1 ,2 ]
Zhao, Wang [4 ]
Lu, Jiwen [1 ,2 ]
Zhou, Jie [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[3] Swiss Fed Inst Technol, Zurich, Switzerland
[4] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV48922.2021.00556
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present a new multi-view depth estimation method that utilizes both conventional SfM reconstruction and learning-based priors over the recently proposed neural radiance fields (NeRF). Unlike existing neural network based optimization method that relies on estimated correspondences, our method directly optimizes over implicit volumes, eliminating the challenging step of matching pixels in indoor scenes. The key to our approach is to utilize the learning-based priors to guide the optimization process of NeRF. Our system firstly adapts a monocular depth network over the target scene by finetuning on its sparse SfM reconstruction. Then, we show that the shape-radiance ambiguity of NeRF still exists in indoor environments and propose to address the issue by employing the adapted depth priors to monitor the sampling process of volume rendering. Finally, a per-pixel confidence map acquired by error computation on the rendered image can be used to further improve the depth quality. Experiments show that our proposed framework significantly outperforms state-of-the-art methods on indoor scenes, with surprising findings presented on the effectiveness of correspondence-based optimization and NeRF-based optimization over the adapted depth priors. In addition, we show that the guided optimization scheme does not sacrifice the original synthesis capability of neural radiance fields, improving the rendering quality on both seen and novel views. Code is available at https://github.com/weiyithu/NerfingMVS.
引用
收藏
页码:5590 / 5599
页数:10
相关论文
共 50 条
  • [31] Polarimetric Multi-View Stereo
    Cui, Zhaopeng
    Gu, Jinwei
    Shi, Boxin
    Tan, Ping
    Kautz, Jan
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 369 - 378
  • [32] Multi-View Stereo: A Tutorial
    Furukawa, Yasutaka
    Hernandez, Carlos
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2013, 9 (1-2): : 1 - 148
  • [33] Piecewise planar scene reconstruction and optimization for multi-view stereo
    Kim, Hyojin
    Xiao, Hong
    Max, Nelson
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2013, 7727 LNCS (PART 4): : 191 - 204
  • [34] Optimization of Plane Fits to Image Segments in Multi-View Stereo
    Max, Nelson
    Kim, Hyojin
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 1130 - 1136
  • [35] STRUCTURE OPTIMIZATION FOR MULTI-VIEW ACQUISITION AND STEREO DISPLAY SYSTEM
    Cheng, Hao
    You, Zhixiang
    An, Ping
    Zhang, Zhaoyang
    2013 IEEE 11TH IVMSP WORKSHOP: 3D IMAGE/VIDEO TECHNOLOGIES AND APPLICATIONS (IVMSP 2013), 2013,
  • [36] Transformer-guided Feature Pyramid Network for Multi-View Stereo
    Wang, Lina
    She, Jiangfeng
    Zhao, Qiang
    Wen, Xiang
    Guan, Yuzheng
    NEUROCOMPUTING, 2025, 617
  • [37] SatelliteRF: Accelerating 3D Reconstruction in Multi-View Satellite Images with Efficient Neural Radiance Fields
    Zhou, Xin
    Wang, Yang
    Lin, Daoyu
    Cao, Zehao
    Li, Biqing
    Liu, Junyi
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [38] MosaicMVS: Mosaic-Based Omnidirectional Multi-View Stereo for Indoor Scenes
    Shin, Min-Jung
    Park, Woojune
    Cho, Minji
    Kong, Kyeongbo
    Son, Hoseong
    Kim, Joonsoo
    Yun, Kug-Jin
    Lee, Gwangsoon
    Kang, Suk-Ju
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8279 - 8290
  • [39] Multi-view stereo for weakly textured indoor 3D reconstruction
    Wang, Tao
    Gan, Vincent J. L.
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (10) : 1469 - 1489
  • [40] RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
    Chang, Di
    Bozic, Aljaz
    Zhang, Tong
    Yan, Qingsong
    Chen, Yingcong
    Susstrunk, Sabine
    Niessner, Matthias
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 665 - 680