GenS: Generalizable Neural Surface Reconstruction from Multi-View Images

被引：0

作者：

Peng, Rui ^{[1
,2
]}

Gu, Xiaodong ^{[3
]}

Tang, Luyang ^{[1
]}

Shen, Shihe ^{[1
]}

Yu, Fanqi ^{[1
]}

Wang, Ronggang ^{[1
,2
]}

机构：

[1] Peking Univ, Sch Elect & Comp Engn, Beijing, Peoples R China

[2] Peng Cheng Lab, Shenzhen, Peoples R China

[3] Alibaba Grp, Hangzhou, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Combining the signed distance function (SDF) and differentiable volume rendering has emerged as a powerful paradigm for surface reconstruction from multi-view images without 3D supervision. However, current methods are impeded by requiring long-time per-scene optimizations and cannot generalize to new scenes. In this paper, we present GenS, an end-to-end generalizable neural surface reconstruction model. Unlike coordinate-based methods that train a separate network for each scene, we construct a generalized multi-scale volume to directly encode all scenes. Compared with existing solutions, our representation is more powerful, which can recover high-frequency details while maintaining global smoothness. Meanwhile, we introduce a multi-scale feature-metric consistency to impose the multi-view consistency in a more discriminative multi-scale feature space, which is robust to the failures of the photometric consistency. And the learnable feature can be self-enhanced to continuously improve the matching accuracy and mitigate aggregation ambiguity. Furthermore, we design a view contrast loss to force the model to be robust to those regions covered by few viewpoints through distilling the geometric prior from dense input to sparse input. Extensive experiments on popular benchmarks show that our model can generalize well to new scenes and outperform existing state-of-the-art methods even those employing ground-truth depth supervision. Code will be available at https://github.com/prstrive/GenS.

引用

页数：14

共 50 条

[1] Improving Neural Surface Reconstruction with Feature Priors from Multi-view Images
Ren, Xinlin
Cao, Chenjie
Fu, Yanwei
Xu, Xiangyang
COMPUTER VISION - ECCV 2024, PT LVIII, 2025, 15116 : 445 - 463
[2] MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
Liu, Tianqi
Wang, Guangcong
Hu, Shoukang
She, Liao
Ye, Xinyi
Zang, Yuhang
Cao, Zhiguo
Li, Wei
Liu, Ziwei
COMPUTER VISION-ECCV 2024, PT XVIII, 2025, 15076 : 37 - 53
[3] MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo
Chen, Anpei
Xu, Zexiang
Zhao, Fuqiang
Zhang, Xiaoshuai
Xiang, Fanbo
Yu, Jingyi
Su, Hao
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14104 - 14113
[4] A clustering approach to free form surface reconstruction from multi-view range images
Zhou, Hong
Liu, Yonghuai
Li, Longzhuang
Wei, Baogang
IMAGE AND VISION COMPUTING, 2009, 27 (06) : 725 - 747
[5] A clustering approach to free form surface reconstruction from multi-view range images
Zhou, Hong
Liu, Yonghuai
2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-3, 2006, : 941 - +
[6] Generalizable Geometry-Aware Human Radiance Modeling from Multi-view Images
Wu, Weijun
Mo, Zhixiong
Yu, Weihao
Cheng, Yizhou
Zhang, Tinghua
Huang, Jin
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 95 - 109
[7] High Quality Texture Reconstruction from Multi-view Images
Kim, Hye-sun
Park, Chang-joon
2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1112 - 1114
[8] Reconstruction of cloud geometry from multi-view satellite images
Seiz, G
Davies, R
REMOTE SENSING OF ENVIRONMENT, 2006, 100 (02) : 143 - 149
[9] GM-NeRF: Learning Generalizable Model-based Neural Radiance Fields from Multi-view Images
Chen, Jianchuan
Yi, Wentao
Ma, Liqian
Jia, Xu
Lu, Huchuan
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 20648 - 20658
[10] JOINT RECONSTRUCTION OF COMPRESSED MULTI-VIEW IMAGES
Chen, Xu
Frossard, Pascal
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1005 - +

← 1 2 3 4 5 →