GenS: Generalizable Neural Surface Reconstruction from Multi-View Images

被引:0
|
作者
Peng, Rui [1 ,2 ]
Gu, Xiaodong [3 ]
Tang, Luyang [1 ]
Shen, Shihe [1 ]
Yu, Fanqi [1 ]
Wang, Ronggang [1 ,2 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Beijing, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Alibaba Grp, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Combining the signed distance function (SDF) and differentiable volume rendering has emerged as a powerful paradigm for surface reconstruction from multi-view images without 3D supervision. However, current methods are impeded by requiring long-time per-scene optimizations and cannot generalize to new scenes. In this paper, we present GenS, an end-to-end generalizable neural surface reconstruction model. Unlike coordinate-based methods that train a separate network for each scene, we construct a generalized multi-scale volume to directly encode all scenes. Compared with existing solutions, our representation is more powerful, which can recover high-frequency details while maintaining global smoothness. Meanwhile, we introduce a multi-scale feature-metric consistency to impose the multi-view consistency in a more discriminative multi-scale feature space, which is robust to the failures of the photometric consistency. And the learnable feature can be self-enhanced to continuously improve the matching accuracy and mitigate aggregation ambiguity. Furthermore, we design a view contrast loss to force the model to be robust to those regions covered by few viewpoints through distilling the geometric prior from dense input to sparse input. Extensive experiments on popular benchmarks show that our model can generalize well to new scenes and outperform existing state-of-the-art methods even those employing ground-truth depth supervision. Code will be available at https://github.com/prstrive/GenS.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Improving Neural Surface Reconstruction with Feature Priors from Multi-view Images
    Ren, Xinlin
    Cao, Chenjie
    Fu, Yanwei
    Xu, Xiangyang
    COMPUTER VISION - ECCV 2024, PT LVIII, 2025, 15116 : 445 - 463
  • [2] MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
    Liu, Tianqi
    Wang, Guangcong
    Hu, Shoukang
    She, Liao
    Ye, Xinyi
    Zang, Yuhang
    Cao, Zhiguo
    Li, Wei
    Liu, Ziwei
    COMPUTER VISION-ECCV 2024, PT XVIII, 2025, 15076 : 37 - 53
  • [3] MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo
    Chen, Anpei
    Xu, Zexiang
    Zhao, Fuqiang
    Zhang, Xiaoshuai
    Xiang, Fanbo
    Yu, Jingyi
    Su, Hao
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14104 - 14113
  • [4] A clustering approach to free form surface reconstruction from multi-view range images
    Zhou, Hong
    Liu, Yonghuai
    Li, Longzhuang
    Wei, Baogang
    IMAGE AND VISION COMPUTING, 2009, 27 (06) : 725 - 747
  • [5] A clustering approach to free form surface reconstruction from multi-view range images
    Zhou, Hong
    Liu, Yonghuai
    2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-3, 2006, : 941 - +
  • [6] Generalizable Geometry-Aware Human Radiance Modeling from Multi-view Images
    Wu, Weijun
    Mo, Zhixiong
    Yu, Weihao
    Cheng, Yizhou
    Zhang, Tinghua
    Huang, Jin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 95 - 109
  • [7] High Quality Texture Reconstruction from Multi-view Images
    Kim, Hye-sun
    Park, Chang-joon
    2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1112 - 1114
  • [8] Reconstruction of cloud geometry from multi-view satellite images
    Seiz, G
    Davies, R
    REMOTE SENSING OF ENVIRONMENT, 2006, 100 (02) : 143 - 149
  • [9] GM-NeRF: Learning Generalizable Model-based Neural Radiance Fields from Multi-view Images
    Chen, Jianchuan
    Yi, Wentao
    Ma, Liqian
    Jia, Xu
    Lu, Huchuan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 20648 - 20658
  • [10] JOINT RECONSTRUCTION OF COMPRESSED MULTI-VIEW IMAGES
    Chen, Xu
    Frossard, Pascal
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1005 - +