GenS: Generalizable Neural Surface Reconstruction from Multi-View Images

被引:0
|
作者
Peng, Rui [1 ,2 ]
Gu, Xiaodong [3 ]
Tang, Luyang [1 ]
Shen, Shihe [1 ]
Yu, Fanqi [1 ]
Wang, Ronggang [1 ,2 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Beijing, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Alibaba Grp, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Combining the signed distance function (SDF) and differentiable volume rendering has emerged as a powerful paradigm for surface reconstruction from multi-view images without 3D supervision. However, current methods are impeded by requiring long-time per-scene optimizations and cannot generalize to new scenes. In this paper, we present GenS, an end-to-end generalizable neural surface reconstruction model. Unlike coordinate-based methods that train a separate network for each scene, we construct a generalized multi-scale volume to directly encode all scenes. Compared with existing solutions, our representation is more powerful, which can recover high-frequency details while maintaining global smoothness. Meanwhile, we introduce a multi-scale feature-metric consistency to impose the multi-view consistency in a more discriminative multi-scale feature space, which is robust to the failures of the photometric consistency. And the learnable feature can be self-enhanced to continuously improve the matching accuracy and mitigate aggregation ambiguity. Furthermore, we design a view contrast loss to force the model to be robust to those regions covered by few viewpoints through distilling the geometric prior from dense input to sparse input. Extensive experiments on popular benchmarks show that our model can generalize well to new scenes and outperform existing state-of-the-art methods even those employing ground-truth depth supervision. Code will be available at https://github.com/prstrive/GenS.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] MVPSNet: Fast Generalizable Multi-view Photometric Stereo
    Zhao, Dongxu
    Lichy, Daniel
    Perrin, Pierre-Nicolas
    Frahm, Jan-Michael
    Sengupta, Soumyadip
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12491 - 12502
  • [22] Anthropometric Measurements from Multi-View Images
    Li, Jie
    Sun, Mingui
    Chen, Hsin-Chen
    Li, Zhaoxin
    Jia, Wenyan
    2012 38TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE (NEBEC), 2012, : 426 - +
  • [23] VRML animation from multi-view images
    Iwadate, Y
    Katayama, M
    Tomiyama, K
    Imaizumi, H
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 881 - 884
  • [24] 3D wireframe model reconstruction of buildings from multi-view images using neural implicit fields
    Fan, Weiwei
    Liu, Xinyi
    Zhang, Yongjun
    Wei, Dong
    Guo, Haoyu
    Yue, Dongdong
    AUTOMATION IN CONSTRUCTION, 2025, 174
  • [25] GRAPE: Generalizable and Robust Multi-view Facial Capture
    Li, Jing
    Kang, Di
    He, Zhenyu
    COMPUTER VISION - ECCV 2024, PT XLVII, 2025, 15105 : 403 - 418
  • [26] Coordinate Quantized Neural Implicit Representations for Multi-view Reconstruction
    Jiang, Sijia
    Hua, Jing
    Han, Zhizhong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18312 - 18323
  • [27] NodeSLAM: Neural Object Descriptors for Multi-View Shape Reconstruction
    Sucar, Edgar
    Wada, Kentaro
    Davison, Andrew
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 949 - 958
  • [28] BRDF Reconstruction from Real Object using Reconstructed Geometry of Multi-view Images
    Ono, Taishi
    Kubo, Hiroyuki
    Funatomi, Takuya
    Mukaigawa, Yasuhiro
    IGGRAPH ASIA 2017 TECHNICAL BRIEFS (SA'17), 2017,
  • [29] An Extension of PatchMatch Stereo for 3D Reconstruction from Multi-View Images
    Hiradate, Mutsuki
    Ito, Koichi
    Aoki, Takafumi
    Watanabe, Takafumi
    Unten, Hiroki
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 61 - 65
  • [30] 3D Clothed Human Reconstruction from Sparse Multi-View Images
    Hong, Jin Gyu
    Noh, Seung Young
    Lee, Hee Kyung
    Cheong, Won Sik
    Chang, Ju Yong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 677 - 687