MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo

被引:0
|
作者
Liu, Tianqi [1 ]
Wang, Guangcong [2 ,3 ]
Hu, Shoukang [2 ]
She, Liao [1 ]
Ye, Xinyi [1 ]
Zang, Yuhang [4 ]
Cao, Zhiguo [1 ]
Li, Wei [2 ]
Liu, Ziwei [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch AIA, Wuhan, Peoples R China
[2] Nanyang Technol Univ, S Lab, Singapore, Singapore
[3] Great Bay Univ, Dongguan, Peoples R China
[4] Shanghai AI Lab, Shanghai, Peoples R China
来源
关键词
Generalizable Gaussian Splatting; Multi-View Stereo; Neural Radiance Field; Novel View Synthesis;
D O I
10.1007/978-3-031-72649-1_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present MVSGaussian, a new generalizable 3D Gaussian representation approach derived from Multi-View Stereo (MVS) that can efficiently reconstruct unseen scenes. Specifically, 1) we leverage MVS to encode geometry-aware Gaussian representations and decode them into Gaussian parameters. 2) To further enhance performance, we propose a hybrid Gaussian rendering that integrates an efficient volume rendering design for novel view synthesis. 3) To support fast fine-tuning for specific scenes, we introduce a multi-view geometric consistent aggregation strategy to effectively aggregate the point clouds generated by the generalizable model, serving as the initialization for per-scene optimization. Compared with previous generalizable NeRF-based methods, which typically require minutes of fine-tuning and seconds of rendering per image, MVSGaussian achieves real-time rendering with better synthesis quality for each scene. Compared with the vanilla 3D-GS, MVSGaussian achieves better view synthesis with less training computational cost. Extensive experiments on DTU, Real Forward-facing, NeRF Synthetic, and Tanks and Temples datasets validate that MVSGaussian attains state-of-the-art performance with convincing generalizability, real-time rendering speed, and fast per-scene optimization.
引用
收藏
页码:37 / 53
页数:17
相关论文
共 50 条
  • [31] Multi-View Stereo: A Tutorial
    Furukawa, Yasutaka
    Hernandez, Carlos
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2013, 9 (1-2): : 1 - 148
  • [32] Multi-Resolution Range Data Fusion for Multi-View Stereo Reconstruction
    Kuhn, Andreas
    Hirschmueller, Heiko
    Mayer, Helmut
    PATTERN RECOGNITION, GCPR 2013, 2013, 8142 : 41 - 50
  • [33] Practical BRDF reconstruction using reliable geometric regions from multi-view stereo
    Taishi Ono
    Hiroyuki Kubo
    Kenichiro Tanaka
    Takuya Funatomi
    Yasuhiro Mukaigawa
    Computational Visual Media, 2019, 5 : 325 - 336
  • [34] High-Resolution Plant Shape Measurements from Multi-view Stereo Reconstruction
    Klodt, Maria
    Cremers, Daniel
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT IV, 2015, 8928 : 174 - 184
  • [35] Practical BRDF reconstruction using reliable geometric regions from multi-view stereo
    Taishi Ono
    Hiroyuki Kubo
    Kenichiro Tanaka
    Takuya Funatomi
    Yasuhiro Mukaigawa
    ComputationalVisualMedia, 2019, 5 (04) : 325 - 336
  • [36] Practical BRDF reconstruction using reliable geometric regions from multi-view stereo
    Ono, Taishi
    Kubo, Hiroyuki
    Tanaka, Kenichiro
    Funatomi, Takuya
    Mukaigawa, Yasuhiro
    COMPUTATIONAL VISUAL MEDIA, 2019, 5 (04) : 325 - 336
  • [37] An Extension of PatchMatch Stereo for 3D Reconstruction from Multi-View Images
    Hiradate, Mutsuki
    Ito, Koichi
    Aoki, Takafumi
    Watanabe, Takafumi
    Unten, Hiroki
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 61 - 65
  • [38] Multi-view stereo reconstruction of total knee replacement from X-rays
    Varshney, Kush R.
    Paragios, Nikos
    Kulski, Alain
    Raymond, Remy
    Hernigou, Phillipe
    Rahmouni, Alain
    2007 4TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING : MACRO TO NANO, VOLS 1-3, 2007, : 1148 - +
  • [39] Textured mesh surface reconstruction of large buildings with multi-view stereo
    Zhu, Chen
    Leow, Wee Kheng
    VISUAL COMPUTER, 2013, 29 (6-8): : 609 - 615
  • [40] Textured mesh surface reconstruction of large buildings with multi-view stereo
    Chen Zhu
    Wee Kheng Leow
    The Visual Computer, 2013, 29 : 609 - 615