MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo

被引:0
|
作者
Liu, Tianqi [1 ]
Wang, Guangcong [2 ,3 ]
Hu, Shoukang [2 ]
She, Liao [1 ]
Ye, Xinyi [1 ]
Zang, Yuhang [4 ]
Cao, Zhiguo [1 ]
Li, Wei [2 ]
Liu, Ziwei [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch AIA, Wuhan, Peoples R China
[2] Nanyang Technol Univ, S Lab, Singapore, Singapore
[3] Great Bay Univ, Dongguan, Peoples R China
[4] Shanghai AI Lab, Shanghai, Peoples R China
来源
关键词
Generalizable Gaussian Splatting; Multi-View Stereo; Neural Radiance Field; Novel View Synthesis;
D O I
10.1007/978-3-031-72649-1_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present MVSGaussian, a new generalizable 3D Gaussian representation approach derived from Multi-View Stereo (MVS) that can efficiently reconstruct unseen scenes. Specifically, 1) we leverage MVS to encode geometry-aware Gaussian representations and decode them into Gaussian parameters. 2) To further enhance performance, we propose a hybrid Gaussian rendering that integrates an efficient volume rendering design for novel view synthesis. 3) To support fast fine-tuning for specific scenes, we introduce a multi-view geometric consistent aggregation strategy to effectively aggregate the point clouds generated by the generalizable model, serving as the initialization for per-scene optimization. Compared with previous generalizable NeRF-based methods, which typically require minutes of fine-tuning and seconds of rendering per image, MVSGaussian achieves real-time rendering with better synthesis quality for each scene. Compared with the vanilla 3D-GS, MVSGaussian achieves better view synthesis with less training computational cost. Extensive experiments on DTU, Real Forward-facing, NeRF Synthetic, and Tanks and Temples datasets validate that MVSGaussian attains state-of-the-art performance with convincing generalizability, real-time rendering speed, and fast per-scene optimization.
引用
收藏
页码:37 / 53
页数:17
相关论文
共 50 条
  • [21] Multi-View Stereo Reconstruction with High Dynamic Range Texture
    Lu, Feng
    Ji, Xiangyang
    Dai, Qionghai
    Er, Guihua
    COMPUTER VISION - ACCV 2010, PT II, 2011, 6493 : 412 - +
  • [22] Multi-view stereo reconstruction of dense shape and complex appearance
    Jin, HL
    Soatto, S
    Yezzi, AJ
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 63 (03) : 175 - 189
  • [23] A surface-growing approach to multi-view, stereo reconstruction
    Habbecke, Martin
    Kobbelt, Leif
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 1720 - +
  • [24] Multi-View Stereo Reconstruction of Dense Shape and Complex Appearance
    Hailin Jin
    Stefano Soatto
    Anthony J. Yezzi
    International Journal of Computer Vision, 2005, 63 : 175 - 189
  • [25] Hole-filling algorithm in multi-view stereo reconstruction
    Wu, Xiaojun
    Wen, Fei
    Wen, Peizhi
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2012, 24 (12): : 1606 - 1613
  • [26] Multi-View Stereo 3D Edge Reconstruction
    Bignoli, Andrea
    Romanoni, Andrea
    Matteucci, Matteo
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 867 - 875
  • [27] Fast level set multi-view stereo on graphics hardware
    Labatut, Patrick
    Keriven, Renaud
    Pons, Jean-Philippe
    THIRD INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING, VISUALIZATION, AND TRANSMISSION, PROCEEDINGS, 2007, : 774 - 781
  • [28] Conditional Single-view Shape Generation for Multi-view Stereo Reconstruction
    Wei, Yi
    Liu, Shaohui
    Zhao, Wang
    Lu, Jiwen
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9643 - 9652
  • [29] Refractive Multi-view Stereo
    Cassidy, Matthew
    Melou, Jean
    Queau, Yvain
    Lauze, Francois
    Durou, Jean-Denis
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 384 - 393
  • [30] Polarimetric Multi-View Stereo
    Cui, Zhaopeng
    Gu, Jinwei
    Shi, Boxin
    Tan, Ping
    Kautz, Jan
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 369 - 378