MG-MVSNet: Multiple granularities feature fusion network for multi-view stereo

被引:10
|
作者
Zhang, Xuedian [1 ]
Yang, Fanzhou [1 ]
Chang, Min [1 ]
Qin, Xiaofei [1 ]
机构
[1] Univ Shanghai Sci & Technol, Key Lab Opt Technol & Instrument Med, Minist Educ, Shanghai 200093, Peoples R China
基金
国家重点研发计划;
关键词
Multi-view stereo; 3D reconstruction; Deep learning; Multiple granularities feature fusion;
D O I
10.1016/j.neucom.2023.01.062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of Multi-View Stereo is to reconstruct the 3D point cloud model from multiple views. With the development of deep learning, more and more learning-based research has achieved remarkable results. However, existing methods ignore the fine-grained features of the bottom layer, which leads to the poor quality of model reconstruction, especially in terms of completeness. Besides, current methods still rely on a large amount of consumed memory resources because of the application of 3D convolution. To this end, this paper proposes a Multiple Granularities Feature Fusion Network for Multi-View Stereo, an end-to-end depth estimation network combining global and local features, which is characterized by fine-granularity multi-feature fusion. Firstly, we propose a dense feature adaptive connection module, which can adaptively fuse the global and local features in the scene, provide a more complete and effective fea-ture map for inferring a more detailed depth map, and make the ultimate model more complete. Secondly, in order to further improve the accuracy and completeness of the reconstructed point cloud, we introduce normal and edge loss futead of only using depth loss functions as in the existing methods, which makes the network more sensitive to small depth structures. Finally, we propose distributed 3D convolution instead of traditional 3D convolution, which reduces memory consumption. The experimen-tal results on the DTU and Tanks & Temples datasets demonstrate that the proposed method in this papaer achieves the state-of-the-art performance, which proves the accuracy and effectiveness of the MG-MVSNet proposed in this paper.(c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:35 / 47
页数:13
相关论文
共 50 条
  • [1] LE-MVSNet: Lightweight Efficient Multi-view Stereo Network
    Kong, Changfei
    Zhang, Ziyi
    Mao, Jiafa
    Chan, Sixian
    Sheng, Weigou
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 484 - 497
  • [2] MVSNet: Depth Inference for Unstructured Multi-view Stereo
    Yao, Yao
    Luo, Zixin
    Li, Shiwei
    Fang, Tian
    Quan, Long
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 785 - 801
  • [3] Vis-MVSNet: Visibility-Aware Multi-view Stereo Network
    Zhang, Jingyang
    Li, Shiwei
    Luo, Zixin
    Fang, Tian
    Yao, Yao
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (01) : 199 - 214
  • [4] Vis-MVSNet: Visibility-Aware Multi-view Stereo Network
    Jingyang Zhang
    Shiwei Li
    Zixin Luo
    Tian Fang
    Yao Yao
    International Journal of Computer Vision, 2023, 131 : 199 - 214
  • [5] OD-MVSNet: Omni-dimensional dynamic multi-view stereo network
    Pan, Ke
    Li, Kefeng
    Zhang, Guangyuan
    Zhu, Zhenfang
    Wang, Peng
    Wang, Zhenfei
    Fu, Chen
    Li, Guangchen
    Ding, Yuxuan
    PLOS ONE, 2024, 19 (08):
  • [6] DAR-MVSNet: a novel dual attention residual network for multi-view stereo
    Li, Tingshuai
    Liang, Hu
    Wen, Changchun
    Qu, Jiacheng
    Zhao, Shengrong
    Zhang, Qingmeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 5857 - 5866
  • [7] DRI-MVSNet: A depth residual inference network for multi-view stereo images
    Li, Ying
    Li, Wenyue
    Zhao, Zhijie
    Fan, JiaHao
    PLOS ONE, 2022, 17 (03):
  • [8] Feature distribution normalization network for multi-view stereo
    Chen, Ziyang
    Zhao, Yang
    He, Junling
    Lu, Yujie
    Cui, Zhongwei
    Li, Wenting
    Zhang, Yongjun
    VISUAL COMPUTER, 2025, 41 (01): : 409 - 421
  • [9] NTPP-MVSNet: Multi-View Stereo Network Based on Neighboring Tangent Plane Propagation
    Zhao, Qi
    Deng, Yangyan
    Yang, Yifan
    Li, Yawei
    Yuan, Ding
    APPLIED SCIENCES-BASEL, 2023, 13 (14):
  • [10] RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
    Chang, Di
    Bozic, Aljaz
    Zhang, Tong
    Yan, Qingsong
    Chen, Yingcong
    Susstrunk, Sabine
    Niessner, Matthias
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 665 - 680