MG-MVSNet: Multiple granularities feature fusion network for multi-view stereo

被引:10
|
作者
Zhang, Xuedian [1 ]
Yang, Fanzhou [1 ]
Chang, Min [1 ]
Qin, Xiaofei [1 ]
机构
[1] Univ Shanghai Sci & Technol, Key Lab Opt Technol & Instrument Med, Minist Educ, Shanghai 200093, Peoples R China
基金
国家重点研发计划;
关键词
Multi-view stereo; 3D reconstruction; Deep learning; Multiple granularities feature fusion;
D O I
10.1016/j.neucom.2023.01.062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of Multi-View Stereo is to reconstruct the 3D point cloud model from multiple views. With the development of deep learning, more and more learning-based research has achieved remarkable results. However, existing methods ignore the fine-grained features of the bottom layer, which leads to the poor quality of model reconstruction, especially in terms of completeness. Besides, current methods still rely on a large amount of consumed memory resources because of the application of 3D convolution. To this end, this paper proposes a Multiple Granularities Feature Fusion Network for Multi-View Stereo, an end-to-end depth estimation network combining global and local features, which is characterized by fine-granularity multi-feature fusion. Firstly, we propose a dense feature adaptive connection module, which can adaptively fuse the global and local features in the scene, provide a more complete and effective fea-ture map for inferring a more detailed depth map, and make the ultimate model more complete. Secondly, in order to further improve the accuracy and completeness of the reconstructed point cloud, we introduce normal and edge loss futead of only using depth loss functions as in the existing methods, which makes the network more sensitive to small depth structures. Finally, we propose distributed 3D convolution instead of traditional 3D convolution, which reduces memory consumption. The experimen-tal results on the DTU and Tanks & Temples datasets demonstrate that the proposed method in this papaer achieves the state-of-the-art performance, which proves the accuracy and effectiveness of the MG-MVSNet proposed in this paper.(c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:35 / 47
页数:13
相关论文
共 50 条
  • [41] MFNet: Multi-level fusion aware feature pyramid based multi-view stereo network for 3D reconstruction
    Cai, Youcheng
    Li, Lin
    Wang, Dong
    Liu, Xiaoping
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4289 - 4301
  • [42] MFNet: Multi-level fusion aware feature pyramid based multi-view stereo network for 3D reconstruction
    Youcheng Cai
    Lin Li
    Dong Wang
    Xiaoping Liu
    Applied Intelligence, 2023, 53 : 4289 - 4301
  • [43] MVSNet plus plus : Learning Depth-Based Attention Pyramid Features for Multi-View Stereo
    Chen, Po-Heng
    Yang, Hsiao-Chien
    Chen, Kuan-Wen
    Chen, Yong-Sheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7261 - 7273
  • [44] Learning Efficient Photometric Feature Transform for Multi-view Stereo
    Kang, Kaizhang
    Xie, Cihui
    Zhu, Ruisheng
    Ma, Xiaohe
    Tan, Ping
    Wu, Hongzhi
    Zhou, Kun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5936 - 5945
  • [45] FADE: Feature Aggregation for Depth Estimation With Multi-View Stereo
    Yang, Hsiao-Chien
    Chen, Po-Heng
    Chen, Kuan-Wen
    Lee, Chen-Yi
    Chen, Yong-Sheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6590 - 6600
  • [46] Feature-enhanced representation with transformers for multi-view stereo
    Xiang, Lintao
    Yin, Hujun
    IET IMAGE PROCESSING, 2024, 18 (06) : 1530 - 1539
  • [47] Multi-view Stereo Network with Attention Thin Volume
    Wan, Zihang
    Xu, Chao
    Hu, Jing
    Xiao, Jian
    Meng, Zhaopeng
    Chen, Jitai
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 410 - 423
  • [48] Multi-View Stereo Network With Gaussian Distribution Iteration
    Zhang, Xiaohan
    Li, Shikun
    IEEE ACCESS, 2023, 11 : 53359 - 53372
  • [49] Point-Based Multi-View Stereo Network
    Chen, Rui
    Han, Songfang
    Xu, Jing
    Su, Hao
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1538 - 1547
  • [50] Multi-view Stereo Network with Attention Thin Volume
    Wan, Zihang
    Xu, Chao
    Hu, Jing
    Xiao, Jian
    Meng, Zhaopeng
    Chen, Jitai
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13631 LNCS : 410 - 423