SA-MVSNet: Self-attention-based multi-view stereo network for 3D reconstruction of images with weak texture

被引:3
|
作者
Yang, Ronghao [1 ]
Miao, Wang [1 ]
Zhang, Zhenxin [2 ,3 ]
Liu, Zhenlong [1 ]
Li, Mubai [2 ,3 ]
Lin, Bin [1 ]
机构
[1] Chengdu Univ Technol, Coll Earth Sci, Chengdu 610059, Sichuan, Peoples R China
[2] Capital Normal Univ, Key Lab 3D Informat Acquisit & Applicat, MOE, Beijing 100048, Peoples R China
[3] Capital Normal Univ, Coll Resource Environm & Tourism, Beijing 100048, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Multi-view stereo; Depth estimation; Self-attention; Transformer; Weak texture; Adaptive propagation;
D O I
10.1016/j.engappai.2023.107800
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-view stereo (MVS) reconstruction is a key task of image-based 3D reconstruction, and deep learning-based methods can achieve better results than traditional algorithms. However, most of the current deep learning-based MVS methods use convolutional neural networks (CNNs) to extract image features, which cannot achieve the aggregation of long-distance context information and capture robust global information. In addition, in the process of fusing depth maps into point clouds, the confidence filters will filter out the depth values with low confidence in weak texture areas. These problems will lead to the low completeness of 3D reconstruction of weak texture and texture-less areas. To address the above problems, this paper proposes SA-MVSNet based on the PatchmatchNet with a self-attentive mechanism. First, we design a coarse-to-fine network framework to advance depth map estimation. In the feature extraction network, a module with a pyramid structure based on Swin Transformer Block is used to replace the original Feature Pyramid Network (FPN), and the self-correlation between weak texture areas is enhanced by applying a global self-attention mechanism. Then, we also propose a self-attention-based adaptive propagation module (SA-AP), which applies a self-attention calculation within depth value propagation window to obtain the relative weight values of current pixel and others, and then adaptively samples the depth values of neighbors on the same surface for propagation. Experiments show that SA-MVSNet has significantly improved the completeness of 3D reconstruction for the images with weak texture on DTU (provided by Danish Technical University), BlendedMVS, and Tanks and Temple datasets. Our code is available at https://github.com/miaowang525/SA-MVSNet.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Combining Photometric Normals and Multi-View Stereo for 3D Reconstruction
    Grochulla, Martin
    Thormaehlen, Thorsten
    CVMP 2015: PROCEEDINGS OF THE 12TH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, 2015,
  • [22] PlaneMVS: 3D Plane Reconstruction from Multi-View Stereo
    Liu, Jiachen
    Ji, Pan
    Bansal, Nitin
    Cai, Changjiang
    Yan, Qingan
    Huang, Xiaolei
    Xu, Yi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8655 - 8665
  • [23] Improvement on Matching Breakage of Multi-View Stereo 3D Reconstruction
    Lin, Hung-Lin
    Lin, Tsung-Yi
    Li, Yi-Xuan
    Tseng, Yu-Sheng
    Li, Xin-Yi
    Cal, Qlan-Wen
    Chen, Zheng
    Shi, Yi-Rou
    PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS FOR SCIENCE AND ENGINEERING (IEEE-ICAMSE 2016), 2016, : 423 - 425
  • [24] Multi-view stereo for weakly textured indoor 3D reconstruction
    Wang, Tao
    Gan, Vincent J. L.
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (10) : 1469 - 1489
  • [25] Pruning multi-view stereo net for efficient 3D reconstruction
    Xiang, Xiang
    Wang, Zhiyuan
    Lao, Shanshan
    Zhang, Baochang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 168 (168) : 17 - 27
  • [26] Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction
    Orsingher, Marco
    Zani, Paolo
    Medici, Paolo
    Bertozzi, Massimo
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 190 - 196
  • [27] Enhanced multi view 3D reconstruction with improved MVSNet
    Li, Guangchen
    Li, Kefeng
    Zhang, Guangyuan
    Zhu, Zhenfang
    Wang, Peng
    Wang, Zhenfei
    Fu, Chen
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [28] Enhanced multi view 3D reconstruction with improved MVSNet
    Guangchen Li
    Kefeng Li
    Guangyuan Zhang
    Zhenfang Zhu
    Peng Wang
    Zhenfei Wang
    Chen Fu
    Scientific Reports, 14 (1)
  • [29] Multi-View Images 3D Reconstruction based on Spatial Geometric Constraint
    Liu, Haibo
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 1217 - 1220
  • [30] HC-MVSNet: A probability sampling-based multi-view-stereo network with hybrid cascade structure for 3D reconstruction
    Gao, Tianxiang
    Hong, Zijian
    Tan, Yixing
    Sun, Lizhuo
    Wei, Yichen
    Ma, Jianwei
    PATTERN RECOGNITION LETTERS, 2024, 185 : 59 - 65