DSC-MVSNet: attention aware cost volume regularization based on depthwise separable convolution for multi-view stereo

被引:0
|
作者
Song Zhang
Zhiwei Wei
Wenjia Xu
Lili Zhang
Yang Wang
Xin Zhou
Junyi Liu
机构
[1] Chinese Academy of Sciences,Aerospace Information Research Institute
[2] Chinese Academy of Sciences,Key Laboratory of Network Information System Technology (NIST), Institute of Electronics
[3] University of Chinese Academy of Sciences,School of Electronic, Electrical and Communication Engineering
[4] University of Chinese Academy of Sciences,State Key Laboratory of Networking and Switching Technology
[5] Beijing University of Posts and Telecommunications,undefined
来源
关键词
Multi-view stereo; Depth estimation; DSC-MVSNet;
D O I
暂无
中图分类号
学科分类号
摘要
Deep learning has recently been proven to deliver excellent performance in multi-view stereo (MVS). However, it is difficult for deep learning-based MVS approaches to balance their efficiency and effectiveness. Towards this end, we propose the DSC-MVSNet, a novel coarse-to-fine and end-to-end framework for more efficient and more accurate depth estimation in MVS. In particular, we propose an attention aware 3D UNet-shape network, which first uses the depthwise separable convolutions for cost volume regularization. This mechanism enables effective aggregation of information and significantly reduces the model parameters and computation by transforming the ordinary convolution on cost volume as depthwise convolution and pointwise convolution. Besides, a 3D-Attention module is proposed to alleviate the feature mismatching problem in cost volume regularization and aggregate the important information of cost volume in three dimensions (i.e. channel, space, and depth). Moreover, we propose an efficient Feature Transfer Module to upsample the low-resolution (LR) depth map to a high-resolution (HR) depth map to achieve higher accuracy. With extensive experiments on two benchmark datasets, i.e. DTU and Tanks & Temples, we demonstrate that the parameters of our model are significantly reduced to 25%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$25\%$$\end{document} of the state-of-the-art model MVSNet. Besides, our method outperforms or maintains on par accuracy with the state-of-the-art models. Our source code is available at https://github.com/zs670980918/DSC-MVSNet.
引用
收藏
页码:6953 / 6969
页数:16
相关论文
共 49 条
  • [1] DSC-MVSNet: attention aware cost volume regularization based on depthwise separable convolution for multi-view stereo
    Zhang, Song
    Wei, Zhiwei
    Xu, Wenjia
    Zhang, Lili
    Wang, Yang
    Zhou, Xin
    Liu, Junyi
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (06) : 6953 - 6969
  • [2] ATLAS-MVSNet: Attention Layers for Feature Extraction and Cost Volume Regularization in Multi-View Stereo
    Weilharter, Rafael
    Fraundorfer, Friedrich
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3557 - 3563
  • [3] Multi-View Stereo Representation Revist: Region-Aware MVSNet
    Zhang, Yisu
    Zhu, Jianke
    Lin, Lixiang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17376 - 17385
  • [4] Attention aware cost volume pyramid based multi-view stereo network for 3D reconstruction
    Yu, Anzhu
    Guo, Wenyue
    Liu, Bing
    Chen, Xin
    Wang, Xin
    Cao, Xuefeng
    Jiang, Bingchuan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 175 : 448 - 460
  • [5] Attention-Aware Multi-View Stereo
    Luo, Keyang
    Guan, Tao
    Ju, Lili
    Wang, Yuesong
    Chen, Zhuo
    Luo, Yawei
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1587 - 1596
  • [6] Vis-MVSNet: Visibility-Aware Multi-view Stereo Network
    Zhang, Jingyang
    Li, Shiwei
    Luo, Zixin
    Fang, Tian
    Yao, Yao
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (01) : 199 - 214
  • [7] Vis-MVSNet: Visibility-Aware Multi-view Stereo Network
    Jingyang Zhang
    Shiwei Li
    Zixin Luo
    Tian Fang
    Yao Yao
    International Journal of Computer Vision, 2023, 131 : 199 - 214
  • [8] DAR-MVSNet: a novel dual attention residual network for multi-view stereo
    Li, Tingshuai
    Liang, Hu
    Wen, Changchun
    Qu, Jiacheng
    Zhao, Shengrong
    Zhang, Qingmeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 5857 - 5866
  • [9] Attention-enhanced multi-source cost volume multi-view stereo
    Wang, Yucan
    Wang, Zhenzhen
    Tian, Hui
    Song, Yifan
    Cao, Yangjie
    Wei, Ronghan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [10] MVSNet plus plus : Learning Depth-Based Attention Pyramid Features for Multi-View Stereo
    Chen, Po-Heng
    Yang, Hsiao-Chien
    Chen, Kuan-Wen
    Chen, Yong-Sheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7261 - 7273