Deep Pruner and Adaptive Cost Volume Multiview Stereo Network for 3D Reconstruction

被引:0
|
作者
Jamshid, Junaid [1 ,2 ,3 ]
Wanggen, Wan [1 ,2 ]
Shahzad, Khurram [3 ]
Muzahid, A. A. M. [4 ]
Kang, Yuan [1 ,2 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Inst Smart City, Shanghai 200444, Peoples R China
[3] ILMA Univ, Fac Sci & Technol, Karachi 74900, Pakistan
[4] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai 201620, Peoples R China
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Costs; Feature extraction; Three-dimensional displays; Accuracy; Image reconstruction; Solid modeling; Computational modeling; Adaptation models; Memory management; Surface texture; Aggregated cost volume; feature network; pruning; memory efficient; 3D reconstruction; MVSNET;
D O I
10.1109/ACCESS.2025.3535616
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reconstructing three-dimensional (3D) images is imperative in computer vision because it assists in restoring the 3D structure of a scene. However, challenges like accurate matching in low-texture and reflective areas, along with inefficient feature extraction, degrade 3D reconstruction quality and increase computational complexity. To address these challenges, we propose a robust multi-view stereo network, DPrun-RMVSNet, designed to enhance matching in occluded regions and improve feature extraction for texture-less and reflective surfaces. Our model incorporates a recurrent neural network (RNN) with long-short-term memory (LSTM) to handle depth interference. The feature network captures essential information about the image content, such as edges, textures, and corners. To reduce computational costs, we introduce a novel deep pruner feature network (DPF) with an adaptive cost volume, enabling efficient and accurate 3D model creation. The proposed model was trained using the public DTU dataset and evaluated on two benchmark datasets including DTU, and Tank and Temple. Additionally, we conduct an ablation study to assess the impact of the proposed methods, offering both quantitative and qualitative evaluations to validate the model's effectiveness. Experimental results show that our model improves state-of-the-art (SOTA) approaches, achieving better reconstruction accuracy while using less execution time and memory.
引用
收藏
页码:28777 / 28788
页数:12
相关论文
共 50 条
  • [1] Multiview Stereo High-Completeness Network for 3D Reconstruction
    Yu, Cuihong
    Han, Cheng
    Yang, Zhengquan
    Zhang, Chao
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2024, 68 (05)
  • [2] AN EVALUATION OF STEREO AND MULTIVIEW ALGORITHMS FOR 3D RECONSTRUCTION WITH SYNTHETIC DATA
    Reyes, M. Fuentes
    d'Angelo, P.
    Fraundorfer, F.
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 1021 - 1028
  • [3] Attention aware cost volume pyramid based multi-view stereo network for 3D reconstruction
    Yu, Anzhu
    Guo, Wenyue
    Liu, Bing
    Chen, Xin
    Wang, Xin
    Cao, Xuefeng
    Jiang, Bingchuan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 175 : 448 - 460
  • [4] Fusing Multiview and Photometric Stereo for 3D Reconstruction under Uncalibrated Illumination
    Wu, Chenglei
    Liu, Yebin
    Dai, Qionghai
    Wilburn, Bennett
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (08) : 1082 - 1095
  • [5] 3D LiDAR and Stereo Fusion using Stereo Matching Network with Conditional Cost Volume Normalization
    Wang, Tsun-Hsuan
    Hu, Hou-Ning
    Lin, Chieh Hubert
    Tsai, Yi-Hsuan
    Chiu, Wei-Chen
    Sun, Min
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 5895 - 5902
  • [6] Multiview 3D reconstruction in geosciences
    Favalli, M.
    Fornaciai, A.
    Isola, I.
    Tarquini, S.
    Nannipieri, L.
    COMPUTERS & GEOSCIENCES, 2012, 44 : 168 - 176
  • [7] Quasi-Dense 3D Reconstruction using Tensor-based Multiview Stereo
    Wu, Tai-Pang
    Yeung, Sai-Kit
    Jia, Jiaya
    Tang, Chi-Keung
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1482 - 1489
  • [8] Adaptive Unimodal Cost Volume Filtering for Deep Stereo Matching
    Zhang, Youmin
    Chen, Yimin
    Bai, Xiao
    Yu, Suihanjin
    Yu, Kun
    Li, Zhiwei
    Yang, Kuiyuan
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12926 - 12934
  • [9] Multiview Compressive Coding for 3D Reconstruction
    Wu, Chao-Yuan
    Johnson, Justin
    Malik, Jitendra
    Feichtenhofer, Christoph
    Gkioxari, Georgia
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9065 - 9075
  • [10] MVS-Pheno: A Portable and Low-Cost Phenotyping Platform for Maize Shoots Using Multiview Stereo 3D Reconstruction
    Wu, Sheng
    Wen, Weiliang
    Wang, Yongjian
    Fan, Jiangchuan
    Wang, Chuanyu
    Gou, Wenbo
    Guo, Xinyu
    PLANT PHENOMICS, 2020, 2020