Deep Pruner and Adaptive Cost Volume Multiview Stereo Network for 3D Reconstruction

被引:0
|
作者
Jamshid, Junaid [1 ,2 ,3 ]
Wanggen, Wan [1 ,2 ]
Shahzad, Khurram [3 ]
Muzahid, A. A. M. [4 ]
Kang, Yuan [1 ,2 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Inst Smart City, Shanghai 200444, Peoples R China
[3] ILMA Univ, Fac Sci & Technol, Karachi 74900, Pakistan
[4] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai 201620, Peoples R China
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Costs; Feature extraction; Three-dimensional displays; Accuracy; Image reconstruction; Solid modeling; Computational modeling; Adaptation models; Memory management; Surface texture; Aggregated cost volume; feature network; pruning; memory efficient; 3D reconstruction; MVSNET;
D O I
10.1109/ACCESS.2025.3535616
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reconstructing three-dimensional (3D) images is imperative in computer vision because it assists in restoring the 3D structure of a scene. However, challenges like accurate matching in low-texture and reflective areas, along with inefficient feature extraction, degrade 3D reconstruction quality and increase computational complexity. To address these challenges, we propose a robust multi-view stereo network, DPrun-RMVSNet, designed to enhance matching in occluded regions and improve feature extraction for texture-less and reflective surfaces. Our model incorporates a recurrent neural network (RNN) with long-short-term memory (LSTM) to handle depth interference. The feature network captures essential information about the image content, such as edges, textures, and corners. To reduce computational costs, we introduce a novel deep pruner feature network (DPF) with an adaptive cost volume, enabling efficient and accurate 3D model creation. The proposed model was trained using the public DTU dataset and evaluated on two benchmark datasets including DTU, and Tank and Temple. Additionally, we conduct an ablation study to assess the impact of the proposed methods, offering both quantitative and qualitative evaluations to validate the model's effectiveness. Experimental results show that our model improves state-of-the-art (SOTA) approaches, achieving better reconstruction accuracy while using less execution time and memory.
引用
收藏
页码:28777 / 28788
页数:12
相关论文
共 50 条
  • [21] Continuous global optimization in multiview 3D reconstruction
    Kolev, Kalin
    Klodt, Maria
    Brox, Thomas
    Esedoglu, Selim
    Cremers, Daniel
    ENERGY MINIMIZATION METHODS IN COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 2007, 4679 : 441 - +
  • [22] A multiview 3D modeling system based on stereo vision techniques
    Park, SY
    Subbarao, M
    MACHINE VISION AND APPLICATIONS, 2005, 16 (03) : 148 - 156
  • [23] On adjustment of stereo parameters in multiview synthesis for planar 3D displays
    Li, Dongxiao
    Qiao, Xiaotian
    Zang, Dongning
    Wang, Lianghao
    Zhang, Ming
    JOURNAL OF THE SOCIETY FOR INFORMATION DISPLAY, 2015, 23 (10) : 491 - 502
  • [24] A multiview 3D modeling system based on stereo vision techniques
    Soon-Yong Park
    Murali Subbarao
    Machine Vision and Applications, 2005, 16 : 148 - 156
  • [25] Adaptive QoS framework for multiview 3D streaming
    Kim, JR
    Won, YJ
    Iwadate, Y
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 1, PROCEEDINGS, 2004, 3036 : 519 - 522
  • [26] STEREO-IMAGING NETWORK DESIGN FOR PRECISE AND DENSE 3D RECONSTRUCTION
    Ahmadabadian, Ali Hosseininaveh
    Robson, Stuart
    Boehm, Jan
    Shortis, Mark
    PHOTOGRAMMETRIC RECORD, 2014, 29 (147): : 317 - 336
  • [27] Neural-network-based photometric stereo for 3D surface reconstruction
    Cheng, Wen-Chang
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 404 - 410
  • [28] OrangeStereo: A navel orange stereo matching network for 3D surface reconstruction
    Gao, Yuan
    Wang, Qingyu
    Rao, Xiuqin
    Xie, Lijuan
    Ying, Yibin
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 217
  • [29] Stereo Matching for 3D Building Reconstruction
    Gupta, Gaurav
    Balasubramanian, R.
    Rawat, M. S.
    Bhargava, R.
    Krishna, B. Gopala
    ADVANCES IN COMPUTING, COMMUNICATION AND CONTROL, 2011, 125 : 522 - +
  • [30] 3D Shape Estimation of Multiview RGB Images from Deep Convolutional Network
    Han B.-K.
    Park J.
    Seo H.
    Song S.-H.
    Journal of Institute of Control, Robotics and Systems, 2022, 28 (07): : 671 - 677