Deep learning based multi-view stereo matching and 3D scene reconstruction from oblique aerial images

被引:24
|
作者
Liu, Jin [1 ]
Gao, Jian [1 ]
Ji, Shunping [1 ]
Zeng, Chang [1 ]
Zhang, Shaoyi [1 ]
Gong, Jianya [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
3D scene reconstruction; Multi-view stereo; Oblique aerial images; Deep learning; Dense image matching;
D O I
10.1016/j.isprsjprs.2023.08.015
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
In this paper, we propose a practical three-dimensional (3D) real-scene reconstruction framework named Deep3D, which is paired with a deep learning based multi-view stereo (MVS) matching model named the adaptive multi-view aggregation matching (Ada-MVS) model, to obtain a 3D textured mesh model from multi view oblique aerial images. Deep3D is the first deep learning based framework for 3D scene reconstruction, in which aerial triangulation and view selection are first performed on the input images, and the depth map of each image is then inferred using the pretrained Ada-MVS model. All the inferred depth maps are then fused into a dense point cloud after filtering the outliers. Finally, the 3D textured mesh is extracted from the dense 3D points as the final product. In the Ada-MVS model, a novel adaptive inter-view aggregation module is specially proposed to address the inconsistent information among oblique views and to fuse the multi-view costs into a robust cost volume. A lightweight recurrent regularization module is also designed for high-efficiency processing of high-capacity aerial images with large depth variations. Moreover, as oblique aerial image datasets are currently lacking, we built a large-scale synthetic multi-view oblique aerial image dataset (WHU-OMVS dataset) for deep learning based model training and methodology evaluation for the task of 3D scene reconstruction. The experimental results show that, firstly, the proposed Ada-MVS model has obvious advantages when used with high capacity oblique aerial images, compared with several relevant learning-based MVS methods. Secondly, through a comprehensive comparison with popular commercial software packages and open-source solutions, it is shown that the proposed Deep3D framework outperforms all the other solutions in terms of reconstruction quality, and outperforms all the open-source solutions and some of the software packages in terms of efficiency on the WHU-OMVS dataset. Thirdly, the Deep3D framework shows a stable generalization ability and excellent performance when applied to other oblique or nadir aerial images, without any further fine-tuning. The dataset and code will be available at http://gpcv.whu.edu.cn/data.
引用
收藏
页码:42 / 60
页数:19
相关论文
共 50 条
  • [31] 3D Clothed Human Reconstruction from Sparse Multi-View Images
    Hong, Jin Gyu
    Noh, Seung Young
    Lee, Hee Kyung
    Cheong, Won Sik
    Chang, Ju Yong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 677 - 687
  • [32] Multi-view stereo reconstruction and scene flow estimation with a global image-based matching score
    Pons, Jean-Philippe
    Keriven, Renaud
    Faugeras, Olivier
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 72 (02) : 179 - 193
  • [33] View Planning for Multi-View Stereo 3D Reconstruction Using an Autonomous Multicopter
    Korbinian Schmid
    Heiko Hirschmüller
    Andreas Dömel
    Iris Grixa
    Michael Suppa
    Gerd Hirzinger
    Journal of Intelligent & Robotic Systems, 2012, 65 : 309 - 323
  • [34] Multi-View Stereo Reconstruction and Scene Flow Estimation with a Global Image-Based Matching Score
    Jean-Philippe Pons
    Renaud Keriven
    Olivier Faugeras
    International Journal of Computer Vision, 2007, 72 : 179 - 193
  • [35] The Construction Method of Measurable Aerial Panorama Based on Panoramic Image and Multi-view Oblique Images Matching
    Hu, Datian
    Wang, Yue
    Hu, Qingwu
    Hu, Wei
    2016 4RTH INTERNATIONAL WORKSHOP ON EARTH OBSERVATION AND REMOTE SENSING APPLICATIONS (EORSA), 2016,
  • [36] View Planning for Multi-View Stereo 3D Reconstruction Using an Autonomous Multicopter
    Schmid, Korbinian
    Hirschmueller, Heiko
    Doemel, Andreas
    Grixa, Iris
    Suppa, Michael
    Hirzinger, Gerd
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2012, 65 (1-4) : 309 - 323
  • [37] 3D Reconstruction from Multi-view Google Earth Satellite Stereo Images by Generating Virtual RPC based on 3D Homography-based Georeferencing
    Seo, D. U.
    Park, S. Y.
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 1075 - 1080
  • [38] REPRESENTATION LEARNING OF VERTEX HEATMAPS FOR 3D HUMAN MESH RECONSTRUCTION FROM MULTI-VIEW IMAGES
    Chun, Sungho
    Park, Sungbum
    Chang, Ju Yong
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 670 - 674
  • [39] GEMVS: a novel approach for automatic 3D reconstruction from uncalibrated multi-view Google Earth images using multi-view stereo and projective to metric 3D homography transformation
    Park, Soon-Yong
    Seo, DongUk
    Lee, Min-Jae
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (09) : 3005 - 3030
  • [40] DETransMVSnet: Research on Terahertz 3D Reconstruction of Multi-View Stereo Network With Deep Equilibrium Transformers
    Bai, Fan
    Li, Lun
    Wang, Wencheng
    Wu, Xiaojin
    IEEE ACCESS, 2023, 11 : 146042 - 146053