Charting the Landscape of Multi-view Stereo: An In-Depth Exploration of Deep Learning Techniques

被引:0
|
作者
Zhou, Zhe [1 ,2 ]
Liu, Xiaozhang [1 ]
Tang, Xiangyan [1 ,2 ]
机构
[1] Hainan Univ, Sch Comp Sci & Technol, Haikou 570228, Hainan, Peoples R China
[2] Hainan Blockchain Technol Engn Res Ctr, Haikou 570228, Hainan, Peoples R China
来源
基金
中国国家自然科学基金; 海南省自然科学基金;
关键词
Multi-view Stereo; Disparity Estimation; 3D Reconstruction; Deep Learning;
D O I
10.1007/978-981-97-4387-2_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Constructing a three-dimensional (3D) representation entails forming a 3D depiction of an object or scene using 2D images or a variety of data sources. It has gained substantial attention due to its versatile applications in virtual reality, augmented reality, medicine, cultural heritage preservation, intelligent transportation, and autonomous driving. Advancements in computational power, deep learning, and sensor technology have markedly improved the quality and efficiency of 3D reconstruction. Modern hardware and software tools have made it more practical. Nevertheless, the interpretability challenges have contributed to the ongoing evolution of deep learning approaches in 3D reconstruction. This paper provides an overview of the latest developments in a prominent 3D reconstruction method, Multi-View Stereo (MVS) deep learning. Notably, the application of 3D reconstruction in security-related scenarios is currently a hot topic. In intelligent transportation and autonomous driving, it enhances traffic safety and navigation accuracy by modeling road and traffic conditions. In medicine, it improves surgical safety through surgical navigation and virtual surgical simulation. In cultural heritage preservation, it aids in digital conservation and artifact security monitoring. The potential of 3D reconstruction in security applications is substantial, promising future research and innovation. In summary, 3D reconstruction holds promise in various application domains, including security. Ongoing advancements in deep learning and related technologies are expected to drive innovative applications in a broader range of security fields.
引用
收藏
页码:152 / 165
页数:14
相关论文
共 50 条
  • [41] Refractive Multi-view Stereo
    Cassidy, Matthew
    Melou, Jean
    Queau, Yvain
    Lauze, Francois
    Durou, Jean-Denis
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 384 - 393
  • [42] Deep Facial Non-Rigid Multi-View Stereo
    Bai, Ziqian
    Cui, Zhaopeng
    Rahim, Jamal Ahmed
    Liu, Xiaoming
    Tan, Ping
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5849 - 5859
  • [43] RobustMVS: Single Domain Generalized Deep Multi-View Stereo
    Xu, Hongbin
    Chen, Weitao
    Sun, Baigui
    Xie, Xuansong
    Kang, Wenxiong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9181 - 9194
  • [44] Polarimetric Multi-View Stereo
    Cui, Zhaopeng
    Gu, Jinwei
    Shi, Boxin
    Tan, Ping
    Kautz, Jan
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 369 - 378
  • [45] Uncertainty-Aware Deep Multi-View Photometric Stereo
    Kaya, Berk
    Kumar, Suryansh
    Oliveira, Carlos
    Ferrari, Vittorio
    Van Gool, Luc
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12591 - 12601
  • [46] Deep Multi-view Depth Estimation with Predicted Uncertainty
    Tong Ke
    Tien Do
    Khiem Vuong
    Sartipi, Kourosh
    Roumeliotis, Stergios, I
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 9235 - 9241
  • [47] Multi-view Stereo by Fusing Monocular and a Combination of Depth Representation Methods
    Yu, Fanqi
    Sun, Xinyang
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 298 - 309
  • [48] Multi-View Stereo: A Tutorial
    Furukawa, Yasutaka
    Hernandez, Carlos
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2013, 9 (1-2): : 1 - 148
  • [49] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose M.
    Liu, Miaomiao
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4876 - 4885
  • [50] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose
    Liu, Miaomiao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4748 - 4760