Image depth estimation assisted by multi-view projection

被引:0
|
作者
Liu, Liman [1 ]
Tian, Jinshan [1 ]
Luo, Guansheng [1 ]
Xu, Siyuan [2 ]
Zhang, Chen [2 ]
Hu, Huaifei [1 ]
Tao, Wenbing [2 ]
机构
[1] South Cent Minzu Univ, Sch Biomed Engn, Key Lab Cognit Sci, Hubei Prov Key Lab Med Informat Anal & Tumor Diag, Minzu Rd, Wuhan 430074, Hubei, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Natl Key Lab Sci & Technol Multispectral Informat, Luoyu Rd, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-view projection; Depth estimation; Neural network; Optical flow; BENCHMARK;
D O I
10.1007/s40747-024-01688-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep learning has significantly advanced the development of image depth estimation algorithms. The depth estimation network with single-view input can only extract features from a single 2D image, often neglecting the information contained in neighboring views, resulting in learned features that lack real geometrical information in the 3D world and stricter constraints on the 3D structure, leading to limitations in the performance of image depth estimation. In the absence of accurate camera information, the multi-view geometric cues obtained by some methods may not accurately reflect the real 3D structure, resulting in a lack of multi-view geometric constraints in image depth estimation algorithms. To address this problem, a multi-view projection-assisted image depth estimation network is proposed, which integrates multi-view stereo vision into a deep learning-based encoding-decoding image depth estimation framework without pre-estimation of view bitmap. The network estimates optical flow for pixel-level matching across views, thereby projecting the features of neighboring views to the reference viewpoints for self-attentive feature aggregation, compensating for the lack of stereo geometry information in the image depth estimation framework. Additionally, a multi-view reprojection error is designed for supervised optical flow estimation to effectively constrain the optical flow estimation process. In addition, a long-distance attention decoding module is proposed to achieve effective extraction and aggregation of features in distant areas of the scene, which enhances the perception capability for outdoor long-distance. Experimental results on the KITTI dataset, vKITTI dataset, and SeasonDepth dataset demonstrate that our method achieves significant improvements compared to other state-of-the-art depth estimation techniques. This confirms its superior performance in image depth estimation.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Depth Estimation in Multi-View Stereo Based on Image Pyramid
    Xu, Hanfei
    Cai, Yangang
    Wang, Ronggang
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 345 - 349
  • [2] Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry
    Bae, Gwangbin
    Budvytis, Ignas
    Cipolla, Roberto
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2832 - 2841
  • [3] Are Multi-view Edges Incomplete for Depth Estimation?
    Khan, Numair
    Kim, Min H.
    Tompkin, James
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2639 - 2673
  • [4] Continuous Depth Estimation for Multi-view Stereo
    Liu, Yebin
    Cao, Xun
    Dai, Qionghai
    Xu, Wenli
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2121 - 2128
  • [5] Depth assisted object segmentation in multi-view video
    Cigla, Cevahir
    Alatan, A. Aydin
    2008 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO, 2008, : 165 - 168
  • [6] Adaptive depth estimation for pyramid multi-view stereo
    Liao, Jie
    Fu, Yanping
    Yan, Qingan
    Luo, Fei
    Xiao, Chunxia
    COMPUTERS & GRAPHICS-UK, 2021, 97 : 268 - 278
  • [7] A Benchmark and a Baseline for Robust Multi-view Depth Estimation
    Schroeppel, Philipp
    Bechtold, Jan
    Amiranashvili, Artemij
    Brox, Thomas
    2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 637 - 645
  • [8] Deep Multi-view Depth Estimation with Predicted Uncertainty
    Tong Ke
    Tien Do
    Khiem Vuong
    Sartipi, Kourosh
    Roumeliotis, Stergios, I
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 9235 - 9241
  • [9] Monocular depth estimation with multi-view attention autoencoder
    Jung, Geunho
    Yoon, Sang Min
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33759 - 33770
  • [10] REVISED DEPTH MAP ESTIMATION FOR MULTI-VIEW STEREO
    Yao, Yao
    Zhu, Hao
    Nie, Yongming
    Ji, Xiaoli
    Cao, Xun
    2014 INTERNATIONAL CONFERENCE ON 3D IMAGING (IC3D), 2014,