Image depth estimation assisted by multi-view projection

被引:0
|
作者
Liu, Liman [1 ]
Tian, Jinshan [1 ]
Luo, Guansheng [1 ]
Xu, Siyuan [2 ]
Zhang, Chen [2 ]
Hu, Huaifei [1 ]
Tao, Wenbing [2 ]
机构
[1] South Cent Minzu Univ, Sch Biomed Engn, Key Lab Cognit Sci, Hubei Prov Key Lab Med Informat Anal & Tumor Diag, Minzu Rd, Wuhan 430074, Hubei, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Natl Key Lab Sci & Technol Multispectral Informat, Luoyu Rd, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-view projection; Depth estimation; Neural network; Optical flow; BENCHMARK;
D O I
10.1007/s40747-024-01688-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep learning has significantly advanced the development of image depth estimation algorithms. The depth estimation network with single-view input can only extract features from a single 2D image, often neglecting the information contained in neighboring views, resulting in learned features that lack real geometrical information in the 3D world and stricter constraints on the 3D structure, leading to limitations in the performance of image depth estimation. In the absence of accurate camera information, the multi-view geometric cues obtained by some methods may not accurately reflect the real 3D structure, resulting in a lack of multi-view geometric constraints in image depth estimation algorithms. To address this problem, a multi-view projection-assisted image depth estimation network is proposed, which integrates multi-view stereo vision into a deep learning-based encoding-decoding image depth estimation framework without pre-estimation of view bitmap. The network estimates optical flow for pixel-level matching across views, thereby projecting the features of neighboring views to the reference viewpoints for self-attentive feature aggregation, compensating for the lack of stereo geometry information in the image depth estimation framework. Additionally, a multi-view reprojection error is designed for supervised optical flow estimation to effectively constrain the optical flow estimation process. In addition, a long-distance attention decoding module is proposed to achieve effective extraction and aggregation of features in distant areas of the scene, which enhances the perception capability for outdoor long-distance. Experimental results on the KITTI dataset, vKITTI dataset, and SeasonDepth dataset demonstrate that our method achieves significant improvements compared to other state-of-the-art depth estimation techniques. This confirms its superior performance in image depth estimation.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Learning Descriptor, Confidence, and Depth Estimation in Multi-view Stereo
    Choi, Sungil
    Kim, Seungryong
    Park, Kihong
    Sohn, Kwanghoon
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 389 - 395
  • [22] A semi-automatic multi-view depth estimation method
    Wildeboer, Meindert Onno
    Fukushima, Norishige
    Yendo, Tomohiro
    Tehrani, Mehrdad Panahpour
    Fujii, Toshiaki
    Tanimoto, Masayuki
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2010, 2010, 7744
  • [23] Context-Guided Multi-view Stereo with Depth Back-Projection
    Feng, Tianxing
    Zhang, Zhe
    Xiong, Kaiqiang
    Wang, Ronggang
    MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 91 - 102
  • [24] An effective epipolar geometry assisted motion estimation technique for multi-view image and video coding
    Lu, Jiangbo
    Cai, Hua
    Lou, Jian-Guang
    Li, Jiang
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 1089 - +
  • [25] Uniform Projection for Multi-View Learning
    Zhang, Zhenyue
    Zhai, Zheng
    Li, Limin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) : 1675 - 1689
  • [26] A multi-view video coding approach using layered depth image
    Cheng, Xiaoyu
    Sun, Lifeng
    Yang, Shiqiang
    2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 143 - 146
  • [27] Range-Agnostic Multi-View Depth Estimation with Keyframe Selection
    Conti, Andrea
    Poggi, Matteo
    Cambareri, Valerio
    Mattoccia, Stefano
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1350 - 1359
  • [28] Unsupervised Multi-View Constrained Convolutional Network for Accurate Depth Estimation
    Zhang, Yuyang
    Xu, Shibiao
    Wu, Baoyuan
    Shi, Jian
    Meng, Weiliang
    Zhang, Xiaopeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7019 - 7031
  • [29] IAFMVS: Iterative Depth Estimation with Adaptive Features for Multi-View Stereo
    Zhao, Guyu
    Wei, Huyixin
    He, Hongdou
    NEUROCOMPUTING, 2025, 629
  • [30] Expansion-Based Depth Map Estimation for Multi-View Stereo
    Song, Peng
    Wu, Xiaojun
    Wang, Michael Yu
    Wu, Jianhuang
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 3213 - 3218