Image depth estimation assisted by multi-view projection

被引:0
|
作者
Liu, Liman [1 ]
Tian, Jinshan [1 ]
Luo, Guansheng [1 ]
Xu, Siyuan [2 ]
Zhang, Chen [2 ]
Hu, Huaifei [1 ]
Tao, Wenbing [2 ]
机构
[1] South Cent Minzu Univ, Sch Biomed Engn, Key Lab Cognit Sci, Hubei Prov Key Lab Med Informat Anal & Tumor Diag, Minzu Rd, Wuhan 430074, Hubei, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Natl Key Lab Sci & Technol Multispectral Informat, Luoyu Rd, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-view projection; Depth estimation; Neural network; Optical flow; BENCHMARK;
D O I
10.1007/s40747-024-01688-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep learning has significantly advanced the development of image depth estimation algorithms. The depth estimation network with single-view input can only extract features from a single 2D image, often neglecting the information contained in neighboring views, resulting in learned features that lack real geometrical information in the 3D world and stricter constraints on the 3D structure, leading to limitations in the performance of image depth estimation. In the absence of accurate camera information, the multi-view geometric cues obtained by some methods may not accurately reflect the real 3D structure, resulting in a lack of multi-view geometric constraints in image depth estimation algorithms. To address this problem, a multi-view projection-assisted image depth estimation network is proposed, which integrates multi-view stereo vision into a deep learning-based encoding-decoding image depth estimation framework without pre-estimation of view bitmap. The network estimates optical flow for pixel-level matching across views, thereby projecting the features of neighboring views to the reference viewpoints for self-attentive feature aggregation, compensating for the lack of stereo geometry information in the image depth estimation framework. Additionally, a multi-view reprojection error is designed for supervised optical flow estimation to effectively constrain the optical flow estimation process. In addition, a long-distance attention decoding module is proposed to achieve effective extraction and aggregation of features in distant areas of the scene, which enhances the perception capability for outdoor long-distance. Experimental results on the KITTI dataset, vKITTI dataset, and SeasonDepth dataset demonstrate that our method achieves significant improvements compared to other state-of-the-art depth estimation techniques. This confirms its superior performance in image depth estimation.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Multi-view Depth Estimation with Adaptive Feature Extraction and Region-Aware Depth Prediction
    Zhang, Chi
    Li, Lingyu
    Zhou, Jijun
    Xu, Yong
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 32 - 45
  • [42] Single-View and Multi-View Depth Fusion
    Facil, Jose M.
    Concha, Alejo
    Montesano, Luis
    Civera, Javier
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (04): : 1994 - 2001
  • [43] VIEW-CONSISTENT MULTI-VIEW DEPTH ESTIMATION FOR THREE-DIMENSIONAL VIDEO GENERATION
    Lee, Sang-Beom
    Ho, Yo-Sung
    2010 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON 2010), 2010,
  • [44] IMPROVED MULTI-VIEW DEPTH ESTIMATION FOR VIEW SYNTHESIS IN 3D VIDEO CODING
    Zhang, Qiuwen
    An, Ping
    Zhang, Yan
    Shen, Liquan
    Zhang, Zhaoyang
    2011 3DTV CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2011,
  • [45] REAL-TIME UNSUPERVISED MULTI-VIEW DEPTH ESTIMATION NETWORK FOR VIRTUAL VIEW SYNTHESIS
    Qiu, Ke
    Gu, Song
    Liu, Shiyi
    Lai, Yawen
    Cai, Yangang
    Wang, Ronggang
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [46] Enhancements of representation and interactivity for multi-view video based on layered depth image
    Xiaoyu Cheng
    Lifeng Sun
    Shiqiang Yang
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4309 - +
  • [47] Edge-Aware Spatial Propagation Network for Multi-view Depth Estimation
    Siyuan Xu
    Qingshan Xu
    Wanjuan Su
    Wenbing Tao
    Neural Processing Letters, 2023, 55 : 10905 - 10923
  • [48] Global depth estimation for multi-view video coding using camera parameters
    Zhang, Xiaoyun
    Zhu, Weile
    Yang, George
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2008, : 631 - +
  • [49] MULTI-VIEW WIDE BASELINE DEPTH ESTIMATION ROBUST TO SPARSE INPUT SAMPLING
    Jorissen, Lode
    Goorts, Patrik
    Lafruit, Gauthier
    Bekaert, Philippe
    2016 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2016,
  • [50] Integration of colour and affine invariant feature for multi-view depth video estimation
    Zuo, Y.
    An, P.
    Shen, L.
    Li, C.
    Ma, R.
    IMAGING SCIENCE JOURNAL, 2016, 64 (06): : 313 - 320