Image depth estimation assisted by multi-view projection

被引:0
|
作者
Liu, Liman [1 ]
Tian, Jinshan [1 ]
Luo, Guansheng [1 ]
Xu, Siyuan [2 ]
Zhang, Chen [2 ]
Hu, Huaifei [1 ]
Tao, Wenbing [2 ]
机构
[1] South Cent Minzu Univ, Sch Biomed Engn, Key Lab Cognit Sci, Hubei Prov Key Lab Med Informat Anal & Tumor Diag, Minzu Rd, Wuhan 430074, Hubei, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Natl Key Lab Sci & Technol Multispectral Informat, Luoyu Rd, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-view projection; Depth estimation; Neural network; Optical flow; BENCHMARK;
D O I
10.1007/s40747-024-01688-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep learning has significantly advanced the development of image depth estimation algorithms. The depth estimation network with single-view input can only extract features from a single 2D image, often neglecting the information contained in neighboring views, resulting in learned features that lack real geometrical information in the 3D world and stricter constraints on the 3D structure, leading to limitations in the performance of image depth estimation. In the absence of accurate camera information, the multi-view geometric cues obtained by some methods may not accurately reflect the real 3D structure, resulting in a lack of multi-view geometric constraints in image depth estimation algorithms. To address this problem, a multi-view projection-assisted image depth estimation network is proposed, which integrates multi-view stereo vision into a deep learning-based encoding-decoding image depth estimation framework without pre-estimation of view bitmap. The network estimates optical flow for pixel-level matching across views, thereby projecting the features of neighboring views to the reference viewpoints for self-attentive feature aggregation, compensating for the lack of stereo geometry information in the image depth estimation framework. Additionally, a multi-view reprojection error is designed for supervised optical flow estimation to effectively constrain the optical flow estimation process. In addition, a long-distance attention decoding module is proposed to achieve effective extraction and aggregation of features in distant areas of the scene, which enhances the perception capability for outdoor long-distance. Experimental results on the KITTI dataset, vKITTI dataset, and SeasonDepth dataset demonstrate that our method achieves significant improvements compared to other state-of-the-art depth estimation techniques. This confirms its superior performance in image depth estimation.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Differentiable Diffusion for Dense Depth Estimation from Multi-view Images
    Khan, Numair
    Kim, Min H.
    Tompkin, James
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8908 - 8917
  • [32] Fast multi-view disparity estimation for multi-view video systems
    Jiang, Gangyi
    Yu, Mei
    Shao, Feng
    Yang, You
    Dong, Haitao
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2006, 4179 : 493 - 500
  • [33] Segment-based multi-view depth map estimation using belief propagation from dense multi-view video
    Lee, Sang-Beom
    Oh, Kwan-Jung
    Ho, Yo-Sung
    2008 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO, 2008, : 173 - 176
  • [34] Multi-View Depth Estimation by Using Adaptive Point Graph to Fuse Single-View Depth Probabilities
    Wang, Ke
    Liu, Chuhao
    Liu, Zhanwen
    Xiao, Fangwei
    An, Yisheng
    Zhao, Xiangmo
    Shen, Shaojie
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6400 - 6407
  • [35] Multi-view Image Fusion
    Comino Trinidad, Marc
    Martin Brualla, Ricardo
    Kainz, Florian
    Kontkanen, Janne
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4100 - 4109
  • [36] Depth Detection in Multi-View Displays
    Yang-Mao, Shys-Fan
    Lin, Yu-Ting
    Wang, Te-Mei
    Ho, Chia-Hang
    Chen, Hsiao-Wei
    Wu, Chang-Shuo
    IDW/AD '12: PROCEEDINGS OF THE INTERNATIONAL DISPLAY WORKSHOPS, PT 1, 2012, 19 : 627 - 630
  • [37] Unsupervised multi-view stereo network based on multi-stage depth estimation
    Qi, Shuai
    Sang, Xinzhu
    Yan, Binbin
    Wang, Peng
    Chen, Duo
    Wang, Huachun
    Ye, Xiaoqian
    IMAGE AND VISION COMPUTING, 2022, 122
  • [38] Multi-view depth video coding using depth view synthesis
    Na, Sang-Tae
    Oh, Kwan-Jung
    Lee, Cheon
    Ho, Yo-Sung
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 1400 - 1403
  • [39] A cascade network with adaptive depth hypotheses estimation for multi-view stereo and image three-dimensional reconstruction
    Wang, Dong
    Liu, Zhong
    Yue, Haosong
    Wu, Xingming
    Chen, Weihai
    2024 IEEE 19TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ICIEA 2024, 2024,
  • [40] Depth map-based disparity estimation technique using multi-view and depth camera
    Um, Gi-Mun
    Kim, Seung-Man
    Hur, Namho
    Lee, Kwan Hang
    Lee, Soo In
    STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS XIII, 2006, 6055