Depth-Guided Aggregation for Real-Time Binocular Depth Estimation Network

被引:0
|
作者
Fu, Dongxin [1 ]
Zheng, Shaowu [1 ]
Xie, Pengcheng [1 ]
Li, Weihua [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
关键词
Costs; Estimation; Feature extraction; Three-dimensional displays; Convolution; Real-time systems; Data mining; Cameras;
D O I
10.1109/MMUL.2024.3395695
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Using binocular cameras to obtain depth information of target pixels offers a cost-effective and natural alternative to lidar systems. However, most of the current binocular depth estimation networks have difficulty achieving a better balance between speed and accuracy in real-world situations, and their prediction accuracy for long-range depth is often limited. In this article, we introduce the end-to-end real-time depth estimation network (RTDENet), which efficiently utilizes multiscale cost volumes for improved performance. We propose an efficient and flexible cost aggregation module that supplements residual information with high-resolution cost volumes. By replacing some computationally demanding 3-D convolutional layers with depth-guided excitation, we maintain accuracy while effectively controlling model computation. Alongside the distance-sensitive loss function, RTDENet achieves a global difference of 2.41 m and an inference time of 27 ms on the KITTI Stereo dataset. This balance of speed and accuracy outperforms other state-of-the-art algorithms in depth estimation tasks.
引用
收藏
页码:36 / 47
页数:12
相关论文
共 50 条
  • [31] Exploiting temporal consistency for real-time video depth estimation
    Zhang, Haokui
    Shen, Chunhua
    Li, Ying
    Cao, Yuanzhouhan
    Liu, Yu
    Yan, Youliang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1725 - 1734
  • [32] Real-Time Depth Estimation from a Monocular Moving Camera
    Handa, Aniket
    Sharma, Prateek
    CONTEMPORARY COMPUTING, 2012, 306 : 494 - 495
  • [33] Depth-guided deep filtering network for efficient single image bokeh rendering
    Quan Chen
    Bolun Zheng
    Xiaofei Zhou
    Aiai Huang
    Yaoqi Sun
    Chuqiao Chen
    Chenggang Yan
    Shanxin Yuan
    Neural Computing and Applications, 2023, 35 : 20869 - 20887
  • [34] Towards Real-Time Monocular Depth Estimation For Mobile Systems
    Deldjoo, Yashar
    Di Noia, Tommaso
    Di Sciascio, Eugenio
    Pernisco, Gaetano
    Reno, Vito
    Stella, Ettore
    MULTIMODAL SENSING AND ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS II, 2021, 11785
  • [35] A Real-Time Depth Estimation Approach for a Focused Plenoptic Camera
    Vasko, Ross
    Zeller, Niclas
    Quint, Franz
    Stilla, Uwe
    ADVANCES IN VISUAL COMPUTING, PT II (ISVC 2015), 2015, 9475 : 70 - 80
  • [36] Real-time Monocular Depth Estimation with Sparse Supervision on Mobile
    Yucel, Mehmet Kerim
    Dimaridou, Valia
    Drosou, Anastasios
    Saa-Garriga, Albert
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2428 - 2437
  • [37] Real-time monocular depth estimation with adaptive receptive fields
    Zhenyan Ji
    Xiaojun Song
    Xiaoxuan Guo
    Fangshi Wang
    José Enrique Armendáriz-Iñigo
    Journal of Real-Time Image Processing, 2021, 18 : 1369 - 1381
  • [38] FPGA Implementation of Full HD Real-time Depth Estimation
    Li, Hejian
    An, Ping
    Teng, Guowei
    Zhang, Zhaoyang
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS BERLIN (ICCE-BERLIN), 2014, : 249 - 253
  • [39] Towards real-time unsupervised monocular depth estimation on CPU
    Poggi, Matteo
    Aleotti, Filippo
    Tosi, Fabio
    Mattoccia, Stefano
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 5848 - 5854
  • [40] Real-time monocular depth estimation with adaptive receptive fields
    Ji, Zhenyan
    Song, Xiaojun
    Guo, Xiaoxuan
    Wang, Fangshi
    Armendariz-Inigo, Jose Enrique
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (04) : 1369 - 1381