Depth-Guided Aggregation for Real-Time Binocular Depth Estimation Network

被引:0
|
作者
Fu, Dongxin [1 ]
Zheng, Shaowu [1 ]
Xie, Pengcheng [1 ]
Li, Weihua [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
关键词
Costs; Estimation; Feature extraction; Three-dimensional displays; Convolution; Real-time systems; Data mining; Cameras;
D O I
10.1109/MMUL.2024.3395695
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Using binocular cameras to obtain depth information of target pixels offers a cost-effective and natural alternative to lidar systems. However, most of the current binocular depth estimation networks have difficulty achieving a better balance between speed and accuracy in real-world situations, and their prediction accuracy for long-range depth is often limited. In this article, we introduce the end-to-end real-time depth estimation network (RTDENet), which efficiently utilizes multiscale cost volumes for improved performance. We propose an efficient and flexible cost aggregation module that supplements residual information with high-resolution cost volumes. By replacing some computationally demanding 3-D convolutional layers with depth-guided excitation, we maintain accuracy while effectively controlling model computation. Alongside the distance-sensitive loss function, RTDENet achieves a global difference of 2.41 m and an inference time of 27 ms on the KITTI Stereo dataset. This balance of speed and accuracy outperforms other state-of-the-art algorithms in depth estimation tasks.
引用
收藏
页码:36 / 47
页数:12
相关论文
共 50 条
  • [41] A Compact Light Field Camera for Real-Time Depth Estimation
    Anisimov, Yuriy
    Wasenmuller, Oliver
    Stricker, Didier
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT I, 2019, 11678 : 52 - 63
  • [42] Real-time Stereo Matching for Depth Estimation Using GPU
    Cheng, Fang-Hsuan
    Huang, Kuan-Yu
    2015 8TH INTERNATIONAL CONFERENCE ON UBI-MEDIA COMPUTING (UMEDIA) CONFERENCE PROCEEDINGS, 2015, : 3 - 6
  • [43] Real-time Depth Estimation Using Recurrent CNN with Sparse Depth Cues for SLAM System
    Lee, Sang Jun
    Choi, Heeyoul
    Hwang, Sung Soo
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2020, 18 (01) : 206 - 216
  • [44] Real-time Depth Estimation Using Recurrent CNN with Sparse Depth Cues for SLAM System
    Sang Jun Lee
    Heeyoul Choi
    Sung Soo Hwang
    International Journal of Control, Automation and Systems, 2020, 18 : 206 - 216
  • [45] Semantic attention and relative scene depth-guided network for underwater image enhancement
    Chen, Tingkai
    Wang, Ning
    Chen, Yanzheng
    Kong, Xiangjun
    Lin, Yejin
    Zhao, Hong
    Karimi, Hamid Reza
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [46] Depth-guided deep filtering network for efficient single image bokeh rendering
    Chen, Quan
    Zheng, Bolun
    Zhou, Xiaofei
    Huang, Aiai
    Sun, Yaoqi
    Chen, Chuqiao
    Yan, Chenggang
    Yuan, Shanxin
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (28): : 20869 - 20887
  • [47] Real-Time Free Viewpoint Video Synthesis System Based on DIBR and a Depth Estimation Network
    Guo, Shuai
    Hu, Jingchuan
    Zhou, Kai
    Wang, Jionghao
    Song, Li
    Xie, Rong
    Zhang, Wenjun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6701 - 6716
  • [48] MiniNet: An extremely lightweight convolutional neural network for real-time unsupervised monocular depth estimation
    Liu, Jun
    Li, Qing
    Cao, Rui
    Tang, Wenming
    Qiu, Guoping
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 166 (166) : 255 - 267
  • [49] REAL-TIME UNSUPERVISED MULTI-VIEW DEPTH ESTIMATION NETWORK FOR VIRTUAL VIEW SYNTHESIS
    Qiu, Ke
    Gu, Song
    Liu, Shiyi
    Lai, Yawen
    Cai, Yangang
    Wang, Ronggang
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [50] Driving Scene Perception Network: Real-time Joint Detection, Depth Estimation and Semantic Segmentation
    Chen, Liangfu
    Yang, Zeng
    Ma, Jianjun
    Luo, Zheng
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1283 - 1291