Depth-Guided Aggregation for Real-Time Binocular Depth Estimation Network

被引:0
|
作者
Fu, Dongxin [1 ]
Zheng, Shaowu [1 ]
Xie, Pengcheng [1 ]
Li, Weihua [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
关键词
Costs; Estimation; Feature extraction; Three-dimensional displays; Convolution; Real-time systems; Data mining; Cameras;
D O I
10.1109/MMUL.2024.3395695
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Using binocular cameras to obtain depth information of target pixels offers a cost-effective and natural alternative to lidar systems. However, most of the current binocular depth estimation networks have difficulty achieving a better balance between speed and accuracy in real-world situations, and their prediction accuracy for long-range depth is often limited. In this article, we introduce the end-to-end real-time depth estimation network (RTDENet), which efficiently utilizes multiscale cost volumes for improved performance. We propose an efficient and flexible cost aggregation module that supplements residual information with high-resolution cost volumes. By replacing some computationally demanding 3-D convolutional layers with depth-guided excitation, we maintain accuracy while effectively controlling model computation. Alongside the distance-sensitive loss function, RTDENet achieves a global difference of 2.41 m and an inference time of 27 ms on the KITTI Stereo dataset. This balance of speed and accuracy outperforms other state-of-the-art algorithms in depth estimation tasks.
引用
收藏
页码:36 / 47
页数:12
相关论文
共 50 条
  • [21] Depth-Guided Dehazing Network for Long-Range Aerial Scenes
    Wang, Yihu
    Zhao, Jilin
    Yao, Liangliang
    Fu, Changhong
    REMOTE SENSING, 2024, 16 (12)
  • [22] A Depth-Guided Attention Strategy for Crowd Counting
    Chen, Hao
    Li, Zhan
    Bhanu, Bir
    Lu, Dongping
    Han, Xuming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 25 - 37
  • [23] Real-time Monocular Depth Estimation with Extremely Light-Weight Neural Network
    Chiu, Mian-Jhong
    Chiu, Wei-Chen
    Chen, Hua-Tsung
    Chuang, Jen-Hui
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7050 - 7057
  • [24] A real-time railway fastener inspection method using the lightweight depth estimation network
    Zhong, Haoyu
    Liu, Long
    Wang, Jie
    Fu, Qinyi
    Yi, Bing
    MEASUREMENT, 2022, 189
  • [25] Underwater variable zoom: Depth-guided perception network for underwater image enhancement
    Huang, Zhixiong
    Wang, Xinying
    Xu, Chengpei
    Li, Jinjiang
    Feng, Lin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [26] Efficient Depth-Guided Urban View Synthesis
    Miao, Sheng
    Huang, Jiaxin
    Bai, Dongfeng
    Qiu, Weichao
    Liu, Bingbing
    Geiger, Andreas
    Liao, Yiyi
    COMPUTER VISION - ECCV 2024, PT XXX, 2025, 15088 : 90 - 107
  • [27] Tracking Based Depth-guided Video Inpainting
    Hatheele, Saroj
    Zaveri, Mukesh A.
    2013 FOURTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2013,
  • [28] Depth-Guided Texture Diffusion for Image Semantic Segmentation
    Sun, Wei
    Li, Yuan
    Ye, Qixiang
    Jiao, Jianbin
    Zhou, Yanzhao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1287 - 1302
  • [29] DGECN: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation
    Cao, Tuo
    Luo, Fei
    Fu, Yanping
    Zhang, Wenxiao
    Zheng, Shengjie
    Xiao, Chunxia
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3773 - 3782
  • [30] Real-time Depth Estimation for Aerial Panoramas in Virtual Reality
    Xu, Di
    Liu, Xiaojun
    Zhang, Yanning
    2020 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES WORKSHOPS (VRW 2020), 2020, : 705 - 706