Multi-Scale Monocular Depth Estimation Based on Global Understanding

被引:1
|
作者
Xiao, Jiejie [1 ]
Li, Lihong [2 ]
Su, Xu [1 ]
Tan, Guopeng [1 ]
机构
[1] Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056038, Peoples R China
[2] Hebei Univ Engn, Hebei Key Lab Secur & Protect Informat Sensing & P, Handan 056038, Peoples R China
关键词
Convolutional neural networks; Network architecture; Transformers; Spatial resolution; depth estimation; global understanding module; difference module; cascade module;
D O I
10.1109/ACCESS.2024.3382572
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the advancement of Convolutional Neural Networks, numerous convolutional neural network-based methods have been proposed for depth estimation and have achieved significant achievements. However, the repetitive convolutional layers and spatial pooling layers in these networks often lead to a reduction in spatial resolution and loss of local information, such as edge contours. To address this issue, this study presents a multi-scale monocular depth estimation model. Specifically, a Global Understanding Module was introduced on top of a generic encoder to increase the receptive field and capture contextual information. Additionally, the decoding process incorporates a Difference Module and a Multi-scale Cascade Module to guide the decoding information and refine edge contour details. Finally, extensive experiments were conducted using the KITTI and NYUv2 datasets. For the KITTI dataset, the Absolute Relative Error (Abs. Rel) was 0.057, and the Root Mean Squared Error (RMSE) was 2.415. On the NYUv2 dataset, Abs.Rel was 0.104, and RMSE was 0.380. These results indicate that the model performs well in accurately estimating depth information.
引用
收藏
页码:46930 / 46939
页数:10
相关论文
共 50 条
  • [41] R-MSFM: Recurrent Multi-Scale Feature Modulation for Monocular Depth Estimating
    Zhou, Zhongkai
    Fan, Xinnan
    Shi, Pengfei
    Xin, Yuanxue
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12757 - 12766
  • [42] Monocular Depth Estimation Using Multi Scale Neural Network And Feature Fusion
    Sagar, Abhinav
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 656 - 662
  • [43] Multi-Scale Dilated Convolution Network Based Depth Estimation in Intelligent Transportation Systems
    Tian, Yanling
    Zhang, Qieshi
    Ren, Ziliang
    Wu, Fuxiang
    Hao, Pengyi
    Hu, Jinglu
    IEEE ACCESS, 2019, 7 : 185179 - 185188
  • [44] Solving Monocular Sensors Depth Prediction Using MLP-Based Architecture and Multi-Scale Inverse Attention
    Cheng, Zeyu
    Zhang, Yi
    Tang, Chengkai
    IEEE SENSORS JOURNAL, 2022, 22 (16) : 16178 - 16189
  • [45] Dyna-MSDepth: multi-scale self-supervised monocular depth estimation network for visual SLAM in dynamic scenes
    Yao, Jianjun
    Li, Yingzhao
    Li, Jiajia
    MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
  • [46] Self-Supervised Monocular Depth Estimation Using Global and Local Mixed Multi-Scale Feature Enhancement Network for Low-Altitude UAV Remote Sensing
    Chang, Rong
    Yu, Kailong
    Yang, Yang
    REMOTE SENSING, 2023, 15 (13)
  • [47] Monocular Depth and Velocity Estimation Based on Multi-Cue Fusion
    Qi, Chunyang
    Zhao, Hongxiang
    Song, Chuanxue
    Zhang, Naifu
    Song, Sinxin
    Xu, Haigang
    Xiao, Feng
    MACHINES, 2022, 10 (05)
  • [48] Unsupervised Learning of Depth Estimation and Camera Pose With Multi-Scale GANs
    Xu, Yufan
    Wang, Yan
    Huang, Rui
    Lei, Zeyu
    Yang, Junyao
    Li, Zijian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 17039 - 17047
  • [49] MANET: MULTI-SCALE AGGREGATED NETWORK FOR LIGHT FIELD DEPTH ESTIMATION
    Li, Yan
    Zhang, Lu
    Wang, Qiong
    Lafruit, Gauthier
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1998 - 2002
  • [50] THE IMAGE DEPTH ESTIMATION BASED ON MULTI-SCALE TEXTURE FEATURES AND LEAST-SQUARE METHOD
    Zhang, Lizhi
    Chen, Tingting
    Sun, Huadong
    Zhao, Zhijie
    Jin, Xuesong
    Huang, Ju
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 816 - 820