Multi-Scale Monocular Depth Estimation Based on Global Understanding

被引:1
|
作者
Xiao, Jiejie [1 ]
Li, Lihong [2 ]
Su, Xu [1 ]
Tan, Guopeng [1 ]
机构
[1] Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056038, Peoples R China
[2] Hebei Univ Engn, Hebei Key Lab Secur & Protect Informat Sensing & P, Handan 056038, Peoples R China
关键词
Convolutional neural networks; Network architecture; Transformers; Spatial resolution; depth estimation; global understanding module; difference module; cascade module;
D O I
10.1109/ACCESS.2024.3382572
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the advancement of Convolutional Neural Networks, numerous convolutional neural network-based methods have been proposed for depth estimation and have achieved significant achievements. However, the repetitive convolutional layers and spatial pooling layers in these networks often lead to a reduction in spatial resolution and loss of local information, such as edge contours. To address this issue, this study presents a multi-scale monocular depth estimation model. Specifically, a Global Understanding Module was introduced on top of a generic encoder to increase the receptive field and capture contextual information. Additionally, the decoding process incorporates a Difference Module and a Multi-scale Cascade Module to guide the decoding information and refine edge contour details. Finally, extensive experiments were conducted using the KITTI and NYUv2 datasets. For the KITTI dataset, the Absolute Relative Error (Abs. Rel) was 0.057, and the Root Mean Squared Error (RMSE) was 2.415. On the NYUv2 dataset, Abs.Rel was 0.104, and RMSE was 0.380. These results indicate that the model performs well in accurately estimating depth information.
引用
收藏
页码:46930 / 46939
页数:10
相关论文
共 50 条
  • [31] SAU-Net: Monocular Depth Estimation Combining Multi-Scale Features and Attention Mechanisms
    Zhao, Wei
    Song, Yunqing
    Wang, Tingting
    IEEE ACCESS, 2023, 11 : 137734 - 137746
  • [32] The enhancement of depth estimation based on multi-scale convolution kernels
    Hua, Heng
    Sang, Xinzhu
    Tian, Xiyu
    Sun, Wanqi
    Chen, Duo
    Wang, Peng
    OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY V, 2018, 10817
  • [33] DEPTH ESTIMATION OF MULTI-MODAL SCENE BASED ON MULTI-SCALE MODULATION
    Wang, Anjie
    Fang, Zhijun
    Jiang, Xiaoyan
    Gao, Yongbin
    Cao, Gaofeng
    Ma, Siwei
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2795 - 2799
  • [34] HIGH QUALITY MONOCULAR DEPTH ESTIMATION VIA A MULTI-SCALE NETWORK AND A DETAIL-PRESERVING OBJECTIVE
    Jiang, Hualie
    Huang, Rui
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1920 - 1924
  • [35] MSD-CRFS: MULTI-SCALE DUAL AGGREGATION CONDITIONAL RANDOM FIELDS FOR MONOCULAR DEPTH ESTIMATION
    Zhang, Xidan
    Wei, Jianing
    Moteki, Atsunori
    Kobayashi, Yoshie
    Suzuki, Genta
    Tan, Zhiming
    2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2024, : 2001 - 2007
  • [36] GAMNet: Global attention via multi-scale context for depth estimation algorithm and application
    Yang, Huitong
    Lei, Liang
    Sang, Haiwei
    IET IMAGE PROCESSING, 2024, 18 (01) : 247 - 264
  • [37] Super-Resolution for Monocular Depth Estimation With Multi-Scale Sub-Pixel Convolutions and a Smoothness Constraint
    Zhao, Shiyu
    Zhang, Lin
    Shen, Ying
    Zhao, Shengjie
    Zhang, Huijuan
    IEEE ACCESS, 2019, 7 : 16323 - 16335
  • [38] Binocular Depth Estimation Algorithm Based on Multi-Scale Attention Feature Fusion
    Yang Huitong
    Lei Lang
    Lin Yongchun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [39] Unsupervised Monocular Depth Estimation Based on Scale Clue Enhancement
    Qu, Yi
    Chen, Ying
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (09): : 3217 - 3227
  • [40] An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling
    Zhi, Henghui
    Yin, Chenyang
    Li, Huibin
    Pang, Shanmin
    SENSORS, 2022, 22 (14)