Multi-Scale Monocular Depth Estimation Based on Global Understanding

被引:1
|
作者
Xiao, Jiejie [1 ]
Li, Lihong [2 ]
Su, Xu [1 ]
Tan, Guopeng [1 ]
机构
[1] Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056038, Peoples R China
[2] Hebei Univ Engn, Hebei Key Lab Secur & Protect Informat Sensing & P, Handan 056038, Peoples R China
关键词
Convolutional neural networks; Network architecture; Transformers; Spatial resolution; depth estimation; global understanding module; difference module; cascade module;
D O I
10.1109/ACCESS.2024.3382572
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the advancement of Convolutional Neural Networks, numerous convolutional neural network-based methods have been proposed for depth estimation and have achieved significant achievements. However, the repetitive convolutional layers and spatial pooling layers in these networks often lead to a reduction in spatial resolution and loss of local information, such as edge contours. To address this issue, this study presents a multi-scale monocular depth estimation model. Specifically, a Global Understanding Module was introduced on top of a generic encoder to increase the receptive field and capture contextual information. Additionally, the decoding process incorporates a Difference Module and a Multi-scale Cascade Module to guide the decoding information and refine edge contour details. Finally, extensive experiments were conducted using the KITTI and NYUv2 datasets. For the KITTI dataset, the Absolute Relative Error (Abs. Rel) was 0.057, and the Root Mean Squared Error (RMSE) was 2.415. On the NYUv2 dataset, Abs.Rel was 0.104, and RMSE was 0.380. These results indicate that the model performs well in accurately estimating depth information.
引用
收藏
页码:46930 / 46939
页数:10
相关论文
共 50 条
  • [1] Monocular Depth Estimation Based on Multi-Scale Depth Map Fusion
    Yang, Xin
    Chang, Qingling
    Liu, Xinglin
    He, Siyuan
    Cui, Yan
    IEEE ACCESS, 2021, 9 : 67696 - 67705
  • [2] Multi-scale depth classification network for monocular depth estimation
    Yang, Yi
    Tian, Lihua
    Li, Chen
    Zhang, Botong
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
  • [3] Monocular Depth Estimation With Multi-Scale Feature Fusion
    Xu, Xianfa
    Chen, Zhe
    Yin, Fuliang
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 678 - 682
  • [4] Monocular depth estimation with multi-scale feature fusion
    Wang Q.
    Zhang S.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (05): : 7 - 12
  • [5] Monocular Depth Estimation Based on Multi-Scale Graph Convolution Networks
    Fu, Junwei
    Liang, Jun
    Wang, Ziyang
    IEEE ACCESS, 2020, 8 : 997 - 1009
  • [6] DEEP MULTI-SCALE ARCHITECTURES FOR MONOCULAR DEPTH ESTIMATION
    Moukari, M.
    Picard, S.
    Simon, L.
    Jurie, F.
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2940 - 2944
  • [7] Monocular Image Depth Estimation Based on Multi-Scale Attention Oriented Network
    Liu J.
    Wen J.
    Liang Y.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2020, 48 (12): : 52 - 62
  • [8] Saliency Driven Monocular Depth Estimation Based on Multi-scale Graph Convolutional Network
    Wu, Dunquan
    Chen, Chenglizhao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 445 - 456
  • [9] Swin-Depth: Using Transformers and Multi-Scale Fusion for Monocular-Based Depth Estimation
    Cheng, Zeyu
    Zhang, Yi
    Tang, Chengkai
    IEEE SENSORS JOURNAL, 2021, 21 (23) : 26912 - 26920
  • [10] Multi-scale Residual Pyramid Attention Network for Monocular Depth Estimation
    Liu, Jing
    Zhang, Xiaona
    Li, Zhaoxin
    Mao, Tianlu
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5137 - 5144