Multi-Scale Monocular Depth Estimation Based on Global Understanding

被引：1

作者：

Xiao, Jiejie ^{[1
]}

Li, Lihong ^{[2
]}

Su, Xu ^{[1
]}

Tan, Guopeng ^{[1
]}

机构：

[1] Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056038, Peoples R China

[2] Hebei Univ Engn, Hebei Key Lab Secur & Protect Informat Sensing & P, Handan 056038, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Convolutional neural networks; Network architecture; Transformers; Spatial resolution; depth estimation; global understanding module; difference module; cascade module;

D O I：

10.1109/ACCESS.2024.3382572

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the advancement of Convolutional Neural Networks, numerous convolutional neural network-based methods have been proposed for depth estimation and have achieved significant achievements. However, the repetitive convolutional layers and spatial pooling layers in these networks often lead to a reduction in spatial resolution and loss of local information, such as edge contours. To address this issue, this study presents a multi-scale monocular depth estimation model. Specifically, a Global Understanding Module was introduced on top of a generic encoder to increase the receptive field and capture contextual information. Additionally, the decoding process incorporates a Difference Module and a Multi-scale Cascade Module to guide the decoding information and refine edge contour details. Finally, extensive experiments were conducted using the KITTI and NYUv2 datasets. For the KITTI dataset, the Absolute Relative Error (Abs. Rel) was 0.057, and the Root Mean Squared Error (RMSE) was 2.415. On the NYUv2 dataset, Abs.Rel was 0.104, and RMSE was 0.380. These results indicate that the model performs well in accurately estimating depth information.

引用

页码：46930 / 46939

页数：10

共 50 条

[41] R-MSFM: Recurrent Multi-Scale Feature Modulation for Monocular Depth Estimating
Zhou, Zhongkai
Fan, Xinnan
Shi, Pengfei
Xin, Yuanxue
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12757 - 12766
[42] Monocular Depth Estimation Using Multi Scale Neural Network And Feature Fusion
Sagar, Abhinav
2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 656 - 662
[43] Multi-Scale Dilated Convolution Network Based Depth Estimation in Intelligent Transportation Systems
Tian, Yanling
Zhang, Qieshi
Ren, Ziliang
Wu, Fuxiang
Hao, Pengyi
Hu, Jinglu
IEEE ACCESS, 2019, 7 : 185179 - 185188
[44] Solving Monocular Sensors Depth Prediction Using MLP-Based Architecture and Multi-Scale Inverse Attention
Cheng, Zeyu
Zhang, Yi
Tang, Chengkai
IEEE SENSORS JOURNAL, 2022, 22 (16) : 16178 - 16189
[45] Dyna-MSDepth: multi-scale self-supervised monocular depth estimation network for visual SLAM in dynamic scenes
Yao, Jianjun
Li, Yingzhao
Li, Jiajia
MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
[46] Self-Supervised Monocular Depth Estimation Using Global and Local Mixed Multi-Scale Feature Enhancement Network for Low-Altitude UAV Remote Sensing
Chang, Rong
Yu, Kailong
Yang, Yang
REMOTE SENSING, 2023, 15 (13)
[47] Monocular Depth and Velocity Estimation Based on Multi-Cue Fusion
Qi, Chunyang
Zhao, Hongxiang
Song, Chuanxue
Zhang, Naifu
Song, Sinxin
Xu, Haigang
Xiao, Feng
MACHINES, 2022, 10 (05)
[48] Unsupervised Learning of Depth Estimation and Camera Pose With Multi-Scale GANs
Xu, Yufan
Wang, Yan
Huang, Rui
Lei, Zeyu
Yang, Junyao
Li, Zijian
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 17039 - 17047
[49] MANET: MULTI-SCALE AGGREGATED NETWORK FOR LIGHT FIELD DEPTH ESTIMATION
Li, Yan
Zhang, Lu
Wang, Qiong
Lafruit, Gauthier
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1998 - 2002
[50] THE IMAGE DEPTH ESTIMATION BASED ON MULTI-SCALE TEXTURE FEATURES AND LEAST-SQUARE METHOD
Zhang, Lizhi
Chen, Tingting
Sun, Huadong
Zhao, Zhijie
Jin, Xuesong
Huang, Ju
2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 816 - 820

← 1 2 3 4 5 →