Integrating convolutional guidance and Transformer fusion with Markov Random Fields smoothing for monocular depth estimation

被引：0

作者：

Peng, Xiaorui ^{[1
]}

Meng, Yu ^{[1
]}

Shi, Boqiang ^{[1
]}

Zheng, Chao ^{[1
]}

Wang, Meijun ^{[1
]}

机构：

[1] Univ Sci & Technol Beijing, XueYuan Rd 30, Beijing 100083, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 143卷

关键词：

Monocular depth estimation; Intelligent transportation; Environment perception;

D O I：

10.1016/j.engappai.2025.110011

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Monocular depth estimation is a challenging and prominent problem in current computer vision research and is widely used in intelligent transportation like environment perception, navigation and localization. Accurately delineating object boundaries and ensuring smooth transitions in estimated depth images from a single image remain significant challenges. These issues place higher demands on the network's global and local feature extraction capabilities. In response, we proposed a depth estimation framework, designed to address detection accuracy and the global smooth transition of predicted depth maps. Our method introduces a novel feature decoding structure named Convolutional Guided Fusion (CoGF), which utilizes local features extracted by a convolutional neural network as a guide and fuses them with long-range dependent features extracted by a Transformer. This approach enables the model to retain both local details and global contextual information during the decoding process. To ensure global smoothness in the depth estimation results, we incorporate a smoothing strategy based on Markov Random Fields (MRF), enhancing pixel-to-pixel continuity and ensuring robust spatial consistency in the generated depth maps. Our proposed method is evaluated on current mainstream benchmarks. Experimental results demonstrate that our depth estimation method outperforms previous approaches. The code is available at https://github.com/pxrw/CGTF-Depth.git.

引用

页数：10

共 50 条

[1] CATNet: Convolutional attention and transformer for monocular depth estimation
Tang, Shuai
Lu, Tongwei
Liu, Xuanxuan
Zhou, Huabing
Zhang, Yanduo
PATTERN RECOGNITION, 2024, 145
[2] Lightweight monocular depth estimation using a fusion-improved transformer
Sui, Xin
Gao, Song
Xu, Aigong
Zhang, Cong
Wang, Changqiang
Shi, Zhengxu
SCIENTIFIC REPORTS, 2024, 14 (01):
[3] Residual Vision Transformer and Adaptive Fusion Autoencoders for Monocular Depth Estimation
Yang, Wei-Jong
Wu, Chih-Chen
Yang, Jar-Ferr
SENSORS, 2025, 25 (01)
[4] Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation
Xu, Dan
Wang, Wei
Tang, Hao
Liu, Hong
Sebe, Nicu
Ricci, Elisa
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3917 - 3925
[5] Monocular Depth Estimation Algorithm Integrating Parallel Transformer and Multi-Scale Features
Wang, Weiqiang
Tan, Chao
Yan, Yunbing
ELECTRONICS, 2023, 12 (22)
[6] DEPTHFORMER: MULTISCALE VISION TRANSFORMER FOR MONOCULAR DEPTH ESTIMATION WITH GLOBAL LOCAL INFORMATION FUSION
Agarwal, Ashutosh
Arora, Chetan
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3873 - 3877
[7] Multimodal Monocular Dense Depth Estimation with Event-Frame Fusion Using Transformer
Xiao, Baihui
Xu, Jingzehua
Zhang, Zekai
Xing, Tianyu
Wang, Jingjing
Ren, Yong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 419 - 433
[8] Transformer-based monocular depth estimation with hybrid attention fusion and progressive regression
Liu, Peng
Zhang, Zonghua
Meng, Zhaozong
Gao, Nan
NEUROCOMPUTING, 2025, 620
[9] DTTNet: Depth Transverse Transformer Network for Monocular Depth Estimation
Kamath, Shreyas K. M.
Rajeev, Srijith
Panetta, Karen
Agaian, Sos S.
MULTIMODAL IMAGE EXPLOITATION AND LEARNING 2022, 2022, 12100
[10] Triple-Supervised Convolutional Transformer Aggregation for Robust Monocular Endoscopic Dense Depth Estimation
Fan, Wenkang
Jiang, Wenjing
Shi, Hong
Zeng, Hui-Qing
Chen, Yinran
Luo, Xiongbiao
IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2024, 6 (03): : 1017 - 1029

← 1 2 3 4 5 →