Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

被引：250

作者：

Xu, Dan ^{[1
]}

Wang, Wei ^{[1
]}

Tang, Hao ^{[1
]}

Liu, Hong ^{[2
]}

Sebe, Nicu ^{[1
]}

Ricci, Elisa ^{[1
,3
]}

机构：

[1] Univ Trento, Multimedia & Human Understanding Grp, Trento, Italy

[2] Peking Univ, Shenzhen Grad Sch, Key Lab Machine Percept, Beijing, Peoples R China

[3] Fdn Bruno Kessler, Technol Vis Grp, Trento, Italy

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR.2018.00412

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks. Following this line of research, in this paper we introduce a novel approach for monocular depth estimation. Similarly to previous works, our method employs a continuous CRF to fuse multi-scale information derived from different layers of a front-end Convolutional Neural Network (CNN). Differently from past works, our approach benefits from a structured attention model which automatically regulates the amount of information transferred between corresponding features at different scales. Importantly, the proposed attention model is seamlessly integrated into the CRF allowing end-to-end training of the entire architecture. Our extensive experimental evaluation demonstrates the effectiveness of the proposed method which is competitive with previous methods on the KITH benchmark and outperforms the state of the art on the NYU Depth V2 dataset.

引用

页码：3917 / 3925

页数：9

共 50 条

[1] Monocular depth estimation via convolutional neural network with attention module
Lan, Lingling
Zhang, Yaping
Yang, Yuwei
Journal of Physics: Conference Series, 2021, 2025 (01):
[2] CATNet: Convolutional attention and transformer for monocular depth estimation
Tang, Shuai
Lu, Tongwei
Liu, Xuanxuan
Zhou, Huabing
Zhang, Yanduo
PATTERN RECOGNITION, 2024, 145
[3] Visualization of Convolutional Neural Networks for Monocular Depth Estimation
Hu, Junjie
Zhang, Yan
Okatani, Takayuki
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3868 - 3877
[4] Attention based multilayer feature fusion convolutional neural network for unsupervised monocular depth estimation
Lei, Zeyu
Wang, Yan
Li, Zijian
Yang, Junyao
NEUROCOMPUTING, 2021, 423 : 343 - 352
[5] Depth estimation for monocular image based on convolutional neural networks
Niu B.
Tang M.
Chen X.
International Journal of Circuits, Systems and Signal Processing, 2021, 15 : 533 - 540
[6] MobileXNet: An Efficient Convolutional Neural Network for Monocular Depth Estimation
Dong, Xingshuai
Garratt, Matthew A.
Anavatti, Sreenatha G.
Abbass, Hussein A.
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 20134 - 20147
[7] MONOCULAR DEPTH ESTIMATION OF GOOGLE EARTH IMAGES USING CONVOLUTIONAL NEURAL NETWORKS
Najaf, M.
Arefi, H.
Amirkolaee, H. Amini
Farajelahi, B.
ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, : 589 - 594
[8] Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields
Liu, Fayao
Shen, Chunhua
Lin, Guosheng
Reid, Ian
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (10) : 2024 - 2039
[9] Deep neural networks with attention mechanism for monocular depth estimation on embedded devices
Liu, Siping
Tu, Xiaohan
Xu, Cheng
Li, Renfa
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 131 : 137 - 150
[10] EdgeConv with Attention Module for Monocular Depth Estimation
Lee, Minhyeok
Hwang, Sangwon
Park, Chaewon
Lee, Sangyoun
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2364 - 2373

← 1 2 3 4 5 →