Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

被引：250

作者：

Xu, Dan ^{[1
]}

Wang, Wei ^{[1
]}

Tang, Hao ^{[1
]}

Liu, Hong ^{[2
]}

Sebe, Nicu ^{[1
]}

Ricci, Elisa ^{[1
,3
]}

机构：

[1] Univ Trento, Multimedia & Human Understanding Grp, Trento, Italy

[2] Peking Univ, Shenzhen Grad Sch, Key Lab Machine Percept, Beijing, Peoples R China

[3] Fdn Bruno Kessler, Technol Vis Grp, Trento, Italy

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR.2018.00412

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks. Following this line of research, in this paper we introduce a novel approach for monocular depth estimation. Similarly to previous works, our method employs a continuous CRF to fuse multi-scale information derived from different layers of a front-end Convolutional Neural Network (CNN). Differently from past works, our approach benefits from a structured attention model which automatically regulates the amount of information transferred between corresponding features at different scales. Importantly, the proposed attention model is seamlessly integrated into the CRF allowing end-to-end training of the entire architecture. Our extensive experimental evaluation demonstrates the effectiveness of the proposed method which is competitive with previous methods on the KITH benchmark and outperforms the state of the art on the NYU Depth V2 dataset.

引用

页码：3917 / 3925

页数：9

共 50 条

[11] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
Sumanta Bhattacharyya
Ju Shen
Stephen Welch
Chen Chen
Journal of Real-Time Image Processing, 2021, 18 : 1357 - 1368
[12] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
Bhattacharyya, Sumanta
Shen, Ju
Welch, Stephen
Chen, Chen
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (04) : 1357 - 1368
[13] Bidirectional Attention Network for Monocular Depth Estimation
Aich, Shubhra
Vianney, Jean Marie Uwabeza
Islam, Md Amirul
Kaur, Mannat
Liu, Bingbing
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 11746 - 11752
[14] Monocular Depth Estimation with Adaptive Geometric Attention
Naderi, Taher
Sadovnik, Amir
Hayward, Jason
Qi, Hairong
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 617 - 627
[15] Dynamic Guided Network for Monocular Depth Estimation
Xing, Xiaoxia
Cai, Yinghao
Wang, Yanqing
Lu, Tao
Yang, Yiping
Wen, Dayong
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5459 - 5465
[16] AGNet: Attention Guided Sparse Depth Completion Using Convolutional Neural Networks
Liang, Xiaolong
Jung, Cheolkon
IEEE ACCESS, 2022, 10 : 10514 - 10522
[17] Depth-Relative Self Attention for Monocular Depth Estimation
Shim, Kyuhong
Kim, Jiyoung
Lee, Gusang
Shim, Byonghyo
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1396 - 1404
[18] Deep Convolutional Neural Fields for Depth Estimation from a Single Image
Liu, Fayao
Shen, Chunhua
Lin, Guosheng
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5162 - 5170
[19] Scale Recovery for Monocular Visual Odometry Using Depth Estimated with Deep Convolutional Neural Fields
Yin, Xiaochuan
Wang, Xiangwei
Du, Xiaoguo
Chen, Qijun
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5871 - 5879
[20] Integrating convolutional guidance and Transformer fusion with Markov Random Fields smoothing for monocular depth estimation
Peng, Xiaorui
Meng, Yu
Shi, Boqiang
Zheng, Chao
Wang, Meijun
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 143

← 1 2 3 4 5 →