Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

被引：250

作者：

Xu, Dan ^{[1
]}

Wang, Wei ^{[1
]}

Tang, Hao ^{[1
]}

Liu, Hong ^{[2
]}

Sebe, Nicu ^{[1
]}

Ricci, Elisa ^{[1
,3
]}

机构：

[1] Univ Trento, Multimedia & Human Understanding Grp, Trento, Italy

[2] Peking Univ, Shenzhen Grad Sch, Key Lab Machine Percept, Beijing, Peoples R China

[3] Fdn Bruno Kessler, Technol Vis Grp, Trento, Italy

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR.2018.00412

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks. Following this line of research, in this paper we introduce a novel approach for monocular depth estimation. Similarly to previous works, our method employs a continuous CRF to fuse multi-scale information derived from different layers of a front-end Convolutional Neural Network (CNN). Differently from past works, our approach benefits from a structured attention model which automatically regulates the amount of information transferred between corresponding features at different scales. Importantly, the proposed attention model is seamlessly integrated into the CRF allowing end-to-end training of the entire architecture. Our extensive experimental evaluation demonstrates the effectiveness of the proposed method which is competitive with previous methods on the KITH benchmark and outperforms the state of the art on the NYU Depth V2 dataset.

引用

页码：3917 / 3925

页数：9

共 50 条

[31] Single Image Depth Estimation With Normal Guided Scale Invariant Deep Convolutional Fields
Yan, Han
Yu, Xin
Zhang, Yu
Zhang, Shunli
Zhao, Xiaolin
Zhang, Li
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (01) : 80 - 92
[32] Feature Enhanced Fully Convolutional Networks for Monocular Depth Estimation
Shi, Chunxiu
Chen, Jie
Chen, Juan
Zhang, Zheng
2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 270 - 276
[33] Research on Monocular Depth Estimation Algorithm Based on Structured Loss
Huo Z.
Qiao L.
Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2021, 50 (05): : 728 - 733
[34] A Self-Supervised Monocular Depth Estimation Method Based on High Resolution Convolutional Neural Network
Pu, Zhengdong
Chen, Shu
Zou, Beiji
Pu, Baoxing
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (01): : 118 - 127
[35] MiniNet: An extremely lightweight convolutional neural network for real-time unsupervised monocular depth estimation
Liu, Jun
Li, Qing
Cao, Rui
Tang, Wenming
Qiu, Guoping
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 166 (166) : 255 - 267
[36] Depth estimation for a road scene using a monocular image sequence based on fully convolutional neural network
Wang, Haixia
Sun, Yehao
Zhang, Zhiguo
Lu, Xiao
Sheng, Chunyang
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (03)
[37] Monocular depth estimation with multi-view attention autoencoder
Geunho Jung
Sang Min Yoon
Multimedia Tools and Applications, 2022, 81 : 33759 - 33770
[38] Lightweight monocular absolute depth estimation based on attention mechanism
Jin, Jiayu
Tao, Bo
Qian, Xinbo
Hu, Jiaxin
Li, Gongfa
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)
[39] Attention-Based Grasp Detection With Monocular Depth Estimation
Xuan Tan, Phan
Hoang, Dinh-Cuong
Nguyen, Anh-Nhat
Nguyen, Van-Thiep
Vu, Van-Duc
Nguyen, Thu-Uyen
Hoang, Ngoc-Anh
Phan, Khanh-Toan
Tran, Duc-Thanh
Vu, Duy-Quang
Ngo, Phuc-Quan
Duong, Quang-Tri
Ho, Ngoc-Trung
Tran, Cong-Trinh
Duong, Van-Hiep
Mai, Anh-Truong
IEEE ACCESS, 2024, 12 : 65041 - 65057
[40] Attention meets Geometry: Geometry Guided Spatial-Temporal Attention for Consistent Self-Supervised Monocular Depth Estimation
Ruhkamp, Patrick
Gao, Daoyi
Chen, Hanzhi
Navab, Nassir
Busam, Beniamin
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 837 - 847

← 1 2 3 4 5 →