Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

被引:250
|
作者
Xu, Dan [1 ]
Wang, Wei [1 ]
Tang, Hao [1 ]
Liu, Hong [2 ]
Sebe, Nicu [1 ]
Ricci, Elisa [1 ,3 ]
机构
[1] Univ Trento, Multimedia & Human Understanding Grp, Trento, Italy
[2] Peking Univ, Shenzhen Grad Sch, Key Lab Machine Percept, Beijing, Peoples R China
[3] Fdn Bruno Kessler, Technol Vis Grp, Trento, Italy
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR.2018.00412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks. Following this line of research, in this paper we introduce a novel approach for monocular depth estimation. Similarly to previous works, our method employs a continuous CRF to fuse multi-scale information derived from different layers of a front-end Convolutional Neural Network (CNN). Differently from past works, our approach benefits from a structured attention model which automatically regulates the amount of information transferred between corresponding features at different scales. Importantly, the proposed attention model is seamlessly integrated into the CRF allowing end-to-end training of the entire architecture. Our extensive experimental evaluation demonstrates the effectiveness of the proposed method which is competitive with previous methods on the KITH benchmark and outperforms the state of the art on the NYU Depth V2 dataset.
引用
收藏
页码:3917 / 3925
页数:9
相关论文
共 50 条
  • [31] Single Image Depth Estimation With Normal Guided Scale Invariant Deep Convolutional Fields
    Yan, Han
    Yu, Xin
    Zhang, Yu
    Zhang, Shunli
    Zhao, Xiaolin
    Zhang, Li
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (01) : 80 - 92
  • [32] Feature Enhanced Fully Convolutional Networks for Monocular Depth Estimation
    Shi, Chunxiu
    Chen, Jie
    Chen, Juan
    Zhang, Zheng
    2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 270 - 276
  • [33] Research on Monocular Depth Estimation Algorithm Based on Structured Loss
    Huo Z.
    Qiao L.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2021, 50 (05): : 728 - 733
  • [34] A Self-Supervised Monocular Depth Estimation Method Based on High Resolution Convolutional Neural Network
    Pu, Zhengdong
    Chen, Shu
    Zou, Beiji
    Pu, Baoxing
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (01): : 118 - 127
  • [35] MiniNet: An extremely lightweight convolutional neural network for real-time unsupervised monocular depth estimation
    Liu, Jun
    Li, Qing
    Cao, Rui
    Tang, Wenming
    Qiu, Guoping
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 166 (166) : 255 - 267
  • [36] Depth estimation for a road scene using a monocular image sequence based on fully convolutional neural network
    Wang, Haixia
    Sun, Yehao
    Zhang, Zhiguo
    Lu, Xiao
    Sheng, Chunyang
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (03)
  • [37] Monocular depth estimation with multi-view attention autoencoder
    Geunho Jung
    Sang Min Yoon
    Multimedia Tools and Applications, 2022, 81 : 33759 - 33770
  • [38] Lightweight monocular absolute depth estimation based on attention mechanism
    Jin, Jiayu
    Tao, Bo
    Qian, Xinbo
    Hu, Jiaxin
    Li, Gongfa
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)
  • [39] Attention-Based Grasp Detection With Monocular Depth Estimation
    Xuan Tan, Phan
    Hoang, Dinh-Cuong
    Nguyen, Anh-Nhat
    Nguyen, Van-Thiep
    Vu, Van-Duc
    Nguyen, Thu-Uyen
    Hoang, Ngoc-Anh
    Phan, Khanh-Toan
    Tran, Duc-Thanh
    Vu, Duy-Quang
    Ngo, Phuc-Quan
    Duong, Quang-Tri
    Ho, Ngoc-Trung
    Tran, Cong-Trinh
    Duong, Van-Hiep
    Mai, Anh-Truong
    IEEE ACCESS, 2024, 12 : 65041 - 65057
  • [40] Attention meets Geometry: Geometry Guided Spatial-Temporal Attention for Consistent Self-Supervised Monocular Depth Estimation
    Ruhkamp, Patrick
    Gao, Daoyi
    Chen, Hanzhi
    Navab, Nassir
    Busam, Beniamin
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 837 - 847