Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

被引:250
|
作者
Xu, Dan [1 ]
Wang, Wei [1 ]
Tang, Hao [1 ]
Liu, Hong [2 ]
Sebe, Nicu [1 ]
Ricci, Elisa [1 ,3 ]
机构
[1] Univ Trento, Multimedia & Human Understanding Grp, Trento, Italy
[2] Peking Univ, Shenzhen Grad Sch, Key Lab Machine Percept, Beijing, Peoples R China
[3] Fdn Bruno Kessler, Technol Vis Grp, Trento, Italy
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR.2018.00412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks. Following this line of research, in this paper we introduce a novel approach for monocular depth estimation. Similarly to previous works, our method employs a continuous CRF to fuse multi-scale information derived from different layers of a front-end Convolutional Neural Network (CNN). Differently from past works, our approach benefits from a structured attention model which automatically regulates the amount of information transferred between corresponding features at different scales. Importantly, the proposed attention model is seamlessly integrated into the CRF allowing end-to-end training of the entire architecture. Our extensive experimental evaluation demonstrates the effectiveness of the proposed method which is competitive with previous methods on the KITH benchmark and outperforms the state of the art on the NYU Depth V2 dataset.
引用
收藏
页码:3917 / 3925
页数:9
相关论文
共 50 条
  • [11] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
    Sumanta Bhattacharyya
    Ju Shen
    Stephen Welch
    Chen Chen
    Journal of Real-Time Image Processing, 2021, 18 : 1357 - 1368
  • [12] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
    Bhattacharyya, Sumanta
    Shen, Ju
    Welch, Stephen
    Chen, Chen
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (04) : 1357 - 1368
  • [13] Bidirectional Attention Network for Monocular Depth Estimation
    Aich, Shubhra
    Vianney, Jean Marie Uwabeza
    Islam, Md Amirul
    Kaur, Mannat
    Liu, Bingbing
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 11746 - 11752
  • [14] Monocular Depth Estimation with Adaptive Geometric Attention
    Naderi, Taher
    Sadovnik, Amir
    Hayward, Jason
    Qi, Hairong
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 617 - 627
  • [15] Dynamic Guided Network for Monocular Depth Estimation
    Xing, Xiaoxia
    Cai, Yinghao
    Wang, Yanqing
    Lu, Tao
    Yang, Yiping
    Wen, Dayong
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5459 - 5465
  • [16] AGNet: Attention Guided Sparse Depth Completion Using Convolutional Neural Networks
    Liang, Xiaolong
    Jung, Cheolkon
    IEEE ACCESS, 2022, 10 : 10514 - 10522
  • [17] Depth-Relative Self Attention for Monocular Depth Estimation
    Shim, Kyuhong
    Kim, Jiyoung
    Lee, Gusang
    Shim, Byonghyo
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1396 - 1404
  • [18] Deep Convolutional Neural Fields for Depth Estimation from a Single Image
    Liu, Fayao
    Shen, Chunhua
    Lin, Guosheng
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5162 - 5170
  • [19] Scale Recovery for Monocular Visual Odometry Using Depth Estimated with Deep Convolutional Neural Fields
    Yin, Xiaochuan
    Wang, Xiangwei
    Du, Xiaoguo
    Chen, Qijun
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5871 - 5879
  • [20] Integrating convolutional guidance and Transformer fusion with Markov Random Fields smoothing for monocular depth estimation
    Peng, Xiaorui
    Meng, Yu
    Shi, Boqiang
    Zheng, Chao
    Wang, Meijun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 143