Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

被引:250
|
作者
Xu, Dan [1 ]
Wang, Wei [1 ]
Tang, Hao [1 ]
Liu, Hong [2 ]
Sebe, Nicu [1 ]
Ricci, Elisa [1 ,3 ]
机构
[1] Univ Trento, Multimedia & Human Understanding Grp, Trento, Italy
[2] Peking Univ, Shenzhen Grad Sch, Key Lab Machine Percept, Beijing, Peoples R China
[3] Fdn Bruno Kessler, Technol Vis Grp, Trento, Italy
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR.2018.00412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks. Following this line of research, in this paper we introduce a novel approach for monocular depth estimation. Similarly to previous works, our method employs a continuous CRF to fuse multi-scale information derived from different layers of a front-end Convolutional Neural Network (CNN). Differently from past works, our approach benefits from a structured attention model which automatically regulates the amount of information transferred between corresponding features at different scales. Importantly, the proposed attention model is seamlessly integrated into the CRF allowing end-to-end training of the entire architecture. Our extensive experimental evaluation demonstrates the effectiveness of the proposed method which is competitive with previous methods on the KITH benchmark and outperforms the state of the art on the NYU Depth V2 dataset.
引用
收藏
页码:3917 / 3925
页数:9
相关论文
共 50 条
  • [1] Monocular depth estimation via convolutional neural network with attention module
    Lan, Lingling
    Zhang, Yaping
    Yang, Yuwei
    Journal of Physics: Conference Series, 2021, 2025 (01):
  • [2] CATNet: Convolutional attention and transformer for monocular depth estimation
    Tang, Shuai
    Lu, Tongwei
    Liu, Xuanxuan
    Zhou, Huabing
    Zhang, Yanduo
    PATTERN RECOGNITION, 2024, 145
  • [3] Visualization of Convolutional Neural Networks for Monocular Depth Estimation
    Hu, Junjie
    Zhang, Yan
    Okatani, Takayuki
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3868 - 3877
  • [4] Attention based multilayer feature fusion convolutional neural network for unsupervised monocular depth estimation
    Lei, Zeyu
    Wang, Yan
    Li, Zijian
    Yang, Junyao
    NEUROCOMPUTING, 2021, 423 : 343 - 352
  • [5] Depth estimation for monocular image based on convolutional neural networks
    Niu B.
    Tang M.
    Chen X.
    International Journal of Circuits, Systems and Signal Processing, 2021, 15 : 533 - 540
  • [6] MobileXNet: An Efficient Convolutional Neural Network for Monocular Depth Estimation
    Dong, Xingshuai
    Garratt, Matthew A.
    Anavatti, Sreenatha G.
    Abbass, Hussein A.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 20134 - 20147
  • [7] MONOCULAR DEPTH ESTIMATION OF GOOGLE EARTH IMAGES USING CONVOLUTIONAL NEURAL NETWORKS
    Najaf, M.
    Arefi, H.
    Amirkolaee, H. Amini
    Farajelahi, B.
    ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, : 589 - 594
  • [8] Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields
    Liu, Fayao
    Shen, Chunhua
    Lin, Guosheng
    Reid, Ian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (10) : 2024 - 2039
  • [9] Deep neural networks with attention mechanism for monocular depth estimation on embedded devices
    Liu, Siping
    Tu, Xiaohan
    Xu, Cheng
    Li, Renfa
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 131 : 137 - 150
  • [10] EdgeConv with Attention Module for Monocular Depth Estimation
    Lee, Minhyeok
    Hwang, Sangwon
    Park, Chaewon
    Lee, Sangyoun
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2364 - 2373