Monocular Depth Distribution Alignment with Low Computation

被引:0
|
作者
Sheng, Fei [1 ]
Xue, Feng [1 ]
Chang, Yicong [1 ]
Liang, Wenteng [1 ]
Ming, Anlong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
关键词
D O I
10.1109/ICRA46639.2022.9811937
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of monocular depth estimation generally depends on the amount of parameters and computational cost. It leads to a large accuracy contrast between light-weight networks and heavy-weight networks, which limits their application in the real world. In this paper, we model the majority of accuracy contrast between them as the difference of depth distribution, which we call 'Distribution drift'. To this end, a distribution alignment network (DANet) is proposed. We firstly design a pyramid scene transformer (PST) module to capture inter-region interaction in multiple scales. By perceiving the difference of depth features between every two regions, DANet tends to predict a reasonable scene structure, which fits the shape of distribution to ground truth. Then, we propose a local-global optimization (LGO) scheme to realize the supervision of global range of scene depth. Thanks to the alignment of depth distribution shape and scene depth range, DANet sharply alleviates the distribution drift, and achieves a comparable performance with prior heavy-weight methods, but uses only 1% floating-point operations per second (FLOPs) of them. The experiments on two datasets, namely the widely used NYUDv2 dataset and the more challenging iBims-1 dataset, demonstrate the effectiveness of our method. The source code is available at https://github.com/YiLiM1/DANet.
引用
收藏
页码:6548 / 6555
页数:8
相关论文
共 50 条
  • [21] Electrophysiological correlates of monocular depth
    Spang, K.
    Gillam, B.
    Fahle, M.
    PERCEPTION, 2011, 40 : 147 - 148
  • [22] Perceptual Monocular Depth Estimation
    Janice Pan
    Alan C. Bovik
    Neural Processing Letters, 2021, 53 : 1205 - 1228
  • [23] MONOCULAR DEPTH PERCEPTION IN RATS
    ASHIDA, S
    PSYCHOLOGICAL REPORTS, 1972, 30 (02) : 427 - &
  • [24] Depth from monocular half images: Occlusion or low-level processing?
    Harris, JM
    Wilcox, L
    McKee, S
    AUSTRALIAN JOURNAL OF PSYCHOLOGY, 2004, 56 : 117 - 117
  • [25] Retinal Prostheses: Functional Use of Monocular Depth Perception in the Low Resolution Limit
    Stiles, Noelle
    McIntosh, Ben
    Tanguay, Armand
    Humayun, Mark
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2013, 54 (15)
  • [26] MONOCULAR SEGMENT-WISE DEPTH: MONOCULAR DEPTH ESTIMATION BASED ON A SEMANTIC SEGMENTATION PRIOR
    Atapour-Abarghouei, Amir
    Breckon, Toby P.
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4295 - 4299
  • [27] Self-Supervised Monocular Depth Learning in Low-Texture Areas
    Xu, Wanpeng
    Zou, Ling
    Wu, Lingda
    Fu, Zhipeng
    REMOTE SENSING, 2021, 13 (09)
  • [28] Unsupervised Monocular Depth Estimation for Monocular Visual SLAM Systems
    Liu, Feng
    Huang, Ming
    Ge, Hongyu
    Tao, Dan
    Gao, Ruipeng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13
  • [29] MONOCULAR CUES CONTROL BINOCULAR ALIGNMENT
    KINSBOURNE, M
    SEABER, J
    OPTICA ACTA, 1971, 18 (10): : 759 - +
  • [30] Monocular Depth Estimation Using Relative Depth Maps
    Lee, Jae-Han
    Kim, Chang-Su
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9721 - 9730