Monocular Depth Distribution Alignment with Low Computation

被引:0
|
作者
Sheng, Fei [1 ]
Xue, Feng [1 ]
Chang, Yicong [1 ]
Liang, Wenteng [1 ]
Ming, Anlong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022 | 2022年
关键词
D O I
10.1109/ICRA46639.2022.9811937
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of monocular depth estimation generally depends on the amount of parameters and computational cost. It leads to a large accuracy contrast between light-weight networks and heavy-weight networks, which limits their application in the real world. In this paper, we model the majority of accuracy contrast between them as the difference of depth distribution, which we call 'Distribution drift'. To this end, a distribution alignment network (DANet) is proposed. We firstly design a pyramid scene transformer (PST) module to capture inter-region interaction in multiple scales. By perceiving the difference of depth features between every two regions, DANet tends to predict a reasonable scene structure, which fits the shape of distribution to ground truth. Then, we propose a local-global optimization (LGO) scheme to realize the supervision of global range of scene depth. Thanks to the alignment of depth distribution shape and scene depth range, DANet sharply alleviates the distribution drift, and achieves a comparable performance with prior heavy-weight methods, but uses only 1% floating-point operations per second (FLOPs) of them. The experiments on two datasets, namely the widely used NYUDv2 dataset and the more challenging iBims-1 dataset, demonstrate the effectiveness of our method. The source code is available at https://github.com/YiLiM1/DANet.
引用
收藏
页码:6548 / 6555
页数:8
相关论文
共 50 条
  • [41] Monocular Dominance in binocular depth Perception
    Mayer-Hillebrand, Franziska
    ZEITSCHRIFT FUR PSYCHOLOGIE, 1943, 155 (3-6): : 357 - 358
  • [42] Monocular depth estimation with enhanced edge
    Wang Q.
    Wang Q.
    Cheng K.
    Liu Z.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (03): : 36 - 42
  • [43] Energy-Quality Scalable Monocular Depth Estimation on Low-Power CPUs
    Cipolletta, Antonio
    Peluso, Valentino
    Calimera, Andrea
    Poggi, Matteo
    Tosi, Fabio
    Aleotti, Filippo
    Mattoccia, Stefano
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (01) : 25 - 36
  • [44] A Comprehensive Evaluation of Monocular Depth Estimation Methods in Low-Altitude Forest Environment
    Jia, Jiwen
    Kang, Junhua
    Chen, Lin
    Gao, Xiang
    Zhang, Borui
    Yang, Guijun
    REMOTE SENSING, 2025, 17 (04)
  • [45] Monocular Depth Estimation with Sharp Boundary
    Yang, Xin
    Chang, Qingling
    Xu, Shiting
    Liu, Xinlin
    Cui, Yan
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 136 (01): : 573 - 592
  • [46] Aperture Supervision for Monocular Depth Estimation
    Srinivasan, Pratul P.
    Garg, Rahul
    Wadhwa, Neal
    Ng, Ren
    Barron, Jonathan T.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6393 - 6401
  • [47] Monocular Depth Estimation for Mobile Device
    Lee, Yongsik
    Lee, Seungjae
    Ko, Jong Gook
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2021,
  • [48] The depth of monocular-rivalry suppression
    O'Shea, RP
    Alais, D
    Parker, A
    AUSTRALIAN JOURNAL OF PSYCHOLOGY, 2005, 57 : 66 - 66
  • [49] Monocular depth sensing using metalens
    Yang, Fan
    Lin, Hung-, I
    Chen, Peng
    Hu, Juejun
    Gu, Tian
    NANOPHOTONICS, 2023, 12 (14) : 2987 - 2996
  • [50] Crowding in depth for binocular and monocular observation
    Eberhardt, Lisa Valentina
    Huckauf, Anke
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2019, 81 (06) : 1951 - 1961