Monocular Depth Distribution Alignment with Low Computation

被引:0
|
作者
Sheng, Fei [1 ]
Xue, Feng [1 ]
Chang, Yicong [1 ]
Liang, Wenteng [1 ]
Ming, Anlong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022 | 2022年
关键词
D O I
10.1109/ICRA46639.2022.9811937
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of monocular depth estimation generally depends on the amount of parameters and computational cost. It leads to a large accuracy contrast between light-weight networks and heavy-weight networks, which limits their application in the real world. In this paper, we model the majority of accuracy contrast between them as the difference of depth distribution, which we call 'Distribution drift'. To this end, a distribution alignment network (DANet) is proposed. We firstly design a pyramid scene transformer (PST) module to capture inter-region interaction in multiple scales. By perceiving the difference of depth features between every two regions, DANet tends to predict a reasonable scene structure, which fits the shape of distribution to ground truth. Then, we propose a local-global optimization (LGO) scheme to realize the supervision of global range of scene depth. Thanks to the alignment of depth distribution shape and scene depth range, DANet sharply alleviates the distribution drift, and achieves a comparable performance with prior heavy-weight methods, but uses only 1% floating-point operations per second (FLOPs) of them. The experiments on two datasets, namely the widely used NYUDv2 dataset and the more challenging iBims-1 dataset, demonstrate the effectiveness of our method. The source code is available at https://github.com/YiLiM1/DANet.
引用
收藏
页码:6548 / 6555
页数:8
相关论文
共 50 条
  • [31] Monocular Depth Estimation With Augmented Ordinal Depth Relationships
    Cao, Yuanzhouhan
    Zhao, Tianqi
    Xian, Ke
    Shen, Chunhua
    Cao, Zhiguo
    Xu, Shugong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (08) : 2674 - 2682
  • [32] Self-Supervised Monocular Depth Estimation in the Dark: Towards Data Distribution Compensation
    Yang, Haolin
    Zhao, Chaoqiang
    Sheng, Lu
    Tang, Yang
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 1561 - 1569
  • [33] IMPROVING MONOCULAR SLAM INVERSE DEPTH PARAMETERIZATION COMPUTATION TIME VIA SOFTWARE PROFILING AND PARALLEL MATRIX MULTIPLICATION
    Idris, Mohd Yamani Idna
    Arof, Hamzah
    Noor, Noorzaily Mohamed
    Tamil, Emran Mohd
    Razak, Zaidi
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2011, 7 (11): : 6273 - 6287
  • [34] Monocular and binocular depth discrimination thresholds
    Kaye, SB
    Siddiqui, A
    Ward, A
    Noonan, C
    Fisher, AC
    Green, JR
    Brown, MC
    Wareing, PA
    Watt, P
    OPTOMETRY AND VISION SCIENCE, 1999, 76 (11) : 770 - 782
  • [35] Inverse Depth Parametrization for Monocular SLAM
    Civera, Javier
    Davison, Andrew J.
    Montiel, J. M. Martinez
    IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (05) : 932 - 945
  • [36] EVALUATING MONOCULAR DEPTH ESTIMATION METHODS
    Padkan, N.
    Trybala, P.
    Battisti, R.
    Remondino, F.
    Bergeret, C.
    2ND GEOBENCH WORKSHOP ON EVALUATION AND BENCHMARKING OF SENSORS, SYSTEMS AND GEOSPATIAL DATA IN PHOTOGRAMMETRY AND REMOTE SENSING, VOL. 48-1, 2023, : 137 - 144
  • [37] DepthNet: A Monocular Depth Estimation Framework
    Anunay
    Pankaj
    Dhiman, Chhavi
    2021 7TH INTERNATIONAL CONFERENCE ON ENGINEERING AND EMERGING TECHNOLOGIES (ICEET 2021), 2021, : 495 - 500
  • [38] Monocular Depth Estimation for Equirectangular Videos
    Fraser, Helmi
    Wang, Sen
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 5293 - 5299
  • [39] MONOCULAR DEPTH ESTIMATION IN FOREST ENVIRONMENTS
    Hristova, H.
    Abegg, M.
    Fischer, C.
    Rehush, N.
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 43-B2 : 1017 - 1023
  • [40] The role of colour as a monocular depth cue
    Troscianko, T.
    Montagnon, R.
    Le Clerc, J.
    Malbert, E.
    Chanteau, P-L
    PERCEPTION, 1990, 19 (04) : 340 - 340