Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem

被引:15
|
作者
Chen, Xingyu [1 ]
Zhang, Ruonan [1 ]
Jiang, Ji [1 ]
Wang, Yan [1 ]
Li, Ge [1 ]
Li, Thomas H. [1 ,2 ,3 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Beijing, Peoples R China
[2] Peking Univ, Adv Inst Informat Technol, Beijing, Peoples R China
[3] Peking Univ, Informat Technol R&D Innovat Ctr, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/WACV56688.2023.00573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-supervised monocular depth estimation (MDE) models universally suffer from the notorious edge-fattening issue. Triplet loss, as a widespread metric learning strategy, has largely succeeded in many computer vision applications. In this paper, we redesign the patch-based triplet loss in MDE to alleviate the ubiquitous edge-fattening issue. We show two drawbacks of the raw triplet loss in MDE and demonstrate our problem-driven redesigns. First, we present a min. operator based strategy applied to all negative samples, to prevent well-performing negatives sheltering the error of edge-fattening negatives. Second, we split the anchor-positive distance and anchor-negative distance from within the original triplet, which directly optimizes the positives without any mutual effect with the negatives. Extensive experiments show the combination of these two small redesigns can achieve unprecedented results: Our powerful and versatile triplet loss not only makes our model outperform all previous SoTA by a large margin, but also provides substantial performance boosts to a large number of existing models, while introducing no extra inference computation at all.
引用
收藏
页码:5765 / 5775
页数:11
相关论文
共 50 条
  • [41] Self-Supervised Deep Monocular Depth Estimation With Ambiguity Boosting
    Bello, Juan Luis Gonzalez
    Kim, Munchurl
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9131 - 9149
  • [42] MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer
    Zhao, Chaoqiang
    Zhang, Youmin
    Poggi, Matteo
    Tosi, Fabio
    Guo, Xianda
    Zhu, Zheng
    Huang, Guan
    Tang, Yang
    Mattoccia, Stefano
    2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 668 - 678
  • [43] Excavating the Potential Capacity of Self-Supervised Monocular Depth Estimation
    Peng, Rui
    Wang, Ronggang
    Lai, Yawen
    Tang, Luyang
    Cai, Yangang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15540 - 15549
  • [44] A LIGHTWEIGHT SELF-SUPERVISED TRAINING FRAMEWORK FOR MONOCULAR DEPTH ESTIMATION
    Heydrich, Tim
    Yang, Yimin
    Du, Shan
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2265 - 2269
  • [45] Constant Velocity Constraints for Self-Supervised Monocular Depth Estimation
    Zhou, Hang
    Greenwood, David
    Taylor, Sarah
    Gong, Han
    CVMP 2020: THE 17TH ACM SIGGRAPH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, 2020,
  • [46] Transferring knowledge from monocular completion for self-supervised monocular depth estimation
    Sun, Lin
    Li, Yi
    Liu, Bingzheng
    Xu, Liying
    Zhang, Zhe
    Zhu, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42485 - 42495
  • [47] Transferring knowledge from monocular completion for self-supervised monocular depth estimation
    Lin Sun
    Yi Li
    Bingzheng Liu
    Liying Xu
    Zhe Zhang
    Jie Zhu
    Multimedia Tools and Applications, 2022, 81 : 42485 - 42495
  • [48] Self-Supervised Monocular Depth Hints
    Watson, Jamie
    Firman, Michael
    Brostow, Gabriel J.
    Turmukhambetov, Daniyar
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2162 - 2171
  • [49] Self-Supervised Monocular Depth Underwater
    Amitai, Shlomi
    Klein, Itzik
    Treibitz, Tali
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 1098 - 1104
  • [50] RA-Depth: Resolution Adaptive Self-supervised Monocular Depth Estimation
    He, Mu
    Hui, Le
    Bian, Yikai
    Ren, Jian
    Xie, Jin
    Yang, Jian
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 565 - 581