Geometric Pretraining for Monocular Depth Estimation

被引:0
|
作者
Wang, Kaixuan [1 ]
Chen, Yao [2 ]
Guo, Hengkai [2 ]
Wen, Linfu [2 ]
Shen, Shaojie [1 ]
机构
[1] HKUST, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[2] ByteDance AI Lab, Beijing, Peoples R China
关键词
D O I
10.1109/icra40945.2020.9196847
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
ImageNet-pretrained networks have been widely used in transfer learning for monocular depth estimation. These pretrained networks are trained with classification losses for which only semantic information is exploited while spatial information is ignored. However, both semantic and spatial information is important for per-pixel depth estimation. In this paper, we design a novel self-supervised geometric pretraining task that is tailored for monocular depth estimation using uncalibrated videos. The designed task decouples the structure information from input videos by a simple yet effective conditional autoencoder-decoder structure. Using almost unlimited videos from the internet, networks are pretrained to capture a variety of structures of the scene and can be easily transferred to depth estimation tasks using calibrated images. Extensive experiments are used to demonstrate that the proposed geometric-pretrained networks perform better than ImageNet-pretrained networks in terms of accuracy, few-shot learning and generalization ability. Using existing learning methods, geometric-transferred networks achieve new state-of-the-art results by a large margin. The pretrained networks will be open source soon(1).
引用
收藏
页码:4782 / 4788
页数:7
相关论文
共 50 条
  • [21] Monocular depth estimation using self-supervised learning with more effective geometric constraints
    Xiong, Mingkang
    Zhang, Zhenghong
    Liu, Jiyuan
    Zhang, Tao
    Xiong, Huilin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 128
  • [22] Monocular Depth Estimation Using Relative Depth Maps
    Lee, Jae-Han
    Kim, Chang-Su
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9721 - 9730
  • [23] Monocular Depth Estimation with Sharp Boundary
    Yang, Xin
    Chang, Qingling
    Xu, Shiting
    Liu, Xinlin
    Cui, Yan
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 136 (01): : 573 - 592
  • [24] Aperture Supervision for Monocular Depth Estimation
    Srinivasan, Pratul P.
    Garg, Rahul
    Wadhwa, Neal
    Ng, Ren
    Barron, Jonathan T.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6393 - 6401
  • [25] Monocular Depth Estimation for Mobile Device
    Lee, Yongsik
    Lee, Seungjae
    Ko, Jong Gook
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2021,
  • [26] Unsupervised Monocular Depth and Pose Estimation Using Multiple Masks Based on Photometric and Geometric Consistency
    Kong, Huifang
    Liu, Tiankuo
    Hu, Jie
    Fang, Yao
    Sun, Jixing
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3558 - 3563
  • [27] Monocular Depth Estimation With Augmented Ordinal Depth Relationships
    Cao, Yuanzhouhan
    Zhao, Tianqi
    Xian, Ke
    Shen, Chunhua
    Cao, Zhiguo
    Xu, Shugong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (08) : 2674 - 2682
  • [28] Monocular depth estimation with SPN loss
    Mathew, Alwyn
    Mathew, Jimson
    IMAGE AND VISION COMPUTING, 2020, 100
  • [29] Monocular Depth Estimation: A Thorough Review
    Arampatzakis, Vasileios
    Pavlidis, George
    Mitianoudis, Nikolaos
    Papamarkos, Nikos
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (04) : 2396 - 2414
  • [30] Fast Monocular Depth Estimation on an FPGA
    Sada, Youki
    Soga, Naoto
    Shimoda, Masayuki
    Jinguji, Akira
    Sato, Shimpei
    Nakahara, Hiroki
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 143 - 146