Switchable-Encoder-Based Self-Supervised Learning Framework for Monocular Depth and Pose Estimation

被引:1
|
作者
Kim, Junoh [1 ]
Gao, Rui [1 ]
Park, Jisun [1 ]
Yoon, Jinsoo [2 ]
Cho, Kyungeun [3 ]
机构
[1] Dongguk Univ Seoul, Dept Multimedia Engn, 30 Pildongro 1 Gil, Seoul 04620, South Korea
[2] KoROAD Korea Rd Traff Author, Autonomous Driving Res Dept, 2 Hyeoksin Ro, Wonu Si 26466, Gangwon Do, South Korea
[3] Dongguk Univ Seoul, Div AI Software Convergence, 30,Pildongro 1 Gil, Seoul 04620, South Korea
关键词
structure from motion; self-supervised learning; monocular depth estimation; VISUAL ODOMETRY; DEEP;
D O I
10.3390/rs15245739
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Monocular depth prediction research is essential for expanding meaning from 2D to 3D. Recent studies have focused on the application of a newly proposed encoder; however, the development within the self-supervised learning framework remains unexplored, an aspect critical for advancing foundational models of 3D semantic interpretation. Addressing the dynamic nature of encoder-based research, especially in performance evaluations for feature extraction and pre-trained models, this research proposes the switchable encoder learning framework (SELF). SELF enhances versatility by enabling the seamless integration of diverse encoders in a self-supervised learning context for depth prediction. This integration is realized through the direct transfer of feature information from the encoder and by standardizing the input structure of the decoder to accommodate various encoder architectures. Furthermore, the framework is extended and incorporated into an adaptable decoder for depth prediction and camera pose learning, employing standard loss functions. Comparative experiments with previous frameworks using the same encoder reveal that SELF achieves a 7% reduction in parameters while enhancing performance. Remarkably, substituting newly proposed algorithms in place of an encoder improves the outcomes as well as significantly decreases the number of parameters by 23%. The experimental findings highlight the ability of SELF to broaden depth factors, such as depth consistency. This framework facilitates the objective selection of algorithms as a backbone for extended research in monocular depth prediction.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Adaptive Self-supervised Depth Estimation in Monocular Videos
    Mendoza, Julio
    Pedrini, Helio
    IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 687 - 699
  • [32] Self-supervised monocular depth estimation with direct methods
    Wang, Haixia
    Sun, Yehao
    Wu, Q. M. Jonathan
    Lu, Xiao
    Wang, Xiuling
    Zhang, Zhiguo
    NEUROCOMPUTING, 2021, 421 : 340 - 348
  • [33] Self-supervised monocular depth estimation with direct methods
    Wang H.
    Sun Y.
    Wu Q.M.J.
    Lu X.
    Wang X.
    Zhang Z.
    Neurocomputing, 2021, 421 : 340 - 348
  • [34] Self-Supervised Monocular Depth Estimation With Extensive Pretraining
    Choi, Hyukdoo
    IEEE ACCESS, 2021, 9 : 157236 - 157246
  • [35] Self-Supervised Monocular Depth Estimation with Extensive Pretraining
    Choi, Hyukdoo
    IEEE Access, 2021, 9 : 157236 - 157246
  • [36] TinyDepth: Lightweight self-supervised monocular depth estimation based on transformer
    Cheng, Zeyu
    Zhang, Yi
    Yu, Yang
    Song, Zhe
    Tang, Chengkai
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [37] Self-Supervised Learning for Monocular Depth Estimation on Minimally Invasive Surgery Scenes
    Shao, Shuwei
    Pei, Zhongcai
    Chen, Weihai
    Zhang, Baochang
    Wu, Xingming
    Sun, Dianmin
    Doermann, David
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 7159 - 7165
  • [38] Self-Supervised Monocular Depth Estimation via Binocular Geometric Correlation Learning
    Peng, Bo
    Sun, Lin
    Lei, Jianjun
    Liu, Bingzheng
    Shen, Haifeng
    Li, Wanqing
    Huang, Qingming
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (08)
  • [39] SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning
    Choi, Jaehoon
    Jung, Dongki
    Lee, Yonghan
    Kim, Deokhwa
    Manocha, Dinesh
    Lee, Donghwan
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6511 - 6518
  • [40] Monocular Depth Estimation with Self-Supervised Learning for Vineyard Unmanned Agricultural Vehicle
    Cui, Xue-Zhi
    Feng, Quan
    Wang, Shu-Zhi
    Zhang, Jian-Hua
    SENSORS, 2022, 22 (03)