Switchable-Encoder-Based Self-Supervised Learning Framework for Monocular Depth and Pose Estimation

被引:1
|
作者
Kim, Junoh [1 ]
Gao, Rui [1 ]
Park, Jisun [1 ]
Yoon, Jinsoo [2 ]
Cho, Kyungeun [3 ]
机构
[1] Dongguk Univ Seoul, Dept Multimedia Engn, 30 Pildongro 1 Gil, Seoul 04620, South Korea
[2] KoROAD Korea Rd Traff Author, Autonomous Driving Res Dept, 2 Hyeoksin Ro, Wonu Si 26466, Gangwon Do, South Korea
[3] Dongguk Univ Seoul, Div AI Software Convergence, 30,Pildongro 1 Gil, Seoul 04620, South Korea
关键词
structure from motion; self-supervised learning; monocular depth estimation; VISUAL ODOMETRY; DEEP;
D O I
10.3390/rs15245739
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Monocular depth prediction research is essential for expanding meaning from 2D to 3D. Recent studies have focused on the application of a newly proposed encoder; however, the development within the self-supervised learning framework remains unexplored, an aspect critical for advancing foundational models of 3D semantic interpretation. Addressing the dynamic nature of encoder-based research, especially in performance evaluations for feature extraction and pre-trained models, this research proposes the switchable encoder learning framework (SELF). SELF enhances versatility by enabling the seamless integration of diverse encoders in a self-supervised learning context for depth prediction. This integration is realized through the direct transfer of feature information from the encoder and by standardizing the input structure of the decoder to accommodate various encoder architectures. Furthermore, the framework is extended and incorporated into an adaptable decoder for depth prediction and camera pose learning, employing standard loss functions. Comparative experiments with previous frameworks using the same encoder reveal that SELF achieves a 7% reduction in parameters while enhancing performance. Remarkably, substituting newly proposed algorithms in place of an encoder improves the outcomes as well as significantly decreases the number of parameters by 23%. The experimental findings highlight the ability of SELF to broaden depth factors, such as depth consistency. This framework facilitates the objective selection of algorithms as a backbone for extended research in monocular depth prediction.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Dense Depth Estimation in Monocular Endoscopy With Self-Supervised Learning Methods
    Liu, Xingtong
    Sinha, Ayushi
    Ishii, Masaru
    Hager, Gregory D.
    Reiter, Austin
    Taylor, Russell H.
    Unberath, Mathias
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (05) : 1438 - 1447
  • [22] ATTENTION-BASED SELF-SUPERVISED LEARNING MONOCULAR DEPTH ESTIMATION WITH EDGE REFINEMENT
    Jiang, Chenweinan
    Liu, Haichun
    Li, Lanzhen
    Pan, Changchun
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3218 - 3222
  • [23] Monocular depth estimation for vision-based vehicles based on a self-supervised learning method
    Tektonidis, Marco
    Monnin, David
    AUTONOMOUS SYSTEMS: SENSORS, PROCESSING, AND SECURITY FOR VEHICLES AND INFRASTRUCTURE 2020, 2020, 11415
  • [24] Semantically guided self-supervised monocular depth estimation
    Lu, Xiao
    Sun, Haoran
    Wang, Xiuling
    Zhang, Zhiguo
    Wang, Haixia
    IET IMAGE PROCESSING, 2022, 16 (05) : 1293 - 1304
  • [25] Self-Supervised Monocular Scene Decomposition and Depth Estimation
    Safadoust, Sadra
    Guney, Fatma
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 627 - 636
  • [26] Joint Self-Supervised Monocular Depth Estimation and SLAM
    Xing, Xiaoxia
    Cai, Yinghao
    Lu, Tao
    Yang, Yiping
    Wen, Dayong
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4030 - 4036
  • [27] Learn to Adapt for Self-Supervised Monocular Depth Estimation
    Sun, Qiyu
    Yen, Gary G.
    Tang, Yang
    Zhao, Chaoqiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15647 - 15659
  • [28] Self-Supervised Monocular Depth Estimation With Multiscale Perception
    Zhang, Yourun
    Gong, Maoguo
    Li, Jianzhao
    Zhang, Mingyang
    Jiang, Fenlong
    Zhao, Hongyu
    IEEE Transactions on Image Processing, 2022, 31 : 3251 - 3266
  • [29] Self-Supervised Monocular Depth Estimation With Multiscale Perception
    Zhang, Yourun
    Gong, Maoguo
    Li, Jianzhao
    Zhang, Mingyang
    Jiang, Fenlong
    Zhao, Hongyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3251 - 3266
  • [30] Self-supervised monocular depth estimation for gastrointestinal endoscopy
    Liu, Yuying
    Zuo, Siyang
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 238