Switchable-Encoder-Based Self-Supervised Learning Framework for Monocular Depth and Pose Estimation

被引：1

作者：

Kim, Junoh ^{[1
]}

Gao, Rui ^{[1
]}

Park, Jisun ^{[1
]}

Yoon, Jinsoo ^{[2
]}

Cho, Kyungeun ^{[3
]}

机构：

[1] Dongguk Univ Seoul, Dept Multimedia Engn, 30 Pildongro 1 Gil, Seoul 04620, South Korea

[2] KoROAD Korea Rd Traff Author, Autonomous Driving Res Dept, 2 Hyeoksin Ro, Wonu Si 26466, Gangwon Do, South Korea

[3] Dongguk Univ Seoul, Div AI Software Convergence, 30,Pildongro 1 Gil, Seoul 04620, South Korea

来源：

REMOTE SENSING | 2023年 / 15卷 / 24期

关键词：

structure from motion; self-supervised learning; monocular depth estimation; VISUAL ODOMETRY; DEEP;

D O I：

10.3390/rs15245739

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Monocular depth prediction research is essential for expanding meaning from 2D to 3D. Recent studies have focused on the application of a newly proposed encoder; however, the development within the self-supervised learning framework remains unexplored, an aspect critical for advancing foundational models of 3D semantic interpretation. Addressing the dynamic nature of encoder-based research, especially in performance evaluations for feature extraction and pre-trained models, this research proposes the switchable encoder learning framework (SELF). SELF enhances versatility by enabling the seamless integration of diverse encoders in a self-supervised learning context for depth prediction. This integration is realized through the direct transfer of feature information from the encoder and by standardizing the input structure of the decoder to accommodate various encoder architectures. Furthermore, the framework is extended and incorporated into an adaptable decoder for depth prediction and camera pose learning, employing standard loss functions. Comparative experiments with previous frameworks using the same encoder reveal that SELF achieves a 7% reduction in parameters while enhancing performance. Remarkably, substituting newly proposed algorithms in place of an encoder improves the outcomes as well as significantly decreases the number of parameters by 23%. The experimental findings highlight the ability of SELF to broaden depth factors, such as depth consistency. This framework facilitates the objective selection of algorithms as a backbone for extended research in monocular depth prediction.

引用

页数：25

共 50 条

[21] Dense Depth Estimation in Monocular Endoscopy With Self-Supervised Learning Methods
Liu, Xingtong
Sinha, Ayushi
Ishii, Masaru
Hager, Gregory D.
Reiter, Austin
Taylor, Russell H.
Unberath, Mathias
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (05) : 1438 - 1447
[22] ATTENTION-BASED SELF-SUPERVISED LEARNING MONOCULAR DEPTH ESTIMATION WITH EDGE REFINEMENT
Jiang, Chenweinan
Liu, Haichun
Li, Lanzhen
Pan, Changchun
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3218 - 3222
[23] Monocular depth estimation for vision-based vehicles based on a self-supervised learning method
Tektonidis, Marco
Monnin, David
AUTONOMOUS SYSTEMS: SENSORS, PROCESSING, AND SECURITY FOR VEHICLES AND INFRASTRUCTURE 2020, 2020, 11415
[24] Semantically guided self-supervised monocular depth estimation
Lu, Xiao
Sun, Haoran
Wang, Xiuling
Zhang, Zhiguo
Wang, Haixia
IET IMAGE PROCESSING, 2022, 16 (05) : 1293 - 1304
[25] Self-Supervised Monocular Scene Decomposition and Depth Estimation
Safadoust, Sadra
Guney, Fatma
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 627 - 636
[26] Joint Self-Supervised Monocular Depth Estimation and SLAM
Xing, Xiaoxia
Cai, Yinghao
Lu, Tao
Yang, Yiping
Wen, Dayong
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4030 - 4036
[27] Learn to Adapt for Self-Supervised Monocular Depth Estimation
Sun, Qiyu
Yen, Gary G.
Tang, Yang
Zhao, Chaoqiang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15647 - 15659
[28] Self-Supervised Monocular Depth Estimation With Multiscale Perception
Zhang, Yourun
Gong, Maoguo
Li, Jianzhao
Zhang, Mingyang
Jiang, Fenlong
Zhao, Hongyu
IEEE Transactions on Image Processing, 2022, 31 : 3251 - 3266
[29] Self-Supervised Monocular Depth Estimation With Multiscale Perception
Zhang, Yourun
Gong, Maoguo
Li, Jianzhao
Zhang, Mingyang
Jiang, Fenlong
Zhao, Hongyu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3251 - 3266
[30] Self-supervised monocular depth estimation for gastrointestinal endoscopy
Liu, Yuying
Zuo, Siyang
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 238

← 1 2 3 4 5 →