Switchable-Encoder-Based Self-Supervised Learning Framework for Monocular Depth and Pose Estimation

被引：1

作者：

Kim, Junoh ^{[1
]}

Gao, Rui ^{[1
]}

Park, Jisun ^{[1
]}

Yoon, Jinsoo ^{[2
]}

Cho, Kyungeun ^{[3
]}

机构：

[1] Dongguk Univ Seoul, Dept Multimedia Engn, 30 Pildongro 1 Gil, Seoul 04620, South Korea

[2] KoROAD Korea Rd Traff Author, Autonomous Driving Res Dept, 2 Hyeoksin Ro, Wonu Si 26466, Gangwon Do, South Korea

[3] Dongguk Univ Seoul, Div AI Software Convergence, 30,Pildongro 1 Gil, Seoul 04620, South Korea

来源：

REMOTE SENSING | 2023年 / 15卷 / 24期

关键词：

structure from motion; self-supervised learning; monocular depth estimation; VISUAL ODOMETRY; DEEP;

D O I：

10.3390/rs15245739

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Monocular depth prediction research is essential for expanding meaning from 2D to 3D. Recent studies have focused on the application of a newly proposed encoder; however, the development within the self-supervised learning framework remains unexplored, an aspect critical for advancing foundational models of 3D semantic interpretation. Addressing the dynamic nature of encoder-based research, especially in performance evaluations for feature extraction and pre-trained models, this research proposes the switchable encoder learning framework (SELF). SELF enhances versatility by enabling the seamless integration of diverse encoders in a self-supervised learning context for depth prediction. This integration is realized through the direct transfer of feature information from the encoder and by standardizing the input structure of the decoder to accommodate various encoder architectures. Furthermore, the framework is extended and incorporated into an adaptable decoder for depth prediction and camera pose learning, employing standard loss functions. Comparative experiments with previous frameworks using the same encoder reveal that SELF achieves a 7% reduction in parameters while enhancing performance. Remarkably, substituting newly proposed algorithms in place of an encoder improves the outcomes as well as significantly decreases the number of parameters by 23%. The experimental findings highlight the ability of SELF to broaden depth factors, such as depth consistency. This framework facilitates the objective selection of algorithms as a backbone for extended research in monocular depth prediction.

引用

页数：25

共 50 条

[31] Adaptive Self-supervised Depth Estimation in Monocular Videos
Mendoza, Julio
Pedrini, Helio
IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 687 - 699
[32] Self-supervised monocular depth estimation with direct methods
Wang, Haixia
Sun, Yehao
Wu, Q. M. Jonathan
Lu, Xiao
Wang, Xiuling
Zhang, Zhiguo
NEUROCOMPUTING, 2021, 421 : 340 - 348
[33] Self-supervised monocular depth estimation with direct methods
Wang H.
Sun Y.
Wu Q.M.J.
Lu X.
Wang X.
Zhang Z.
Neurocomputing, 2021, 421 : 340 - 348
[34] Self-Supervised Monocular Depth Estimation With Extensive Pretraining
Choi, Hyukdoo
IEEE ACCESS, 2021, 9 : 157236 - 157246
[35] Self-Supervised Monocular Depth Estimation with Extensive Pretraining
Choi, Hyukdoo
IEEE Access, 2021, 9 : 157236 - 157246
[36] TinyDepth: Lightweight self-supervised monocular depth estimation based on transformer
Cheng, Zeyu
Zhang, Yi
Yu, Yang
Song, Zhe
Tang, Chengkai
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
[37] Self-Supervised Learning for Monocular Depth Estimation on Minimally Invasive Surgery Scenes
Shao, Shuwei
Pei, Zhongcai
Chen, Weihai
Zhang, Baochang
Wu, Xingming
Sun, Dianmin
Doermann, David
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 7159 - 7165
[38] Self-Supervised Monocular Depth Estimation via Binocular Geometric Correlation Learning
Peng, Bo
Sun, Lin
Lei, Jianjun
Liu, Bingzheng
Shen, Haifeng
Li, Wanqing
Huang, Qingming
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (08)
[39] SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning
Choi, Jaehoon
Jung, Dongki
Lee, Yonghan
Kim, Deokhwa
Manocha, Dinesh
Lee, Donghwan
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6511 - 6518
[40] Monocular Depth Estimation with Self-Supervised Learning for Vineyard Unmanned Agricultural Vehicle
Cui, Xue-Zhi
Feng, Quan
Wang, Shu-Zhi
Zhang, Jian-Hua
SENSORS, 2022, 22 (03)

← 1 2 3 4 5 →