Switchable-Encoder-Based Self-Supervised Learning Framework for Monocular Depth and Pose Estimation

被引:1
|
作者
Kim, Junoh [1 ]
Gao, Rui [1 ]
Park, Jisun [1 ]
Yoon, Jinsoo [2 ]
Cho, Kyungeun [3 ]
机构
[1] Dongguk Univ Seoul, Dept Multimedia Engn, 30 Pildongro 1 Gil, Seoul 04620, South Korea
[2] KoROAD Korea Rd Traff Author, Autonomous Driving Res Dept, 2 Hyeoksin Ro, Wonu Si 26466, Gangwon Do, South Korea
[3] Dongguk Univ Seoul, Div AI Software Convergence, 30,Pildongro 1 Gil, Seoul 04620, South Korea
关键词
structure from motion; self-supervised learning; monocular depth estimation; VISUAL ODOMETRY; DEEP;
D O I
10.3390/rs15245739
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Monocular depth prediction research is essential for expanding meaning from 2D to 3D. Recent studies have focused on the application of a newly proposed encoder; however, the development within the self-supervised learning framework remains unexplored, an aspect critical for advancing foundational models of 3D semantic interpretation. Addressing the dynamic nature of encoder-based research, especially in performance evaluations for feature extraction and pre-trained models, this research proposes the switchable encoder learning framework (SELF). SELF enhances versatility by enabling the seamless integration of diverse encoders in a self-supervised learning context for depth prediction. This integration is realized through the direct transfer of feature information from the encoder and by standardizing the input structure of the decoder to accommodate various encoder architectures. Furthermore, the framework is extended and incorporated into an adaptable decoder for depth prediction and camera pose learning, employing standard loss functions. Comparative experiments with previous frameworks using the same encoder reveal that SELF achieves a 7% reduction in parameters while enhancing performance. Remarkably, substituting newly proposed algorithms in place of an encoder improves the outcomes as well as significantly decreases the number of parameters by 23%. The experimental findings highlight the ability of SELF to broaden depth factors, such as depth consistency. This framework facilitates the objective selection of algorithms as a backbone for extended research in monocular depth prediction.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] Depth Estimation Based on Monocular Camera Sensors in Autonomous Vehicles: A Self-supervised Learning Approach
    Li, Guofa
    Chi, Xingyu
    Qu, Xingda
    AUTOMOTIVE INNOVATION, 2023, 6 (02) : 268 - 280
  • [42] Depth Estimation Based on Monocular Camera Sensors in Autonomous Vehicles: A Self-supervised Learning Approach
    Guofa Li
    Xingyu Chi
    Xingda Qu
    Automotive Innovation, 2023, 6 : 268 - 280
  • [43] Enhancing Self-supervised Monocular Depth Estimation via Piece-Wise Pose Estimation and Geometric Constraints
    Shyam, Pranjay
    Okon, Alexandre
    Yoo, HyunJin
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 221 - 231
  • [44] Monocular Depth Estimation via Self-Supervised Self-Distillation
    Hu, Haifeng
    Feng, Yuyang
    Li, Dapeng
    Zhang, Suofei
    Zhao, Haitao
    SENSORS, 2024, 24 (13)
  • [45] Self-Supervised Monocular Depth Estimation by Digging into Uncertainty Quantification
    Li, Yuan-Zhen
    Zheng, Sheng-Jie
    Tan, Zi-Xin
    Cao, Tuo
    Luo, Fei
    Xiao, Chun-Xia
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (03) : 510 - 525
  • [46] MonoVAN: Visual Attention for Self-Supervised Monocular Depth Estimation
    Indyk, Ilia
    Makarov, Ilya
    2023 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY, ISMAR, 2023, : 1211 - 1220
  • [47] Frequency-Aware Self-Supervised Monocular Depth Estimation
    Chen, Xingyu
    Li, Thomas H.
    Zhang, Ruonan
    Li, Ge
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5797 - 5806
  • [48] Self-Supervised Scale Recovery for Monocular Depth and Egomotion Estimation
    Wagstaff, Brandon
    Kelly, Jonathan
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2620 - 2627
  • [49] Graph semantic information for self-supervised monocular depth estimation
    Zhang, Dongdong
    Wang, Chunping
    Wang, Huiying
    Fu, Qiang
    PATTERN RECOGNITION, 2024, 156
  • [50] Image Masking for Robust Self-Supervised Monocular Depth Estimation
    Chawla, Hemang
    Jeeveswaran, Kishaan
    Arani, Elahe
    Zonooz, Bahram
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10054 - 10060