APAC-Net: Unsupervised Learning of Depth and Ego-Motion from Monocular Video

被引:0
|
作者
Lin, Rui [1 ]
Lu, Yao [1 ]
Lu, Guangming [1 ]
机构
[1] Harbin Inst Technol ShenZhen, Shenzhen 518055, Peoples R China
关键词
Depth estimation; Ego-motion estimation; Attention mechanism;
D O I
10.1007/978-3-030-36189-1_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an unsupervised novel method, Attention-Pixel and Attention-Channel Network (APAC-Net), for unsupervised monocular learning of estimating scene depth and ego-motion. Our model only utilizes monocular image sequences and does not need additional sensor information, such as IMU and GPS, for supervising. The attention mechanism is employed in APAC-Net to improve the networks' efficiency. Specifically, three attention modules are proposed to adjust feature weights when training. Moreover, to minimum the effect of noise, which is produced in the reconstruction processing, the Image-reconstruction loss based on PSNR LPSNR is used to evaluation the reconstruction quality. In addition, due to the fail depth estimation of the objects closed to camera, the Temporal-consistency loss LTemp between adjacent frames and the Scale-based loss LScale among different scales are proposed. Experimental results showed APAC-Net can perform well in both the depth and ego-motion tasks, and it even behaved better in several items on KITTI and Cityscapes.
引用
收藏
页码:336 / 348
页数:13
相关论文
共 50 条
  • [31] Influence of Neural Network Receptive Field on Monocular Depth and Ego-Motion Estimation
    S. A. Linok
    D. A. Yudin
    Optical Memory and Neural Networks, 2023, 32 : S206 - S213
  • [32] Influence of Neural Network Receptive Field on Monocular Depth and Ego-Motion Estimation
    Linok, S. A.
    Yudin, D. A.
    OPTICAL MEMORY AND NEURAL NETWORKS, 2023, 32 (Suppl 2) : S206 - S213
  • [33] Monocular Depth and Ego-motion Estimation with Scale Based on Superpixel and Normal Constraints
    Lu, Junxin
    Gao, Yongbin
    Chen, Jieyu
    Hwang, Jeng-Neng
    Fuji, Hamido
    Fang, Zhijun
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (10)
  • [34] Unsupervised Depth Estimation from Monocular Video based on Relative Motion
    Cao, Hui
    Wang, Chao
    Wang, Ping
    Zou, Qingquan
    Xiao, Xiao
    2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MACHINE LEARNING (SPML 2018), 2018, : 159 - 165
  • [35] Unsupervised Learning of Monocular Depth and Large-Ego-Motion With Multiple Loop Consistency Losses
    Zhang, Junning
    Su, Qunxing
    Liu, Pengyuan
    Xu, Chao
    Chen, Yanlong
    IEEE ACCESS, 2019, 7 : 77839 - 77848
  • [36] Deep learning of monocular depth, optical flow and ego-motion with geometric guidance for UAV navigation in dynamic environments
    Mumuni, Fuseini
    Mumuni, Alhassan
    Amuzuvi, Christian Kwaku
    MACHINE LEARNING WITH APPLICATIONS, 2022, 10
  • [37] Monocular Ego-motion Estimation with a Compact Omnidirectional Camera
    Stuerzl, Wolfgang
    Burschka, Darius
    Suppa, Michael
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 822 - 828
  • [38] Unsupervised learning for monocular depth and motion with real scale
    Liu, Sibo
    Fang, Lijin
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 630 - 635
  • [39] Self-Supervised Depth and Ego-Motion Estimation for Monocular Thermal Video Using Multi-Spectral Consistency Loss
    Shin, Ukcheol
    Lee, Kyunghyun
    Lee, Seokju
    Kweon, In So
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 1103 - 1110
  • [40] Self-Supervised Attention Learning for Depth and Ego-motion Estimation
    Sadek, Assent
    Chidlovskii, Boris
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10054 - 10060