Progressive Fusion for Unsupervised Binocular Depth Estimation Using Cycled Networks

被引:21
|
作者
Pilzer, Andrea [1 ]
Lathuiliere, Stephane [1 ]
Xu, Dan [1 ,2 ]
Puscas, Mihai Marian [1 ]
Ricci, Elisa [1 ,3 ]
Sebe, Nicu [1 ,4 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, I-38122 Trento, Italy
[2] Univ Oxford, Dept Engn Sci, Oxford OX1 2JD, England
[3] Fdn Bruno Kessler, I-38122 Trento, Italy
[4] Huawei Technol Ireland, Dublin D02 R156, Ireland
关键词
Estimation; Training; Deep learning; Cameras; Solid modeling; Predictive models; Network architecture; Stereo depth estimation; convolutional neural networks (ConvNet); deep multi-scale fusion; cycle network;
D O I
10.1109/TPAMI.2019.2942928
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance. However, they require costly ground truth annotations during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps. We introduce a new network architecture, named Progressive Fusion Network (PFN), that is specifically designed for binocular stereo depth estimation. This network is based on a multi-scale refinement strategy that combines the information provided by both stereo views. In addition, we propose to stack twice this network in order to form a cycle. This cycle approach can be interpreted as a form of data-augmentation since, at training time, the network learns both from the training set images (in the forward half-cycle) but also from the synthesized images (in the backward half-cycle). The architecture is jointly trained with adversarial learning. Extensive experiments on the publicly available datasets KITTI, Cityscapes and ApolloScape demonstrate the effectiveness of the proposed model which is competitive with other unsupervised deep learning methods for depth prediction.
引用
收藏
页码:2380 / 2395
页数:16
相关论文
共 50 条
  • [41] Recognition and Depth Estimation of Ships Based on Binocular Stereo Vision
    Zheng, Yuanzhou
    Liu, Peng
    Qian, Long
    Qin, Shiquan
    Liu, Xinyu
    Ma, Yong
    Cheng, Ganjun
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (08)
  • [42] Stereo Where-What Networks: Unsupervised Binocular Feature Learning
    Solgi, Mojtaba
    Weng, Juyang
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [43] Unsupervised Monocular Depth Estimation for Autonomous Flight of Drones
    Zhao Shuanfeng
    Huang Tao
    Xu Qian
    Geng Longlong
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
  • [44] Unsupervised monocular depth estimation based on edge enhancement
    Qu Y.
    Chen Y.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2024, 46 (01): : 71 - 79
  • [45] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
    Sumanta Bhattacharyya
    Ju Shen
    Stephen Welch
    Chen Chen
    Journal of Real-Time Image Processing, 2021, 18 : 1357 - 1368
  • [46] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
    Bhattacharyya, Sumanta
    Shen, Ju
    Welch, Stephen
    Chen, Chen
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (04) : 1357 - 1368
  • [47] Depth-Guided Aggregation for Real-Time Binocular Depth Estimation Network
    Fu, Dongxin
    Zheng, Shaowu
    Xie, Pengcheng
    Li, Weihua
    IEEE MULTIMEDIA, 2024, 31 (02) : 36 - 47
  • [48] Unsupervised Monocular Depth Estimation With Channel and Spatial Attention
    Wang, Zhuping
    Dai, Xinke
    Guo, Zhanyu
    Huang, Chao
    Zhang, Hao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7860 - 7870
  • [49] AsiANet: Autoencoders in Autoencoder for Unsupervised Monocular Depth Estimation
    Yusiong, John Paul T.
    Naval, Prospero C., Jr.
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 443 - 451
  • [50] Structured Adversarial Training for Unsupervised Monocular Depth Estimation
    Mehta, Ishit
    Sakurikar, Parikshit
    Narayanan, P. J.
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 314 - 323