Progressive Fusion for Unsupervised Binocular Depth Estimation Using Cycled Networks

被引：21

作者：

Pilzer, Andrea ^{[1
]}

Lathuiliere, Stephane ^{[1
]}

Xu, Dan ^{[1
,2
]}

Puscas, Mihai Marian ^{[1
]}

Ricci, Elisa ^{[1
,3
]}

Sebe, Nicu ^{[1
,4
]}

机构：

[1] Univ Trento, Dept Informat Engn & Comp Sci, I-38122 Trento, Italy

[2] Univ Oxford, Dept Engn Sci, Oxford OX1 2JD, England

[3] Fdn Bruno Kessler, I-38122 Trento, Italy

[4] Huawei Technol Ireland, Dublin D02 R156, Ireland

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2020年 / 42卷 / 10期

关键词：

Estimation; Training; Deep learning; Cameras; Solid modeling; Predictive models; Network architecture; Stereo depth estimation; convolutional neural networks (ConvNet); deep multi-scale fusion; cycle network;

D O I：

10.1109/TPAMI.2019.2942928

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance. However, they require costly ground truth annotations during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps. We introduce a new network architecture, named Progressive Fusion Network (PFN), that is specifically designed for binocular stereo depth estimation. This network is based on a multi-scale refinement strategy that combines the information provided by both stereo views. In addition, we propose to stack twice this network in order to form a cycle. This cycle approach can be interpreted as a form of data-augmentation since, at training time, the network learns both from the training set images (in the forward half-cycle) but also from the synthesized images (in the backward half-cycle). The architecture is jointly trained with adversarial learning. Extensive experiments on the publicly available datasets KITTI, Cityscapes and ApolloScape demonstrate the effectiveness of the proposed model which is competitive with other unsupervised deep learning methods for depth prediction.

引用

页码：2380 / 2395

页数：16

共 50 条

[41] Recognition and Depth Estimation of Ships Based on Binocular Stereo Vision
Zheng, Yuanzhou
Liu, Peng
Qian, Long
Qin, Shiquan
Liu, Xinyu
Ma, Yong
Cheng, Ganjun
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (08)
[42] Stereo Where-What Networks: Unsupervised Binocular Feature Learning
Solgi, Mojtaba
Weng, Juyang
2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
[43] Unsupervised Monocular Depth Estimation for Autonomous Flight of Drones
Zhao Shuanfeng
Huang Tao
Xu Qian
Geng Longlong
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
[44] Unsupervised monocular depth estimation based on edge enhancement
Qu Y.
Chen Y.
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2024, 46 (01): : 71 - 79
[45] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
Sumanta Bhattacharyya
Ju Shen
Stephen Welch
Chen Chen
Journal of Real-Time Image Processing, 2021, 18 : 1357 - 1368
[46] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
Bhattacharyya, Sumanta
Shen, Ju
Welch, Stephen
Chen, Chen
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (04) : 1357 - 1368
[47] Depth-Guided Aggregation for Real-Time Binocular Depth Estimation Network
Fu, Dongxin
Zheng, Shaowu
Xie, Pengcheng
Li, Weihua
IEEE MULTIMEDIA, 2024, 31 (02) : 36 - 47
[48] Unsupervised Monocular Depth Estimation With Channel and Spatial Attention
Wang, Zhuping
Dai, Xinke
Guo, Zhanyu
Huang, Chao
Zhang, Hao
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7860 - 7870
[49] AsiANet: Autoencoders in Autoencoder for Unsupervised Monocular Depth Estimation
Yusiong, John Paul T.
Naval, Prospero C., Jr.
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 443 - 451
[50] Structured Adversarial Training for Unsupervised Monocular Depth Estimation
Mehta, Ishit
Sakurikar, Parikshit
Narayanan, P. J.
2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 314 - 323

← 1 2 3 4 5 →