Multitask Deep Neural Networks for Tele-Wide Stereo Matching

被引：2

作者：

El-Khamy, Mostafa ^{[1
,2
]}

Ren, Haoyu ^{[1
]}

Du, Xianzhi ^{[1
]}

Lee, Jungwon ^{[1
]}

机构：

[1] Samsung Semicond Inc SSI, DSA SOC Res & Dev, San Diego, CA 92121 USA

[2] Alexandria Univ, Fac Engn, Alexandria 21544, Egypt

来源：

IEEE ACCESS | 2020年 / 8卷

关键词：

Estimation; Cameras; Optical imaging; Neural networks; Feature extraction; Optical sensors; Lenses; Stereo disparity; single-image depth estimation; stereo matching; tele-wide disparity; deep network fusion; CLASSIFIER FUSION;

D O I：

10.1109/ACCESS.2020.3029085

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, we propose deep learning solutions for the estimation of the real world depth of elements in a scene captured by two cameras with different field of views. We consider a realistic smart-phone scenario, where the first field of view (FOV) is a wide FOV with 1 x the optical zoom, and the second FOV is contained in thefirst FOV captured by a tele zoom lens with 2 x the optical zoom. We refer to the problem of estimating the depth for all elements in the union of the FOVs which corresponds to the Wide FOV as `tele-wide stereo matching'. Traditional approaches can only estimate the disparity or depth in the overlapped FOV, corresponding to the Tele FOV, using stereo matching algorithms. To benchmark this novel problem, we introduce a single-image inverse-depth estimation (SIDE) solution to estimate the disparity from the image corresponding to the union Wide FOV only. We also design a novel multitask tele-wide stereo matching deep neural network (MT-TW-SMNet), which is the first to combine the stereo matching and the single image depth tasks in one network. Moreover, we propose multiple methods for the fusion between the above networks. For example, we have input feature fusion, that utilizes the disparity estimated by stereo-matching as an additional input feature for SIDE. We also designed networks for decision fusion, that deploys a stacked hour glass (SHG) network for fusion and refnement of the disparity maps from both the SIDE network and the MT-TW-SMNet. These fusion schemes signifcantly improve the accuracy. Experimental results on KITTI and SceneFlow datasets demonstrate that our proposed approaches provide a reasonable solution to the tele-wide stereo matching problem. We demonstrate the effectiveness of our solutions in generating the Bokeh effect on the full Wide FOV.

引用

页码：184383 / 184398

页数：16

共 50 条

[41] Wide context learning network for stereo matching
Tien Phuoc Nguyen
Jeon, Jae Wook
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 78 : 263 - 273
[42] Wide neural networks with bottlenecks are deep gaussian processes
Agrawal, Devanshu
Papamarkou, Theodore
Hinkle, Jacob
Journal of Machine Learning Research, 2020, 21
[43] Wide and deep neural networks achieve consistency for classification
Radhakrishnan, Adityanarayanan
Belkin, Mikhail
Uhler, Caroline
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (14)
[44] Stable behaviour of infinitely wide deep neural networks
Favaro, Stefano
Fortini, Sandra
Peluchetti, Stefano
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1137 - 1145
[45] Wide Neural Networks with Bottlenecks are Deep Gaussian Processes
Agrawal, Devanshu
Papamarkou, Theodore
Hinkle, Jacob
JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
[46] Copolymer Informatics with Multitask Deep Neural Networks (vol 54, pg 5957, 2021)
Kuenneth, Christopher
Schertzer, William
Ramprasad, Rampi
MACROMOLECULES, 2021, 54 (15) : 7321 - 7321
[47] Stitching Weight-Shared Deep Neural Networks for Efficient Multitask Inference on GPU
Wang, Zeyu
He, Xiaoxi
Zhou, Zimu
Wang, Xu
Ma, Qiang
Miao, Xin
Liu, Zhuo
Thiele, Lothar
Yang, Zheng
2022 19TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2022, : 145 - 153
[48] Discriminative feature representation for image classification via multimodal multitask deep neural networks
Mei, Shuang
Yang, Hua
Yin, Zhouping
JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (01)
[49] Multiple attention networks for stereo matching
Guo, Longyuan
Duan, Houyu
Zhou, Wuwei
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (18) : 28583 - 28601
[50] Multiple attention networks for stereo matching
Longyuan Guo
Houyu Duan
Wuwei Zhou
Multimedia Tools and Applications, 2021, 80 : 28583 - 28601

← 1 2 3 4 5 →