Multitask Deep Neural Networks for Tele-Wide Stereo Matching

被引:2
|
作者
El-Khamy, Mostafa [1 ,2 ]
Ren, Haoyu [1 ]
Du, Xianzhi [1 ]
Lee, Jungwon [1 ]
机构
[1] Samsung Semicond Inc SSI, DSA SOC Res & Dev, San Diego, CA 92121 USA
[2] Alexandria Univ, Fac Engn, Alexandria 21544, Egypt
关键词
Estimation; Cameras; Optical imaging; Neural networks; Feature extraction; Optical sensors; Lenses; Stereo disparity; single-image depth estimation; stereo matching; tele-wide disparity; deep network fusion; CLASSIFIER FUSION;
D O I
10.1109/ACCESS.2020.3029085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we propose deep learning solutions for the estimation of the real world depth of elements in a scene captured by two cameras with different field of views. We consider a realistic smart-phone scenario, where the first field of view (FOV) is a wide FOV with 1 x the optical zoom, and the second FOV is contained in thefirst FOV captured by a tele zoom lens with 2 x the optical zoom. We refer to the problem of estimating the depth for all elements in the union of the FOVs which corresponds to the Wide FOV as `tele-wide stereo matching'. Traditional approaches can only estimate the disparity or depth in the overlapped FOV, corresponding to the Tele FOV, using stereo matching algorithms. To benchmark this novel problem, we introduce a single-image inverse-depth estimation (SIDE) solution to estimate the disparity from the image corresponding to the union Wide FOV only. We also design a novel multitask tele-wide stereo matching deep neural network (MT-TW-SMNet), which is the first to combine the stereo matching and the single image depth tasks in one network. Moreover, we propose multiple methods for the fusion between the above networks. For example, we have input feature fusion, that utilizes the disparity estimated by stereo-matching as an additional input feature for SIDE. We also designed networks for decision fusion, that deploys a stacked hour glass (SHG) network for fusion and refnement of the disparity maps from both the SIDE network and the MT-TW-SMNet. These fusion schemes signifcantly improve the accuracy. Experimental results on KITTI and SceneFlow datasets demonstrate that our proposed approaches provide a reasonable solution to the tele-wide stereo matching problem. We demonstrate the effectiveness of our solutions in generating the Bokeh effect on the full Wide FOV.
引用
收藏
页码:184383 / 184398
页数:16
相关论文
共 50 条
  • [21] Color stereo matching based on self-organization neural networks
    Hua, XJ
    Yokomichi, M
    Kono, M
    2004 47TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2004, : 213 - 216
  • [22] Refinement of matching costs for stereo disparities using recurrent neural networks
    Emlek, Alper
    Peker, Murat
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2021, 2021 (01)
  • [23] Semantic Stereo using Semi Global Matching and Convolutional Neural Networks
    Saeed, Maria
    Ghuffar, Sajid
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVI, 2020, 11533
  • [24] Refinement of matching costs for stereo disparities using recurrent neural networks
    Alper Emlek
    Murat Peker
    EURASIP Journal on Image and Video Processing, 2021
  • [25] Propagation Mechanism for Deep and Wide Neural Networks
    Xu, Dejiang
    Lee, Mong Li
    Hsu, Wynne
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9212 - 9220
  • [26] Deep and Wide Neural Networks Covariance Estimation
    Arratia, Argimiro
    Cabana, Alejandra
    Rafael Leon, Jose
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 195 - 206
  • [27] The Loss Surface of Deep and Wide Neural Networks
    Quynh Nguyen
    Hein, Matthias
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [28] Multitask Learning Deep Neural Networks Enable Embedded Design of Active Metamaterials
    Yuan, Xiaogen
    Wei, Zhongchao
    Ma, Qiongxiong
    Ding, Wen
    Guo, Jianping
    ACS APPLIED MATERIALS & INTERFACES, 2024, 16 (20) : 26500 - 26511
  • [29] ONLINE ACTION DETECTION AND FORECAST VIA MULTITASK DEEP RECURRENT NEURAL NETWORKS
    Liu, Chunhui
    Li, Yanghao
    Hu, Yueyu
    Liu, Jiaying
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1702 - 1706
  • [30] Simultaneous fruit detection and size estimation using multitask deep neural networks
    Ferrer-Ferrer, Mar
    Ruiz-Hidalgo, Javier
    Gregorio, Eduard
    Vilaplana, Veronica
    Morros, Josep-Ramon
    Gene-Mola, Jordi
    BIOSYSTEMS ENGINEERING, 2023, 233 : 63 - 75