Multitask Deep Neural Networks for Tele-Wide Stereo Matching

被引:2
|
作者
El-Khamy, Mostafa [1 ,2 ]
Ren, Haoyu [1 ]
Du, Xianzhi [1 ]
Lee, Jungwon [1 ]
机构
[1] Samsung Semicond Inc SSI, DSA SOC Res & Dev, San Diego, CA 92121 USA
[2] Alexandria Univ, Fac Engn, Alexandria 21544, Egypt
关键词
Estimation; Cameras; Optical imaging; Neural networks; Feature extraction; Optical sensors; Lenses; Stereo disparity; single-image depth estimation; stereo matching; tele-wide disparity; deep network fusion; CLASSIFIER FUSION;
D O I
10.1109/ACCESS.2020.3029085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we propose deep learning solutions for the estimation of the real world depth of elements in a scene captured by two cameras with different field of views. We consider a realistic smart-phone scenario, where the first field of view (FOV) is a wide FOV with 1 x the optical zoom, and the second FOV is contained in thefirst FOV captured by a tele zoom lens with 2 x the optical zoom. We refer to the problem of estimating the depth for all elements in the union of the FOVs which corresponds to the Wide FOV as `tele-wide stereo matching'. Traditional approaches can only estimate the disparity or depth in the overlapped FOV, corresponding to the Tele FOV, using stereo matching algorithms. To benchmark this novel problem, we introduce a single-image inverse-depth estimation (SIDE) solution to estimate the disparity from the image corresponding to the union Wide FOV only. We also design a novel multitask tele-wide stereo matching deep neural network (MT-TW-SMNet), which is the first to combine the stereo matching and the single image depth tasks in one network. Moreover, we propose multiple methods for the fusion between the above networks. For example, we have input feature fusion, that utilizes the disparity estimated by stereo-matching as an additional input feature for SIDE. We also designed networks for decision fusion, that deploys a stacked hour glass (SHG) network for fusion and refnement of the disparity maps from both the SIDE network and the MT-TW-SMNet. These fusion schemes signifcantly improve the accuracy. Experimental results on KITTI and SceneFlow datasets demonstrate that our proposed approaches provide a reasonable solution to the tele-wide stereo matching problem. We demonstrate the effectiveness of our solutions in generating the Bokeh effect on the full Wide FOV.
引用
收藏
页码:184383 / 184398
页数:16
相关论文
共 50 条
  • [1] Stereo Matching through Squeeze Deep Neural Networks
    Caffaratti, Gabriel D.
    Marehetta, Martin G.
    Forradellas, Raymundo Q.
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAN JOURNAL OF ARTIFICIAL INTELLIGENCE, 2019, 22 (63): : 16 - 38
  • [2] Copolymer Informatics with Multitask Deep Neural Networks
    Kuenneth, Christopher
    Schertzer, William
    Ramprasad, Rampi
    MACROMOLECULES, 2021, 54 (13) : 5957 - 5961
  • [3] Cellular neural networks for the stereo matching problem
    Taraglio, S
    Zanela, A
    1996 FOURTH IEEE INTERNATIONAL WORKSHOP ON CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS, PROCEEDINGS (CNNA-96), 1996, : 93 - 98
  • [4] Bioplastic design using multitask deep neural networks
    Kuenneth, Christopher
    Lalonde, Jessica
    Marrone, Babetta L. L.
    Iverson, Carl N. N.
    Ramprasad, Rampi
    Pilania, Ghanshyam
    COMMUNICATIONS MATERIALS, 2022, 3 (01)
  • [5] Multitask Deep Neural Networks for Ames Mutagenicity Prediction
    Jimena Martinez, Maria
    Virginia Sabando, Maria
    Soto, Axel J.
    Roca, Carlos
    Requena-Triguero, Carlos
    Campillo, Nuria E.
    Paez, Juan A.
    Ponzoni, Ignacio
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (24) : 6342 - 6351
  • [6] Bioplastic design using multitask deep neural networks
    Christopher Kuenneth
    Jessica Lalonde
    Babetta L. Marrone
    Carl N. Iverson
    Rampi Ramprasad
    Ghanshyam Pilania
    Communications Materials, 3
  • [7] Hierarchical Neural Architecture Search for Deep Stereo Matching
    Cheng, Xuelian
    Zhong, Yiran
    Harandi, Mehrtash
    Dai, Yuchao
    Chang, Xiaojun
    Drummond, Tom
    Li, Hongdong
    Ge, Zongyuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [8] ROBUST FULL-FOV DEPTH ESTIMATION IN TELE-WIDE CAMERA SYSTEM
    Guo, Kai
    Song, Seongwook
    Chang, Soonkeun
    Kim, Tae-ui
    Han, Seungmin
    Kim, Irina
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1993 - 1997
  • [9] MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching
    Shamsafar, Faranak
    Woerz, Samuel
    Rahim, Rafia
    Zell, Andreas
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 677 - 686
  • [10] 3D reconstruction of curvilinear structures with stereo matching deep convolutional neural networks
    Altingovde, Okan
    Mishchuk, Anastasiia
    Ganeeva, Gulnaz
    Oveisi, Emad
    Hebert, Cecile
    Fua, Pascal
    ULTRAMICROSCOPY, 2022, 234