Depth estimation using an improved stereo network

被引:1
|
作者
Xu, Wanpeng [1 ]
Zou, Ling [3 ]
Wu, Lingda [1 ]
Qi, Yue [2 ]
Qian, Zhaoyong [1 ]
机构
[1] Space Engn Univ, Sci & Technol Complex Elect Syst Simulat Lab, Beijing 101416, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[3] Beijing Film Acad, Digital Media Sch, Beijing 100088, Peoples R China
基金
中国国家自然科学基金;
关键词
Monocular depth estimation; Self-supervised; Image reconstruction; TP391; 4;
D O I
10.1631/FITEE.2000676
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised depth estimation approaches present excellent results that are comparable to those of the fully supervised approaches, by employing view synthesis between the target and reference images in the training data. ResNet, which serves as a backbone network, has some structural deficiencies when applied to downstream fields, because its original purpose was to cope with classification problems. The low-texture area also deteriorates the performance. To address these problems, we propose a set of improvements that lead to superior predictions. First, we boost the information flow in the network and improve the ability to learn spatial structures by improving the network structures. Second, we use a binary mask to remove the pixels in low-texture areas between the target and reference images to more accurately reconstruct the image. Finally, we input the target and reference images randomly to expand the dataset and pre-train it on ImageNet, so that the model obtains a favorable general feature representation. We demonstrate state-of-the-art performance on an Eigen split of the KITTI driving dataset using stereo pairs.
引用
收藏
页码:777 / 789
页数:13
相关论文
共 50 条
  • [41] A tunable perceptual microsystem for stereo depth estimation
    Bruccoleri, F
    Sabatini, SP
    Bisio, GM
    Raffo, L
    1997 2ND IEEE-CAS REGION 8 WORKSHOP ON ANALOG AND MIXED IC DESIGN, PROCEEDINGS, 1997, : 47 - 52
  • [42] A Depth Map Estimation Approach for Trinocular Stereo
    Zhou, Jun
    Wang, Ling
    Gu, Xiao
    Xu, Kang
    Zhang, Ya
    2015 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2015,
  • [43] Improved Estimation of Hand Postures Using Depth Images
    Hamester, Dennis
    Jirak, Doreen
    Wermter, Stefan
    2013 16TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2013,
  • [44] Learning to refine depth for robust stereo estimation
    Cheng, Feiyang
    He, Xuming
    Zhang, Hong
    PATTERN RECOGNITION, 2018, 74 : 122 - 133
  • [45] Stereo depth estimation: a confidence interval approach
    Mandelbaum, R
    Kamberova, G
    Mintz, M
    SIXTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, 1998, : 503 - 509
  • [46] Stereo-disparity estimation using a supervised neural network
    Venkatesh, YV
    Venkatesh, BS
    Kumar, AJ
    MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 785 - 793
  • [47] SBIN: A stereo disparity estimation network using binary convolutions
    Aguilera, Cristhian A.
    IEEE LATIN AMERICA TRANSACTIONS, 2022, 20 (04) : 693 - 699
  • [48] Object Depth and Size Estimation Using Stereo-Vision and Integration With SLAM
    Hamad, Layth
    Khan, Muhammad Asif
    Mohamed, Amr
    IEEE SENSORS LETTERS, 2024, 8 (04) : 1 - 4
  • [49] DEPTH MAP ESTIMATION IN LIGHT FIELDS USING AN STEREO-LIKE TAXONOMY
    Calderon, Francisco C.
    Parra, Carlos A.
    Nino, Cesar L.
    REVISTA DE INVESTIGACIONES-UNIVERSIDAD DEL QUINDIO, 2016, 28 (01): : 92 - 100
  • [50] Depth Estimation Using an Infrared Dot Projector and an Infrared Color Stereo Camera
    Hisatomi, Kensuke
    Kano, Masanori
    Ikeya, Kensuke
    Katayama, Miwa
    Mishina, Tomoyuki
    Iwadate, Yuichi
    Aizawa, Kiyoharu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (10) : 2086 - 2097