Depth estimation for advancing intelligent transport systems based on self-improving pyramid stereo network

被引:13
|
作者
Tian, Yanling [1 ,2 ]
Du, Yubo [1 ]
Zhang, Qieshi [1 ,3 ]
Cheng, Jun [1 ,3 ]
Yang, Zhuo [4 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, CAS Key Lab Human Machine Intelligence Synergy Sy, Shenzhen, Peoples R China
[2] Waseda Univ, Grad Sch Informat Prod & Syst, Tokyo, Japan
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] Guangdong Univ Technol, Sch Comp, Guangzhou, Peoples R China
关键词
computer vision; stereo image processing; neural nets; learning (artificial intelligence); intelligent transport systems; pyramid stereo network; autonomous driving; stereo vision-based depth estimation technology; stereo depth estimation problem; deep learning model; convolutional neural networks; strong adaptive capabilities; ground truth depth; training data; complicated post-processing; ill-posed area; online learning; data limitation problem;
D O I
10.1049/iet-its.2019.0462
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In autonomous driving, stereo vision-based depth estimation technology can help to estimate the distance of obstacles accurately, which is crucial for correctly planning the path of the vehicle. Recent work has formulated the stereo depth estimation problem into a deep learning model with convolutional neural networks. However, these methods need a lot of post-processing and do not have strong adaptive capabilities to ill-posed regions or new scenes. In addition, due to the difficulty of the labelling the ground truth depth for real circumstance, training data for the system is limited. To overcome the above problems, the authors came up with self-improving pyramid stereo network, which can not only get a direct regression disparity without complicated post-processing but also be robust in ill-posed area. Moreover, by online learning, the proposed model can not only address the data limitation problem but also save the time spent on training and hardware resources in practice. At the same time, the proposed model has a self-improving ability to new scenes, which can quickly adjust the model according to the test data in time and improve the accuracy of prediction. Experiments on Scene Flow and KITTI data set demonstrate the effectiveness of the proposed network.
引用
收藏
页码:338 / 345
页数:8
相关论文
共 41 条
  • [1] Uncertainty-Aware Self-Improving Framework for Depth Estimation
    Nie, Xinyu
    Shi, Dianxi
    Li, Ruihao
    Liu, Zhe
    Chen, Xucan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01): : 41 - 48
  • [2] Depth Estimation in Multi-View Stereo Based on Image Pyramid
    Xu, Hanfei
    Cai, Yangang
    Wang, Ronggang
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 345 - 349
  • [3] A Self-Improving Framework for Joint Depth Estimation and Underwater Target Detection from Hyperspectral Imagery
    Qi, Jiahao
    Wan, Pengcheng
    Gong, Zhiqiang
    Xue, Wei
    Yao, Aihuan
    Liu, Xingyue
    Zhong, Ping
    REMOTE SENSING, 2021, 13 (09)
  • [4] Omnidirectional stereo depth estimation based on spherical deep network
    Li, Ming
    Hu, Xuejiao
    Dai, Jingzhao
    Li, Yang
    Du, Sidan
    IMAGE AND VISION COMPUTING, 2021, 114
  • [5] Towards Self-Improving Activity Recognition Systems based on Probabilistic, Generative Models
    Jaenicke, Martin
    Tomforde, Sven
    Sick, Bernhard
    2016 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING (ICAC), 2016, : 285 - 291
  • [6] Depth Estimation of Monocular Road Images Based on Pyramid Scene Analysis Network
    Zhou Wujie
    Pan Ting
    Gu Pengli
    Zhai Zhinian
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (10) : 2509 - 2515
  • [7] Multi-Scale Dilated Convolution Network Based Depth Estimation in Intelligent Transportation Systems
    Tian, Yanling
    Zhang, Qieshi
    Ren, Ziliang
    Wu, Fuxiang
    Hao, Pengyi
    Hu, Jinglu
    IEEE ACCESS, 2019, 7 : 185179 - 185188
  • [8] Multilevel Pyramid Network for Monocular Depth Estimation Based on Feature Refinement and Adaptive Fusion
    Xu, Huihui
    Li, Fei
    ELECTRONICS, 2022, 11 (16)
  • [9] A novel depth estimation approach based on bidirectional matching for stereo vision systems
    Okae, J.
    Du, J.
    Huang, T.
    ADVANCED ROBOTICS, 2020, 34 (15) : 998 - 1011
  • [10] OTFPF: Optimal transport based feature pyramid fusion network for brain age estimation
    Fu, Yu
    Huang, Yanyan
    Zhang, Zhe
    Dong, Shunjie
    Xue, Le
    Niu, Meng
    Li, Yunxin
    Shi, Zhiguo
    Wang, Yalin
    Zhang, Hong
    Tian, Mei
    Zhuo, Cheng
    INFORMATION FUSION, 2023, 100