Depth estimation for advancing intelligent transport systems based on self-improving pyramid stereo network

被引：13

作者：

Tian, Yanling ^{[1
,2
]}

Du, Yubo ^{[1
]}

Zhang, Qieshi ^{[1
,3
]}

Cheng, Jun ^{[1
,3
]}

Yang, Zhuo ^{[4
]}

机构：

[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, CAS Key Lab Human Machine Intelligence Synergy Sy, Shenzhen, Peoples R China

[2] Waseda Univ, Grad Sch Informat Prod & Syst, Tokyo, Japan

[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[4] Guangdong Univ Technol, Sch Comp, Guangzhou, Peoples R China

来源：

IET INTELLIGENT TRANSPORT SYSTEMS | 2020年 / 14卷 / 05期

关键词：

computer vision; stereo image processing; neural nets; learning (artificial intelligence); intelligent transport systems; pyramid stereo network; autonomous driving; stereo vision-based depth estimation technology; stereo depth estimation problem; deep learning model; convolutional neural networks; strong adaptive capabilities; ground truth depth; training data; complicated post-processing; ill-posed area; online learning; data limitation problem;

D O I：

10.1049/iet-its.2019.0462

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In autonomous driving, stereo vision-based depth estimation technology can help to estimate the distance of obstacles accurately, which is crucial for correctly planning the path of the vehicle. Recent work has formulated the stereo depth estimation problem into a deep learning model with convolutional neural networks. However, these methods need a lot of post-processing and do not have strong adaptive capabilities to ill-posed regions or new scenes. In addition, due to the difficulty of the labelling the ground truth depth for real circumstance, training data for the system is limited. To overcome the above problems, the authors came up with self-improving pyramid stereo network, which can not only get a direct regression disparity without complicated post-processing but also be robust in ill-posed area. Moreover, by online learning, the proposed model can not only address the data limitation problem but also save the time spent on training and hardware resources in practice. At the same time, the proposed model has a self-improving ability to new scenes, which can quickly adjust the model according to the test data in time and improve the accuracy of prediction. Experiments on Scene Flow and KITTI data set demonstrate the effectiveness of the proposed network.

引用

页码：338 / 345

页数：8

共 41 条

[31] Spatio-temporal layers based intra-operative stereo depth estimation network via hierarchical prediction and progressive training
Chen, Ziyang
Cruciani, Laura
Lievore, Elena
Fontana, Matteo
De Cobelli, Ottavio
Musi, Gennaro
Ferrigno, Giancarlo
De Momi, Elena
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 244
[32] Efficient Stereo Depth Estimation for Pseudo-LiDAR: A Self-Supervised Approach Based on Multi-Input ResNet Encoder
Hossain, Sabir
Lin, Xianke
SENSORS, 2023, 23 (03)
[33] Unsupervised Monocular Depth Estimation and Visual Odometry Based on Generative Adversarial Network and Self-attention Mechanism
Ye X.
He Y.
Ru S.
Jiqiren/Robot, 2021, 43 (02): : 203 - 213
[34] Cognitive Network Architecture Systems to Provide Intelligent Services: An Intelligent Self-Organization Approach With a Game-Based Incentive Mechanism
Liu, Yuxin
Gui, Jinsong
Xiong, N.
IEEE SYSTEMS MAN AND CYBERNETICS MAGAZINE, 2023, 9 (01): : 25 - 36
[35] Depth Estimation Using a Self-Supervised Network Based on Cross-Layer Feature Fusion and the Quadtree Constraint
Tian, Fangzheng
Gao, Yongbin
Fang, Zhijun
Fang, Yuming
Gu, Jia
Fujita, Hamido
Hwang, Jenq-Neng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 1751 - 1766
[36] Improving Vision-based Self-positioning in Intelligent Transportation Systems via Integrated Lane and Vehicle Detection
Chandakkar, Parag S.
Wang, Yilin
Li, Baoxin
2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 404 - 411
[37] Intelligent control of chaotic systems via self-organizing Hermite-polynomial-based neural network
Hsu, Chun-Fei
NEUROCOMPUTING, 2014, 123 : 197 - 206
[38] MARL-Based AUV Formation for Underwater Intelligent Autonomous Transport Systems Supported by 6G Network
He, Jingyi
Xi, Meng
Wen, Jiabao
Xiao, Shuai
Yang, Jiachen
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
[39] LANet: Stereo matching network based on linear-attention mechanism for depth estimation optimization in 3D reconstruction of inter-forest scene
Liu, Lina
Liu, Yaqiu
Lv, Yunlei
Xing, Jian
FRONTIERS IN PLANT SCIENCE, 2022, 13
[40] MLDA-Net: Multi-Level Dual Attention-Based Network for Self-Supervised Monocular Depth Estimation
Song, Xibin
Li, Wei
Zhou, Dingfu
Dai, Yuchao
Fang, Jin
Li, Hongdong
Zhang, Liangjun
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4691 - 4705

← 1 2 3 4 5 →