BEV-TP: End-to-End Visual Perception and Trajectory Prediction for Autonomous Driving

被引：1

作者：

Lang, Bo ^{[1
]}

Li, Xin ^{[2
]}

Chuah, Mooi Choo ^{[1
]}

机构：

[1] Lehigh Univ, Comp Sci & Engn, Bethlehem, PA 18015 USA

[2] Qualcomm Technol Inc, Qualcomm AI Res, San Diego, CA 92121 USA

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 11期

基金：

美国国家科学基金会;

关键词：

Three-dimensional displays; Trajectory; Transformers; Feature extraction; Visualization; Object detection; Task analysis; Vision-based; end-to-end perception and prediction; autonomous driving; TRANSFORMER; TRACKING;

D O I：

10.1109/TITS.2024.3433591

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

For autonomous vehicles (AVs), the ability for effective end-to-end perception and future trajectory prediction is critical in planning a safe automatic maneuver. In the current AVs systems, perception and prediction are two separate modules. The prediction module receives only a restricted amount of information from the perception module. Furthermore, perception errors will propagate into the prediction module, ultimately having a negative impact on the accuracy of the prediction results. In this paper, we present a novel framework termed BEV-TP, a visual context-guided center-based transformer network for joint 3D perception and trajectory prediction. BEV-TP exploits visual information from consecutive multi-view images and context information from HD semantic maps, to predict better objects' centers whose locations are then used to query visual features and context features via the attention mechanism. Generated agent queries and map queries facilitate learning of the transformer module for further feature aggregation. Finally, multiple regression heads are used to perform 3D bounding box detection and future velocity prediction. This center-based approach achieves a differentiable, simple, and efficient E2E trajectory prediction framework. Extensive experiments conducted on the nuScenes dataset demonstrate the effectiveness of BEV-TP over traditional pipelines with sequential paradigms.

引用

页码：18537 / 18546

页数：10

共 50 条

[1] Explaining Autonomous Driving by Learning End-to-End Visual Attention
Cultrera, Luca
Seidenari, Lorenzo
Becattini, Federico
Pala, Pietro
Del Bimbo, Alberto
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1389 - 1398
[2] End-to-end deep learning for reverse driving trajectory of autonomous bulldozer
You, Ke
Ding, Lieyun
Jiang, Yutian
Wu, Zhangang
Zhou, Cheng
KNOWLEDGE-BASED SYSTEMS, 2022, 252
[3] End-to-end Autonomous Driving Perception with Sequential Latent Representation Learning
Chen, Jianyu
Xu, Zhuo
Tomizuka, Masayoshi
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 1999 - 2006
[4] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline
Wu, Penghao
Jia, Xiaosong
Chen, Li
Yan, Junchi
Li, Hongyang
Qiao, Yu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[5] Multimodal End-to-End Autonomous Driving
Xiao, Yi
Codevilla, Felipe
Gurram, Akhil
Urfalioglu, Onay
Lopez, Antonio M.
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (01) : 537 - 547
[6] PPAD: Iterative Interactions of Prediction and Planning for End-to-End Autonomous Driving
Chen, Zhili
Ye, Maosheng
Xu, Shuangjie
Cao, Tongyi
Chen, Qifeng
COMPUTER VISION-ECCV 2024, PT XXXV, 2025, 15093 : 239 - 256
[7] Adversarial Driving: Attacking End-to-End Autonomous Driving
Wu, Han
Yunas, Syed
Rowlands, Sareh
Ruan, Wenjie
Wahlstrom, Johan
2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
[8] ICOP: Image-based Cooperative Perception for End-to-End Autonomous Driving
Li, Lantao
Cheng, Yujie
Sun, Chen
Zhang, Wenqi
2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 2367 - 2374
[9] Attacking vision-based perception in end-to-end autonomous driving models
Boloor, Adith
Garimella, Karthik
He, Xin
Gill, Christopher
Vorobeychik, Yevgeniy
Zhang, Xuan
JOURNAL OF SYSTEMS ARCHITECTURE, 2020, 110
[10] DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous Driving
Jia, Xiaosong
Gao, Yulu
Chen, Li
Yan, Junchi
Liu, Patrick Langechuan
Li, Hongyang
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 7919 - 7929

← 1 2 3 4 5 →