BEV-TP: End-to-End Visual Perception and Trajectory Prediction for Autonomous Driving

被引:1
|
作者
Lang, Bo [1 ]
Li, Xin [2 ]
Chuah, Mooi Choo [1 ]
机构
[1] Lehigh Univ, Comp Sci & Engn, Bethlehem, PA 18015 USA
[2] Qualcomm Technol Inc, Qualcomm AI Res, San Diego, CA 92121 USA
基金
美国国家科学基金会;
关键词
Three-dimensional displays; Trajectory; Transformers; Feature extraction; Visualization; Object detection; Task analysis; Vision-based; end-to-end perception and prediction; autonomous driving; TRANSFORMER; TRACKING;
D O I
10.1109/TITS.2024.3433591
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
For autonomous vehicles (AVs), the ability for effective end-to-end perception and future trajectory prediction is critical in planning a safe automatic maneuver. In the current AVs systems, perception and prediction are two separate modules. The prediction module receives only a restricted amount of information from the perception module. Furthermore, perception errors will propagate into the prediction module, ultimately having a negative impact on the accuracy of the prediction results. In this paper, we present a novel framework termed BEV-TP, a visual context-guided center-based transformer network for joint 3D perception and trajectory prediction. BEV-TP exploits visual information from consecutive multi-view images and context information from HD semantic maps, to predict better objects' centers whose locations are then used to query visual features and context features via the attention mechanism. Generated agent queries and map queries facilitate learning of the transformer module for further feature aggregation. Finally, multiple regression heads are used to perform 3D bounding box detection and future velocity prediction. This center-based approach achieves a differentiable, simple, and efficient E2E trajectory prediction framework. Extensive experiments conducted on the nuScenes dataset demonstrate the effectiveness of BEV-TP over traditional pipelines with sequential paradigms.
引用
收藏
页码:18537 / 18546
页数:10
相关论文
共 50 条
  • [1] Explaining Autonomous Driving by Learning End-to-End Visual Attention
    Cultrera, Luca
    Seidenari, Lorenzo
    Becattini, Federico
    Pala, Pietro
    Del Bimbo, Alberto
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1389 - 1398
  • [2] End-to-end deep learning for reverse driving trajectory of autonomous bulldozer
    You, Ke
    Ding, Lieyun
    Jiang, Yutian
    Wu, Zhangang
    Zhou, Cheng
    KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [3] End-to-end Autonomous Driving Perception with Sequential Latent Representation Learning
    Chen, Jianyu
    Xu, Zhuo
    Tomizuka, Masayoshi
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 1999 - 2006
  • [4] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline
    Wu, Penghao
    Jia, Xiaosong
    Chen, Li
    Yan, Junchi
    Li, Hongyang
    Qiao, Yu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] Multimodal End-to-End Autonomous Driving
    Xiao, Yi
    Codevilla, Felipe
    Gurram, Akhil
    Urfalioglu, Onay
    Lopez, Antonio M.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (01) : 537 - 547
  • [6] PPAD: Iterative Interactions of Prediction and Planning for End-to-End Autonomous Driving
    Chen, Zhili
    Ye, Maosheng
    Xu, Shuangjie
    Cao, Tongyi
    Chen, Qifeng
    COMPUTER VISION-ECCV 2024, PT XXXV, 2025, 15093 : 239 - 256
  • [7] Adversarial Driving: Attacking End-to-End Autonomous Driving
    Wu, Han
    Yunas, Syed
    Rowlands, Sareh
    Ruan, Wenjie
    Wahlstrom, Johan
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [8] ICOP: Image-based Cooperative Perception for End-to-End Autonomous Driving
    Li, Lantao
    Cheng, Yujie
    Sun, Chen
    Zhang, Wenqi
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 2367 - 2374
  • [9] Attacking vision-based perception in end-to-end autonomous driving models
    Boloor, Adith
    Garimella, Karthik
    He, Xin
    Gill, Christopher
    Vorobeychik, Yevgeniy
    Zhang, Xuan
    JOURNAL OF SYSTEMS ARCHITECTURE, 2020, 110
  • [10] DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous Driving
    Jia, Xiaosong
    Gao, Yulu
    Chen, Li
    Yan, Junchi
    Liu, Patrick Langechuan
    Li, Hongyang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 7919 - 7929