HTTNet: hybrid transformer-based approaches for trajectory prediction

被引:0
|
作者
Ge, Xianlei [1 ,3 ]
Shen, Xiaobo [1 ,4 ]
Zhou, Xuanxin [1 ]
Li, Xiaoyan [2 ,3 ]
机构
[1] Huainan Normal Univ, Sch Elect Engn, Huainan, Peoples R China
[2] Huainan Normal Univ, Sch Comp, Huainan, Peoples R China
[3] Natl Univ, Coll Comp & Informat Technol, Manila, Philippines
[4] Technol Univ Philippines, Coll Ind Educ, Manila, Philippines
关键词
trajectory prediction; transformer; convolutional neural network; multimodal data; LSTM;
D O I
10.24425/bpasts.2024.150811
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Forecasting future trajectories of intelligent agents presents a formidable challenge, necessitating the analysis of intricate scenarios and uncertainties arising from agent interactions. Consequently, it is judicious to contemplate the establishment of inter-agent relationships and the assimilation of contextual semantic information. In this manuscript, we introduce HTTNet, a comprehensive framework that spans three dimensions of information modeling: (1) the temporal dimension, where HTTNet employs a time encoder to articulate time sequences, comprehending the influences of past and future trajectories; (2) the social dimension, where the trajectory encoder facilitates the input of trajectories from multiple agents, thereby streamlining the modeling of interaction information among intelligent agents; (3) the contextual dimension, where the TF-map encoder integrates semantic scene input, amplifying HTTNet cognitive grasp of scene information. Furthermore, HTTNet integrates a hybrid modeling paradigm featuring CNN and transformer, transmuting map scenes into feature information for the transformer. Qualitative and quantitative analyses on the nuScenes and interaction datasets highlight the exceptional performance of HTTNet, achieving 1.03 minADE10 and a 0.31 miss rate on nuScenes, underscoring its effectiveness in multi-agent trajectory prediction in complex scenarios.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Arabic Paraphrase Generation Using Transformer-Based Approaches
    Al-Shameri, Noora Aref
    Al-Khalifa, Hend S.
    IEEE ACCESS, 2024, 12 : 121896 - 121914
  • [22] Traffic Transformer: Transformer-based framework for temporal traffic accident prediction
    Al-Thani, Mansoor G.
    Sheng, Ziyu
    Cao, Yuting
    Yang, Yin
    AIMS MATHEMATICS, 2024, 9 (05): : 12610 - 12629
  • [23] Transformer and Graph Transformer-Based Prediction of Drug-Target Interactions
    Qian, Meiling
    Lu, Weizhong
    Zhang, Yu
    Liu, Junkai
    Wu, Hongjie
    Lu, Yaoyao
    Li, Haiou
    Fu, Qiming
    Shen, Jiyun
    Xiao, Yongbiao
    CURRENT BIOINFORMATICS, 2024, 19 (05) : 470 - 481
  • [24] A Transformer-Based Bridge Structural Response Prediction Framework
    Li, Ziqi
    Li, Dongsheng
    Sun, Tianshu
    SENSORS, 2022, 22 (08)
  • [25] Rethinking Transformer-based Set Prediction for Object Detection
    Sun, Zhiqing
    Cao, Shengcao
    Yang, Yiming
    Kitani, Kris
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3591 - 3600
  • [26] TRANSFORMER-BASED ACOUSTIC MODELING FOR HYBRID SPEECH RECOGNITION
    Wang, Yongqiang
    Mohamed, Abdelrahman
    Le, Duc
    Liu, Chunxi
    Xiao, Alex
    Mahadeokar, Jay
    Huang, Hongzhao
    Tjandra, Andros
    Zhang, Xiaohui
    Zhang, Frank
    Fuegen, Christian
    Zweig, Geoffrey
    Seltzer, Michael L.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6874 - 6878
  • [27] Transformer-based attention network for stock movement prediction
    Zhang, Qiuyue
    Qin, Chao
    Zhang, Yunfeng
    Bao, Fangxun
    Zhang, Caiming
    Liu, Peide
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 202
  • [28] TransCFD: A transformer-based decoder for flow field prediction
    Jiang, Jundou
    Li, Guanxiong
    Jiang, Yi
    Zhang, Laiping
    Deng, Xiaogang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [29] Deep Transformer-Based Asset Price and Direction Prediction
    Gezici, Abdul Haluk Batur
    Sefer, Emre
    IEEE ACCESS, 2024, 12 : 24164 - 24178
  • [30] Transformer-based Architecture for Empathy Prediction and Emotion Classification
    Vasava, Himil
    Uikey, Pramegh
    Wasnik, Gaurav
    Sharma, Raksha
    PROCEEDINGS OF THE 12TH WORKSHOP ON COMPUTATIONAL APPROACHES TO SUBJECTIVITY, SENTIMENT & SOCIAL MEDIA ANALYSIS, 2022, : 261 - 264