M2Tames: Interaction and Semantic Context Enhanced Pedestrian Trajectory Prediction

被引:0
|
作者
Gao, Xu [1 ,2 ]
Wang, Yanan [1 ,2 ]
Zhao, Yaqian [1 ,2 ]
Li, Yilong [3 ]
Wu, Gang [1 ,2 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
[2] Natl Supercomp Ctr Zhengzhou, Zhengzhou 450001, Peoples R China
[3] Henan Univ, Sch Comp & Informat Engn, Kaifeng 475000, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 18期
关键词
trajectory prediction; attention mechanism; autonomous driving; deep learning;
D O I
10.3390/app14188497
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Autonomous driving pays considerable attention to pedestrian trajectory prediction as a crucial task. Constructing effective pedestrian trajectory prediction models depends heavily on utilizing the motion characteristics of pedestrians, along with their interactions among themselves and between themselves and their environment. However, traditional trajectory prediction models often fall short of capturing complex real-world scenarios. To address these challenges, this paper proposes an enhanced pedestrian trajectory prediction model, M(2)Tames, which incorporates comprehensive motion, interaction, and semantic context factors. M(2)Tames provides an interaction module (IM), which consists of an improved multi-head mask temporal attention mechanism (M(2)Tea) and an Interaction Inference Module (I-2). M(2)Tea thoroughly characterizes the historical trajectories and potential interactions, while I-2 determines the precise interaction types. Then, IM adaptively aggregates useful neighbor features to generate a more accurate interactive feature map and feeds it into the final layer of the U-Net encoder to fuse with the encoder's output. Furthermore, by adopting the U-Net architecture, M(2)Tames can learn and interpret scene semantic information, enhancing its understanding of the spatial relationships between pedestrians and their surroundings. These innovations improve the accuracy and adaptability of the model for predicting pedestrian trajectories. Finally, M(2)Tames is evaluated on the ETH/UCY and SDD datasets for short- and long-term settings, respectively. The results demonstrate that M(2)Tames outperforms the state-of-the-art model MSRL by 2.49% (ADE) and 8.77% (FDE) in the short-term setting and surpasses the optimum Y-Net by 6.89% (ADE) and 1.12% (FDE) in the long-term prediction. Excellent performance is also shown on the ETH/UCY datasets.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Causal Temporal-Spatial Pedestrian Trajectory Prediction With Goal Point Estimation and Contextual Interaction
    Lian, Jing
    Yu, Fengning
    Li, Linhui
    Zhou, Yafu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24499 - 24509
  • [32] FRVO: A Filter Enhanced Interaction Model for Pedestrian Path Prediction in Crowded Scenarios
    Wei, Baoshan
    Zhang, Xing
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 538 - 543
  • [33] Social interaction model enhanced with speculation stage for human trajectory prediction
    Pi, Lei
    Zhang, Qiang
    Yang, Lingfang
    Huang, Zhi
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 161
  • [34] Mode Normalization Enhanced Recurrent Model for Multi-Modal Semantic Trajectory Prediction
    Zhu, Shaojie
    Zhang, Lei
    Liu, Bailong
    Cui, Shumin
    Shao, Changxing
    Li, Yun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (01): : 174 - 176
  • [35] SISGAN: A Generative Adversarial Network Pedestrian Trajectory Prediction Model Combining Interaction Information and Scene Information
    Dou, Wanqing
    Lu, Lili
    APPLIED SCIENCES-BASEL, 2024, 14 (20):
  • [36] STIGCN: spatial-temporal interaction-aware graph convolution network for pedestrian trajectory prediction
    Chen, Wangxing
    Sang, Haifeng
    Wang, Jinyu
    Zhao, Zishan
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (08): : 10695 - 10719
  • [37] Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks
    Saleh, Khaled
    SENSORS, 2022, 22 (19)
  • [38] CSGAT-Net: a conditional pedestrian trajectory prediction network based on scene semantic maps and spatiotemporal graph attention
    Yang X.
    Fan J.
    Wang X.
    Li T.
    Neural Computing and Applications, 2024, 36 (19) : 11409 - 11423
  • [39] Social-ATPGNN: Prediction of multi-modal pedestrian trajectory of non-homogeneous social interaction
    Wang, Kehao
    Zou, Han
    IET COMPUTER VISION, 2024, 18 (07) : 907 - 921
  • [40] Polar Collision Grids: Effective Interaction Modelling for Pedestrian Trajectory Prediction in Shared Space Using Collision Checks
    Golchoubian, Mahsa
    Ghafurian, Moojan
    Dautenhahn, Kerstin
    Azad, Nasser Lashgarian
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 791 - 798