M2Tames: Interaction and Semantic Context Enhanced Pedestrian Trajectory Prediction

被引:0
|
作者
Gao, Xu [1 ,2 ]
Wang, Yanan [1 ,2 ]
Zhao, Yaqian [1 ,2 ]
Li, Yilong [3 ]
Wu, Gang [1 ,2 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
[2] Natl Supercomp Ctr Zhengzhou, Zhengzhou 450001, Peoples R China
[3] Henan Univ, Sch Comp & Informat Engn, Kaifeng 475000, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 18期
关键词
trajectory prediction; attention mechanism; autonomous driving; deep learning;
D O I
10.3390/app14188497
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Autonomous driving pays considerable attention to pedestrian trajectory prediction as a crucial task. Constructing effective pedestrian trajectory prediction models depends heavily on utilizing the motion characteristics of pedestrians, along with their interactions among themselves and between themselves and their environment. However, traditional trajectory prediction models often fall short of capturing complex real-world scenarios. To address these challenges, this paper proposes an enhanced pedestrian trajectory prediction model, M(2)Tames, which incorporates comprehensive motion, interaction, and semantic context factors. M(2)Tames provides an interaction module (IM), which consists of an improved multi-head mask temporal attention mechanism (M(2)Tea) and an Interaction Inference Module (I-2). M(2)Tea thoroughly characterizes the historical trajectories and potential interactions, while I-2 determines the precise interaction types. Then, IM adaptively aggregates useful neighbor features to generate a more accurate interactive feature map and feeds it into the final layer of the U-Net encoder to fuse with the encoder's output. Furthermore, by adopting the U-Net architecture, M(2)Tames can learn and interpret scene semantic information, enhancing its understanding of the spatial relationships between pedestrians and their surroundings. These innovations improve the accuracy and adaptability of the model for predicting pedestrian trajectories. Finally, M(2)Tames is evaluated on the ETH/UCY and SDD datasets for short- and long-term settings, respectively. The results demonstrate that M(2)Tames outperforms the state-of-the-art model MSRL by 2.49% (ADE) and 8.77% (FDE) in the short-term setting and surpasses the optimum Y-Net by 6.89% (ADE) and 1.12% (FDE) in the long-term prediction. Excellent performance is also shown on the ETH/UCY datasets.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] SELF-GROWING SPATIAL GRAPH NETWORK FOR CONTEXT-AWARE PEDESTRIAN TRAJECTORY PREDICTION
    Haddad, Sirin
    Lam, Siew-Kei
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1029 - 1033
  • [22] Decoder Fusion RNN: Context and Interaction Aware Decoders for Trajectory Prediction
    Rella, Edoardo Mello
    Zaech, Jan-Nico
    Liniger, Alexander
    Van Gool, Luc
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5937 - 5943
  • [23] Context CVGN: A conditional multimodal trajectory prediction network based on scene semantic modeling
    Yang, Xin
    Wang, Shiyu
    Zhu, Yitian
    Zhou, Dake
    Li, Tao
    INFORMATION SCIENCES, 2024, 666
  • [24] Spatio-Temporal Interaction Aware and Trajectory Distribution Aware Graph Convolution Network for Pedestrian Multimodal Trajectory Prediction
    Wang, Ruiping
    Song, Xiao
    Hu, Zhijian
    Cui, Yong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [25] Multi-scale wavelet transform enhanced graph neural network for pedestrian trajectory prediction
    Lin, Xuanqi
    Zhang, Yong
    Wang, Shun
    Hu, Yongli
    Yin, Baocai
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2025, 659
  • [26] SDAGCN: Sparse Directed Attention Graph Convolutional Network for Spatial Interaction in Pedestrian Trajectory Prediction
    Sun, Chao
    Wang, Bo
    Leng, Jianghao
    Zhang, Xiangchao
    Wang, Bo
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (24): : 39225 - 39235
  • [27] STIGCN: spatial–temporal interaction-aware graph convolution network for pedestrian trajectory prediction
    Wangxing Chen
    Haifeng Sang
    Jinyu Wang
    Zishan Zhao
    The Journal of Supercomputing, 2024, 80 : 10695 - 10719
  • [28] Spatiotemporal Attention-Based Pedestrian Trajectory Prediction Considering Traffic-Actor Interaction
    Zhou, Xiaochuan
    Zhao, Wanzhong
    Wang, Anxu
    Wang, Chunyan
    Zheng, Shuangquan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) : 297 - 311
  • [29] DSTIGCN: Deformable Spatial-Temporal Interaction Graph Convolution Network for Pedestrian Trajectory Prediction
    Chen, Wangxing
    Sang, Haifeng
    Wang, Jinyu
    Zhao, Zishan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
  • [30] A Synchronous Bi-Directional Framework With Temporally Dependent Interaction Modeling for Pedestrian Trajectory Prediction
    Li, Yuanman
    Xie, Ce
    Liang, Rongqin
    Du, Jie
    Zhou, Jiantao
    Li, Xia
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (01): : 793 - 806