Enhanced spatial-temporal dynamics in pose forecasting through multi-graph convolution networks

被引:0
|
作者
Ren, Hongwei [1 ]
Zhang, Xiangran [1 ]
Shi, Yuhong [1 ]
Liang, Kewei [2 ]
机构
[1] Zhejiang Univ, Polytech Inst, Shixiang Rd, Hangzhou 310058, Zhejiang, Peoples R China
[2] Zhejiang Univ, Sch Math Sci, Yuhangtang Rd, Hangzhou 310015, Zhejiang, Peoples R China
关键词
Graph convolutional network; Pose prediction; Attention mechanism; MOTION;
D O I
10.1007/s13042-024-02254-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there has been a growing interest in predicting human motion, which involves forecasting future body poses based on observed pose sequences. This task is complex due to modeling spatial and temporal relationships. Autoregressive models, including recurrent neural networks (RNNs) and their variants, as well as transformer networks, are commonly used for addressing this challenge. However, autoregressive models have several serious drawbacks, such as vanishing or exploding gradients. Other researchers have attempted to solve the communication problem in the spatial dimension by integrating graph convolutional networks (GCNs) and long short-term memory (LSTM) or convolutional neural network (CNN) models. These approaches process temporal and spatial information separately and fuse them to extract features, whereas this sequential processing hampers the model's ability to capture spatiotemporal information and perform feature extraction simultaneously. To address this in human pose forecasting, we propose a novel approach called the multi-graph convolution network (MGCN). By introducing an augmented graph for pose sequences, our model captures spatial and temporal information in one step only using GCN. Multiple frames provide multiple parts, which are joined together in a unified graph instance. Furthermore, our model investigates the impact of natural structure and sequence-aware attention. In the experimental evaluation of the large-scale benchmark datasets (Human3.6M, AMSS, and 3DPW), MGCN outperforms the state-of-the-art methods in human pose prediction.
引用
收藏
页码:5453 / 5467
页数:15
相关论文
共 50 条
  • [41] Orthogonal Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting
    Fei, Yanhong
    Hu, Ming
    Wei, Xian
    Chen, Mingsong
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 71 - 76
  • [42] Forecasting traffic flow with spatial-temporal convolutional graph attention networks
    Zhang, Xiyue
    Xu, Yong
    Shao, Yizhen
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15457 - 15479
  • [43] Predicting Traffic Flow Using Dynamic Spatial-Temporal Graph Convolution Networks
    Liu, Yunchang
    Wan, Fei
    Liang, Chengwu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (03): : 4343 - 4361
  • [44] DMGCRN: Dynamic multi-graph convolution recurrent network for traffic forecasting
    Qin, Yanjun
    Fang, Yuchen
    Luo, Haiyong
    Zhao, Fang
    Wang, Chenxing
    arXiv, 2021,
  • [45] Multi-Modal Graph Interaction for Multi-Graph Convolution Network in Urban Spatiotemporal Forecasting
    Zhang, Lingyu
    Geng, Xu
    Qin, Zhiwei
    Wang, Hongjun
    Wang, Xiao
    Zhang, Ying
    Liang, Jian
    Wu, Guobin
    Song, Xuan
    Wang, Yunhai
    SUSTAINABILITY, 2022, 14 (19)
  • [46] Adaptive Spatial-Temporal Convolution Network for Traffic Forecasting
    Li, Zhao
    Zhang, Yong
    Zhang, Zhao
    Wang, Xing
    Zhu, Lin
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, 2022, 13369 : 287 - 299
  • [47] Multi-Graph Attention Networks With Bilinear Convolution for Diagnosis of Schizophrenia
    Yu, Renping
    Pan, Cong
    Fei, Xuan
    Chen, Mingming
    Shen, Dinggang
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (03) : 1443 - 1454
  • [48] Crowd Flow Forecasting with Multi-Graph Neural Networks
    Zhang, Xu
    Cao, Ruixu
    Zhang, Zuyu
    Xia, Ying
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [49] MDSTGCN : Multi-Scale Dynamic Spatial-Temporal Graph Convolution Network With Edge Feature Embedding for Traffic Forecasting
    Liu, Sijia
    Xu, Hui
    Meng, Fanyu
    Ren, Qianqian
    2024 IEEE 24TH INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID 2024, 2024, : 284 - 290
  • [50] Multi-Stream and Enhanced Spatial-Temporal Graph Convolution Network for Skeleton-Based Action Recognition
    Li, Fanjia
    Zhu, Aichun
    Xu, Yonggang
    Cui, Ran
    Hua, Gang
    IEEE ACCESS, 2020, 8 : 97757 - 97770