KSOF: Leveraging kinematics and spatio-temporal optimal fusion for human motion prediction

被引:0
|
作者
Ding, Rui [1 ]
Qu, Kehua [1 ]
Tang, Jin [2 ]
机构
[1] Capital Normal Univ, Informat Engn Coll, Beijing 100048, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Intelligent Engn & Automat, Beijing 100876, Peoples R China
关键词
Human motion prediction; Kinematic constraints; Spatio-temporal optimal fusion;
D O I
10.1016/j.patcog.2024.111206
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ignoring the meaningful kinematics law, which generates improbable or impractical predictions, is one of the obstacles to human motion prediction. Current methods attempt to tackle this problem by taking simple kinematics information as auxiliary features to improve predictions. However, it remains challenging to utilize human prior knowledge deeply, such as the trajectory formed by the same joint should be smooth and continuous in this task. In this paper, we advocate explicitly describing kinematics information via velocity and acceleration by proposing a novel loss called joint point smoothness (JPS) loss, which calculates the acceleration of joints to smooth the sudden change in joint velocity. In addition, capturing spatio-temporal dependencies to make feature representations more informative is also one of the obstacles in this task. Therefore, we propose a dual-path network (KSOF) that models the temporal and spatial dependencies from kinematic temporal convolutional network (K-TCN) and spatial graph convolutional networks (S-GCN), respectively. Moreover, we propose a novel multi-scale fusion module named spatio-temporal optimal fusion (SOF) to enhance extraction of the essential correlation and important features at different scales from spatiotemporal coupling features. We evaluate our approach on three standard benchmark datasets, including Human3.6M, CMU-Mocap, and 3DPW datasets. For both short-term and long-term predictions, our method achieves outstanding performance on all these datasets. The code is available at https://github.com/qukehua/ KSOF.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Human Motion Prediction via Spatio-Temporal Inpainting
    Ruiz, A. Hernandez
    Gall, J.
    Moreno-Noguer, F.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7133 - 7142
  • [2] Spatio-temporal aggregation of skeletal motion features for human motion prediction
    Ueda, Itsuki
    Shishido, Hidehiko
    Kitahara, Itaru
    ARRAY, 2022, 15
  • [3] Optimizing human motion prediction through decoupled motion spatio-temporal trends
    Pan, Huan
    Ji, Ruiya
    Cao, Wenming
    Huang, Zhao
    Zhong, Jianqi
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [4] Spatio-temporal structure of human motion primitives and its application to motion prediction
    Takano, Wataru
    Imagawa, Hirotaka
    Nakamura, Yoshihiko
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 : 288 - 296
  • [5] Human gait recognition by the fusion of motion and static spatio-temporal templates
    Lam, Toby H. W.
    Lee, Raymond S. T.
    Zhang, David
    PATTERN RECOGNITION, 2007, 40 (09) : 2563 - 2573
  • [6] Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction
    Zhong, Chongyang
    Hu, Lei
    Zhang, Zihao
    Ye, Yongjing
    Xia, Shihong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6437 - 6446
  • [7] Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction
    Zhong, Chongyang
    Hu, Lei
    Zhang, Zihao
    Ye, Yongjing
    Xia, Shihong
    arXiv, 2022,
  • [8] Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction
    Zhong, Chongyang
    Hu, Lei
    Zhang, Zihao
    Ye, Yongjing
    Xia, Shihong
    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, 2022-June : 6437 - 6446
  • [9] A Spatio-temporal Transformer for 3D Human Motion Prediction
    Aksan, Emre
    Kaufmann, Manuel
    Cao, Peng
    Hilliges, Otmar
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 565 - 574
  • [10] Spatio-Temporal Branching for Motion Prediction using Motion Increments
    Wang, Jiexin
    Zhou, Yujie
    Qiang, Wenwen
    Ba, Ying
    Su, Bing
    Wen, Ji-Rong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4290 - 4299