KSOF: Leveraging kinematics and spatio-temporal optimal fusion for human motion prediction

被引:0
|
作者
Ding, Rui [1 ]
Qu, Kehua [1 ]
Tang, Jin [2 ]
机构
[1] Capital Normal Univ, Informat Engn Coll, Beijing 100048, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Intelligent Engn & Automat, Beijing 100876, Peoples R China
关键词
Human motion prediction; Kinematic constraints; Spatio-temporal optimal fusion;
D O I
10.1016/j.patcog.2024.111206
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ignoring the meaningful kinematics law, which generates improbable or impractical predictions, is one of the obstacles to human motion prediction. Current methods attempt to tackle this problem by taking simple kinematics information as auxiliary features to improve predictions. However, it remains challenging to utilize human prior knowledge deeply, such as the trajectory formed by the same joint should be smooth and continuous in this task. In this paper, we advocate explicitly describing kinematics information via velocity and acceleration by proposing a novel loss called joint point smoothness (JPS) loss, which calculates the acceleration of joints to smooth the sudden change in joint velocity. In addition, capturing spatio-temporal dependencies to make feature representations more informative is also one of the obstacles in this task. Therefore, we propose a dual-path network (KSOF) that models the temporal and spatial dependencies from kinematic temporal convolutional network (K-TCN) and spatial graph convolutional networks (S-GCN), respectively. Moreover, we propose a novel multi-scale fusion module named spatio-temporal optimal fusion (SOF) to enhance extraction of the essential correlation and important features at different scales from spatiotemporal coupling features. We evaluate our approach on three standard benchmark datasets, including Human3.6M, CMU-Mocap, and 3DPW datasets. For both short-term and long-term predictions, our method achieves outstanding performance on all these datasets. The code is available at https://github.com/qukehua/ KSOF.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] MOTION DETECTION IN SPATIO-TEMPORAL SPACE
    LIOU, SP
    JAIN, RC
    COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1989, 45 (02): : 227 - 250
  • [32] Spatio-temporal model for image motion
    Park, E
    Wohn, K
    ELECTRONICS LETTERS, 1998, 34 (16) : 1574 - 1575
  • [33] GraphTCN: Spatio-Temporal Interaction Modeling for Human Trajectory Prediction
    Wang, Chengxin
    Cai, Shaofeng
    An, Gary
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3449 - 3458
  • [34] Learning hierarchical spatio-temporal pattern for human activity prediction
    Ding, Wenwen
    Liu, Kai
    Cheng, Fei
    Zhang, Jin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 35 : 103 - 111
  • [35] Few-shot human motion prediction using deformable spatio-temporal CNN with parameter generation
    Zang, Chuanqi
    Li, Menghao
    Pei, Mingtao
    NEUROCOMPUTING, 2022, 513 : 46 - 58
  • [36] Precipitation nowcasting leveraging spatial correlation feature extraction and deep spatio-temporal fusion network
    Yu, Wenbin
    Li, Yangsong
    Fan, Cheng
    Fu, Daoyong
    Zhang, Chengjun
    Chen, Yadang
    Qian, Ming
    Liu, Jie
    Liu, Gaoping
    EARTH SCIENCE INFORMATICS, 2024, 17 (05) : 4739 - 4755
  • [37] Motion tracking as spatio-temporal motion boundary detection
    Mitiche, A
    Feghali, R
    Mansouri, A
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2003, 43 (01) : 39 - 50
  • [38] Prediction of spatio-temporal AQI data
    Kim, Kyeong Eun
    Ma, Mi Ru
    Lee, Kyeong Won
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2023, 30 (02) : 119 - 133
  • [39] Improved prediction for a spatio-temporal model
    Gen Nowak
    A. H. Welsh
    Environmental and Ecological Statistics, 2020, 27 : 631 - 648
  • [40] Improved prediction for a spatio-temporal model
    Nowak, Gen
    Welsh, A. H.
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2020, 27 (04) : 631 - 648