Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction

被引:23
|
作者
Li, Maosen [1 ,2 ]
Chen, Siheng [1 ,2 ]
Zhang, Zijing [3 ]
Xie, Lingxi [4 ]
Tian, Qi [4 ]
Zhang, Ya [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China
[2] Shanghai AI Lab, Shanghai, Peoples R China
[3] Zhejiang Univ, Hangzhou, Peoples R China
[4] Huawei Cloud & AI, Shenzhen, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Human motion prediction; Adaptive graph scattering; Spatial separation; Bipartite cross-part fusion;
D O I
10.1007/978-3-031-20068-7_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional network based methods that model the body-joints' relations, have recently shown great promise in 3D skeleton-based human motion prediction. However, these methods have two critical issues: first, deep graph convolutions filter features within only limited graph spectrums, losing sufficient information in the full band; second, using a single graph to model the whole body underestimates the diverse patterns on various body-parts. To address the first issue, we propose adaptive graph scattering, which leverages multiple trainable band-pass graph filters to decompose pose features into richer graph spectrum bands. To address the second issue, body-parts are modeled separately to learn diverse dynamics, which enables finer feature extraction along the spatial dimensions. Integrating the above two designs, we propose a novel skeleton-parted graph scattering network (SPGSN). The cores of the model are cascaded multi-part graph scattering blocks (MPGSBs), building adaptive graph scattering on diverse body-parts, as well as fusing the decomposed features based on the inferred spectrum importance and body-part interactions. Extensive experiments have shown that SPGSN outperforms state-of-the-art methods by remarkable margins of 13.8%, 9.3% and 2.7% in terms of 3D mean per joint position error (MPJPE) on Human3.6M, CMU Mocap and 3DPW datasets, respectively.
引用
收藏
页码:18 / 36
页数:19
相关论文
共 50 条
  • [21] Contextually Plausible and Diverse 3D Human Motion Prediction
    Aliakbarian, Sadegh
    Saleh, Fatemeh
    Petersson, Lars
    Gould, Stephen
    Salzmann, Mathieu
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11313 - 11322
  • [22] Action-guided 3D Human Motion Prediction
    Sun, Jiangxin
    Lin, Zihang
    Han, Xintong
    Hu, Jian-Fang
    Xu, Jia
    Zheng, Wei-Shi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [23] Human Motion Recognition Using 3D-Skeleton-Data and Neural Networks
    Vox, Jan P.
    Wallhoff, Frank
    ARTIFICIAL INTELLIGENCE XXXV (AI 2018), 2018, 11311 : 204 - 209
  • [24] Fusion of Temporal Transformer and Spatial Graph Convolutional Network for 3-D Skeleton-Parts-Based Human Motion Prediction
    Lovanshi, Mayank
    Tiwari, Vivek
    Ingle, Rajesh
    Jain, Swati
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2024, 54 (06) : 788 - 797
  • [25] MFOGCN: multi-feature-based orthogonal graph convolutional network for 3D human motion prediction
    Tu, Jianfeng
    Zang, Tuo
    Duan, Mengran
    Jiang, Hanrui
    Zhao, Jiahui
    Jiang, Nan
    Liu, Lingfeng
    VISUAL COMPUTER, 2024, 40 (09): : 6047 - 6062
  • [26] 3D human motion retrieval using graph kernels based on adaptive graph construction
    Li, Meng
    Leung, Howard
    Liu, Zhiguang
    Zhou, Liuyang
    COMPUTERS & GRAPHICS-UK, 2016, 54 : 104 - 112
  • [27] MotioNet: 3D Human Motion Reconstruction from Monocular Video with Skeleton Consistency
    Shi, Mingyi
    Aberman, Kfir
    Aristidou, Andreas
    Komura, Taku
    Lischinski, Dani
    Cohen-Or, Daniel
    Chen, Baoquan
    ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (01):
  • [28] Improved 3D Human Motion Capture Using Kinect Skeleton and Depth Sensor
    Bilesan, Alireza
    Komizunai, Shunsuke
    Tsujita, Teppei
    Konno, Atsushi
    JOURNAL OF ROBOTICS AND MECHATRONICS, 2021, 33 (06) : 1407 - 1421
  • [29] MotioNet: 3D Human motion reconstruction from monocular video with skeleton consistency
    Shi, Mingyi
    Aberman, Kfir
    Aristidou, Andreas
    Komura, Taku
    Lischinski, Dani
    Cohen-Or, Daniel
    Chen, Baoquan
    ACM Transactions on Graphics, 2020, 40 (01):
  • [30] MHFDN:Multi-branch Hybrid Frequency Domain Networks for 3D Human Motion Prediction
    Zhou, Xin
    Liu, Rui
    Dong, Jing
    Yi, Pengfei
    Wang, Ling
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 696 - 701