Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction

被引:23
|
作者
Li, Maosen [1 ,2 ]
Chen, Siheng [1 ,2 ]
Zhang, Zijing [3 ]
Xie, Lingxi [4 ]
Tian, Qi [4 ]
Zhang, Ya [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China
[2] Shanghai AI Lab, Shanghai, Peoples R China
[3] Zhejiang Univ, Hangzhou, Peoples R China
[4] Huawei Cloud & AI, Shenzhen, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Human motion prediction; Adaptive graph scattering; Spatial separation; Bipartite cross-part fusion;
D O I
10.1007/978-3-031-20068-7_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional network based methods that model the body-joints' relations, have recently shown great promise in 3D skeleton-based human motion prediction. However, these methods have two critical issues: first, deep graph convolutions filter features within only limited graph spectrums, losing sufficient information in the full band; second, using a single graph to model the whole body underestimates the diverse patterns on various body-parts. To address the first issue, we propose adaptive graph scattering, which leverages multiple trainable band-pass graph filters to decompose pose features into richer graph spectrum bands. To address the second issue, body-parts are modeled separately to learn diverse dynamics, which enables finer feature extraction along the spatial dimensions. Integrating the above two designs, we propose a novel skeleton-parted graph scattering network (SPGSN). The cores of the model are cascaded multi-part graph scattering blocks (MPGSBs), building adaptive graph scattering on diverse body-parts, as well as fusing the decomposed features based on the inferred spectrum importance and body-part interactions. Extensive experiments have shown that SPGSN outperforms state-of-the-art methods by remarkable margins of 13.8%, 9.3% and 2.7% in terms of 3D mean per joint position error (MPJPE) on Human3.6M, CMU Mocap and 3DPW datasets, respectively.
引用
收藏
页码:18 / 36
页数:19
相关论文
共 50 条
  • [31] Graph Stacked Hourglass Networks for 3D Human Pose Estimation
    Xu, Tianhan
    Takano, Wataru
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16100 - 16109
  • [32] Semantic Graph Convolutional Networks for 3D Human Pose Regression
    Zhao, Long
    Peng, Xi
    Tian, Yu
    Kapadia, Mubbasir
    Metaxas, Dimitris N.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3420 - 3430
  • [33] Compositional Graph Convolutional Networks for 3D Human Pose Estimation
    Zou, Zhiming
    Liu, Tianqi
    Wu, Dapeng
    Tang, Wei
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [34] Human body motion capture via 3D graph-cuts
    Wan, Chengkai
    Yuan, Baozong
    Sun, Yunda
    Miao, Zhenjiang
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1563 - +
  • [35] 3D Human Skeleton Estimation Based on RGB Image Sequence and Graph Convolution Network
    Lie, Wen-Nung
    Yang, Pei-Hsuan
    Vann, Veasna
    Chiang, Jui-Chiu
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [36] Dynamic Dense Graph Convolutional Network for Skeleton-Based Human Motion Prediction
    Wang, Xinshun
    Zhang, Wanying
    Wang, Can
    Gao, Yuan
    Liu, Mengyuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1 - 15
  • [37] Dynamic Dense Graph Convolutional Network for Skeleton-Based Human Motion Prediction
    Wang, Xinshun
    Zhang, Wanying
    Wang, Can
    Gao, Yuan
    Liu, Mengyuan
    IEEE Transactions on Image Processing, 2024, 33 : 1 - 15
  • [38] MoML: Online Meta Adaptation for 3D Human Motion Prediction
    Sun, Xiaoning
    Sun, Huaijiang
    Li, Bin
    Wei, Dong
    Li, Weiqing
    Lu, Jianfeng
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 1042 - 1051
  • [39] 3D HUMAN LIFTING MOTION PREDICTION WITH DIFFERENT PERFORMANCE MEASURES
    Xiang, Yujiang
    Arora, Jasbir S.
    Abdel-Malek, Karim
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2012, 9 (02)
  • [40] A Spatio-temporal Transformer for 3D Human Motion Prediction
    Aksan, Emre
    Kaufmann, Manuel
    Cao, Peng
    Hilliges, Otmar
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 565 - 574