Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction

被引：23

作者：

Li, Maosen ^{[1
,2
]}

Chen, Siheng ^{[1
,2
]}

Zhang, Zijing ^{[3
]}

Xie, Lingxi ^{[4
]}

Tian, Qi ^{[4
]}

Zhang, Ya ^{[1
,2
]}

机构：

[1] Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China

[2] Shanghai AI Lab, Shanghai, Peoples R China

[3] Zhejiang Univ, Hangzhou, Peoples R China

[4] Huawei Cloud & AI, Shenzhen, Peoples R China

来源：

COMPUTER VISION - ECCV 2022, PT VI | 2022年 / 13666卷

基金：

中国国家自然科学基金;

关键词：

Human motion prediction; Adaptive graph scattering; Spatial separation; Bipartite cross-part fusion;

D O I：

10.1007/978-3-031-20068-7_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Graph convolutional network based methods that model the body-joints' relations, have recently shown great promise in 3D skeleton-based human motion prediction. However, these methods have two critical issues: first, deep graph convolutions filter features within only limited graph spectrums, losing sufficient information in the full band; second, using a single graph to model the whole body underestimates the diverse patterns on various body-parts. To address the first issue, we propose adaptive graph scattering, which leverages multiple trainable band-pass graph filters to decompose pose features into richer graph spectrum bands. To address the second issue, body-parts are modeled separately to learn diverse dynamics, which enables finer feature extraction along the spatial dimensions. Integrating the above two designs, we propose a novel skeleton-parted graph scattering network (SPGSN). The cores of the model are cascaded multi-part graph scattering blocks (MPGSBs), building adaptive graph scattering on diverse body-parts, as well as fusing the decomposed features based on the inferred spectrum importance and body-part interactions. Extensive experiments have shown that SPGSN outperforms state-of-the-art methods by remarkable margins of 13.8%, 9.3% and 2.7% in terms of 3D mean per joint position error (MPJPE) on Human3.6M, CMU Mocap and 3DPW datasets, respectively.

引用

页码：18 / 36

页数：19

共 50 条

[31] Graph Stacked Hourglass Networks for 3D Human Pose Estimation
Xu, Tianhan
Takano, Wataru
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16100 - 16109
[32] Semantic Graph Convolutional Networks for 3D Human Pose Regression
Zhao, Long
Peng, Xi
Tian, Yu
Kapadia, Mubbasir
Metaxas, Dimitris N.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3420 - 3430
[33] Compositional Graph Convolutional Networks for 3D Human Pose Estimation
Zou, Zhiming
Liu, Tianqi
Wu, Dapeng
Tang, Wei
2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
[34] Human body motion capture via 3D graph-cuts
Wan, Chengkai
Yuan, Baozong
Sun, Yunda
Miao, Zhenjiang
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1563 - +
[35] 3D Human Skeleton Estimation Based on RGB Image Sequence and Graph Convolution Network
Lie, Wen-Nung
Yang, Pei-Hsuan
Vann, Veasna
Chiang, Jui-Chiu
2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
[36] Dynamic Dense Graph Convolutional Network for Skeleton-Based Human Motion Prediction
Wang, Xinshun
Zhang, Wanying
Wang, Can
Gao, Yuan
Liu, Mengyuan
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1 - 15
[37] Dynamic Dense Graph Convolutional Network for Skeleton-Based Human Motion Prediction
Wang, Xinshun
Zhang, Wanying
Wang, Can
Gao, Yuan
Liu, Mengyuan
IEEE Transactions on Image Processing, 2024, 33 : 1 - 15
[38] MoML: Online Meta Adaptation for 3D Human Motion Prediction
Sun, Xiaoning
Sun, Huaijiang
Li, Bin
Wei, Dong
Li, Weiqing
Lu, Jianfeng
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 1042 - 1051
[39] 3D HUMAN LIFTING MOTION PREDICTION WITH DIFFERENT PERFORMANCE MEASURES
Xiang, Yujiang
Arora, Jasbir S.
Abdel-Malek, Karim
INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2012, 9 (02)
[40] A Spatio-temporal Transformer for 3D Human Motion Prediction
Aksan, Emre
Kaufmann, Manuel
Cao, Peng
Hilliges, Otmar
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 565 - 574

← 1 2 3 4 5 →