LEARNING MONOCULAR 3D HUMAN POSE ESTIMATION WITH SKELETAL INTERPOLATION

被引：2

作者：

Chen, Ziyi ^{[1
]}

Sugimoto, Akihiro ^{[2
]}

Lai, Shang-Hong ^{[1
]}

机构：

[1] Natl Tsing Hua Univ, Hsinchu, Taiwan

[2] Natl Inst Informat, Tokyo, Japan

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

Data augmentation; skeletal interpolation; transformer; 3D human pose estimation;

D O I：

10.1109/ICASSP43922.2022.9746410

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Deep learning has achieved unprecedented accuracy for monocular 3D human pose estimation. However, current learning-based 3D human pose estimation still suffers from poor generalization. Inspired by skeletal animation, which is popular in game development and animation production, we put forward an simple, intuitive yet effective interpolation-based data augmentation approach to synthesize continuous and diverse 3D human body sequences to enhance model generalization. The Transformer-based lifting network, trained with the augmented data, utilizes the self-attention mechanism to perform 2D-to-3D lifting and successfully infer high-quality predictions in the qualitative experiment. The quantitative result of cross-dataset experiment demonstrates that our resulting model achieves superior generalization accuracy on the publicly available dataset.

引用

页码：4218 / 4222

页数：5

共 50 条

[21] Recent Advances of Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective
Liu, Wu
Bao, Qian
Sun, Yu
Mei, Tao
ACM COMPUTING SURVEYS, 2023, 55 (04)
[22] 3D Human Pose Estimation With Adversarial Learning
Meng, Wenming
Hu, Tao
Shuai, Li
2019 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV), 2019, : 93 - 99
[23] Monocular 3D Pose Estimation and Tracking by Detection
Andriluka, Mykhaylo
Roth, Stefan
Schiele, Bernt
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 623 - 630
[24] Limb Pose Aware Networks for Monocular 3D Pose Estimation
Wu, Lele
Yu, Zhenbo
Liu, Yijiang
Liu, Qingshan
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 906 - 917
[25] SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation
Xu, Xiangyu
Liu, Lijuan
Yan, Shuicheng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 3275 - 3289
[26] Evaluation of Human Pose Estimation in 3D with Monocular Camera for Clinical Application
Carrasco-Plaza, Jose
Cerda, Mauricio
INTELLIGENT COMPUTING SYSTEMS (ISICS 2022), 2022, 1569 : 121 - 134
[27] Personalized Graph Generation for Monocular 3D Human Pose and Shape Estimation
Hu, Junxing
Zhang, Hongwen
Wang, Yunlong
Ren, Min
Sun, Zhenan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2399 - 2413
[28] Boosting Monocular 3D Human Pose Estimation With Part Aware Attention
Xue, Youze
Chen, Jiansheng
Gu, Xiangming
Ma, Huimin
Ma, Hongbing
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4278 - 4291
[29] Human Context: Modeling Human-Human Interactions for Monocular 3D Pose Estimation
Andriluka, Mykhaylo
Sigal, Leonid
ARTICULATED MOTION AND DEFORMABLE OBJECTS, 2012, 7378 : 260 - 272
[30] Monocular 3D Pose Estimation via Pose Grammar and Data Augmentation
Xu, Yuanlu
Wang, Wenguan
Liu, Tengyu
Liu, Xiaobai
Xie, Jianwen
Zhu, Song-Chun
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6327 - 6344

← 1 2 3 4 5 →