A generic diffusion-based approach for 3D human pose prediction in the wild

被引：19

作者：

Saadatnejad, Saeed ^{[1
]}

Rasekh, Ali ^{[1
]}

Mofayezi, Mohammadreza ^{[1
]}

Medghalchi, Yasamin ^{[1
]}

Rajahzadeh, Sara ^{[1
]}

Mordan, Taylor ^{[1
]}

Alahi, Alexandre ^{[1
]}

机构：

[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023) | 2023年

关键词：

D O I：

10.1109/ICRA48891.2023.10160399

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are considered as a single sequence containing missing elements (whether in the observation or prediction horizon). All missing elements are treated as noise and denoised with our conditional diffusion model. To better handle long-term forecasting horizon, we present a temporal cascaded diffusion model. We demonstrate the benefits of our approach on four publicly available datasets (Human3.6M, HumanEva-I, AMASS, and 3DPW), outperforming the state-of-the-art. Additionally, we show that our framework is generic enough to improve any 3D pose prediction model as a preprocessing step to repair their inputs and a post-processing step to refine their outputs. The code is available online: https://github.com/vita- epfl/DePOSit.

引用

页码：8246 / 8253

页数：8

共 50 条

[21] Diffusion-Based Facial Aesthetics Enhancement With 3D Structure Guidance
Li, Lisha
Hou, Jingwen
Liu, Weide
Fang, Yuming
Yan, Jiebin
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1879 - 1894
[22] DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion
Rommel, Cedric
Valle, Eduardo
Chen, Mickael
Khalfaoui, Souhaiel
Marlet, Renaud
Cord, Matthieu
Perez, Patrick
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3212 - 3221
[23] SMPL-Based 3D Pedestrian Pose Prediction
Kunchala, Anil
Bouroche, Melanie
D'Arcy, Lorraine
Schoen-Phelan, Bianca
2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
[24] 3D Point Cloud Attribute Compression Using Diffusion-Based Texture-Aware Intra Prediction
Shao, Yiting
Yang, Xiaodong
Gao, Wei
Liu, Shan
Li, Ge
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9633 - 9646
[25] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
Liu, Shuangjun
Sehgal, Naveen
Ostadabbas, Sarah
APPLIED INTELLIGENCE, 2022, 52 (12) : 14491 - 14506
[26] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
Shuangjun Liu
Naveen Sehgal
Sarah Ostadabbas
Applied Intelligence, 2022, 52 : 14491 - 14506
[27] Estimating Egocentric 3D Human Pose in the Wild with External Weak Supervision
Wang, Jian
Liu, Lingjie
Xu, Weipeng
Sarkar, Kripasindhu
Luvizon, Diogo
Theobalt, Christian
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13147 - 13156
[28] EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild
Kaufmann, Manuel
Song, Jie
Guo, Chen
Shen, Kaiyue
Jiang, Tianjian
Tang, Chengcheng
Zarate, Juan Jose
Hilliges, Otmar
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14586 - 14597
[29] RFID-based 3D human pose tracking: A subject generalization approach
Yang, Chao
Wang, Xuyu
Mao, Shiwen
DIGITAL COMMUNICATIONS AND NETWORKS, 2022, 8 (03) : 278 - 288
[30] A Bayesian Part-based Approach to 3D Human Pose and Camera Estimation
Brau, Ernesto
Jiang, Hao
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1762 - 1767

← 1 2 3 4 5 →