A generic diffusion-based approach for 3D human pose prediction in the wild

被引：19

作者：

Saadatnejad, Saeed ^{[1
]}

Rasekh, Ali ^{[1
]}

Mofayezi, Mohammadreza ^{[1
]}

Medghalchi, Yasamin ^{[1
]}

Rajahzadeh, Sara ^{[1
]}

Mordan, Taylor ^{[1
]}

Alahi, Alexandre ^{[1
]}

机构：

[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023) | 2023年

关键词：

D O I：

10.1109/ICRA48891.2023.10160399

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are considered as a single sequence containing missing elements (whether in the observation or prediction horizon). All missing elements are treated as noise and denoised with our conditional diffusion model. To better handle long-term forecasting horizon, we present a temporal cascaded diffusion model. We demonstrate the benefits of our approach on four publicly available datasets (Human3.6M, HumanEva-I, AMASS, and 3DPW), outperforming the state-of-the-art. Additionally, we show that our framework is generic enough to improve any 3D pose prediction model as a preprocessing step to repair their inputs and a post-processing step to refine their outputs. The code is available online: https://github.com/vita- epfl/DePOSit.

引用

页码：8246 / 8253

页数：8

共 50 条

[31] RFID-based 3D human pose tracking:A subject generalization approach
Chao Yang
Xuyu Wang
Shiwen Mao
Digital Communications and Networks, 2022, 8 (03) : 278 - 288
[32] 3D Pose Estimation and 3D Model Retrieval for Objects in the Wild
Grabner, Alexander
Roth, Peter M.
Lepetit, Vincent
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3022 - 3031
[33] An approach to 3D pose determination
Ezquerra, N
Mullick, R
ACM TRANSACTIONS ON GRAPHICS, 1996, 15 (02): : 99 - 120
[34] Diff3DHPE: A Diffusion Model for 3D Human Pose Estimation
Zhou, Jieming
Zhang, Tong
Hayder, Zeeshan
Petersson, Lars
Harandi, Mehrtash
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2084 - 2094
[35] 3D generic object categorization, localization and pose estimation
Savarese, Silvio
Fei-Fei, Li
2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 1245 - 1252
[36] Generic 3D Representation via Pose Estimation and Matching
Zamir, Amir R.
Wekel, Tilman
Agrawal, Pulkit
Wei, Colin
Malik, Jitendra
Savarese, Silvio
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 535 - 553
[37] Exemplar Fine-Tuning for 3D Human Model Fitting Towards In-the-Wild 3D Human Pose Estimation
Joo, Hanbyul
Neverova, Natalia
Vedaldi, Andrea
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 42 - 52
[38] An Image Cues Coding Approach for 3D Human Pose Estimation
Xing, Meng
Feng, Zhiyong
Su, Yong
Zhang, Jianhai
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (04)
[39] Recognizing Conversational Interaction Based on 3D Human Pose
Deng, Jingjing
Xie, Xianghua
Daubney, Ben
Fang, Hui
Grant, Phil W.
ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2013, 2013, 8192 : 138 - 149
[40] 3D Human Pose Estimation based on Center of Gravity
Xu, Liao
Wu, Suping
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,

← 1 2 3 4 5 →