A generic diffusion-based approach for 3D human pose prediction in the wild

被引:19
|
作者
Saadatnejad, Saeed [1 ]
Rasekh, Ali [1 ]
Mofayezi, Mohammadreza [1 ]
Medghalchi, Yasamin [1 ]
Rajahzadeh, Sara [1 ]
Mordan, Taylor [1 ]
Alahi, Alexandre [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
D O I
10.1109/ICRA48891.2023.10160399
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are considered as a single sequence containing missing elements (whether in the observation or prediction horizon). All missing elements are treated as noise and denoised with our conditional diffusion model. To better handle long-term forecasting horizon, we present a temporal cascaded diffusion model. We demonstrate the benefits of our approach on four publicly available datasets (Human3.6M, HumanEva-I, AMASS, and 3DPW), outperforming the state-of-the-art. Additionally, we show that our framework is generic enough to improve any 3D pose prediction model as a preprocessing step to repair their inputs and a post-processing step to refine their outputs. The code is available online: https://github.com/vita- epfl/DePOSit.
引用
收藏
页码:8246 / 8253
页数:8
相关论文
共 50 条
  • [21] Diffusion-Based Facial Aesthetics Enhancement With 3D Structure Guidance
    Li, Lisha
    Hou, Jingwen
    Liu, Weide
    Fang, Yuming
    Yan, Jiebin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1879 - 1894
  • [22] DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion
    Rommel, Cedric
    Valle, Eduardo
    Chen, Mickael
    Khalfaoui, Souhaiel
    Marlet, Renaud
    Cord, Matthieu
    Perez, Patrick
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3212 - 3221
  • [23] SMPL-Based 3D Pedestrian Pose Prediction
    Kunchala, Anil
    Bouroche, Melanie
    D'Arcy, Lorraine
    Schoen-Phelan, Bianca
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [24] 3D Point Cloud Attribute Compression Using Diffusion-Based Texture-Aware Intra Prediction
    Shao, Yiting
    Yang, Xiaodong
    Gao, Wei
    Liu, Shan
    Li, Ge
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9633 - 9646
  • [25] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Liu, Shuangjun
    Sehgal, Naveen
    Ostadabbas, Sarah
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14491 - 14506
  • [26] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Shuangjun Liu
    Naveen Sehgal
    Sarah Ostadabbas
    Applied Intelligence, 2022, 52 : 14491 - 14506
  • [27] Estimating Egocentric 3D Human Pose in the Wild with External Weak Supervision
    Wang, Jian
    Liu, Lingjie
    Xu, Weipeng
    Sarkar, Kripasindhu
    Luvizon, Diogo
    Theobalt, Christian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13147 - 13156
  • [28] EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild
    Kaufmann, Manuel
    Song, Jie
    Guo, Chen
    Shen, Kaiyue
    Jiang, Tianjian
    Tang, Chengcheng
    Zarate, Juan Jose
    Hilliges, Otmar
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14586 - 14597
  • [29] RFID-based 3D human pose tracking: A subject generalization approach
    Yang, Chao
    Wang, Xuyu
    Mao, Shiwen
    DIGITAL COMMUNICATIONS AND NETWORKS, 2022, 8 (03) : 278 - 288
  • [30] A Bayesian Part-based Approach to 3D Human Pose and Camera Estimation
    Brau, Ernesto
    Jiang, Hao
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1762 - 1767