A generic diffusion-based approach for 3D human pose prediction in the wild

被引:19
|
作者
Saadatnejad, Saeed [1 ]
Rasekh, Ali [1 ]
Mofayezi, Mohammadreza [1 ]
Medghalchi, Yasamin [1 ]
Rajahzadeh, Sara [1 ]
Mordan, Taylor [1 ]
Alahi, Alexandre [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
D O I
10.1109/ICRA48891.2023.10160399
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are considered as a single sequence containing missing elements (whether in the observation or prediction horizon). All missing elements are treated as noise and denoised with our conditional diffusion model. To better handle long-term forecasting horizon, we present a temporal cascaded diffusion model. We demonstrate the benefits of our approach on four publicly available datasets (Human3.6M, HumanEva-I, AMASS, and 3DPW), outperforming the state-of-the-art. Additionally, we show that our framework is generic enough to improve any 3D pose prediction model as a preprocessing step to repair their inputs and a post-processing step to refine their outputs. The code is available online: https://github.com/vita- epfl/DePOSit.
引用
收藏
页码:8246 / 8253
页数:8
相关论文
共 50 条
  • [31] RFID-based 3D human pose tracking:A subject generalization approach
    Chao Yang
    Xuyu Wang
    Shiwen Mao
    Digital Communications and Networks, 2022, 8 (03) : 278 - 288
  • [32] 3D Pose Estimation and 3D Model Retrieval for Objects in the Wild
    Grabner, Alexander
    Roth, Peter M.
    Lepetit, Vincent
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3022 - 3031
  • [33] An approach to 3D pose determination
    Ezquerra, N
    Mullick, R
    ACM TRANSACTIONS ON GRAPHICS, 1996, 15 (02): : 99 - 120
  • [34] Diff3DHPE: A Diffusion Model for 3D Human Pose Estimation
    Zhou, Jieming
    Zhang, Tong
    Hayder, Zeeshan
    Petersson, Lars
    Harandi, Mehrtash
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2084 - 2094
  • [35] 3D generic object categorization, localization and pose estimation
    Savarese, Silvio
    Fei-Fei, Li
    2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 1245 - 1252
  • [36] Generic 3D Representation via Pose Estimation and Matching
    Zamir, Amir R.
    Wekel, Tilman
    Agrawal, Pulkit
    Wei, Colin
    Malik, Jitendra
    Savarese, Silvio
    COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 535 - 553
  • [37] Exemplar Fine-Tuning for 3D Human Model Fitting Towards In-the-Wild 3D Human Pose Estimation
    Joo, Hanbyul
    Neverova, Natalia
    Vedaldi, Andrea
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 42 - 52
  • [38] An Image Cues Coding Approach for 3D Human Pose Estimation
    Xing, Meng
    Feng, Zhiyong
    Su, Yong
    Zhang, Jianhai
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (04)
  • [39] Recognizing Conversational Interaction Based on 3D Human Pose
    Deng, Jingjing
    Xie, Xianghua
    Daubney, Ben
    Fang, Hui
    Grant, Phil W.
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2013, 2013, 8192 : 138 - 149
  • [40] 3D Human Pose Estimation based on Center of Gravity
    Xu, Liao
    Wu, Suping
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,