Neural Descent for Visual 3D Human Pose and Shape

被引:35
|
作者
Zanfir, Andrei [1 ]
Bazavan, Eduard Gabriel [1 ]
Zanfir, Mihai [1 ]
Freeman, William T. [1 ]
Sukthankar, Rahul [1 ]
Sminchisescu, Cristian [1 ]
机构
[1] Google Res, Bangalore, Karnataka, India
关键词
D O I
10.1109/CVPR46437.2021.01425
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present deep neural network methodology to reconstruct the 3d pose and shape of people, including hand gestures and facial expression, given an input RGB image. We rely on a recently introduced, expressive full body statistical 3d human model, GHUM, trained end-to-end, and learn to reconstruct its pose and shape state in a self-supervised regime. Central to our methodology, is a learning to learn and optimize approach, referred to as HUman Neural Descent (HUND), which avoids both second-order differentiation when training the model parameters, and expensive state gradient descent in order to accurately minimize a semantic differentiable rendering loss at test time. Instead, we rely on novel recurrent stages to update the pose and shape parameters such that not only losses are minimized effectively, but the process is meta-regularized in order to ensure endprogress. HUND's symmetry between training and testing makes it the first 3d human sensing architecture to natively support different operating regimes including self-supervised ones. In diverse tests, we show that HUND achieves very competitive results in datasets like H3.6M and 3DPW, as well as good quality 3d reconstructions for complex imagery collected in-the-wild.
引用
收藏
页码:14479 / 14488
页数:10
相关论文
共 50 条
  • [1] Visual Feedback for Core Training with 3D Human Shape and Pose
    Xie, Haoran
    Watatani, Atsushi
    Miyata, Kazunori
    2019 NICOGRAPH INTERNATIONAL (NICOINT), 2019, : 49 - 56
  • [2] A 3D shape descriptor for human pose recovery
    Gond, Laetitia
    Sayd, Patrick
    Chateau, Thierry
    Dhome, Michel
    ARTICULATED MOTION AND DEFORMABLE OBJECTS, PROCEEDINGS, 2008, 5098 : 370 - +
  • [3] Personalized 3D Human Pose and Shape Refinement
    Wehrbein, Tom
    Rosenhahn, Bodo
    Matthews, Iain
    Stoll, Carsten
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4191 - 4201
  • [4] Recovering Human Pose in 3D by Visual Manifolds
    Wang, Zibin
    Chung, Ronald
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1771 - 1774
  • [5] NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation
    Li, Jiefeng
    Bian, Siyuan
    Liu, Qi
    Tang, Jiasheng
    Wang, Fan
    Lu, Cewu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12933 - 12942
  • [6] Learnable Human Mesh Triangulation for 3D Human Pose and Shape Estimation
    Chun, Sungho
    Park, Sungbum
    Chang, Ju Yong
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2849 - 2858
  • [7] HYRE: Hybrid Regressor for 3D Human Pose and Shape Estimation
    Li, Wenhao
    Liu, Mengyuan
    Liu, Hong
    Ren, Bin
    Li, Xia
    You, Yingxuan
    Sebe, Nicu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 235 - 246
  • [8] BodyPrint: Pose Invariant 3D Shape Matching of Human Bodies
    Wang, Jiangping
    Ma, Kai
    Singh, Vivek Kumar
    Huang, Thomas
    Chen, Terrence
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1591 - 1599
  • [9] Evaluating Shape and Appearance Descriptors for 3D Human Pose Estimation
    Sedai, S.
    Bennamoun, M.
    Huynh, D. Q.
    2011 6TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2011, : 293 - 298
  • [10] ADVERSARIAL LEARNING ENHANCEMENT FOR 3D HUMAN POSE AND SHAPE ESTIMATION
    Sun, Yidian
    Zhang, Jiwei
    Wang, Wendong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3743 - 3747