NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation

被引:21
|
作者
Li, Jiefeng [1 ]
Bian, Siyuan [1 ]
Liu, Qi [1 ]
Tang, Jiasheng [3 ]
Wang, Fan [3 ]
Lu, Cewu [1 ,2 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[3] Alibaba Grp, Hangzhou, Peoples R China
[4] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Qi Zhi Inst, Shanghai, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/CVPR52729.2023.01243
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the progress of 3D human pose and shape estimation, state-of-the-art methods can either be robust to occlusions or obtain pixel-aligned accuracy in non-occlusion cases. However, they cannot obtain robustness and mesh-image alignment at the same time. In this work, we present NIKI (Neural Inverse Kinematics with Invertible Neural Network), which models bi-directional errors to improve the robustness to occlusions and obtain pixel-aligned accuracy. NIKI can learn from both the forward and inverse processes with invertible networks. In the inverse process, the model separates the error from the plausible 3D pose manifold for a robust 3D human pose estimation. In the forward process, we enforce the zero-error boundary conditions to improve the sensitivity to reliable joint positions for better mesh-image alignment. Furthermore, NIKI emulates the analytical inverse kinematics algorithms with the twist-and-swing decomposition for better interpretability. Experiments on standard and occlusion-specific benchmarks demonstrate the effectiveness of NIKI, where we exhibit robust and well-aligned results simultaneously. Code is available at https://github.com/Jeff-sjtu/NIKI.
引用
收藏
页码:12933 / 12942
页数:10
相关论文
共 50 条
  • [41] Semi-Dynamic Hypergraph Neural Network for 3D Pose Estimation
    Liu, Shengyuan
    Lv, Pei
    Zhang, Yuzhen
    Fu, Jie
    Cheng, Junjin
    Li, Wanqing
    Zhou, Bing
    Xu, Mingliang
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 782 - 788
  • [43] Depth-aware Convolutional Neural Networks for accurate 3D Pose Estimation in RGB-D Images
    Porzi, Lorenzo
    Penate-Sanchez, Adrian
    Ricci, Elisa
    Moreno-Noguer, Francesc
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 5777 - 5783
  • [44] Neural networks for the recognition and pose estimation of 3D objects from a single 2D perspective view
    Yuan, C
    Niemann, H
    IMAGE AND VISION COMPUTING, 2001, 19 (9-10) : 585 - 592
  • [45] DeepPose: Human Pose Estimation via Deep Neural Networks
    Toshev, Alexander
    Szegedy, Christian
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1653 - 1660
  • [46] Double chain networks for monocular 3D human pose estimation
    Bai, Guihu
    Luo, Yanmin
    Pan, Xueliang
    Wang, Youjie
    Wang, Jia
    Guo, Jingming
    IMAGE AND VISION COMPUTING, 2022, 123
  • [47] Compositional Graph Convolutional Networks for 3D Human Pose Estimation
    Zou, Zhiming
    Liu, Tianqi
    Wu, Dapeng
    Tang, Wei
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [48] A Joint Relationship Aware Neural Network for Single-Image 3D Human Pose Estimation
    Zheng, Xiangtao
    Chen, Xiumei
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 4747 - 4758
  • [49] Graph Stacked Hourglass Networks for 3D Human Pose Estimation
    Xu, Tianhan
    Takano, Wataru
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16100 - 16109
  • [50] MANIKIN: Biomechanically Accurate Neural Inverse Kinematics for Human Motion Estimation
    Jiang, Jiaxi
    Streli, Paul
    Luo, Xuejing
    Gebhardt, Christoph
    Holz, Christian
    COMPUTER VISION - ECCV 2024, PT II, 2025, 15060 : 128 - 146