NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation

被引:21
|
作者
Li, Jiefeng [1 ]
Bian, Siyuan [1 ]
Liu, Qi [1 ]
Tang, Jiasheng [3 ]
Wang, Fan [3 ]
Lu, Cewu [1 ,2 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[3] Alibaba Grp, Hangzhou, Peoples R China
[4] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Qi Zhi Inst, Shanghai, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/CVPR52729.2023.01243
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the progress of 3D human pose and shape estimation, state-of-the-art methods can either be robust to occlusions or obtain pixel-aligned accuracy in non-occlusion cases. However, they cannot obtain robustness and mesh-image alignment at the same time. In this work, we present NIKI (Neural Inverse Kinematics with Invertible Neural Network), which models bi-directional errors to improve the robustness to occlusions and obtain pixel-aligned accuracy. NIKI can learn from both the forward and inverse processes with invertible networks. In the inverse process, the model separates the error from the plausible 3D pose manifold for a robust 3D human pose estimation. In the forward process, we enforce the zero-error boundary conditions to improve the sensitivity to reliable joint positions for better mesh-image alignment. Furthermore, NIKI emulates the analytical inverse kinematics algorithms with the twist-and-swing decomposition for better interpretability. Experiments on standard and occlusion-specific benchmarks demonstrate the effectiveness of NIKI, where we exhibit robust and well-aligned results simultaneously. Code is available at https://github.com/Jeff-sjtu/NIKI.
引用
收藏
页码:12933 / 12942
页数:10
相关论文
共 50 条
  • [31] NEURAL NETWORKS AND THE INVERSE KINEMATICS PROBLEM
    JACK, H
    LEE, DMA
    BUCHAL, RO
    ELMARAGHY, WH
    JOURNAL OF INTELLIGENT MANUFACTURING, 1993, 4 (01) : 43 - 66
  • [32] 3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network
    Li, Sijin
    Chan, Antoni B.
    COMPUTER VISION - ACCV 2014, PT II, 2015, 9004 : 332 - 347
  • [33] Pose prediction using 3D deep convolutional neural networks
    Wallach, Izhar
    Dzamba, Michael
    Schrodl, Stefan
    Rampasek, Ladislav
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [34] Learnable Human Mesh Triangulation for 3D Human Pose and Shape Estimation
    Chun, Sungho
    Park, Sungbum
    Chang, Ju Yong
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2849 - 2858
  • [35] HYRE: Hybrid Regressor for 3D Human Pose and Shape Estimation
    Li, Wenhao
    Liu, Mengyuan
    Liu, Hong
    Ren, Bin
    Li, Xia
    You, Yingxuan
    Sebe, Nicu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 235 - 246
  • [36] Evaluating Shape and Appearance Descriptors for 3D Human Pose Estimation
    Sedai, S.
    Bennamoun, M.
    Huynh, D. Q.
    2011 6TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2011, : 293 - 298
  • [37] ADVERSARIAL LEARNING ENHANCEMENT FOR 3D HUMAN POSE AND SHAPE ESTIMATION
    Sun, Yidian
    Zhang, Jiwei
    Wang, Wendong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3743 - 3747
  • [38] 3D shape estimation in a constraint optimization neural network
    Mishra, Pallavi
    Helie, Sebastien
    VISION RESEARCH, 2020, 177 : 118 - 129
  • [39] Deep 3D Pose Dictionary: 3D Human Pose Estimation from Single RGB Image Using Deep Convolutional Neural Network
    Elbasiony, Reda
    Gomaa, Walid
    Ogata, Tetsuya
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 310 - 320
  • [40] Improved numerical inverse kinematics for human pose estimation
    Seo, Yung-Ho
    Lee, Chil-Woo
    Choi, Jong-Soo
    OPTICAL ENGINEERING, 2011, 50 (03)