HandVoxNet++: 3D Hand Shape and Pose Estimation Using Voxel-Based Neural Networks

被引:17
|
作者
Malik, Jameel [1 ,2 ]
Shimada, Soshi [3 ,4 ]
Elhayek, Ahmed [5 ]
Ali, Sk Aziz [1 ,6 ]
Theobalt, Christian [3 ]
Golyanik, Vladislav [3 ]
Stricker, Didier [1 ,6 ]
机构
[1] TU Kaiserslautern, D-67663 Kaiserslautern, Germany
[2] NUST, Islamabad 44000, Pakistan
[3] MPI Informat, Saarbrcken, Germany
[4] Saarland Informat Campus, D-66123 Saarbrcken, Germany
[5] UPM, Medina 42241, Saudi Arabia
[6] DFKI, D-67663 Kaiserslautern, Germany
关键词
3D hand shape and pose from a single depth map; voxelized hand shape; graph convolutions; TSDF; 3D data augmentation; shape registration; GCN-MeshReg; NRGA plus;
D O I
10.1109/TPAMI.2021.3122874
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D hand shape and pose estimation from a single depth map is a new and challenging computer vision problem with many applications. Existing methods addressing it directly regress hand meshes via 2D convolutional neural networks, which leads to artifacts due to perspective distortions in the images. To address the limitations of the existing methods, we develop HandVoxNet++, i.e., a voxel-based deep network with 3D and graph convolutions trained in a fully supervised manner. The input to our network is a 3D voxelized-depth-map-based on the truncated signed distance function (TSDF). HandVoxNet++ relies on two hand shape representations. The first one is the 3D voxelized grid of hand shape, which does not preserve the mesh topology and which is the most accurate representation. The second representation is the hand surface that preserves the mesh topology. We combine the advantages of both representations by aligning the hand surface to the voxelized hand shape either with a new neural Graph-Convolutions-based Mesh Registration (GCN-MeshReg) or classical segment-wise Non-Rigid Gravitational Approach (NRGA++) which does not rely on training data. In extensive evaluations on three public benchmarks, i.e., SynHand5M, depth-based HANDS19 challenge and HO-3D, the proposed HandVoxNet++ achieves the state-of-the-art performance. In this journal extension of our previous approach presented at CVPR 2020, we gain 41.09% and 13.7% higher shape alignment accuracy on SynHand5M and HANDS19 datasets, respectively. Our method is ranked first on the HANDS19 challenge dataset (Task 1: Depth-Based 3D Hand Pose Estimation) at the moment of the submission of our results to the portal in August 2020.
引用
收藏
页码:8962 / 8974
页数:13
相关论文
共 50 条
  • [1] HandVoxNet++: 3D Hand Shape and Pose Estimation Using Voxel-Based Neural Networks
    Malik, Jameel
    Shimada, Soshi
    Elhayek, Ahmed
    Ali, Sk Aziz
    Theobalt, Christian
    Golyanik, Vladislav
    Stricker, Didier
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44 (12): : 8962 - 8974
  • [2] HandVoxNet: Deep Voxel-Based Network for 3D Hand Shape and Pose Estimation from a Single Depth Map
    Malik, Jameel
    Abdelaziz, Ibrahim
    Elhayek, Ahmed
    Shimada, Soshi
    Ali, Sk Aziz
    Golyanik, Vladislav
    Theobalt, Christian
    Stricker, Didier
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7111 - 7120
  • [3] Voxel-Based 3D Shape Segmentation Using Deep Volumetric Convolutional Neural Networks
    Liu, Yuqi
    Long, Wei
    Shu, Zhenyu
    Yi, Shun
    Xin, Shiqing
    ADVANCES IN COMPUTER GRAPHICS, CGI 2022, 2022, 13443 : 489 - 500
  • [4] Hand Pose Estimation using Voxel-based Individualized Hand Model
    Causo, Albert
    Matsuo, Mai
    Ueda, Etsuko
    Takemura, Kentaro
    Matsumoto, Yoshio
    Takamatsu, Jun
    Ogasawara, Tsukasa
    2009 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, VOLS 1-3, 2009, : 451 - +
  • [5] 3D Hand Pose Estimation with Neural Networks
    Antonio Serra, Jose
    Garcia-Rodriguez, Jose
    Orts-Escolano, Sergio
    Manuel Garcia-Chamizo, Juan
    Angelopoulou, Anastassia
    Psarrou, Alexandra
    Mentzelopoulos, Markos
    Montoyo-Bojo, Javier
    Dominguez, Enrique
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT II, 2013, 7903 : 504 - +
  • [6] 3D Hand Shape and Pose Estimation based on 2D Hand Keypoints
    Drosakis, Drosakis
    Argyros, Antonis
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2023, 2023, : 148 - 153
  • [7] Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks
    Ge, Liuhao
    Liang, Hui
    Yuan, Junsong
    Thalmann, Daniel
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (04) : 956 - 970
  • [8] A Convolutional Neural Networks Oriented Approach for Voxel-Based 3D Object Classification
    Sirma, Ridvan
    Dinar, Berkan
    Sahin, Yusuf Huseyin
    Unal, Gozde
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [9] Graph Convolutional Neural Networks-based 3D Hand Pose Estimation over Point Clouds
    Alejandro Castro-Vargas, John
    Martinez-Gonzalez, Pablo
    Oprea, Sergiu
    Garcia-Garcia, Alberto
    Orts-Escolano, Sergio
    Garcia-Rodriguez, Jose
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [10] NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation
    Li, Jiefeng
    Bian, Siyuan
    Liu, Qi
    Tang, Jiasheng
    Wang, Fan
    Lu, Cewu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12933 - 12942