HandVoxNet++: 3D Hand Shape and Pose Estimation Using Voxel-Based Neural Networks

被引:17
|
作者
Malik, Jameel [1 ,2 ]
Shimada, Soshi [3 ,4 ]
Elhayek, Ahmed [5 ]
Ali, Sk Aziz [1 ,6 ]
Theobalt, Christian [3 ]
Golyanik, Vladislav [3 ]
Stricker, Didier [1 ,6 ]
机构
[1] TU Kaiserslautern, D-67663 Kaiserslautern, Germany
[2] NUST, Islamabad 44000, Pakistan
[3] MPI Informat, Saarbrcken, Germany
[4] Saarland Informat Campus, D-66123 Saarbrcken, Germany
[5] UPM, Medina 42241, Saudi Arabia
[6] DFKI, D-67663 Kaiserslautern, Germany
关键词
3D hand shape and pose from a single depth map; voxelized hand shape; graph convolutions; TSDF; 3D data augmentation; shape registration; GCN-MeshReg; NRGA plus;
D O I
10.1109/TPAMI.2021.3122874
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D hand shape and pose estimation from a single depth map is a new and challenging computer vision problem with many applications. Existing methods addressing it directly regress hand meshes via 2D convolutional neural networks, which leads to artifacts due to perspective distortions in the images. To address the limitations of the existing methods, we develop HandVoxNet++, i.e., a voxel-based deep network with 3D and graph convolutions trained in a fully supervised manner. The input to our network is a 3D voxelized-depth-map-based on the truncated signed distance function (TSDF). HandVoxNet++ relies on two hand shape representations. The first one is the 3D voxelized grid of hand shape, which does not preserve the mesh topology and which is the most accurate representation. The second representation is the hand surface that preserves the mesh topology. We combine the advantages of both representations by aligning the hand surface to the voxelized hand shape either with a new neural Graph-Convolutions-based Mesh Registration (GCN-MeshReg) or classical segment-wise Non-Rigid Gravitational Approach (NRGA++) which does not rely on training data. In extensive evaluations on three public benchmarks, i.e., SynHand5M, depth-based HANDS19 challenge and HO-3D, the proposed HandVoxNet++ achieves the state-of-the-art performance. In this journal extension of our previous approach presented at CVPR 2020, we gain 41.09% and 13.7% higher shape alignment accuracy on SynHand5M and HANDS19 datasets, respectively. Our method is ranked first on the HANDS19 challenge dataset (Task 1: Depth-Based 3D Hand Pose Estimation) at the moment of the submission of our results to the portal in August 2020.
引用
收藏
页码:8962 / 8974
页数:13
相关论文
共 50 条
  • [21] Recurrent 3D Hand Pose Estimation Using Cascaded Pose-Guided 3D Alignments
    Deng, Xiaoming
    Zuo, Dexin
    Zhang, Yinda
    Cui, Zhaopeng
    Cheng, Jian
    Tan, Ping
    Chang, Liang
    Pollefeys, Marc
    Fanello, Sean
    Wang, Hongan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 932 - 945
  • [22] Head pose estimation using deep neural networks and 3D point clouds
    Xu, Yuanquan
    Jung, Cheolkon
    Chang, Yakun
    PATTERN RECOGNITION, 2022, 121
  • [23] 3D hand pose estimation using RGBD images and hybrid deep learning networks
    Mofarreh-Bonab, Mohammad
    Seyedarabi, Hadi
    Mozaffari Tazehkand, Behzad
    Kasaei, Shohreh
    VISUAL COMPUTER, 2022, 38 (06): : 2023 - 2032
  • [24] 3D hand pose estimation using RGBD images and hybrid deep learning networks
    Mohammad Mofarreh-Bonab
    Hadi Seyedarabi
    Behzad Mozaffari Tazehkand
    Shohreh Kasaei
    The Visual Computer, 2022, 38 : 2023 - 2032
  • [25] Hand Shape and 3D Pose Estimation Using Depth Data from a Single Cluttered Frame
    Doliotis, Paul
    Athitsos, Vassilis
    Kosmopoulos, Dimitrios
    Perantonis, Stavros
    ADVANCES IN VISUAL COMPUTING, ISVC 2012, PT I, 2012, 7431 : 148 - 158
  • [26] 3D hand pose and shape estimation from RGB images for keypoint-based hand gesture recognition
    Avola, Danilo
    Cinque, Luigi
    Fagioli, Alessio
    Foresti, Gian Luca
    Fragomeni, Adriano
    Pannone, Daniele
    PATTERN RECOGNITION, 2022, 129
  • [27] Attention-Based Pose Sequence Machine for 3D Hand Pose Estimation
    Guo, Fangtai
    He, Zaixing
    Zhang, Shuyou
    Zhao, Xinyue
    Tan, Jianrong
    IEEE ACCESS, 2020, 8 : 18258 - 18269
  • [28] An Improved Method for 3D Shape Estimation Using Cascade of Neural Networks
    Van-Thanh Hoang
    Van-Dung Hoang
    Jo, Kang-Hyun
    2017 IEEE 15TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2017, : 285 - 289
  • [29] FasterVoxelPose plus : Fast and Accurate Voxel-based 3D Human Pose Estimation by Depth-wise Projection Decay
    Zhuang, Zonghuang
    Zhou, Yue
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [30] Voxel-based shape decomposition for feature-preserving 3D thumbnail creation
    Chiang, Pei-Ying
    Kuo, C. -C. Jay
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 23 (01) : 1 - 11