EHFusion: an efficient heterogeneous fusion model for group-based 3D human pose estimation

被引:0
|
作者
Peng, Jihua [1 ]
Zhou, Yanghong [1 ,3 ]
Mok, P. Y. [1 ,2 ]
机构
[1] Hong Kong Polytech Univ, Sch Fash & Text, Hung Hom, Hong Kong, Peoples R China
[2] Lab Artificial Intelligence Design Sci Pk, Hong Kong, Peoples R China
[3] Hong Kong Polytech Univ, Res Ctr Text Future Fash, Hung Hom, Hong Kong, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Efficient network; 3D human pose estimation; Feature fusion; Topology-based grouping strategy; NETWORK;
D O I
10.1007/s00371-024-03724-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Stimulated by its important applications in animation, gaming, virtual reality, augmented reality, and healthcare, 3D human pose estimation has received considerable attention in recent years. To improve the accuracy of 3D human pose estimation, most approaches have converted this challenging task into a local pose estimation problem by dividing the body joints of the human body into different groups based on the human body topology. The body joint features of different groups are then fused to predict the overall pose of the whole body, which requires a joint feature fusion scheme. Nevertheless, the joint feature fusion schemes adopted in existing methods involve the learning of extensive parameters and hence are computationally very expensive. This paper reports a new topology-based grouped method 'EHFusion' for 3D human pose estimation, which involves a heterogeneous feature fusion (HFF) module that integrates grouped pose features. The HFF module reduces the computational complexity of the model while achieving promising accuracy. Moreover, we introduce motion amplitude information and a camera intrinsic embedding module to provide better global information and 2D-to-3D conversion knowledge, thereby improving the overall robustness and accuracy of the method. In contrast to previous methods, the proposed new network can be trained end-to-end in one single stage. Experimental results not only demonstrate the advantageous trade-offs between estimation accuracy and computational complexity achieved by our method but also showcase the competitive performance in comparison with various existing state-of-the-art methods (e.g., transformer-based) when evaluated on two public datasets, Human3.6M and HumanEva. The data and code are available at doi:10.5281/zenodo.11113132
引用
收藏
页数:23
相关论文
共 50 条
  • [1] GTPT: Group-Based Token Pruning Transformer for Efficient Human Pose Estimation
    Wang, Haonan
    Liu, Jie
    Tang, Jie
    Wu, Gangshan
    Xu, Bo
    Chou, Yanbing
    Wang, Yong
    COMPUTER VISION - ECCV 2024, PT LXIX, 2025, 15127 : 213 - 230
  • [2] 3d human pose estimation based on multi view information fusion
    Zhang, Shuo
    Liu, Ming
    Zhao, Yuejin
    Dong, Liquan
    Kong, Lingqin
    OPTICAL METROLOGY AND INSPECTION FOR INDUSTRIAL APPLICATIONS IX, 2022, 12319
  • [3] Cross View Fusion for 3D Human Pose Estimation
    Qiu, Haibo
    Wang, Chunyu
    Wang, Jingdong
    Wang, Naiyan
    Zeng, Wenjun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4341 - 4350
  • [4] Group Spatial Attention for 3D Human Pose Estimation
    Tran, Tien-Dat
    Cao, Ge
    Ashraf, Russo
    Jo, Kang-Hyun
    2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024, 2024,
  • [5] Efficient Hierarchical Multi-view Fusion Transformer for 3D Human Pose Estimation
    Zhou, Kangkang
    Zhang, Lijun
    Lu, Feng
    Zhou, Xiang-Dong
    Shi, Yu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7512 - 7520
  • [6] Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
    Li, Wenhao
    Liu, Mengyuan
    Liu, Hong
    Wang, Pichao
    Cai, Jialun
    Sebe, Nicu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 604 - 613
  • [7] STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation
    Hao, Feng
    Zhong, Fujin
    Yu, Hong
    Hu, Jun
    Yang, Yan
    IMAGE AND VISION COMPUTING, 2024, 149
  • [8] RGB-D FUSION FOR POINT-CLOUD-BASED 3D HUMAN POSE ESTIMATION
    Ying, Jiaming
    Zhao, Xu
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3108 - 3112
  • [9] Efficient 3D human pose estimation from RGBD sensors
    Pascual-Hernandez, David
    de Frutos, Nuria Oyaga
    Mora-Jimenez, Inmaculada
    Canas-Plaza, Jose Maria
    DISPLAYS, 2022, 74
  • [10] 3D Human Pose Estimation based on Center of Gravity
    Xu, Liao
    Wu, Suping
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,