GTPT: Group-Based Token Pruning Transformer for Efficient Human Pose Estimation

被引:0
|
作者
Wang, Haonan [1 ,2 ]
Liu, Jie [1 ]
Tang, Jie [1 ]
Wu, Gangshan [1 ]
Xu, Bo [2 ]
Chou, Yanbing [2 ]
Wang, Yong [2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Cainiao Network, Hangzhou, Peoples R China
来源
关键词
Efficient human pose estimation; Whole-body pose estimation; Transformer; Token pruning; Group;
D O I
10.1007/978-3-031-72890-7_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, 2D human pose estimation has made significant progress on public benchmarks. However, many of these approaches face challenges of less applicability in the industrial community due to the large number of parametric quantities and computational overhead. Efficient human pose estimation remains a hurdle, especially for whole-body pose estimation with numerous keypoints. While most current methods for efficient human pose estimation primarily rely on CNNs, we propose the Group-based Token Pruning Transformer (GTPT) that fully harnesses the advantages of the Transformer. GTPT alleviates the computational burden by gradually introducing keypoints in a coarse-to-fine manner. It minimizes the computation overhead while ensuring high performance. Besides, GTPT groups keypoint tokens and prunes visual tokens to improve model performance while reducing redundancy. We propose the Multi-Head Group Attention (MHGA) between different groups to achieve global interaction with little computational overhead. We conducted experiments on COCO and COCO-WholeBody. Compared to other methods, the experimental results show that GTPT can achieve higher performance with less computation, especially in whole-body with numerous keypoints.
引用
收藏
页码:213 / 230
页数:18
相关论文
共 50 条
  • [1] An efficient sparse pruning method for human pose estimation
    Wang, Mingyang
    Sun, Tianyi
    Song, Kang
    Li, Shuang
    Jiang, Jing
    Sun, Linjun
    CONNECTION SCIENCE, 2022, 34 (01) : 960 - 974
  • [2] EHFusion: an efficient heterogeneous fusion model for group-based 3D human pose estimation
    Peng, Jihua
    Zhou, Yanghong
    Mok, P. Y.
    VISUAL COMPUTER, 2024,
  • [3] Pruning-guided feature distillation for an efficient transformer-based pose estimation model
    Kim, Dong-hwi
    Lee, Dong-hun
    Kim, Aro
    Jeong, Jinwoo
    Lee, Jong Taek
    Kim, Sungjei
    Park, Sang-hyo
    IET COMPUTER VISION, 2024, 18 (06) : 745 - 758
  • [4] PPT: Token-Pruned Pose Transformer for Monocular and Multi-view Human Pose Estimation
    Ma, Haoyu
    Wang, Zhe
    Chen, Yifei
    Kong, Deying
    Chen, Liangjian
    Liu, Xingwei
    Yan, Xiangyi
    Tang, Hao
    Xie, Xiaohui
    COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 424 - 442
  • [5] Lightweight and Efficient Human Pose Estimation Fusing Transformer and Attention
    Wu, Chengpeng
    Tan, Guangxing
    Chen, Haifeng
    Li, Chunyu
    Computer Engineering and Applications, 2024, 60 (22) : 197 - 208
  • [6] LIGHTPOSE: A LIGHTWEIGHT AND EFFICIENT MODEL WITH TRANSFORMER FOR HUMAN POSE ESTIMATION
    Liu, Xiyang
    Li, Peng
    Ni, Ding
    Wang, Yan
    Xue, Hui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2674 - 2678
  • [7] EfficientPose: A Lightweight and Efficient Model with Transformer for Human Pose Estimation
    Liang, Wei
    Cheng, Zhang
    Han, Junjia
    Wang, Yanxia
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 120 - 131
  • [8] Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
    Li, Wenhao
    Liu, Mengyuan
    Liu, Hong
    Wang, Pichao
    Cai, Jialun
    Sebe, Nicu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 604 - 613
  • [9] Aggregation Transformer for Human Pose Estimation
    Dong, Hao
    Wang, Guodong
    Zhang, Xinyue
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3660 - 3667
  • [10] Transformer-based rapid human pose estimation network
    Wang, Dong
    Xie, Wenjun
    Cai, Youcheng
    Li, Xinjie
    Liu, Xiaoping
    COMPUTERS & GRAPHICS-UK, 2023, 116 : 317 - 326