Multi-Person Pose Estimation With Accurate Heatmap Regression and Greedy Association

被引:14
|
作者
Li, Jia [1 ]
Wang, Meng [1 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
基金
中国国家自然科学基金;
关键词
Bottom-up; human pose estimation; Gaussian heatmap; aggregation; offset;
D O I
10.1109/TCSVT.2022.3153044
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-person pose estimation aims at localizing the 2D keypoints (or body joints) for all the people in the image. There are mainly two paradigms to perform this task: top-down and bottom-up. In this paper, we present an advanced bottom-up approach based on accurate keypoint heatmap regression and greedy keypoint association. Firstly, we develop an encoding-decoding method with Gaussian heatmaps and guiding offset fields to represent multi-person pose information, encompassing keypoint positions and adjacent keypoint associations of all individuals in the scene. In particular, we analyze the deficiency of the Gaussian heatmap representation as regards keypoint localization precision if conventional element-wise L-2-type loss is employed merely for heatmap supervision. Therefore, we introduce a peak regularization loss to jointly supervise the heatmap regression. In addition, we present an improved Hourglass Network with multi-scale heatmap aggregation to simultaneously infer the said encoding. Finally, we propose a novel focal L-2 loss to help the network cope with the imbalanced problem of keypoint detection in heatmaps. Our results show that the proposed approach surpasses other bottom-up approaches on COCO dataset, and even outperforms the top-down approaches on CrowdPose dataset containing more crowded scenes.
引用
收藏
页码:5521 / 5535
页数:15
相关论文
共 50 条
  • [41] Enhanced Two-Stage Multi-person Pose Estimation
    Honda, Hiroto
    Kato, Tomohiro
    Uchida, Yusuke
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 217 - 220
  • [42] Multi-person pose estimation based on graph grouping optimization
    Qingzhi Zeng
    Yingsong Hu
    Dan Li
    Dongya Sun
    Multimedia Tools and Applications, 2023, 82 : 7039 - 7053
  • [43] Adaptive Hypergraph Neural Network for Multi-Person Pose Estimation
    Xu, Xixia
    Zou, Qi
    Lin, Xue
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2955 - 2963
  • [44] End-to-End Multi-Person Pose Estimation with Transformers
    Shi, Dahu
    Wei, Xing
    Li, Liangqi
    Ren, Ye
    Tan, Wenming
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11059 - 11068
  • [45] Multi-person Human Pose Estimation Based on Deformable Convolution
    Zhao Y.
    Qian Y.
    Wang K.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (10): : 944 - 950
  • [46] Rethinking the Person Localization for Single-Stage Multi-Person Pose Estimation
    Jin, Lei
    Wang, Xiaojuan
    Nie, Xuecheng
    Wang, Wendong
    Guo, Yandong
    Yan, Shuicheng
    Zhao, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1436 - 1447
  • [47] Contextual Instance Decoupling for Robust Multi-Person Pose Estimation
    Wang, Dongkai
    Zhang, Shiliang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11050 - 11058
  • [48] YOLO-based GNN for Multi-Person Pose Estimation
    Gong, Ming
    Liu, Ruixu
    Asari, Vijayan K.
    PATTERN RECOGNITION AND PREDICTION XXXV, 2024, 13040
  • [49] Learning Quality-Aware Representation for Multi-Person Pose Regression
    Xiao, Yabo
    Yu, Dongdong
    Wang, Xiao Juan
    Jin, Lei
    Wang, Guoli
    Zhang, Qian
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2822 - 2830
  • [50] DetPoseNet: Improving Multi-Person Pose Estimation via Coarse-Pose Filtering
    Ke, Lipeng
    Chang, Ming-Ching
    Qi, Honggang
    Lyu, Siwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2782 - 2795