MULTI-SCALE SUPERVISED NETWORK FOR HUMAN POSE ESTIMATION

被引:0
|
作者
Ke, Lipeng [1 ]
Chang, Ming-Ching [2 ]
Qi, Honggang [1 ]
Lyu, Siwei [2 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
[2] SUNY Albany, Albany, NY 12222 USA
关键词
human pose estimation; conv-deconv module; multi-scale supervision; regression network;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Human pose estimation is an important topic in computer vision with many applications including gesture and activity recognition. However, pose estimation from image is challenging due to appearance variations, occlusions, clutter background, and complex activities. To alleviate these problems, we develop a robust pose estimation method based on the recent deep conv-deconv modules with two improvements: (1) multi-scale supervision of body keypoints, and (2) a global regression to improve structural consistency of keypoints. We refine keypoint detection heatmaps using layer-wise multi-scale supervision to better capture local contexts. Pose inference via keypoint association is optimized globally using a regression network at the end. Our method can effectively disambiguate keypoint matches in close proximity including the mismatch of left-right body parts, and better infer occluded parts. Experimental results show that our method achieves competitive performance among state-of-the-art methods on the MPII and FLIC datasets.
引用
收藏
页码:564 / 568
页数:5
相关论文
共 50 条
  • [31] Multi-scale supervised network for crowd counting
    Wang, Yongjie
    Zhang, Wei
    Huang, Dongxiao
    Liu, Yanyan
    Zhu, Jianghua
    IET IMAGE PROCESSING, 2020, 14 (17) : 4701 - 4707
  • [32] VehiPose: Multi-Scale Framework for Vehicle Pose Estimation
    Gupta, Divyansh
    Artacho, Bruno
    Savakis, Andreas
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [33] Face Pose Estimation with Ensemble Multi-scale Representations
    Han, Zhaocui
    Song, Weiwei
    Yang, Xue
    Ou, Zongying
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION (AIPR 2019), 2019, : 97 - 101
  • [34] Combining detailed appearance and multi-scale representation: a structure-context complementary network for human pose estimation
    Dong, Kaiwen
    Sun, Yanjing
    Cheng, Xiaozhou
    Wang, Xiaolin
    Wang, Bin
    APPLIED INTELLIGENCE, 2023, 53 (07) : 8097 - 8113
  • [35] Combining detailed appearance and multi-scale representation: a structure-context complementary network for human pose estimation
    Kaiwen Dong
    Yanjing Sun
    Xiaozhou Cheng
    Xiaolin Wang
    Bin Wang
    Applied Intelligence, 2023, 53 : 8097 - 8113
  • [36] Multi-scale information transport generative adversarial network for human pose transfer ☆
    Zhang, Jinsong
    Lai, Yu-Kun
    Ma, Jian
    Li, Kun
    DISPLAYS, 2024, 84
  • [37] Joint multi-scale transformers and pose equivalence constraints for 3D human pose estimation
    Wu, Yongpeng
    Kong, Dehui
    Gao, Junna
    Li, Jinghua
    Yin, Baocai
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 103
  • [38] MSRT: multi-scale representation transformer for regression-based human pose estimation
    Shan, Beiguang
    Shi, Qingxuan
    Yang, Fang
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (02) : 591 - 603
  • [39] Human pose estimation with gated multi-scale feature fusion and spatial mutual information
    Zhao, Xiaoming
    Guo, Chenchen
    Zou, Qiang
    VISUAL COMPUTER, 2023, 39 (01): : 119 - 137
  • [40] Human pose estimation with gated multi-scale feature fusion and spatial mutual information
    Xiaoming Zhao
    Chenchen Guo
    Qiang Zou
    The Visual Computer, 2023, 39 : 119 - 137