MULTI-SCALE SUPERVISED NETWORK FOR HUMAN POSE ESTIMATION

被引:0
|
作者
Ke, Lipeng [1 ]
Chang, Ming-Ching [2 ]
Qi, Honggang [1 ]
Lyu, Siwei [2 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
[2] SUNY Albany, Albany, NY 12222 USA
关键词
human pose estimation; conv-deconv module; multi-scale supervision; regression network;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Human pose estimation is an important topic in computer vision with many applications including gesture and activity recognition. However, pose estimation from image is challenging due to appearance variations, occlusions, clutter background, and complex activities. To alleviate these problems, we develop a robust pose estimation method based on the recent deep conv-deconv modules with two improvements: (1) multi-scale supervision of body keypoints, and (2) a global regression to improve structural consistency of keypoints. We refine keypoint detection heatmaps using layer-wise multi-scale supervision to better capture local contexts. Pose inference via keypoint association is optimized globally using a regression network at the end. Our method can effectively disambiguate keypoint matches in close proximity including the mismatch of left-right body parts, and better infer occluded parts. Experimental results show that our method achieves competitive performance among state-of-the-art methods on the MPII and FLIC datasets.
引用
收藏
页码:564 / 568
页数:5
相关论文
共 50 条
  • [41] MSRT: multi-scale representation transformer for regression-based human pose estimation
    Beiguang Shan
    Qingxuan Shi
    Fang Yang
    Pattern Analysis and Applications, 2023, 26 : 591 - 603
  • [42] MPA-GNet: multi-scale parallel adaptive graph network for 3D human pose estimation
    Jia, Ru
    Yang, Honghong
    Zhao, Li
    Wu, Xiaojun
    Zhang, Yumei
    VISUAL COMPUTER, 2024, 40 (08): : 5883 - 5899
  • [43] Head Pose Estimation Using Multi-scale Gaussian Derivatives
    Jain, Varun
    Crowley, James L.
    IMAGE ANALYSIS, SCIA 2013: 18TH SCANDINAVIAN CONFERENCE, 2013, 7944 : 319 - 328
  • [44] TAPoseNet: Teeth Alignment Based on Pose Estimation via Multi-scale Graph Convolutional Network
    Deng, Qingxin
    Yang, Xunyu
    Huang, Minghan
    Jiang, Landu
    Zhang, Dian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, 2024, 15012 : 314 - 323
  • [45] Lightweight head pose estimation without keypoints based on multi-scale lightweight neural network
    Xiaolei Chen
    Yubing Lu
    Baoning Cao
    Dongmei Lin
    Ishfaq Ahmad
    The Visual Computer, 2023, 39 (6) : 2455 - 2469
  • [46] Lightweight head pose estimation without keypoints based on multi-scale lightweight neural network
    Chen, Xiaolei
    Lu, Yubing
    Cao, Baoning
    Lin, Dongmei
    Ahmad, Ishfaq
    VISUAL COMPUTER, 2023, 39 (06): : 2455 - 2469
  • [47] Enhancement and optimisation of human pose estimation with multi-scale spatial attention and adversarial data augmentation
    Zhang, Tong
    Li, Qilin
    Wen, Jingtao
    Chen, C. L. Philip
    INFORMATION FUSION, 2024, 111
  • [48] Multi-scale spatial-temporal transformer for 3D human pose estimation
    Wu, Yongpeng
    Gao, Junna
    2021 5TH INTERNATIONAL CONFERENCE ON VISION, IMAGE AND SIGNAL PROCESSING (ICVISP 2021), 2021, : 242 - 247
  • [49] MSTPose: Learning-Enriched Visual Information with Multi-Scale Transformers for Human Pose Estimation
    Wu, Chengyu
    Wei, Xin
    Li, Shaohua
    Zhan, Ao
    ELECTRONICS, 2023, 12 (15)
  • [50] Multi-scale Feature Injection for Occluded 3D Human Pose and Shape Estimation
    Shi, Yunhui
    Ge, Yangyang
    Wang, Jin
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4881 - 4886