Multi-Scale Contrastive Learning for Human Pose Estimation

被引:0
|
作者
Bao, Wenxia [1 ]
Lin, An [1 ]
Huang, Hua [1 ]
Yang, Xianjun [1 ]
Chen, Hemu [1 ]
机构
[1] Anhui Univ, Sch Elect & Informat Engn, Hefei 230601, Anhui, Peoples R China
关键词
human pose estimation; contrastive learning; multi-scale fea-; ture; feature pyramid network;
D O I
10.1587/transinf.2024EDP7048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent years have seen remarkable progress in human pose estimation. However, manual annotation of keypoints remains tedious and imprecise. To alleviate this problem, this paper proposes a novel method called Multi-Scale Contrastive Learning (MSCL). This method uses a siamese network structure with upper and lower branches that capture diffirent views of the same image. Each branch uses a backbone network to extract image representations, employing multi-scale feature vectors to capture information. These feature vectors are then passed through an enhanced feature pyramid for fusion, producing more robust feature representations. The feature vectors are then further encoded by mapping and prediction heads to predict the feature vector of another view. Using negative cosine similarity between vectors as a loss function, the backbone network is pre-trained on a large-scale unlabeled dataset, enhancing its capacity to extract visual representations. Finally, transfer learning is performed on a small amount of labelled data for the pose estimation task. Experiments on COCO datasets show significant improvements in Average Precision (AP) of 1.8%, 0.9%, and 1.2% with 1%, 5%, and 10% labelled data on COCO. In addition, the Percentage of Correct Keypoints (PCK) improves by 0.5% on MPII&AIC, outperforming mainstream contrastive learning methods.
引用
收藏
页码:1332 / 1341
页数:10
相关论文
共 50 条
  • [11] Human Pose Estimation Based on Lightweight Multi-Scale Coordinate Attention
    Li, Xin
    Guo, Yuxin
    Pan, Weiguo
    Liu, Hongzhe
    Xu, Bingxin
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [12] Hand pose estimation with multi-scale network
    Hu, Zhongxu
    Hu, Youmin
    Wu, Bo
    Liu, Jie
    Han, Dongmin
    Kurfess, Thomas
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2501 - 2515
  • [13] Multi-Scale Structure-Aware Network for Human Pose Estimation
    Ke, Lipeng
    Chang, Ming-Ching
    Qi, Honggang
    Lyu, Siwei
    COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 731 - 746
  • [14] Human Pose Estimation With Deeply Learned Multi-Scale Compositional Models
    Wang, Rui
    Cao, Zhongzheng
    Wang, Xiangyang
    Liu, Zhi
    Zhu, Xiaoqiang
    IEEE ACCESS, 2019, 7 : 71158 - 71166
  • [15] Multi-scale Contrastive Learning for Gastroenteroscopy Classification
    Li, Dan
    Li, Xuechen
    Peng, Zhibin
    Chen, Wenting
    Shen, Linlin
    Wu, Guangyao
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 852 - +
  • [16] Multi-scale Attention Aided Multi-Resolution Network for Human Pose Estimation
    Selvam, Srinika
    Mishra, Deepak
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 461 - 472
  • [17] Human pose estimation based on feature enhancement and multi-scale feature fusion
    Cao, Dandan
    Liu, Weibin
    Xing, Weiwei
    Wei, Xiang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (03) : 643 - 650
  • [18] Enhancing multi-scale information exchange and feature fusion for human pose estimation
    Rui Wang
    Wanyu Wu
    Xiangyang Wang
    The Visual Computer, 2023, 39 : 4751 - 4765
  • [19] MTPose: Human Pose Estimation with High-Resolution Multi-scale Transformers
    Wang, Rui
    Geng, Fudi
    Wang, Xiangyang
    NEURAL PROCESSING LETTERS, 2022, 54 (05) : 3941 - 3964
  • [20] A Multi-scale Recalibrated Approach for 3D Human Pose Estimation
    Xie, Ziwei
    Xia, Hailun
    Feng, Chunyan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT III, 2019, 11441 : 400 - 411