Multi-Scale Contrastive Learning for Human Pose Estimation

被引:0
|
作者
Bao, Wenxia [1 ]
Lin, An [1 ]
Huang, Hua [1 ]
Yang, Xianjun [1 ]
Chen, Hemu [1 ]
机构
[1] Anhui Univ, Sch Elect & Informat Engn, Hefei 230601, Anhui, Peoples R China
关键词
human pose estimation; contrastive learning; multi-scale fea-; ture; feature pyramid network;
D O I
10.1587/transinf.2024EDP7048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent years have seen remarkable progress in human pose estimation. However, manual annotation of keypoints remains tedious and imprecise. To alleviate this problem, this paper proposes a novel method called Multi-Scale Contrastive Learning (MSCL). This method uses a siamese network structure with upper and lower branches that capture diffirent views of the same image. Each branch uses a backbone network to extract image representations, employing multi-scale feature vectors to capture information. These feature vectors are then passed through an enhanced feature pyramid for fusion, producing more robust feature representations. The feature vectors are then further encoded by mapping and prediction heads to predict the feature vector of another view. Using negative cosine similarity between vectors as a loss function, the backbone network is pre-trained on a large-scale unlabeled dataset, enhancing its capacity to extract visual representations. Finally, transfer learning is performed on a small amount of labelled data for the pose estimation task. Experiments on COCO datasets show significant improvements in Average Precision (AP) of 1.8%, 0.9%, and 1.2% with 1%, 5%, and 10% labelled data on COCO. In addition, the Percentage of Correct Keypoints (PCK) improves by 0.5% on MPII&AIC, outperforming mainstream contrastive learning methods.
引用
收藏
页码:1332 / 1341
页数:10
相关论文
共 50 条
  • [41] Multi-scale Contrastive Learning with Attention for Histopathology Image Classification
    Tan, Jing Wei
    Khoa Tuan Nguyen
    Lee, Kyoungbun
    Jeong, Won-Ki
    MEDICAL IMAGING 2023, 2023, 12471
  • [42] Multi-scale contrastive learning method for PolSAR image classification
    Hua, Wenqiang
    Wang, Chen
    Sun, Nan
    Liu, Lin
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (01)
  • [43] ANEMONE: Graph Anomaly Detection with Multi-Scale Contrastive Learning
    Jin, Ming
    Liu, Yixin
    Zheng, Yu
    Chi, Lianhua
    Li, Yuan-Fang
    Pan, Shirui
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3122 - 3126
  • [44] Enhancement and optimisation of human pose estimation with multi-scale spatial attention and adversarial data augmentation
    Zhang, Tong
    Li, Qilin
    Wen, Jingtao
    Chen, C. L. Philip
    INFORMATION FUSION, 2024, 111
  • [45] Multi-scale spatial-temporal transformer for 3D human pose estimation
    Wu, Yongpeng
    Gao, Junna
    2021 5TH INTERNATIONAL CONFERENCE ON VISION, IMAGE AND SIGNAL PROCESSING (ICVISP 2021), 2021, : 242 - 247
  • [46] MCFNet: Multi-scale Cross Fusion Network for 3D Human Pose Estimation
    Wang, Dazhong
    Liu, Rui
    Yi, Pengfei
    Dong, Jing
    Zhou, Dongsheng
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 684 - 688
  • [47] Hierarchical parallel multi-scale graph network for 3d human pose estimation
    Yang, Honghong
    Liu, Hongxi
    Zhang, Yumei
    Wu, Xiaojun
    APPLIED SOFT COMPUTING, 2023, 140
  • [48] MS-HRNet: multi-scale high-resolution network for human pose estimation
    Wang, Yanxia
    Wang, Renjie
    Shi, Hu
    Liu, Dan
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (12): : 17269 - 17291
  • [49] Multi-scale Feature Injection for Occluded 3D Human Pose and Shape Estimation
    Shi, Yunhui
    Ge, Yangyang
    Wang, Jin
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4881 - 4886
  • [50] MsF-HigherHRNet: Multi-scale Feature Fusion for Human Pose Estimation in Crowded Scenes
    Yu, Cuihong
    Han, Cheng
    Zhang, Qi
    Zhang, Chao
    COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS, CAD/GRAPHICS 2023, 2024, 14250 : 16 - 29