Image-Based Fitness Yoga Pose Recognition: Using Ensemble Learning and Multi-head Attention

被引:0
|
作者
Kou, Yue [1 ]
Li, Hai [2 ,3 ]
机构
[1] Civil Aviat Flight Univ China, Coll Aviat Secur, Deyang 618307, Peoples R China
[2] Civil Aviat Flight Univ China, Coll Civil Aviat Safety Engn, Deyang 618307, Peoples R China
[3] Civil Aviat Flight Univ China, Civil Aircraft Fire Sci & Safety Engn Key Lab Sich, Deyang 618307, Peoples R China
关键词
Fitness yoga movements; Residual network; VGG network; Multi-head attention; Ensemble learning;
D O I
10.1007/s44196-024-00662-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing awareness of fitness, more and more people are choosing to participate in fitness activities. Yoga, as a form of exercise that improves both physical and mental health, is becoming increasingly popular worldwide. In order to assist yoga practitioners in more effective training through automated or semi automated systems, improve training effectiveness, assist professional athletes in training through intelligent recognition systems, correct movements, and improve athletic performance. This paper proposes a method that addresses the low accuracy issue of current yoga pose recognition algorithms by integrating multi-head attention mechanism and ensemble learning. Firstly, the Mixup algorithm is used to enhance yoga movement images. Subsequently, convolutional features are extracted from the images using the ResNet101 and VGGNet19 transfer learning models. Finally, the extracted convolutional features are combined and stacked using a multi-head attention mechanism. Model training, validation, and testing are performed using the Soft target cross-entropy loss function. Experimental results demonstrate that the proposed method achieves a training accuracy of 100%, a validation accuracy of 89.94%, a testing accuracy of 93.79%, and a detection speed of 297 frames per second. Overall, this method demonstrates high stability and robustness, providing a technological foundation for intelligent recognition of yoga poses.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Nested Deformable Multi-head Attention for Facial Image Inpainting
    Phutke, Shruti S.
    Murala, Subrahmanyam
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6067 - 6076
  • [22] Multi-head attention with CNN and wavelet for classification of hyperspectral image
    Harshula Tulapurkar
    Biplab Banerjee
    Krishna Mohan Buddhiraju
    Neural Computing and Applications, 2023, 35 : 7595 - 7609
  • [23] A personalized federated learning method based on the residual multi-head attention mechanism
    Li, Zhaobin
    Zhong, Zixuan
    Zuo, Peiliang
    Zhao, Hong
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (04)
  • [24] Classification of Facial Expression In-the-Wild based on Ensemble of Multi-head Cross Attention Networks
    Jeong, Jae Yeop
    Hong, Yeong-Gi
    Kim, Daun
    Jeong, Jin-Woo
    Jung, Yuchul
    Kim, Sang-Ho
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2352 - 2357
  • [25] Omni-Frequency Image Denoising with Multi-Head Attention
    Jiang, Jielin
    Shi, Mingyue
    Yang, Haidong
    Cui, Yan
    Computer Engineering and Applications, 60 (16): : 236 - 247
  • [26] MoMA: Momentum contrastive learning with multi-head attention-based knowledge distillation for histopathology image analysis
    Vuong, Trinh Thi Le
    Kwak, Jin Tae
    MEDICAL IMAGE ANALYSIS, 2025, 101
  • [27] Multi-head attention-based two-stream EfficientNet for action recognition
    Zhou, Aihua
    Ma, Yujun
    Ji, Wanting
    Zong, Ming
    Yang, Pei
    Wu, Min
    Liu, Mingzhe
    MULTIMEDIA SYSTEMS, 2023, 29 (02) : 487 - 498
  • [28] Temporal Residual Network Based Multi-Head Attention Model for Arabic Handwriting Recognition
    Zouari, Ramzi
    Othmen, Dalila
    Boubaker, Houcine
    Kherallah, Monji
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 469 - 476
  • [29] Lip Recognition Based on Bi-GRU with Multi-Head Self-Attention
    Ni, Ran
    Jiang, Haiyang
    Zhou, Lu
    Lu, Yuanyao
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT III, AIAI 2024, 2024, 713 : 99 - 110
  • [30] A facial depression recognition method based on hybrid multi-head cross attention network
    Li, Yutong
    Liu, Zhenyu
    Zhou, Li
    Yuan, Xiaoyan
    Shangguan, Zixuan
    Hu, Xiping
    Hu, Bin
    FRONTIERS IN NEUROSCIENCE, 2023, 17