Image-Based Fitness Yoga Pose Recognition: Using Ensemble Learning and Multi-head Attention

被引:0
|
作者
Kou, Yue [1 ]
Li, Hai [2 ,3 ]
机构
[1] Civil Aviat Flight Univ China, Coll Aviat Secur, Deyang 618307, Peoples R China
[2] Civil Aviat Flight Univ China, Coll Civil Aviat Safety Engn, Deyang 618307, Peoples R China
[3] Civil Aviat Flight Univ China, Civil Aircraft Fire Sci & Safety Engn Key Lab Sich, Deyang 618307, Peoples R China
关键词
Fitness yoga movements; Residual network; VGG network; Multi-head attention; Ensemble learning;
D O I
10.1007/s44196-024-00662-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing awareness of fitness, more and more people are choosing to participate in fitness activities. Yoga, as a form of exercise that improves both physical and mental health, is becoming increasingly popular worldwide. In order to assist yoga practitioners in more effective training through automated or semi automated systems, improve training effectiveness, assist professional athletes in training through intelligent recognition systems, correct movements, and improve athletic performance. This paper proposes a method that addresses the low accuracy issue of current yoga pose recognition algorithms by integrating multi-head attention mechanism and ensemble learning. Firstly, the Mixup algorithm is used to enhance yoga movement images. Subsequently, convolutional features are extracted from the images using the ResNet101 and VGGNet19 transfer learning models. Finally, the extracted convolutional features are combined and stacked using a multi-head attention mechanism. Model training, validation, and testing are performed using the Soft target cross-entropy loss function. Experimental results demonstrate that the proposed method achieves a training accuracy of 100%, a validation accuracy of 89.94%, a testing accuracy of 93.79%, and a detection speed of 297 frames per second. Overall, this method demonstrates high stability and robustness, providing a technological foundation for intelligent recognition of yoga poses.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Research on Transportation Mode Recognition Based on Multi-Head Attention Temporal Convolutional Network
    Cheng, Shuyu
    Liu, Yingan
    SENSORS, 2023, 23 (07)
  • [32] Multi-head attention-based two-stream EfficientNet for action recognition
    Aihua Zhou
    Yujun Ma
    Wanting Ji
    Ming Zong
    Pei Yang
    Min Wu
    Mingzhe Liu
    Multimedia Systems, 2023, 29 : 487 - 498
  • [33] Lightweight Facial Expression Recognition Based on Hybrid Multiscale and Multi-Head Collaborative Attention
    Zhang, Haitao
    Zhuang, Xufei
    Gao, Xudong
    Mao, Rui
    Ren, Qing-Dao-Er-Ji
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II, 2025, 15032 : 304 - 316
  • [34] A new joint CTC-attention-based speech recognition model with multi-level multi-head attention
    Chu-Xiong Qin
    Wen-Lin Zhang
    Dan Qu
    EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [35] A new joint CTC-attention-based speech recognition model with multi-level multi-head attention
    Qin, Chu-Xiong
    Zhang, Wen-Lin
    Qu, Dan
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (01)
  • [36] Image-Based Humanoid Robot Pose Recognition System
    Guo, Sin-Hong
    Liu, Chih-Cheng
    Wong, Ching-Chang
    Lee, Tsu-Tian
    2019 INTERNATIONAL CONFERENCE ON ADVANCED MECHATRONIC SYSTEMS (ICAMECHS), 2019, : 126 - 129
  • [37] Word embedding factor based multi-head attention
    Li, Zhengren
    Zhao, Yumeng
    Zhang, Xiaohang
    Han, Huawei
    Huang, Cui
    ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (04)
  • [38] Multi-head attention with reinforcement learning for supervised video summarization
    Kadam, Bhakti Deepak
    Deshpande, Ashwini Mangesh
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (05)
  • [39] Multi-Head Attention Graph Network for Few Shot Learning
    Zhang, Baiyan
    Ling, Hefei
    Li, Ping
    Wang, Qian
    Shi, Yuxuan
    Wu, Lei
    Wang, Runsheng
    Shen, Jialie
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (02): : 1505 - 1517
  • [40] Distract Your Attention: Multi-Head Cross Attention Network for Facial Expression Recognition
    Wen, Zhengyao
    Lin, Wenzhong
    Wang, Tao
    Xu, Ge
    BIOMIMETICS, 2023, 8 (02)