Real-Time Dynamic Gesture Recognition Algorithm Based on Adaptive Information Fusion and Multi-Scale Optimization Transformer

被引:1
|
作者
Lu, Guangda [1 ,2 ]
Sun, Wenhao [1 ,2 ]
Qin, Zhuanping [1 ,2 ]
Guo, Tinghang [1 ,2 ]
机构
[1] Tianjin Univ Technol & Educ, Sch Automat & Elect Engn, 1310 Dagu South Rd, Tianjin 300222, Peoples R China
[2] Tianjin Key Lab Informat Sensing & Intelligent Co, 1310 DaGu South Rd, Tianjin 300222, Peoples R China
关键词
dynamic gesture recognition; Transformer; optical flow; information fusion;
D O I
10.20965/jaciii.2023.p1096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gesture recognition is a popular technology in the field of computer vision and an important technical mean of achieving human-computer interaction. To address problems such as the limited long-range feature extraction capability of existing dynamic gesture recognition networks based on convolutional operators, we propose a dynamic gesture recognition algorithm based on spatial pyramid pooling Transformer and optical flow information fusion. We take advantage of Transformer's large receptive field to reduce model computation while improving the model's ability to extract features at different scales by embedding spatial pyramid pooling. We use the optical flow algorithm with the global motion aggregation module to obtain an optical flow map of hand motion, and to extract the key frames based on the similarity minimization principle. We also design an adaptive feature fusion method to fuse the spatial and temporal features of the dual channels. Finally, we demonstrate the effectiveness of model components on model recognition enhancement through ablation experiments. We conduct training and validation on the SCUT-DHGA dynamic gesture dataset and on a dataset we collected, and we perform real-time dynamic gesture recognition tests using the trained model. The results show that our algorithm achieves high accuracy even while keeping the parameters balanced. It also achieves fast and accurate recognition of dynamic gestures in real-time tests.
引用
收藏
页码:1096 / 1107
页数:12
相关论文
共 50 条
  • [1] Real-time gesture recognition based on feature recalibration network with multi-scale information
    Cao, Zhengcai
    Xu, Xiaowen
    Hu, Biao
    Zhou, Meng
    Li, Qinglin
    NEUROCOMPUTING, 2019, 347 : 119 - 130
  • [2] Real-time detection algorithm for digital meters based on multi-scale feature fusion and GCS
    Hao, Zhaoming
    Zhang, Xiaoqiong
    Li, Hongyan
    Xu, Meng
    Zhang, Ziyang
    Wang, Zhan
    Wang, Weifeng
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (02)
  • [3] Real-time detection algorithm for digital meters based on multi-scale feature fusion and GCS
    Zhaoming Hao
    Xiaoqiong Zhang
    Hongyan Li
    Meng Xu
    Ziyang Zhang
    Zhan Wang
    Weifeng Wang
    Journal of Real-Time Image Processing, 2024, 21
  • [4] Gesture recognition algorithm based on multi-scale feature fusion in RGB-D images
    Sun, Ying
    Weng, Yaoqing
    Luo, Bowen
    Li, Gongfa
    Tao, Bo
    Jiang, Du
    Chen, Disi
    IET IMAGE PROCESSING, 2023, 17 (04) : 1280 - 1290
  • [5] Real-Time Object Recognition Based on Cortical Multi-scale Keypoints
    Terzic, Kasim
    Rodrigues, Joao M. F.
    Hans du Buf, J. M.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, IBPRIA 2013, 2013, 7887 : 314 - 321
  • [6] Real-Time Conveyor Belt Deviation Detection Algorithm Based on Multi-Scale Feature Fusion Network
    Zeng, Chan
    Zheng, Junfeng
    Li, Jiangyun
    ALGORITHMS, 2019, 12 (10)
  • [7] Camouflaged target detection using real-time video fusion algorithm based on multi-scale transforms
    Pillai, Sarath Somasekharan
    Swamy, M. N. S.
    2014 IEEE 27TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2014,
  • [8] A Real-Time Dynamic Gesture Recognition System
    Guo, Jiang
    Cheng, Jun
    Guo, Yu
    Pang, Jianxin
    MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 : 849 - 855
  • [9] Real-Time Dynamic Hand Gesture Recognition
    Lai, Hsiang-Yueh.
    Lai, Han-Jheng.
    2014 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2014), 2014, : 658 - 661
  • [10] Adaptive Propagation Network Based on Multi-scale Information Fusion
    Ma, Qianli
    Wang, Chenzhi
    Fan, Zheng
    Qian, Yuhua
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 51 - 62