Real-Time Dynamic Gesture Recognition Algorithm Based on Adaptive Information Fusion and Multi-Scale Optimization Transformer

被引:1
|
作者
Lu, Guangda [1 ,2 ]
Sun, Wenhao [1 ,2 ]
Qin, Zhuanping [1 ,2 ]
Guo, Tinghang [1 ,2 ]
机构
[1] Tianjin Univ Technol & Educ, Sch Automat & Elect Engn, 1310 Dagu South Rd, Tianjin 300222, Peoples R China
[2] Tianjin Key Lab Informat Sensing & Intelligent Co, 1310 DaGu South Rd, Tianjin 300222, Peoples R China
关键词
dynamic gesture recognition; Transformer; optical flow; information fusion;
D O I
10.20965/jaciii.2023.p1096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gesture recognition is a popular technology in the field of computer vision and an important technical mean of achieving human-computer interaction. To address problems such as the limited long-range feature extraction capability of existing dynamic gesture recognition networks based on convolutional operators, we propose a dynamic gesture recognition algorithm based on spatial pyramid pooling Transformer and optical flow information fusion. We take advantage of Transformer's large receptive field to reduce model computation while improving the model's ability to extract features at different scales by embedding spatial pyramid pooling. We use the optical flow algorithm with the global motion aggregation module to obtain an optical flow map of hand motion, and to extract the key frames based on the similarity minimization principle. We also design an adaptive feature fusion method to fuse the spatial and temporal features of the dual channels. Finally, we demonstrate the effectiveness of model components on model recognition enhancement through ablation experiments. We conduct training and validation on the SCUT-DHGA dynamic gesture dataset and on a dataset we collected, and we perform real-time dynamic gesture recognition tests using the trained model. The results show that our algorithm achieves high accuracy even while keeping the parameters balanced. It also achieves fast and accurate recognition of dynamic gestures in real-time tests.
引用
收藏
页码:1096 / 1107
页数:12
相关论文
共 50 条
  • [41] Dynamic gesture recognition using PICA with multi-scale theory and HMM
    Wu, H
    IMAGE EXTRACTION, SEGMENTATION, AND RECOGNITION, 2001, 4550 : 132 - 139
  • [42] Real-Time Visual Place Recognition Based on Analyzing Distribution of Multi-scale CNN Landmarks
    Xin, Zhe
    Cui, Xiaoguang
    Zhang, Jixiang
    Yang, Yiping
    Wang, Yanqing
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2019, 94 (3-4) : 777 - 792
  • [43] Real-Time Visual Place Recognition Based on Analyzing Distribution of Multi-scale CNN Landmarks
    Zhe Xin
    Xiaoguang Cui
    Jixiang Zhang
    Yiping Yang
    Yanqing Wang
    Journal of Intelligent & Robotic Systems, 2019, 94 : 777 - 792
  • [44] A hybrid attention multi-scale fusion network for real-time semantic segmentation
    Ye, Baofeng
    Xue, Renzheng
    Wu, Qianlong
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [45] BOUNDARY CORRECTED MULTI-SCALE FUSION NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION
    Jiang, Tianjiao
    Jin, Yi
    Liang, Tengfei
    Wang, Xu
    Li, Yidong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1886 - 1890
  • [46] MAFormer: A transformer network with multi-scale attention fusion for visual recognition
    Sun, Huixin
    Wang, Yunhao
    Wang, Xiaodi
    Zhang, Bin
    Xin, Ying
    Zhang, Baochang
    Cao, Xianbin
    Ding, Errui
    Han, Shumin
    NEUROCOMPUTING, 2024, 595
  • [47] A Real-time Hand Gesture Recognition Algorithm For an Embedded System
    You Lei
    Wang Hongpeng
    Tan Dianxiong
    Wangjue
    2014 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2014), 2014, : 901 - 905
  • [48] A real-time applicable dynamic hand gesture recognition framework
    Kopinski, Thomas
    Gepperth, Alexander
    Handmann, Uwe
    2015 IEEE 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, : 2358 - 2363
  • [49] Multi-Scale Fusion Stereo Matching Algorithm Based on Adaptive Texture Region
    Chen, Yi
    Yu, Jiyan
    Yu, Hongsen
    Computer Engineering and Applications, 2023, 59 (18) : 198 - 206
  • [50] Adaptive Real-Time Gesture Recognition in a Dynamic Scenario for Human-Robot Collaborative Applications
    Scoccia, Cecilia
    Menchi, Giacomo
    Ciccarelli, Marianna
    Forlini, Matteo
    Papetti, Alessandra
    ADVANCES IN ITALIAN MECHANISM SCIENCE, IFTOMM ITALY 2022, 2022, 122 : 637 - 644