Real-Time Dynamic Gesture Recognition Algorithm Based on Adaptive Information Fusion and Multi-Scale Optimization Transformer

被引:1
|
作者
Lu, Guangda [1 ,2 ]
Sun, Wenhao [1 ,2 ]
Qin, Zhuanping [1 ,2 ]
Guo, Tinghang [1 ,2 ]
机构
[1] Tianjin Univ Technol & Educ, Sch Automat & Elect Engn, 1310 Dagu South Rd, Tianjin 300222, Peoples R China
[2] Tianjin Key Lab Informat Sensing & Intelligent Co, 1310 DaGu South Rd, Tianjin 300222, Peoples R China
关键词
dynamic gesture recognition; Transformer; optical flow; information fusion;
D O I
10.20965/jaciii.2023.p1096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gesture recognition is a popular technology in the field of computer vision and an important technical mean of achieving human-computer interaction. To address problems such as the limited long-range feature extraction capability of existing dynamic gesture recognition networks based on convolutional operators, we propose a dynamic gesture recognition algorithm based on spatial pyramid pooling Transformer and optical flow information fusion. We take advantage of Transformer's large receptive field to reduce model computation while improving the model's ability to extract features at different scales by embedding spatial pyramid pooling. We use the optical flow algorithm with the global motion aggregation module to obtain an optical flow map of hand motion, and to extract the key frames based on the similarity minimization principle. We also design an adaptive feature fusion method to fuse the spatial and temporal features of the dual channels. Finally, we demonstrate the effectiveness of model components on model recognition enhancement through ablation experiments. We conduct training and validation on the SCUT-DHGA dynamic gesture dataset and on a dataset we collected, and we perform real-time dynamic gesture recognition tests using the trained model. The results show that our algorithm achieves high accuracy even while keeping the parameters balanced. It also achieves fast and accurate recognition of dynamic gestures in real-time tests.
引用
收藏
页码:1096 / 1107
页数:12
相关论文
共 50 条
  • [31] A Polarizing Universal Multi-scale and Real-time Image Defogging Algorithm
    Lu Xiao-ning
    Liu Yang-yang
    Tan Zheng
    Lu Qun-bo
    ACTA PHOTONICA SINICA, 2019, 48 (08)
  • [32] REAL TIME HAND GESTURE RECOGNITION VIA FINGER-EMPHASIZED MULTI-SCALE DESCRIPTION
    Yang, Jianyu
    Zhu, Chen
    Yuan, Junsong
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 631 - 636
  • [33] Enhancement algorithm for surface electromyographic-based gesture recognition based on real-time fusion of muscle fatigue features
    Yan, Shijia
    Yang, Ye
    Yi, Peng
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (05): : 958 - 968
  • [34] Real-time segmentation method of billet infrared image based on multi-scale feature fusion
    Lixin Zhang
    Qingrong Nan
    Shengqin Bian
    Tao Liu
    Zhengguang Xu
    Scientific Reports, 12
  • [35] Real-time segmentation method of billet infrared image based on multi-scale feature fusion
    Zhang, Lixin
    Nan, Qingrong
    Bian, Shengqin
    Liu, Tao
    Xu, Zhengguang
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [36] Real-Time Detection and Motion Recognition of Human Moving Objects Based on Deep Learning and Multi-Scale Feature Fusion in Video
    Gong, Meimei
    Shu, Yiming
    IEEE ACCESS, 2020, 8 : 25811 - 25822
  • [37] Position-Invariant, Real-Time Gesture Recognition Based on Dynamic Time Warping
    Bodiroza, Sasa
    Doisy, Guillaume
    Hafner, Verena Vanessa
    PROCEEDINGS OF THE 8TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI 2013), 2013, : 87 - +
  • [38] Adaptive Multi-scale Information Fusion Based on Dynamic Receptive Field for Image-to-image Translation
    Yin Mengxiao
    Lin Zhenfeng
    Yang Feng
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (08) : 2386 - 2394
  • [39] Real-time multi-scale tracking based on compressive sensing
    Yunxia Wu
    Ni Jia
    Jiping Sun
    The Visual Computer, 2015, 31 : 471 - 484
  • [40] Real-time multi-scale tracking based on compressive sensing
    Wu, Yunxia
    Jia, Ni
    Sun, Jiping
    VISUAL COMPUTER, 2015, 31 (04): : 471 - 484