Real-Time Dynamic Gesture Recognition Algorithm Based on Adaptive Information Fusion and Multi-Scale Optimization Transformer

被引:1
|
作者
Lu, Guangda [1 ,2 ]
Sun, Wenhao [1 ,2 ]
Qin, Zhuanping [1 ,2 ]
Guo, Tinghang [1 ,2 ]
机构
[1] Tianjin Univ Technol & Educ, Sch Automat & Elect Engn, 1310 Dagu South Rd, Tianjin 300222, Peoples R China
[2] Tianjin Key Lab Informat Sensing & Intelligent Co, 1310 DaGu South Rd, Tianjin 300222, Peoples R China
关键词
dynamic gesture recognition; Transformer; optical flow; information fusion;
D O I
10.20965/jaciii.2023.p1096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gesture recognition is a popular technology in the field of computer vision and an important technical mean of achieving human-computer interaction. To address problems such as the limited long-range feature extraction capability of existing dynamic gesture recognition networks based on convolutional operators, we propose a dynamic gesture recognition algorithm based on spatial pyramid pooling Transformer and optical flow information fusion. We take advantage of Transformer's large receptive field to reduce model computation while improving the model's ability to extract features at different scales by embedding spatial pyramid pooling. We use the optical flow algorithm with the global motion aggregation module to obtain an optical flow map of hand motion, and to extract the key frames based on the similarity minimization principle. We also design an adaptive feature fusion method to fuse the spatial and temporal features of the dual channels. Finally, we demonstrate the effectiveness of model components on model recognition enhancement through ablation experiments. We conduct training and validation on the SCUT-DHGA dynamic gesture dataset and on a dataset we collected, and we perform real-time dynamic gesture recognition tests using the trained model. The results show that our algorithm achieves high accuracy even while keeping the parameters balanced. It also achieves fast and accurate recognition of dynamic gestures in real-time tests.
引用
收藏
页码:1096 / 1107
页数:12
相关论文
共 50 条
  • [21] Real-Time Robotic Grasp Detection with Multi-Scale Feature Fusion
    Ma, Hao
    Yuan, Ding
    Cao, Zhe
    Yin, Jihao
    2020 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE-RCAR 2020), 2020, : 140 - 145
  • [22] Real-time face recognition based on pre-identification and multi-scale classification
    Min, Weidong
    Fan, Mengdan
    Li, Jing
    Han, Qing
    IET COMPUTER VISION, 2019, 13 (02) : 165 - 171
  • [23] An Approach to Dynamic Gesture Recognition for Real-Time Interaction
    Zhao, Jinli
    Chen, Tianding
    SIXTH INTERNATIONAL SYMPOSIUM ON NEURAL NETWORKS (ISNN 2009), 2009, 56 : 369 - 377
  • [24] A Real-Time Gait Phase Recognition Method Based on Multi-Information Fusion
    Zhang, Yue-Peng
    Cao, Guang-Zhong
    Ling, Zi-Qin
    He, Bin-Bin
    Cheng, Hao-Ran
    Li, Wen-Zhou
    Cao, Sheng-Bin
    2021 18TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2021, : 249 - 255
  • [25] Multi-scale feature adaptive fusion model for real-time detection in complex citrus orchard environments
    Zhang, Yunfeng
    Li, Li
    Chun, Changpin
    Wen, Yifeng
    Xu, Gang
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 219
  • [26] Robust human gesture recognition by leveraging multi-scale feature fusion
    Deng, Minwei
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 83
  • [27] Real-Time Dynamic Gesture Recognition based on Boundary-Constraint Dynamic Time Warping
    Cheng, Chunling
    Liu, Yangjunwu
    Yang, Jian
    Zhu, Tao
    Ye, Feng
    PROCEEDINGS OF THE 2019 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON), 2019, : 545 - 551
  • [28] Real-Time Gesture Recognition Based on Kinect
    Bao Zhiqiang
    Lu Chengang
    LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (03)
  • [29] Fusion multi-scale Transformer skin lesion segmentation algorithm
    Liang L.-M.
    Zhou L.-S.
    Yin J.
    Sheng X.-Q.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (04): : 1086 - 1098
  • [30] A novel lightweight multi-scale feature fusion segmentation algorithm for real-time cervical lesion screening
    Yang, Jiahui
    Zhang, Ying
    Fan, Wenlong
    Wang, Jie
    Zhang, Xinhe
    Liu, Chunhui
    Liu, Shuang
    Xue, Linyan
    SCIENTIFIC REPORTS, 2025, 15 (01):