An attention-based RGBD dual-branch gesture recognition network

被引:0
|
作者
Chen, Bo [1 ,2 ]
Xie, Pengwei [1 ,2 ]
Hao, Nan [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Key Lab Complex Syst Intelligent Control & Decis, Beijing 100081, Peoples R China
关键词
Gesture Recognition; RGBD feature fusion; attention mechanism; real-time;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we use a hierarchical architecture based on detector-classifier for gesture recognition task. During the operation of the architecture, the detector,which is essentially the switch of the classifier,is always running. When the output of the detector is true, then the classifier is activated and returns a classification label for the input video stream. Our work focuses on the improvement of detectors and classifiers. In the detector, we introduce an attention mechanism to guide the network to focus on the space and channel where the gesture is located. For the classifier, based on the RGB information stream, we use an independent branch to extract the features of the depth stream, and finally merge the two branches. Because gestures move in a three-dimensional space, depth information can make up for the lack of RGB information. Experiments show that on the Egogesture test set, our detector achieves 98.86% accuracy on RGB input, while the classifier achieves 93.85% accuracy. At the same time, our gesture recognition architecture can fully meet the real-time requirements.
引用
收藏
页码:8022 / 8027
页数:6
相关论文
共 50 条
  • [31] A dual-branch and dual attention transformer and CNN hybrid network for ultrasound image segmentation
    Zhang, Chong
    Wang, Lingtong
    Wei, Guohui
    Kong, Zhiyong
    Qiu, Min
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [32] Attention-Based Gesture Recognition Using Commodity WiFi Devices
    Gu, Yu
    Yan, Huan
    Zhang, Xiang
    Wang, Yantong
    Huang, Jinyang
    Ji, Yusheng
    Ren, Fuji
    IEEE SENSORS JOURNAL, 2023, 23 (09) : 9685 - 9696
  • [33] Dual-Branch Multimodal Fusion Network for Driver Facial Emotion Recognition
    Wang, Le
    Chang, Yuchen
    Wang, Kaiping
    APPLIED SCIENCES-BASEL, 2024, 14 (20):
  • [34] Heterogeneous Dual-Branch Emotional Consistency Network for Facial Expression Recognition
    Mao, Shasha
    Zhang, Yuanyuan
    Yan, Dandan
    Chen, Puhua
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 566 - 570
  • [35] Dual-Branch Network With a Subtle Motion Detector for Microaction Recognition in Videos
    Mi, Yang
    Zhang, Xingyuan
    Li, Zhongguo
    Wang, Song
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6194 - 6208
  • [36] Contrastive dual-branch network for long-tailed visual recognition
    Miao, Jie
    Zhai, Junhai
    Han, Ling
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [37] DBT: multimodal emotion recognition based on dual-branch transformer
    Yufan Yi
    Yan Tian
    Cong He
    Yajing Fan
    Xinli Hu
    Yiping Xu
    The Journal of Supercomputing, 2023, 79 : 8611 - 8633
  • [38] DBT: multimodal emotion recognition based on dual-branch transformer
    Yi, Yufan
    Tian, Yan
    He, Cong
    Fan, Yajing
    Hu, Xinli
    Xu, Yiping
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (08): : 8611 - 8633
  • [39] Multiscaled Multi-Head Attention-Based Video Transformer Network for Hand Gesture Recognition
    Garg, Mallika
    Ghosh, Debashis
    Pradhan, Pyari Mohan
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 80 - 84
  • [40] A Dual Attention-based Modality-Collaborative Fusion Network for Emotion Recognition
    Zhang, Xiaoheng
    Li, Yang
    INTERSPEECH 2023, 2023, : 1468 - 1472