Spatial-Temporal Attention Res-TCN for Skeleton-Based Dynamic Hand Gesture Recognition

被引:74
|
作者
Hou, Jingxuan [1 ]
Wang, Guijin [1 ]
Chen, Xinghao [1 ]
Xue, Jing-Hao [2 ]
Zhu, Rui [3 ]
Yang, Huazhong [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] UCL, London, England
[3] Univ Kent, Canterbury, Kent, England
关键词
Dynamic hand gesture recognition; Spatial-Temporal Attention; Temporal Convolutional Networks; NEURAL-NETWORKS;
D O I
10.1007/978-3-030-11024-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic hand gesture recognition is a crucial yet challenging task in computer vision. The key of this task lies in an effective extraction of discriminative spatial and temporal features to model the evolutions of different gestures. In this paper, we propose an end-to-end Spatial-Temporal Attention Residual Temporal Convolutional Network (STA-Res-TCN) for skeleton-based dynamic hand gesture recognition, which learns different levels of attention and assigns them to each spatia-ltemporal feature extracted by the convolution filters at each time step. The proposed attention branch assists the networks to adaptively focus on the informative time frames and features while exclude the irrelevant ones that often bring in unnecessary noise. Moreover, our proposed STA-Res-TCN is a lightweight model that can be trained and tested in an extremely short time. Experiments on DHG-14/28 Dataset and SHREC'17 Track Dataset show that STA-Res-TCN outperforms state-of-the-art methods on both the 14 gestures setting and the more complicated 28 gestures setting.
引用
收藏
页码:273 / 286
页数:14
相关论文
共 50 条
  • [1] Spatial temporal graph convolutional networks for skeleton-based dynamic hand gesture recognition
    Li, Yong
    He, Zihang
    Ye, Xiang
    He, Zuguo
    Han, Kangrong
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2019, 2019 (01)
  • [2] Spatial-Temporal Dynamic Graph Attention Network for Skeleton-Based Action Recognition
    Rahevar, Mrugendrasinh
    Ganatra, Amit
    Saba, Tanzila
    Rehman, Amjad
    Bahaj, Saeed Ali
    IEEE ACCESS, 2023, 11 : 21546 - 21553
  • [3] Spatial temporal graph convolutional networks for skeleton-based dynamic hand gesture recognition
    Yong Li
    Zihang He
    Xiang Ye
    Zuguo He
    Kangrong Han
    EURASIP Journal on Image and Video Processing, 2019
  • [4] Skeleton-based Dynamic hand gesture recognition
    De Smedt, Quentin
    Wannous, Hazem
    Vandeborre, Jean-Philippe
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 1206 - 1214
  • [5] Spatial-temporal graph attention networks for skeleton-based action recognition
    Huang, Qingqing
    Zhou, Fengyu
    He, Jiakai
    Zhao, Yang
    Qin, Runze
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (05)
  • [6] Spatial--Temporal Synchronous Transformer for Skeleton-Based Hand Gesture Recognition
    Zhao, Dongdong
    Li, Hongli
    Yan, Shi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1403 - 1412
  • [7] Skeleton-based action recognition with local dynamic spatial-temporal aggregation
    Hu, Lianyu
    Liu, Shenglan
    Feng, Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [8] Spatial-Temporal gated graph attention network for skeleton-based action recognition
    Rahevar, Mrugendrasinh
    Ganatra, Amit
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 929 - 939
  • [9] Dynamic Spatial-temporal Hypergraph Convolutional Network for Skeleton-based Action Recognition
    Wang, Shengqin
    Zhang, Yongji
    Qi, Hong
    Zhao, Minghao
    Jiang, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2147 - 2152
  • [10] Dynamic spatial-temporal topology graph network for skeleton-based action recognition
    Chen, Lian
    Lu, Ke
    Niu, Zehai
    Wei, Runchen
    Xue, Jian
    MULTIMEDIA SYSTEMS, 2024, 30 (06)