An Attention-Aware Model for Human Action Recognition on Tree-Based Skeleton Sequences

被引:1
|
作者
Ding, Runwei [1 ]
Liu, Chang [1 ]
Liu, Hong [1 ,2 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Shenzhen, Peoples R China
[2] Peking Univ, Key Lab Machine Percept, Beijing, Peoples R China
来源
SOCIAL ROBOTICS, ICSR 2018 | 2018年 / 11357卷
基金
中国国家自然科学基金;
关键词
Human action recognition; Skeleton; Attention-ware model; Tri-directional Tree Traversal Map (TTTM);
D O I
10.1007/978-3-030-05204-1_56
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based human action recognition (HAR) has attracted a lot of research attentions because of robustness to variations of locations and appearances. However, most existing methods treat the whole skeleton as a fixed pattern, in which the importance of different skeleton joints for action recognition is not considered. In this paper, a novel CNN-based attention-ware network is proposed. First, to describe the semantic meaning of skeletons and learn the discriminative joints over time, an attention generate network named Global Attention Network (GAN) is proposed to generate attention masks. Then, to encode the spatial structure of skeleton sequences, we design a tree-based traversal (TTTM) rule, which can represent the skeleton structure, as a convolution unit of main network. Finally, the GAN and main network are cascaded as a whole network which is trained in an end-to-end manner. Experiments show that the TTTM and GAN are supplemented each other, and the whole network achieves an efficient improvement over the state-of-the-arts, e.g., the classification accuracy of this network was 83.6% and 89.5% on NTU-RGBD CV and CS dataset, which outperforms any other methods.
引用
收藏
页码:569 / 579
页数:11
相关论文
共 50 条
  • [31] Learning features combination for human action recognition from skeleton sequences
    Luvizon, Diogo Carbonera
    Tabia, Hedi
    Picard, David
    PATTERN RECOGNITION LETTERS, 2017, 99 : 13 - 20
  • [32] Hybrid Attention-Aware Learning Network for Facial Expression Recognition in the Wild
    Gong, Weijun
    La, Zhiyao
    Qian, Yurong
    Zhou, Weihang
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (09) : 12203 - 12217
  • [33] A Discriminative Dual-Stream Model With a Novel Sustained Attention Mechanism for Skeleton-Based Human Action Recognition
    Liang, Zhihong
    Shi, Xiaoshan
    Zhang, Yanxin
    Liu, Bo
    IEEE ACCESS, 2020, 8 (08): : 208395 - 208406
  • [34] Attention-Based Generative Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Yang, Kai
    Ding, Xiaolu
    Chen, Wai
    ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 1 - 6
  • [35] Learning an attention-aware parallel sharing network for facial attribute recognition
    Chen, Si
    Lai, Xinyu
    Yan, Yan
    Wang, Da-Han
    Zhu, Shunzhi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
  • [36] Attention-based generative graph convolutional network for skeleton-based human action recognition
    Yang, Kai
    Ding, Xiaolu
    Chen, Wai
    ACM International Conference Proceeding Series, 2019, : 1 - 6
  • [37] Temporal-Channel Attention and Convolution Fusion for Skeleton-Based Human Action Recognition
    Liang, Chengwu
    Yang, Jie
    Du, Ruolin
    Hu, Wei
    Hou, Ning
    IEEE ACCESS, 2024, 12 : 64937 - 64948
  • [38] Graph-aware transformer for skeleton-based action recognition
    Zhang, Jiaxu
    Xie, Wei
    Wang, Chao
    Tu, Ruide
    Tu, Zhigang
    VISUAL COMPUTER, 2023, 39 (10): : 4501 - 4512
  • [39] Graph-aware transformer for skeleton-based action recognition
    Jiaxu Zhang
    Wei Xie
    Chao Wang
    Ruide Tu
    Zhigang Tu
    The Visual Computer, 2023, 39 : 4501 - 4512
  • [40] A unified tree-based framework for joint action localization, recognition and segmentation
    Jiang, Zhuolin
    Lin, Zhe
    Davis, Larry S.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (10) : 1345 - 1355