Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition

被引:0
|
作者
Zhu, Shasha [1 ]
Sun, Lu [1 ]
Ma, Zeyuan [1 ]
Li, Chenxi [1 ]
He, Dongzhi [1 ]
机构
[1] Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
关键词
Skeleton-based action recognition; Graph convolutional network; Attention mechanism; Dynamic convolution; Prompt learning;
D O I
10.1016/j.neucom.2024.128623
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based action recognition is a core task in the field of video understanding. Skeleton sequences are characterized by high information density, low redundancy, and clear structural information, thereby facilitating the analysis of complex relationships among human behaviors more readily than other modalities. Although existing studies have encoded skeleton data and achieved positive outcomes, they have often overlooked the precise high-level semantic information inherent in the action descriptions. To address this issue, this paper proposes a prompt-supervised dynamic attention graph convolutional network (PDA-GCN). Specifically, the PDA-GCN incorporates a prompt supervision (PS) module that leverages a pre-trained large-scale language model (LLM) as a knowledge engine and retains the generated text features as prompts to provide additional supervision during model training, enhancing the model's ability to discern analogous actions with negligible computational cost. In addition, for the purpose of bolstering the learning of discriminative features, a dynamic attention graph convolution (DA-GC) module is presented. This module utilizes self-attention mechanism to adaptively infer intrinsic relationships between joints and integrates dynamic convolution to strengthen the emphasis on local information. This dual focus on both global context and local details further amplifies the efficiency and effectiveness of the model. Extensive experiments, conducted on the widely-used skeleton-based action recognition datasets NTU RGB+D 60 and NTU RGB+D 120, demonstrate that the PDA-GCN surpasses known state-of-the-art methods, achieving accuracies of 93.4% on the NTU RGB+D 60 cross-subject split and 90.7% on the NTU RGB+D 120 cross-subject split.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition
    Si, Chenyang
    Chen, Wentao
    Wang, Wei
    Wang, Liang
    Tan, Tieniu
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1227 - 1236
  • [2] Independent Dual Graph Attention Convolutional Network for Skeleton-Based Action Recognition
    Huo, Jinze
    Cai, Haibin
    Meng, Qinggang
    NEUROCOMPUTING, 2024, 583
  • [3] Skeleton-Based Action Recognition with Shift Graph Convolutional Network
    Cheng, Ke
    Zhang, Yifan
    He, Xiangyu
    Chen, Weihan
    Cheng, Jian
    Lu, Hanqing
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 180 - 189
  • [4] Graph convolutional network with STC attention and adaptive normalization for skeleton-based action recognition
    Zhou, Haiyun
    Xiang, Xuezhi
    Qiu, Yujian
    Liu, Xuzhao
    IMAGING SCIENCE JOURNAL, 2023, 71 (07): : 636 - 646
  • [5] A tri-attention enhanced graph convolutional network for skeleton-based action recognition
    Li, Xingming
    Zhai, Wei
    Cao, Yang
    IET COMPUTER VISION, 2021, 15 (02) : 110 - 121
  • [6] A lightweight graph convolutional network for skeleton-based action recognition
    Dinh-Tan Pham
    Quang-Tien Pham
    Tien-Thanh Nguyen
    Thi-Lan Le
    Hai Vu
    Multimedia Tools and Applications, 2023, 82 : 3055 - 3079
  • [7] Shallow Graph Convolutional Network for Skeleton-Based Action Recognition
    Yang, Wenjie
    Zhang, Jianlin
    Cai, Jingju
    Xu, Zhiyong
    SENSORS, 2021, 21 (02) : 1 - 14
  • [8] Ghost Graph Convolutional Network for Skeleton-based Action Recognition
    Jang, Sungjun
    Lee, Heansung
    Cho, Suhwan
    Woo, Sungmin
    Lee, Sangyoun
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2021,
  • [9] A lightweight graph convolutional network for skeleton-based action recognition
    Pham, Dinh-Tan
    Pham, Quang-Tien
    Nguyen, Tien-Thanh
    Le, Thi-Lan
    Vu, Hai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (02) : 3055 - 3079
  • [10] Shuffle Graph Convolutional Network for Skeleton-Based Action Recognition
    Yu, Qiwei
    Dai, Yaping
    Hirota, Kaoru
    Shao, Shuai
    Dai, Wei
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (05) : 790 - 800