Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition

被引:0
|
作者
Zhu, Shasha [1 ]
Sun, Lu [1 ]
Ma, Zeyuan [1 ]
Li, Chenxi [1 ]
He, Dongzhi [1 ]
机构
[1] Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
关键词
Skeleton-based action recognition; Graph convolutional network; Attention mechanism; Dynamic convolution; Prompt learning;
D O I
10.1016/j.neucom.2024.128623
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based action recognition is a core task in the field of video understanding. Skeleton sequences are characterized by high information density, low redundancy, and clear structural information, thereby facilitating the analysis of complex relationships among human behaviors more readily than other modalities. Although existing studies have encoded skeleton data and achieved positive outcomes, they have often overlooked the precise high-level semantic information inherent in the action descriptions. To address this issue, this paper proposes a prompt-supervised dynamic attention graph convolutional network (PDA-GCN). Specifically, the PDA-GCN incorporates a prompt supervision (PS) module that leverages a pre-trained large-scale language model (LLM) as a knowledge engine and retains the generated text features as prompts to provide additional supervision during model training, enhancing the model's ability to discern analogous actions with negligible computational cost. In addition, for the purpose of bolstering the learning of discriminative features, a dynamic attention graph convolution (DA-GC) module is presented. This module utilizes self-attention mechanism to adaptively infer intrinsic relationships between joints and integrates dynamic convolution to strengthen the emphasis on local information. This dual focus on both global context and local details further amplifies the efficiency and effectiveness of the model. Extensive experiments, conducted on the widely-used skeleton-based action recognition datasets NTU RGB+D 60 and NTU RGB+D 120, demonstrate that the PDA-GCN surpasses known state-of-the-art methods, achieving accuracies of 93.4% on the NTU RGB+D 60 cross-subject split and 90.7% on the NTU RGB+D 120 cross-subject split.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Selective directed graph convolutional network for skeleton-based action recognition
    Ke, Chengyuan
    Liu, Sheng
    Feng, Yuan
    Chen, Shengyong
    PATTERN RECOGNITION LETTERS, 2025, 190 : 141 - 146
  • [22] Scale Adaptive Graph Convolutional Network for Skeleton-Based Action Recognition
    Wang X.
    Zhong Y.
    Jin L.
    Xiao Y.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2022, 55 (03): : 306 - 312
  • [23] Feature reconstruction graph convolutional network for skeleton-based action recognition
    Huang, Junhao
    Wang, Ziming
    Peng, Jian
    Huang, Feihu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [24] Temporal Refinement Graph Convolutional Network for Skeleton-Based Action Recognition
    Zhuang T.
    Qin Z.
    Ding Y.
    Deng F.
    Chen L.
    Qin Z.
    Raymond Choo K.-K.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1586 - 1598
  • [25] EchoGCN: An Echo Graph Convolutional Network for Skeleton-Based Action Recognition
    Qian, Weiwen
    Huang, Qian
    Li, Chang
    Chen, Zhongqi
    Mao, Yingchi
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (245-261):
  • [26] Pyramidal Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Li, Fanjia
    Zhu, Aichun
    Liu, Zhongyu
    Huo, Yu
    Xu, Yonggang
    Hua, Gang
    IEEE SENSORS JOURNAL, 2021, 21 (14) : 16183 - 16191
  • [27] Spatial adaptive graph convolutional network for skeleton-based action recognition
    Qilin Zhu
    Hongmin Deng
    Applied Intelligence, 2023, 53 : 17796 - 17808
  • [28] Pose Refinement Graph Convolutional Network for Skeleton-Based Action Recognition
    Li, Shijie
    Yi, Jinhui
    Abu Farha, Yazan
    Gall, Juergen
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 1028 - 1035
  • [29] Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition
    Liu, Di
    Xu, Hui
    Wang, Jianzhong
    Lu, Yinghua
    Kong, Jun
    Qi, Miao
    SENSORS, 2021, 21 (20)
  • [30] Attention adjacency matrix based graph convolutional networks for skeleton-based action recognition
    Xie, Jun
    Miao, Qiguang
    Liu, Ruyi
    Xin, Wentian
    Tang, Lei
    Zhong, Sheng
    Gao, Xuesong
    NEUROCOMPUTING, 2021, 440 (440) : 230 - 239