An improved spatial temporal graph convolutional network for robust skeleton-based action recognition

被引:14
|
作者
Xing, Yuling [1 ]
Zhu, Jia [2 ]
Li, Yu [1 ]
Huang, Jin [1 ]
Song, Jinlong [1 ]
机构
[1] South China Normal Univ, 55 Zhongshan Ave West, Guangzhou, Peoples R China
[2] Zhejiang Normal Univ, Key Lab Intelligent Educ Technol & Applicat Zheji, Hangzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Adaptive graph; Multi-scale; Occlusion and noise;
D O I
10.1007/s10489-022-03589-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based action recognition methods using complete human skeletons have achieved remarkable performance, but the performance of these methods could significantly deteriorate when critical joints or frames of the skeleton sequence are occluded or disrupted. However, the acquisition of incomplete and noisy human skeletons is inevitable in realistic environments. In order to strengthen the robustness of action recognition model, we propose an Improved Spatial Temporal Graph Convolutional Network (IST-GCN) model, including three modules, namely Multi-dimension Adaptive Graph Convolutional Network (Md-AGCN), Enhanced Attention Mechanism (EAM) and Multi-Scale Temporal Convolutional Network (MS-TCN). Specifically, the Md-AGCN module can first adaptively adjust the graph structure according to different layers and the spatial dimension, temporal dimension, and channel dimension of different action samples to establish corresponding connections for long-range joints with dependencies. Then, the EAM module can focus on important information based on spatial domain, temporal domain and channel to further strengthen the dependencies between important joints. Finally, the MS-TCN module is used to enlarge the receptive field to extract more latent temporal dependencies. The comprehensive experiments on NTU-RGB+D and NTU-RGB+D 120 datasets demonstrate that our approach possesses outstanding performance in terms of both accuracy and robustness when skeleton samples are incomplete and noisy compared with the state-of-the-art (SOTA) approach. Moreover, the parameters and computational complexity of our model are far less than those of the existing approaches.
引用
收藏
页码:4592 / 4608
页数:17
相关论文
共 50 条
  • [21] A lightweight graph convolutional network for skeleton-based action recognition
    Pham, Dinh-Tan
    Pham, Quang-Tien
    Nguyen, Tien-Thanh
    Le, Thi-Lan
    Vu, Hai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (02) : 3055 - 3079
  • [22] Shuffle Graph Convolutional Network for Skeleton-Based Action Recognition
    Yu, Qiwei
    Dai, Yaping
    Hirota, Kaoru
    Shao, Shuai
    Dai, Wei
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (05) : 790 - 800
  • [23] Feedback Graph Convolutional Network for Skeleton-Based Action Recognition
    Yang, Hao
    Yan, Dan
    Zhang, Li
    Sun, Yunda
    Li, Dong
    Maybank, Stephen J.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 164 - 175
  • [24] Hierarchical Graph Convolutional Network for Skeleton-Based Action Recognition
    Huang, Linjiang
    Huang, Yan
    Ouyang, Wanli
    Wang, Liang
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 93 - 102
  • [25] Multiple temporal scale aggregation graph convolutional network for skeleton-based action recognition
    Li, Xuanfeng
    Lu, Jian
    Zhou, Jian
    Liu, Wei
    Zhang, Kaibing
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [26] Improved Graph Convolutional Network with Enriched Graph Topology Representation for Skeleton-Based Action Recognition
    Alsarhan, Tamam
    Harfoushi, Osama
    Shdefat, Ahmed Younes
    Mostafa, Nour
    Alshinwan, Mohammad
    Ali, Ahmad
    ELECTRONICS, 2023, 12 (04)
  • [27] Fast Temporal Graph Convolutional Model for Skeleton-Based Action Recognition
    Nan, Mihai
    Florea, Adina Magda
    SENSORS, 2022, 22 (19)
  • [28] Multi-scale Spatial and Temporal Feature Aggregation Graph Convolutional Network for Skeleton-Based Action Recognition
    Du, Yifei
    Zhang, Mingliang
    Li, Bin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 511 - 524
  • [29] Temporal segment graph convolutional networks for skeleton-based action recognition
    Ding, Chongyang
    Wen, Shan
    Ding, Wenwen
    Liu, Kai
    Belyaev, Evgeny
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110
  • [30] Dynamic Spatial-temporal Hypergraph Convolutional Network for Skeleton-based Action Recognition
    Wang, Shengqin
    Zhang, Yongji
    Qi, Hong
    Zhao, Minghao
    Jiang, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2147 - 2152