Enhancing Robustness of Viewpoint Changes in 3D Skeleton-Based Human Action Recognition

被引:3
|
作者
Park, Jinyoon [1 ,2 ]
Kim, Chulwoong [2 ]
Kim, Seung-Chan [1 ]
机构
[1] Sungkyunkwan Univ, Dept Sport Interact Sci, Machine Learning Syst Lab, Suwon 16419, South Korea
[2] TAIIPA Taean AI Ind Promot Agcy, Taean 32154, South Korea
关键词
action recognition; machine learning; feature learning; skeletal data; data augmentation;
D O I
10.3390/math11153280
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Previous research on 3D skeleton-based human action recognition has frequently relied on a sequence-wise viewpoint normalization process, which adjusts the view directions of all segmented action sequences. This type of approach typically demonstrates robustness against variations in viewpoint found in short-term videos, a characteristic commonly encountered in public datasets. However, our preliminary investigation of complex action sequences, such as discussions or smoking, reveals its limitations in capturing the intricacies of such actions. To address these view-dependency issues, we propose a straightforward, yet effective, sequence-wise augmentation technique. This strategy enhances the robustness of action recognition models, particularly against changes in viewing direction that mainly occur within the horizontal plane (azimuth) by rotating human key points around either the z-axis or the spine vector, effectively creating variations in viewing directions. We scrutinize the robustness of this approach against real-world viewpoint variations through extensive empirical studies on multiple public datasets, including an additional set of custom action sequences. Despite the simplicity of our approach, our experimental results consistently yield improved action recognition accuracies. Compared to the sequence-wise viewpoint normalization method used with advanced deep learning models like Conv1D, LSTM, and Transformer, our approach showed a relative increase in accuracy of 34.42% for the z-axis and 10.86% for the spine vector.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] 3D skeleton-based action recognition by representing motion capture sequences as 2D-RGB images
    Laraba, Sohaib
    Brahimi, Mohammed
    Tilmanne, Joelle
    Dutoit, Thierry
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2017, 28 (3-4)
  • [42] Skeleton Graph Scattering Networks for 3D Skeleton-based Human Motion Prediction
    Li, Maosen
    Chen, Siheng
    Liu, Zihui
    Zhang, Zijing
    Xie, Lingxi
    Tian, Qi
    Zhang, Ya
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 854 - 864
  • [43] Physics-Augmented Autoencoder for 3D Skeleton-Based Gait Recognition
    Guo, Hongji
    Ji, Qiang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19570 - 19581
  • [44] Fourier analysis on robustness of graph convolutional neural networks for skeleton-based action recognition
    Tanaka, Nariki
    Kera, Hiroshi
    Kawamoto, Kazuhiko
    Computer Vision and Image Understanding, 2024, 240
  • [45] Fast 3D-graph convolutional networks for skeleton-based action recognition
    Zhang, Guohao
    Wen, Shuhuan
    Li, Jiaqi
    Che, Haijun
    APPLIED SOFT COMPUTING, 2023, 145
  • [46] Fourier analysis on robustness of graph convolutional neural networks for skeleton-based action recognition
    Tanaka, Nariki
    Kera, Hiroshi
    Kawamoto, Kazuhiko
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [47] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
    Yu, Lubin
    Tian, Lianfang
    Du, Qiliang
    Bhutto, Jameel Ahmed
    APPLIED INTELLIGENCE, 2023, 53 (12) : 14838 - 14854
  • [48] Unsupervised 3D Skeleton-Based Action Recognition using Cross-Attention with Conditioned Generation Capabilities
    Lerch, David J.
    Zhong, Zeyun
    Martin, Manuel
    Voit, Michael
    Beyerer, Juergen
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 202 - 211
  • [49] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
    Lubin Yu
    Lianfang Tian
    Qiliang Du
    Jameel Ahmed Bhutto
    Applied Intelligence, 2023, 53 : 14838 - 14854
  • [50] RELATIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION
    Zheng, Wu
    Li, Lin
    Zhang, Zhaoxiang
    Huang, Yan
    Wang, Liang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 826 - 831