Enhancing Robustness of Viewpoint Changes in 3D Skeleton-Based Human Action Recognition

被引:3
|
作者
Park, Jinyoon [1 ,2 ]
Kim, Chulwoong [2 ]
Kim, Seung-Chan [1 ]
机构
[1] Sungkyunkwan Univ, Dept Sport Interact Sci, Machine Learning Syst Lab, Suwon 16419, South Korea
[2] TAIIPA Taean AI Ind Promot Agcy, Taean 32154, South Korea
关键词
action recognition; machine learning; feature learning; skeletal data; data augmentation;
D O I
10.3390/math11153280
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Previous research on 3D skeleton-based human action recognition has frequently relied on a sequence-wise viewpoint normalization process, which adjusts the view directions of all segmented action sequences. This type of approach typically demonstrates robustness against variations in viewpoint found in short-term videos, a characteristic commonly encountered in public datasets. However, our preliminary investigation of complex action sequences, such as discussions or smoking, reveals its limitations in capturing the intricacies of such actions. To address these view-dependency issues, we propose a straightforward, yet effective, sequence-wise augmentation technique. This strategy enhances the robustness of action recognition models, particularly against changes in viewing direction that mainly occur within the horizontal plane (azimuth) by rotating human key points around either the z-axis or the spine vector, effectively creating variations in viewing directions. We scrutinize the robustness of this approach against real-world viewpoint variations through extensive empirical studies on multiple public datasets, including an additional set of custom action sequences. Despite the simplicity of our approach, our experimental results consistently yield improved action recognition accuracies. Compared to the sequence-wise viewpoint normalization method used with advanced deep learning models like Conv1D, LSTM, and Transformer, our approach showed a relative increase in accuracy of 34.42% for the z-axis and 10.86% for the spine vector.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Image representation of pose-transition feature for 3D skeleton-based action recognition
    Thien Huynh-The
    Hua, Cam-Hao
    Trung-Thanh Ngo
    Kim, Dong-Seong
    INFORMATION SCIENCES, 2020, 513 : 112 - 126
  • [22] MCTD: Motion-Coordinate-Time Descriptor for 3D Skeleton-Based Action Recognition
    Liang, Qi
    Wang, Feng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 577 - 587
  • [23] Contrastive Mask Learning for Self-Supervised 3D Skeleton-Based Action Recognition
    Zhang, Haoyuan
    SENSORS, 2025, 25 (05)
  • [24] Human Action Recognition Based on Quaternion 3D Skeleton Representation
    Xu Haiyang
    Kong Jun
    Jiang Min
    LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (02)
  • [25] 3D PostureNet: A unified framework for skeleton-based posture recognition
    Liu, Jianbo
    Wang, Ying
    Liu, Yongcheng
    Xiang, Shiming
    Pan, Chunhong
    PATTERN RECOGNITION LETTERS, 2020, 140 (140) : 143 - 149
  • [26] Hidden States Exploration for 3D Skeleton-based Gesture Recognition
    Liu, Xin
    Shi, Henglin
    Hong, Xiaopeng
    Chen, Haoyu
    Tao, Dacheng
    Zhao, Guoying
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1846 - 1855
  • [27] Revisiting Skeleton-based Action Recognition
    Duan, Haodong
    Zhao, Yue
    Chen, Kai
    Lin, Dahua
    Dai, Bo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2959 - 2968
  • [28] Action Tree Convolutional Networks: Skeleton-Based Human Action Recognition
    Liu, Wenjie
    Zhang, Ziyi
    Han, Bing
    Zhu, Chenhui
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 783 - 792
  • [29] Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition
    Yang, Jianyu
    Liu, Wu
    Yuan, Junsong
    Mei, Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 883 - 898
  • [30] InfoGCN: Representation Learning for Human Skeleton-based Action Recognition
    Chi, Hyung-gun
    Ha, Myoung Hoon
    Chi, Seunggeun
    Lee, Sang Wan
    Huang, Qixing
    Ramani, Karthik
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20154 - 20164