Enhancing Robustness of Viewpoint Changes in 3D Skeleton-Based Human Action Recognition

被引:3
|
作者
Park, Jinyoon [1 ,2 ]
Kim, Chulwoong [2 ]
Kim, Seung-Chan [1 ]
机构
[1] Sungkyunkwan Univ, Dept Sport Interact Sci, Machine Learning Syst Lab, Suwon 16419, South Korea
[2] TAIIPA Taean AI Ind Promot Agcy, Taean 32154, South Korea
关键词
action recognition; machine learning; feature learning; skeletal data; data augmentation;
D O I
10.3390/math11153280
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Previous research on 3D skeleton-based human action recognition has frequently relied on a sequence-wise viewpoint normalization process, which adjusts the view directions of all segmented action sequences. This type of approach typically demonstrates robustness against variations in viewpoint found in short-term videos, a characteristic commonly encountered in public datasets. However, our preliminary investigation of complex action sequences, such as discussions or smoking, reveals its limitations in capturing the intricacies of such actions. To address these view-dependency issues, we propose a straightforward, yet effective, sequence-wise augmentation technique. This strategy enhances the robustness of action recognition models, particularly against changes in viewing direction that mainly occur within the horizontal plane (azimuth) by rotating human key points around either the z-axis or the spine vector, effectively creating variations in viewing directions. We scrutinize the robustness of this approach against real-world viewpoint variations through extensive empirical studies on multiple public datasets, including an additional set of custom action sequences. Despite the simplicity of our approach, our experimental results consistently yield improved action recognition accuracies. Compared to the sequence-wise viewpoint normalization method used with advanced deep learning models like Conv1D, LSTM, and Transformer, our approach showed a relative increase in accuracy of 34.42% for the z-axis and 10.86% for the spine vector.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Fusion sampling networks for skeleton-based human action recognition
    Chen, Guannan
    Wei, Shimin
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
  • [32] Skeleton-Based Human Action Recognition via Screw Matrices
    DING Wenwen
    LIU Kai
    XU Biao
    CHENG Fei
    Chinese Journal of Electronics, 2017, 26 (04) : 790 - 796
  • [33] STAR: An STGCN ARchitecture for Skeleton-Based Human Action Recognition
    Wu, Weiwei
    Tu, Fengbin
    Niu, Mengqi
    Yue, Zhiheng
    Liu, Leibo
    Wei, Shaojun
    Li, Xiangyu
    Hu, Yang
    Yin, Shouyi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (06) : 2370 - 2383
  • [34] Unsupervised Temporal Adaptation in Skeleton-Based Human Action Recognition
    Tian, Haitao
    Payeur, Pierre
    ALGORITHMS, 2024, 17 (12)
  • [35] Skeleton-based Human Action Recognition in a Thermal Comfort Context
    Martins, John
    Flanigan, Katherine A.
    McComb, Christopher
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 377 - 384
  • [36] Skeleton-based Human Action Recognition using Basis Vectors
    Asteriadis, Stylianos
    Daras, Petros
    8TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2015), 2015,
  • [37] Skeleton-Based Human Action Recognition via Screw Matrices
    Ding Wenwen
    Liu Kai
    Xu Biao
    Cheng Fei
    CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (04) : 790 - 796
  • [38] Skeleton-Based Human Action Recognition:History,Status and Prospects
    Bian, Cunling
    Lyu, Weigang
    Feng, Wei
    Computer Engineering and Applications, 2024, 60 (20) : 1 - 29
  • [39] Pixel Convolutional Networks for Skeleton-Based Human Action Recognition
    Change, Zhichao
    Wang, Jiangyun
    Han, Liang
    METHODS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, 2018, 946 : 513 - 523
  • [40] SKELETON-BASED MODELING OF 3D SURFACES
    Jankauskas, Kestutis
    Noreika, Algirdas
    INFORMATION TECHNOLOGIES' 2009, 2009, : 235 - 242