Enhancing Robustness of Viewpoint Changes in 3D Skeleton-Based Human Action Recognition

被引：3

作者：

Park, Jinyoon ^{[1
,2
]}

Kim, Chulwoong ^{[2
]}

Kim, Seung-Chan ^{[1
]}

机构：

[1] Sungkyunkwan Univ, Dept Sport Interact Sci, Machine Learning Syst Lab, Suwon 16419, South Korea

[2] TAIIPA Taean AI Ind Promot Agcy, Taean 32154, South Korea

来源：

MATHEMATICS | 2023年 / 11卷 / 15期

关键词：

action recognition; machine learning; feature learning; skeletal data; data augmentation;

D O I：

10.3390/math11153280

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Previous research on 3D skeleton-based human action recognition has frequently relied on a sequence-wise viewpoint normalization process, which adjusts the view directions of all segmented action sequences. This type of approach typically demonstrates robustness against variations in viewpoint found in short-term videos, a characteristic commonly encountered in public datasets. However, our preliminary investigation of complex action sequences, such as discussions or smoking, reveals its limitations in capturing the intricacies of such actions. To address these view-dependency issues, we propose a straightforward, yet effective, sequence-wise augmentation technique. This strategy enhances the robustness of action recognition models, particularly against changes in viewing direction that mainly occur within the horizontal plane (azimuth) by rotating human key points around either the z-axis or the spine vector, effectively creating variations in viewing directions. We scrutinize the robustness of this approach against real-world viewpoint variations through extensive empirical studies on multiple public datasets, including an additional set of custom action sequences. Despite the simplicity of our approach, our experimental results consistently yield improved action recognition accuracies. Compared to the sequence-wise viewpoint normalization method used with advanced deep learning models like Conv1D, LSTM, and Transformer, our approach showed a relative increase in accuracy of 34.42% for the z-axis and 10.86% for the spine vector.

引用

页数：17

共 50 条

[31] Fusion sampling networks for skeleton-based human action recognition
Chen, Guannan
Wei, Shimin
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
[32] Skeleton-Based Human Action Recognition via Screw Matrices
DING Wenwen
LIU Kai
XU Biao
CHENG Fei
Chinese Journal of Electronics, 2017, 26 (04) : 790 - 796
[33] STAR: An STGCN ARchitecture for Skeleton-Based Human Action Recognition
Wu, Weiwei
Tu, Fengbin
Niu, Mengqi
Yue, Zhiheng
Liu, Leibo
Wei, Shaojun
Li, Xiangyu
Hu, Yang
Yin, Shouyi
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (06) : 2370 - 2383
[34] Unsupervised Temporal Adaptation in Skeleton-Based Human Action Recognition
Tian, Haitao
Payeur, Pierre
ALGORITHMS, 2024, 17 (12)
[35] Skeleton-based Human Action Recognition in a Thermal Comfort Context
Martins, John
Flanigan, Katherine A.
McComb, Christopher
PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 377 - 384
[36] Skeleton-based Human Action Recognition using Basis Vectors
Asteriadis, Stylianos
Daras, Petros
8TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2015), 2015,
[37] Skeleton-Based Human Action Recognition via Screw Matrices
Ding Wenwen
Liu Kai
Xu Biao
Cheng Fei
CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (04) : 790 - 796
[38] Skeleton-Based Human Action Recognition：History，Status and Prospects
Bian, Cunling
Lyu, Weigang
Feng, Wei
Computer Engineering and Applications, 2024, 60 (20) : 1 - 29
[39] Pixel Convolutional Networks for Skeleton-Based Human Action Recognition
Change, Zhichao
Wang, Jiangyun
Han, Liang
METHODS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, 2018, 946 : 513 - 523
[40] SKELETON-BASED MODELING OF 3D SURFACES
Jankauskas, Kestutis
Noreika, Algirdas
INFORMATION TECHNOLOGIES' 2009, 2009, : 235 - 242

← 1 2 3 4 5 →