Modeling the Uncertainty for Self-supervised 3D Skeleton Action Representation Learning

被引：19

作者：

Su, Yukun ^{[1
]}

Lin, Guosheng ^{[2
]}

Sun, Ruizhou ^{[1
]}

Hao, Yun ^{[1
]}

Wu, Qingyao ^{[1
]}

机构：

[1] South China Univ Technol, Guangzhou, Peoples R China

[2] Nanyang Technol Univ, Singapore, Singapore

来源：

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年

基金：

新加坡国家研究基金会; 中国国家自然科学基金;

关键词：

self-supervised; 3D skeleton action; uncertainty; probabilistic embedding; space;

D O I：

10.1145/3474085.3475248

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Self-supervised learning (SSL) has been proved very effective in learning representations from unlabeled data in language and vision domains. Yet, very few instrumental self-supervised approaches exist for 3D skeleton action understanding, and directly applying the existing SSL methods from other domains for skeleton action learning may suffer from misalignment of representations and some limitations. In this paper, we consider that a good representation learning encoder can distinguish the underlying features of different actions, which can make the similar motions closer while pushing the dissimilar motions away. There exists, however, some uncertainties in the skeleton actions due to the inherent ambiguity of 3D skeleton pose in different viewpoints or the sampling algorithm in contrastive learning, thus, it is ill-posed to differentiate the action features in the deterministic embedding space. To address these issues, we rethink the distance between action features and propose to model each action representation into the probabilistic embedding space to alleviate the uncertainties upon encountering the ambiguous 3D skeleton inputs. To validate the effectiveness of the proposed method, extensive experiments are conducted on Kinetics, NTU60, NTU120, and PKUMMD datasets with several alternative network architectures. Experimental evaluations demonstrate the superiority of our approach and through which, we can gain significant performance improvement without using extra labeled data.

引用

页码：769 / 778

页数：10

共 50 条

[31] Self-Supervised 3D Representation Learning of Dressed Humans From Social Media Videos
Jafarian, Yasamin
Park, Hyun Soo
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8969 - 8983
[32] Collaboratively Self-Supervised Video Representation Learning for Action Recognition
Zhang, Jie
Wan, Zhifan
Hu, Lanqing
Lin, Stephen
Wu, Shuzhe
Shan, Shiguang
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 1895 - 1907
[33] 3D Human Pose Machines with Self-Supervised Learning
Wang, Keze
Lin, Liang
Jiang, Chenhan
Qian, Chen
Wei, Pengxu
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1069 - 1082
[34] Self-Supervised Learning of Detailed 3D Face Reconstruction
Chen, Yajing
Wu, Fanzi
Wang, Zeyu
Song, Yibing
Ling, Yonggen
Bao, Linchao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8696 - 8705
[35] Visual Reinforcement Learning With Self-Supervised 3D Representations
Ze, Yanjie
Hansen, Nicklas
Chen, Yinbo
Jain, Mohit
Wang, Xiaolong
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (05) : 2890 - 2897
[36] Self-Supervised Online Learning of Appearance for 3D Tracking
Lee, Bhoram
Lee, Daniel D.
2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 4930 - 4937
[37] Self-Supervised Deep Learning for 3D Gravity Inversion
Li, Yinshuo
Jia, Zhuo
Lu, Wenkai
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[38] Self-Supervised Deep Learning for 3D Gravity Inversion
Li, Yinshuo
Jia, Zhuo
Lu, Wenkai
IEEE Transactions on Geoscience and Remote Sensing, 2022, 60
[39] Joint Supervised and Self-Supervised Learning for 3D Real World Challenges
Alliegro, Antonio
Boscaini, Davide
Tommasi, Tatiana
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6718 - 6725
[40] Self-Supervised Representation Learning for Skeleton-Based Group Activity Recognition
Bian, Cunling
Feng, Wei
Wang, Song
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5990 - 5998

← 1 2 3 4 5 →