Spatio-Temporal Self-supervision for Few-Shot Action Recognition

被引：0

作者：

Yu, Wanchuan ^{[1
]}

Guo, Hanyu ^{[1
]}

Yan, Yan ^{[1
]}

Li, Jie ^{[2
]}

Wang, Hanzi ^{[1
]}

机构：

[1] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart City, Xiamen, Peoples R China

[2] Xidian Univ, Sch Elect Engn, Video & Image Proc Syst Lab, Xian, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I | 2024年 / 14425卷

基金：

中国国家自然科学基金;

关键词：

Few-shot learning; Action recognition; Self-supervised learning;

D O I：

10.1007/978-981-99-8429-9_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few-shot action recognition aims to classify unseen action classes with limited labeled training samples. Most current works follow the metric learning technology to learn a good embedding and an appropriate comparison metric. Due to the limited labeled data, the generalization of embedding networks is limited when employing the meta-learning process with episodic tasks. In this paper, we aim to repurpose self-supervised learning to learn a more generalized few-shot embedding model. Specifically, a Spatio-Temporal Self-supervision (STS) framework for few-shot action recognition is proposed to generate self-supervision loss at the spatial and temporal levels as auxiliary losses. By this means, the proposed STS can provide a robust representation for few-shot action recognition. Furthermore, we propose a Spatio-Temporal Aggregation (STA) module that accounts for the spatial information relationship among all frames within a video sequence to achieve optimal video embedding. Experiments on several challenging few-shot action recognition benchmarks show the effectiveness of the proposed method in achieving state-of-the-art performance for few-shot action recognition.

引用

页码：84 / 96

页数：13

共 50 条

[41] Task Adaptive Modeling for Few-shot Action Recognition
Wang, Jiayi
Jin, Yi
Feng, Songhe
Li, Yidong
2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
[42] Matching Compound Prototypes for Few-Shot Action Recognition
Huang, Yifei
Yang, Lijin
Chen, Guo
Zhang, Hongjie
Lu, Feng
Sato, Yoichi
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 3977 - 4002
[43] Anomalous Action Recognition Research for Few-shot Learning
Qi, Yufei
Liu, Ting
Fu, Yuzhuo
PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1306 - 1310
[44] EdgeFont: Enhancing style and content representations in few-shot font generation with multi-scale edge self-supervision
Wang, Yefei
Xiong, Kangyue
Yuan, Yiyang
Zeng, Jinshan
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 262
[45] Enhancing Few-Shot Action Recognition Using Skeleton Temporal Alignment and Adversarial Training
Xu, Qingyang
Yang, Jianjun
Zhang, Hongyi
Jie, Xin
Bandara, Danushka
IEEE ACCESS, 2024, 12 : 31745 - 31755
[46] Hierarchical Task-aware Temporal Modeling and Matching for few-shot action recognition
Zhan, Yucheng
Pan, Yijun
Wu, Siying
Zhang, Yueyi
Sun, Xiaoyan
NEUROCOMPUTING, 2025, 624
[47] Hierarchical compositional representations for few-shot action recognition
Li, Changzhen
Zhang, Jie
Wu, Shuzhe
Jin, Xin
Shan, Shiguang
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
[48] Advances in Few-Shot Action Recognition: A Comprehensive Review
Ruan, Zanxi
Wei, Yingmei
Yuan, Yifei
Li, Yu
Guo, Yanming
Xie, Yuxiang
2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 390 - 398
[49] Motion-modulated Temporal Fragment Alignment Network For Few-Shot Action Recognition
Wu, Jiamin
Zhang, Tianzhu
Zhang, Zhe
Wu, Feng
Zhang, Yongdong
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9141 - 9150
[50] A Generative Approach to Zero-Shot and Few-Shot Action Recognition
Mishra, Ashish
Verma, Vinay Kumar
Reddy, M. Shiva Krishna
Arulkumar, S.
Rai, Piyush
Mittal, Anurag
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 372 - 380

← 1 2 3 4 5 →